WILT: A Multi-turn, Memorization-Robust Inductive Logic Benchmark for LLMs