Optimal pure exploration algorithms for bandit learning in matching markets slides