This website gives information about part I of the course Advanced Topics in Stochastic Operations Research: Multi-armed bandit theory and applications. LNMB info: https://www.lnmb.nl/pages/courses/phdcourses/ATS2324.html

When and where:

November 20 on location

November 27 online

December 4 online

December 11 online

December 18 online. 

Material: notes, and the book [BA] Bandit Algorithms, Tor Lattimore and Csaba Szepesvári, Cambridge University Press

Assignment: To pass this part of the course, you need to write a research proposal about a topic related to multi-armed bandit type algorithms, and give feedback to other proposals. Deadlines are January 31, 2024 for the proposal, and February 21, 2024 for the feedback. See below for details. 


Contents:

Lecture 1: introduction to MAB, explore-then-commit policies


Lecture 2: upper-confidence bound policies, EXP3, adversarial bandits


Lecture 3: Regret lower bounds


Lecture 4: Bayesian bandits, Gittins index, Thompson sampling


Lecture 5: dynamic pricing and learning



Assignment details:

To pass this part of the course, the assignment is to write a research proposal about a topic related to multi-armed bandit type algorithms, and give feedback to other proposals.

The proposal should describe:


Remarks:


You will then be asked to give constructive feedback for ≤3 other proposals. For each proposal assigned to you, write ≤1 page in which you evaluate the proposal w.r.t. the following criteria: novelty/originality, scientific/societal relevance, feasibility, quality of writing. Discuss strengths and weaknessnes, and give concrete suggestions for improvement.

Your grade for this part of the course will be based both on your proposal and the quality of your feedback.