Publications
2022
Chloé Rouyer, Dirk van der Hoeven, Nicolò Cesa-Bianchi, Yevgeny Seldin. A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs. NeurIPS, 2022. [arXiv]
Saeed Masoudian, Julian Zimmert, Yevgeny Seldin. A Best-of-Both-Worlds Algorithm for Bandits with Delayed Feedback. NeurIPS 2022. [arXiv]
Yi-Shan Wu, Yevgeny Seldin. Split-kl and PAC-Bayes-split-kl Inequalities. NeurIPS, 2022. [arXiv]
Nikolaos Nomikos, Mohammad Sadegh Talebi, Themistoklis Charalambous, Risto Wichman. Bandit-based power control in full-duplex cooperative relay networks with strict-sense stationary and non-stationary wireless communication channels. IEEE Open Journal of the Communications Society 3: 366-378, 2022. [doi]
2021
Mohammad Sadegh Talebi, Anders Jonsson, Odalric Maillard. Improved Exploration in Factored Average-Reward MDPs. AISTATS, 2021. [paper]
Yi-Shan Wu, Yevgeny Seldin, Andres R Masegosa, Christian Igel, Stephan Sloth Lorenzen. Chebyshev-Cantelli PAC-Bayes-Bennett Inequality for the Weighted Majority Vote. NeurIPS, 2021. [paper]
Yi-Shan Wu, Yi-Te Hong, Chi-Jen Lu. Lifelong Learning with Branching Experts. ACML, 2021. [paper]
Chloé Rouyer, Yevgeny Seldin, and Nicolò Cesa-Bianchi. An algorithm for stochastic and adversarial bandits with switching costs. ICML, 2021. [arXiv]
Yijie Zhang and Herke van Hoof. Deep coherent exploration for continuous control. ICML, 2021. [paper]
Saeed Masoudian and Yevgeny Seldin. Improved analysis of robustness of the Tsallis-INF algorithm to adversarial corruptions in stochastic multiarmed bandits. COLT, 2021. [arXiv]