Papers
Markov Chain Variance Estimation: A Stochastic Approximation Approach
Preliminary version: Policy Evaluation for Variance in Average Reward Reinforcement Learning -- ICML 2024.
Extended version: arxiv.
Shubhada Agrawal, Prashanth L.A., Siva Theja Maguluri
Optimal Top-Two Method for Best Arm Identification and Fluid Analysis
Advances in Neural Information Processing Systems (NeurIPS), 2024
Agniv Bandyopadhyay, Sandeep Juneja, Shubhada Agrawal
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption
Algorithmic Learning Theory (ALT), 2024
Shubhada Agrawal, Timothée Mathieu, Debabrota Basu, Odalric-Ambrym Maillard
ACM SIGMETRICS Performance Evaluation Review, 2023
AAMAS Workshop on Autonomous Agents for Social Good, 2023
Extended version available here
Daksh Mittal, Sandeep Juneja, Shubhada Agrawal
Optimal Best-Arm Identification Methods for Tail-Risk Measures
Advances in Neural Information Processing Systems (NeurIPS), 2021
Shubhada Agrawal, Wouter M. Koolen, Sandeep Juneja
34th Annual Conference on Learning Theory (COLT), 2021
Shubhada Agrawal, Sandeep Juneja, Wouter M. Koolen
Indian Control Conference (ICC), 2021
Shubhada Agrawal, Sandeep Juneja, Wouter M. Koolen
Algorithmic Learning Theory (ALT), 2020, (link)
Shubhada Agrawal, Sandeep Juneja, Peter Glynn
Erratum incorporated in the arxiv version
Journal of the Indian Institute of Science 100: 809-847, 2020
Shubhada Agrawal, Siddharth Bhandari, Anirban Bhattacharjee, Anand Deo, Narendra M. Dixit, Prahladh Harsha, Sandeep Juneja, Poonam Kesarwani, Aditya Krishna Swamy, Preetam Patil, Nihesh Rathod, Ramprasad Saptharishi, Sharad Shriram, Piyush Srivastava, Rajesh Sundaresan, Nidhin Koshy Vaidhiyan and Sarath Yasodharan
Enhanced Indexing for Risk-Averse Investors using Relaxed Second-Order Stochastic Dominance
Journal of Optimization and Engineering 18: 407-442, 2017
Amita Sharma, Shubhada Agrawal, Aparna Mehra
Preprints/under submission and Working Drafts
Concentration of General Stochastic Approximation Under Markovian Noise
In preparation
Shubhada Agrawal, Siva Theja Maguluri, Martin Zubeldia
Eventually LIL Regret: Almost Sure loglog(T) Regret for a Sub-Gaussian Mixture on Unbounded Data
Under submission
Shubhada Agrawal, Aaditya Ramdas
Best Arm Identification for Bandits with Shifting Means
Under submission
Lukas Zierahn, Wouter M Koolen, Shubhada Agrawal, Dirk van der Hoeven, Christina Katsimerou
On Stopping Times of Power-one Sequential Tests: Tight Lower and Upper Bounds
Preprint
Shubhada Agrawal, Aaditya Ramdas
Optimal Best-Arm Identification in Bandits with Access to Offline Data
Preprint
Shubhada Agrawal, Sandeep Juneja, Karthikeyan Shanmugam, Arun Sai Suggala
Thesis
PhD Thesis: Bandits with Heavy Tails: Algorithms, Analysis & Optimality (Bibtex) (talk)