Publications
Papers
Policy Evaluation for Variance in Risk-Sensitive Average Reward Reinforcement Learning
Accepted for publication at ICML 2024
Shubhada Agrawal, Prashanth L A, Siva Theja Maguluri
CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption
Algorithmic Learning Theory (ALT), 2024
Shubhada Agrawal, Timothée Mathieu, Debabrota Basu, Odalric-Ambrym Maillard
ACM SIGMETRICS Performance Evaluation Review, 2023
AAMAS Workshop on Autonomous Agents for Social Good, 2023
Extended version available here
Daksh Mittal, Sandeep Juneja, Shubhada Agrawal
Optimal Best-Arm Identification Methods for Tail-Risk Measures
Advances in Neural Information Processing Systems (NeurIPS), 2021
Shubhada Agrawal, Wouter M. Koolen, Sandeep Juneja
34th Annual Conference on Learning Theory (COLT), 2021
Shubhada Agrawal, Sandeep Juneja, Wouter M. Koolen
Indian Control Conference (ICC), 2021
Shubhada Agrawal, Sandeep Juneja, Wouter M. Koolen
Algorithmic Learning Theory (ALT), 2020, (link)
Shubhada Agrawal, Sandeep Juneja, Peter Glynn
Erratum incorporated in the arxiv version
Journal of the Indian Institute of Science 100: 809-847, 2020
Shubhada Agrawal, Siddharth Bhandari, Anirban Bhattacharjee, Anand Deo, Narendra M. Dixit, Prahladh Harsha, Sandeep Juneja, Poonam Kesarwani, Aditya Krishna Swamy, Preetam Patil, Nihesh Rathod, Ramprasad Saptharishi, Sharad Shriram, Piyush Srivastava, Rajesh Sundaresan, Nidhin Koshy Vaidhiyan and Sarath Yasodharan
Enhanced Indexing for Risk-Averse Investors using Relaxed Second-Order Stochastic Dominance
Journal of Optimization and Engineering 18: 407-442, 2017
Amita Sharma, Shubhada Agrawal, Aparna Mehra
Preprints and Working Drafts
Optimal Top-Two Method for Best Arm Identification and Fluid Analysis
Preprint, under submission
Agniv Bandyopadhyay, Sandeep Juneja, Shubhada Agrawal
Optimal Best-Arm Identification in Bandits with Access to Offline Data
Preprint
Shubhada Agrawal, Sandeep Juneja, Karthikeyan Shanmugam, Arun Sai Suggala
Concentration of Contractive Stochastic Approximation Under Markovian Noise with applications in Reinforcement Learning
In preparation
Shubhada Agrawal, Siva Theja Maguluri, Martin Zubeldia
Thesis
PhD Thesis: Bandits with Heavy Tails: Algorithms, Analysis & Optimality (Bibtex)