GAR-meets-RAG Paradigm for Zero-Shot Information Retrieval. Daman Arora, Anush Kini, Sayak Ray Chowdhury, Nagarajan Natarajan, Gaurav Sinha, Amit Sharma. 2023. Link
Active Preference Optimization for Sample-Efficient RLHF. Nirjhar Das, Souradip Chakraborty, Aldo Pacchiano, Sayak Ray Chowdhury. European Conference on Machine Learning (ECML-PKDD), 2025. Link
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift. S. Son, W. Bankes, Sayak Ray Chowdhury, B. Paige, I. Bogunovic. International Conference on Machine Learning (ICML), 2025. Link
Communication Efficient, Secure, and Private Multi-Party Deep Learning. Sankha Das, Sayak Ray Chowdhury, Nishanth Chandran, Divya Gupta, Rahul Sharma, Satya Lokam. Proceedings on Privacy Enhancing Technologies Symposium (PoPETS), 2025. Link
Provably Robust DPO: Aligning Language Models with Noisy Feedback. Sayak Ray Chowdhury, Anush Kini, Nagarajan Natarajan. International Conference on Machine Learning (ICML), 2024. Link
OAK: Enriching Document Representations using Auxiliary Knowledge for Extreme Classification. Shikhar Mohan, Deepak Saini, Anshul Mittal, Sayak Ray Chowdhury, Bhawna Paliwal, Jian Jiao, Manish Gupta, Manik Varma. International Conference on Machine Learning (ICML), 2024. Link
Differentially Private Federated Linear Contextual Bandits. Xingyu Zhou, Sayak Ray Chowdhury. International Conference on Learning Representations (ICLR), 2024. Link
Differentially Private Reward Estimation with Preference Feedback. Sayak Ray Chowdhury, Xingyu Zhou, Nagarajan Natarajan. International Conference on Artificial Intelligence and Statistics (AISTATS), 2024. Link
Combinatorial Categorized Bandits with Expert Rankings. Sayak Ray Chowdhury, Gaurav Sinha, Nagarajan Natarajan, Amit Sharma. Conference on Uncertainty in Artificial Intelligence (UAI), 2023. Link
Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards. Yulian Wu, Sayak Ray Chowdhury, Xingyu Zhou, Di Wang. International Conference on Machine Learning (ICML), 2023. Link
Bregman Deviations of Generic Exponential Families. Sayak Ray Chowdhury, Patrick Saux, Odalric-Ambrym Maillard, Aditya Gopalan. 36th Annual Conference on Learning Theory (COLT), 2023. Link
Distributed Differential Privacy in Multi-armed Bandits. Sayak Ray Chowdhury, Xingyu Zhou. International Conference on Learning Representations (ICLR), 2023. Link
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference. Debangshu Banerjee, Avishek Ghosh, Sayak Ray Chowdhury, Aditya Gopalan. International Conference on Artificial Intelligence and Statistics (AISTATS), 2023. Link
Value Function Approximations via Kernel Embeddings for No-Regret Reinforcement Learning. Sayak Ray Chowdhury, Rafael Oliveira. Asian Conference on Machine Learning (ACML), 2022. Link
Model Selection in Reinforcement Learning with General Function Approximations. Avishek Ghosh, Sayak Ray Chowdhury. European Conference on Machine Learning (ECML-PKDD), 2022. Link
Shuffle Private Linear Contextual Bandits. Sayak Ray Chowdhury, Xingyu Zhou. International Conference on Machine Learning (ICML), 2022. Link
Differentially Private Regret Minimization in Episodic Markov Decision Processes. Sayak Ray Chowdhury, Xingyu Zhou. AAAI Conference on Artificial Intelligence (AAAI), 2022. Link
Reinforcement Learning in Parametric MDPs with Exponential Families. Sayak Ray Chowdhury, Aditya Gopalan, Odalric-Ambrym Maillard. International Conference on Artificial Intelligence and Statistics (AISTATS), 2021. Link
No-regret Algorithms for Multi-task Bayesian Optimization. Sayak Ray Chowdhury, Aditya Gopalan. International Conference on Artificial Intelligence and Statistics (AISTATS), 2021. Link
Adaptive Control of Differentially Private Linear Quadratic Systems. Sayak Ray Chowdhury, Xingyu Zhou, Ness Shroff. IEEE International Symposium on Information Theory (ISIT), 2021. Link
Active Learning of Conditional Mean Embeddings via Bayesian Optimisation. Sayak Ray Chowdhury, Rafael Oliveira, Fabio Ramos. Conference on Uncertainty in Artificial Intelligence (UAI), 2020. Link
Bayesian Optimization under Heavy-tailed Payoffs. Sayak Ray Chowdhury, Aditya Gopalan. Neural Information Processing Systems (NeurIPS), 2019. Link
Online Learning in Kernelized Markov Decision Processes. Sayak Ray Chowdhury, Aditya Gopalan. International Conference on Artificial Intelligence and Statistics (AISTATS), 2019. Link
On Kernelized Multi-armed Bandits. Sayak Ray Chowdhury, Aditya Gopalan. International Conference on Machine Learning (ICML), 2017. Link
Misspecified Linear Bandits. Avishek Ghosh, Sayak Ray Chowdhury, Aditya Gopalan. AAAI Conference on Artificial Intelligence (AAAI), 2017. Link
Active Preference Optimization for Sample Efficient RLHF. Nirjhar Das, Souradip Chakraborty, Aldo Pacchiano, Sayak Ray Chowdhury. ICML Workshop on Theoretical Foundations of Foundational Models, 2024. Also appeared in Adaptive Learning in Complex Environments Workshop, TTIC Chicago, 2024.
Provably Robust DPO: Aligning Language Models with Noisy Feedback. Sayak Ray Chowdhury, Anush Kini, Nagarajan Natarajan. ICLR Workshop on Mathematical and Empirical understanding of Foundational Models, 2024.
Differentially Private Reward Estimation from Preference-based Feedback. Sayak Ray Chowdhury, Xingyu Zhou. ICML Workshop on The Many Facets of Preference-Based Learning, 2023. Also appeared in Theory and Practice of Differential Privacy (TPDP), Boston University, 2023.
On Differentially Private Federated Linear Contextual Bandits. Xingyu Zhou, Sayak Ray Chowdhury. ICML Workshop on Federated Learning and Analytics, 2023. Also appeared in Theory and Practice of Differential Privacy (TPDP), Boston University, 2023.
Online Contextual Learning with Limited Feedback. Sayak Ray Chowdhury, Aditya Gangrade, Ashok Cutkosky, Venkatesh Saligrama. ICML Workshop on Adaptive Experimental Design and Active Learning in the Real World, 2022. Link
Online Learning in Kernelized Markov Decision Processes. Sayak Ray Chowdhury, Aditya Gopalan. NeurIPS workshop on Infer to Control: Probabilistic Reinforcement Learning and Structured Control, 2018.
On Batch Bayesian Optimization. Sayak Ray Chowdhury, Aditya Gopalan. NeurIPS workshop on All of Bayesian Nonparametrics, 2018. Link
PhD Thesis: Online Reinforcement Learning in Large and Structured Environments. Sayak Ray Chowdhury. Department of ECE, Indian Institute of Science. July 2021.
Masters Thesis: A Game Theoretic Approach to Robust Optimization. Sayak Ray Chowdhury. Department of CSA, Indian Institute of Science. June 2015.