Tejas Bodas (Computer Systems Group)
Saketh Ayyagari: Model free reinforcement learning for non-stationary environments
Harshit Lalwani: Reinforcement learning for LDPC/message passing
Kavin Aravindan: Kernel selection for Gaussian Processes.
Kshitij Mishra: Tabular and Deep Reinforcement learning for Gittins index
Shikhar Saksena: CGD: Modifying the loss landscape by gradient regularization
Aks Kanodia: Gaussian Processes for dynamic pricing and optimal execution
Aanvik Bhatnagar: RL for optimal multiple stopping problems
Mrudani Pimpalkhare: Optimal timing of Systematic Investment Plans
Manitej Sriram: Kernel selection for Gaussian Processes
Shreeprabhas Varma: Optimal timing of Systematic Investment Plans
Sohan Thummala: Change point detection with MMD
Sriyansh Suryadevara: RL for optimal multiple stopping problems
Shivani Kulkarni: Merton's portfolio theory and Discrete models of Financial Markets.
Nishita Kannan: Merton's Portfolio Optimization.
Pranav Agrawal, B.Tech Honors: Reinforcement learning for Dynamic pricing and learning
Anush Anand, B.Tech Honors: Bayesian Optimization for dynamic pricing .
S Akash (Intern, IIT Patna): MAQL: Speeding up Q-learning with a model assist.
Basab Ghosh (Intern, IISc): Var/Cvar minimization and Risk sensitive RL.
Sukhjinder Kumar (Intern, IIITH Alumni): Policy gradient for Markovian Bandits.
Shivam Sood (BTP): Diffusion models for Reinforcement learning.
Kunal Jain, Dual Degree (Independent Study): Bayesian Optimization for Composite functions
Activities of the Group are supported by the ANRF Matrics and IIITH Seed grant. See this page for some recent publications from the group.
All affiliations are IIIT Hyderabad, unless specified.