CS7.603: Topics in Reinforcement Learning (4 credits, since Spring 23)
MA601: Probability and Statistics (4 credits, since Monsoon 22)
CS3.307: Performance modeling for computer systems (2 credits, since Spring 22)
EE309: Introduction to communication systems (Spring 20, 3 credits, @ IIT Dharwad)
EE 216: Communication lab (Spring 20, 3 credits, @ IIT Dharwad)
EE 407: Course on Stochastic Processes (Autumn 20, 6 credits, @ IIT Dharwad)
Kavin A, Dual degree: Gaussian Processes and Bayesian Optimization for Reinforcement learning.
Aanvik Bhatnagar, BTech Honors: RL for optimal stopping problems
Shikhar Saxsena, Dual degree: CGD: Modifying the loss landscape by gradient regularization
Kshitij Mishra, Dual degree: Tabular and Deep Reinforcement learning for Gittins index.
Shivani Kulkarni: Reinforcement learning for Portfolio optimization.
Swaroop C: Reinforcement learning for dynamic pricing with (s,S) inventory control.
Aryaman Mahajan: Convergence properties of first and second order optimization algorithms.
Pranav Agrawal, B.Tech Honors: Reinforcement learning for Dynamic pricing and learning
Anush Anand, B.Tech Honors: Bayesian Optimization for dynamic pricing.
S Akash (Intern, IIT Patna): MAQL: Speeding up Q-learning with a model assist.
Basab Ghosh (Intern, IISc): Var/Cvar minimization and Risk sensitive RL.
Sukhjinder Kumar (Intern, IIITH Alumni): Policy gradient for Markovian Bandits.
Pratham Thakkar & Nipun Tulsian (BTP): Model free reinforcement learning in non-stationary environments.
Aman Raj (BTP): Data driven estimation in queues.
Shivam Sood (BTP): Diffusion models for Reinforcement learning.
Meka Sai Mukund (Independent Study) : Surrogate Gradient Descent using GP.
Anurag Gupta, B.Tech (Independent Study): RL algorithms for dynamic pricing and learning.
Kunal Jain, Dual Degree (Independent Study): Bayesian Optimization for Composite functions
Naman Ahuja, Dual Degree (Independent Study): Topics in Revenue Management and Pricing
Rohit Reddy (BTP stage 2): Generative Adversarial Networks for simulating Markov chains.
Srinath Maddineni (BTP Stage 2): Convergent DQN with Spectral Normalization
Romaharshan Pusapati, Vamsi Krishna, Sudheer Reddy (BTP Stage 2): Notes on Regret Analysis for Bandits
Paritosh Pankaj (Intern, IIT Kanpur) : Bayesian Optimization for Dynamic pricing and learning.
Harshit Dhankhar (Intern, IIT Patna): Tabular and Deep Reinforcement learning for Gittins index.
Aditya Iyer (Intern, Manipal): Generative Adversarial Networks for simulating random variables.
Aryan C. and Kushagra K: First order Bayesian Optimization for RL tasks (R&D project @ IIT Dharwad, co-supervised with Prof. K.J. Prabuchandran)
Utkarsh K: Bayesian optimization using the first order (gradient) conditions. (Summer project + BTP @ IIT Dharwad in 2021, co-supervised with Prof. K.J. Prabuchandran)
Aryan C. and Kushagra K. : Scheduling in queues with deadline. (Summer project @ IIT Dharwad in 21)
Venkata D. B. & Deepak N. : Modelling the COVID-19 pandemic: The SAQR model (BTP@ IIT Dharwad in 2020)