Search this site
Embedded Files
Skip to main content
Skip to navigation
Kishan Panaganti Badrinath
Home
Research
Publications
CV
Job Market
Kishan Panaganti Badrinath
Home
Research
Publications
CV
Job Market
More
Home
Research
Publications
CV
Job Market
Open-Source Lectures
Results in Multi-armed Bandit literature
Proof sketch of the converse result in the Classical Multi-armed Bandit problem
Proof sketch of the Uniform Confidence Bound (UCB) algorithm
Proof sketch of the Thompson Sampling algorithm using Beta priors
Paper Explained series for Offline Reinforcement Learning
Dataset for benchmarking Offline/Batch RL algorithms
Model-based Offline Reinforcement Learning algorithm
Model-based Offline Policy Optimization algorithm
Critic Regularized Regression
Fitted Value/Policy Iteration algorithm for Offline R
L
Report abuse
Page details
Page updated
Report abuse