Jian Qian's Homepage

Jian Qian's Homepage

Hi! I am a final-year Ph.D. student at MIT EECS. It is my great pleasure to be advised by Sasha Rakhlin. My research primarily focuses on the intersection between machine learning theory and interactive decision making ranging from online learning, bandits to reinforcement learning.

Email: jianqian@hku.hk; [Google Scholar]

Publications

(α-β denotes alphabetical ordering, * denotes equal contribution)

Preprints

To bootstrap or to rollout? An optimal and adaptive interpolation

(α-β) W Mou, J Qian

arXiv, 2024

Refined Risk Bounds for Unbounded Losses via Transductive Priors

(α-β) J Qian, A Rakhlin, N Zhivotovskiy

arXiv, 2024

The Statistical Complexity of Interactive Decision Making

(α-β) DJ Foster, S Kakade, J Qian, A Rakhlin

arXiv, 2021

Conference Papers

Bridging multiple worlds: multi-marginal optimal transport for causal partial-identification problem

(α-β) Z Gao, S Ge, J Qian

AISTATS, 2025

Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit Learnability

(α-β) F Chen, DJ Foster, Y Han, J Qian, A Rakhlin, Y Xu

NeurIPS, 2024 (spotlight)

Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff

J Qian, H Hu, D Simchi-Levi

NeurIPS, 2024

Online Estimation via Offline Estimation: An Information-Theoretic Framework

(α-β) DJ Foster, Y Han, J Qian, A Rakhlin

NeurIPS, 2024

How Does Variance Shape the Regret in Contextual Bandits?

(α-β) Z Jia, J Qian, A Rakhlin, C Wei

NeurIPS, 2024

The Non-linear F-Design and Applications to Interactive Learning

(α-β) A Agarwal, J Qian, A Rakhlin, T Zhang

ICML, 2024

Model-free reinforcement learning with the decision-estimation coefficient

(α-β) DJ Foster, N Golowich, J Qian, A Rakhlin, A Sekhari

NeurIPS, 2023

Convex and Non-Convex Optimization under Generalized Smoothness

H Li*, J Qian*, Y Tian, A Rakhlin, A Jadbabaie

NeurIPS, 2023 (spotlight)

Byzantine-robust federated linear bandits

(α-β) A Jadbabaie, H Li, J Qian, Y Tian

CDC, 2022

Robust Learning Under Clean-label Attack

(α-β) A Blum, S Hanneke, J Qian, H Shao

COLT, 2021

Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes

Y Tian*, J Qian*, S Sra

NeurIPS, 2020 (spotlight)

Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes

J Qian, R Fruit, M Pirotta, A Lazaric

NeurIPS, 2019

Importance Resampling for Off-policy Prediction

M Schlegel, W Chung, D Graves, J Qian, M White

NeurIPS, 2019

Technical Note

Concentration inequalities for multinoulli random variables

J Qian, R Fruit, M Pirotta, A Lazaric

arXiv, 2020

Teaching

Dynamic Programming & Reinforcement Learning (Spring 2022), Teaching Assistant.

Page updated

Google Sites

Report abuse