(α-β denotes alphabetical ordering, * denotes equal contribution)To bootstrap or to rollout? An optimal and adaptive interpolation
(α-β) W Mou, J Qian
arXiv, 2024
Refined Risk Bounds for Unbounded Losses via Transductive Priors
(α-β) J Qian, A Rakhlin, N Zhivotovskiy
arXiv, 2024
The Statistical Complexity of Interactive Decision Making
(α-β) DJ Foster, S Kakade, J Qian, A Rakhlin
arXiv, 2021
Bridging multiple worlds: multi-marginal optimal transport for causal partial-identification problem
(α-β) Z Gao, S Ge, J Qian
AISTATS, 2025
Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit Learnability
(α-β) F Chen, DJ Foster, Y Han, J Qian, A Rakhlin, Y Xu
NeurIPS, 2024 (spotlight)
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
J Qian, H Hu, D Simchi-Levi
NeurIPS, 2024
Online Estimation via Offline Estimation: An Information-Theoretic Framework
(α-β) DJ Foster, Y Han, J Qian, A Rakhlin
NeurIPS, 2024
How Does Variance Shape the Regret in Contextual Bandits?
(α-β) Z Jia, J Qian, A Rakhlin, C Wei
NeurIPS, 2024
The Non-linear F-Design and Applications to Interactive Learning
(α-β) A Agarwal, J Qian, A Rakhlin, T Zhang
ICML, 2024
Model-free reinforcement learning with the decision-estimation coefficient
(α-β) DJ Foster, N Golowich, J Qian, A Rakhlin, A Sekhari
NeurIPS, 2023
Convex and Non-Convex Optimization under Generalized Smoothness
H Li*, J Qian*, Y Tian, A Rakhlin, A Jadbabaie
NeurIPS, 2023 (spotlight)
Byzantine-robust federated linear bandits
(α-β) A Jadbabaie, H Li, J Qian, Y Tian
CDC, 2022
Robust Learning Under Clean-label Attack
(α-β) A Blum, S Hanneke, J Qian, H Shao
COLT, 2021
Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes
Y Tian*, J Qian*, S Sra
NeurIPS, 2020 (spotlight)
Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes
J Qian, R Fruit, M Pirotta, A Lazaric
NeurIPS, 2019
Importance Resampling for Off-policy Prediction
M Schlegel, W Chung, D Graves, J Qian, M White
NeurIPS, 2019
Concentration inequalities for multinoulli random variables
J Qian, R Fruit, M Pirotta, A Lazaric
arXiv, 2020