Publications

Google scholar page


Submitted Papers

Quantizer Design for Finite Model Approximations, Model Learning, and Quantized Q-Learning for MDPs with Unbounded Spaces

arXiv:2510.04355 

Sensitivity of Filter Kernels and Robustness Bounds to Transition and Measurement Kernel Perturbations in Partially Observable Stochastic Control

arXiv:2508.10658 

Learning POMDPs with Linear Function Approximation and Finite Memory 

arXiv:2505.14879

Refined Bounds on Near Optimality Finite Window Policies in POMDPs and Their Reinforcement Learning 

arXiv:2409.04351 

Q-Learning for Continuous State and Action MDPs under Average Cost Criteria 

arXiv:2308.07591  


Journal Publications

Learning with Linear Function Approximations in Mean-Field Control

Journal of Machine Learning Research, vol. 26, no. 192, pp. 1-53, 2025

 arXiv:2408.00991 

Finite Approximations for Mean Field Type Multi Agent Control and Their Near Optimality

Applied Mathematics and Optimization, vol. 92(7), 2025

 arXiv:2211.09633

 Special Issue in Honor of Peter Caines's 80th Birthday, Editors: Minyi Huang and Ji-Feng Zhang, 2025 (pdf)

Approximation Schemes for POMDPs and Their Near Optimality 

arXiv:2410.02895

Infinite Horizon Average Cost Optimality for Mean-Field Control,  

SIAM Journal on Control and Optimization, vol. 62(5), pp. 2776-2806, 2024 

arXiv:2309.11744 

Average Cost Optimality of Partially Observed MDPs: Contraction of Non-linear Filters and Existence of Optimal Solutions,  

SIAM Journal on Control and Optimization, vol. 62(6), pp. 2859-2883,2024

arXiv:2312.14111

Q-Learning for Stochastic Control under General Information Structures and Non-Markovian Environments,  

Transactions on Machine Learning Research, 2024, (Featured Certification),   

OpenReview  

Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability, 

Mathematics of Operations Research, vol. 48, no. 4, pp. 2066-2093, 2023. 

arXiv:2103.12158

Near Optimality of Finite Memory Feedback Policies in Partially Observed Markov Decision Processes, 

Journal of Machine Learning Research, vol. 23, no. 11, pp. 1-46, 2022. 

(paper link) 

Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity, 

Journal of Machine Learning Research, vol. 24, no. 199, pp. 1-34, 2023. 

(paper link)

SIAM Journal on Mathematics of Data Science, vol. 5, no. 3, pp. 615-638, 2023. 

arXiv:2203.07499 

Robustness to Incorrect Models and Adaptive Learning in Average-Cost Optimal Stochastic Control

Automatica, vol. 139(3):110179, (2022) (Editor's Choice)

arXiv:2003.05769

Robustness to Incorrect System Models in Stochastic Control

SIAM Journal on Control and Optimization, vol. 58(2), pp. 1144–1182, 2020 

arXiv:1803.06046

Weak Feller Property of Non-linear Filters, 

Systems and Control Letters, vol. 134, pp. 104512  Dec. 2019 

arXiv:1812.05509

Robustness to Incorrect Priors in Partially Observed Stochastic Control

SIAM Journal on Control and Optimization, v. 57(3), pp. 1929–1964, 2019 

arXiv:1803.05103


Book Chapters


Conference Publications