Search this site
Embedded Files
Miao Liu's Homepage
  • Home
  • Publications
Miao Liu's Homepage
  • Home
  • Publications
  • More
    • Home
    • Publications

Google Scholar Profile

2025

  • Contextual Bandits for Large-Scale Structured Discrete Constrained Optimization Problems. P Harsha, C Subramanian, N Abe, S Subramanian, A Ba, K A Fernández Román, M L Garrido, M Liu, A Lozano, C Narayanaswami, NeurIPS 2025 Workshop MLxOR: Mathematical Foundations and Operational Integration of Machine Learning for Uncertainty-Aware Decision-Making [link]

  • Meta-D2AG: Causal Graph Learning with Interventional Dynamic Data. T Gao, S Lu, J Lee, N Nelson, D Bhattacharjya, Y Yu, M Liu, NeurIPS2025 [link]

  • Bridging the Gap: Unifying HCI & ML Perspectives on Mutual Theory of Mind.  Z Ashktorab, D Bouneffouf, K Brimijoin, R Bellamy, M Campbell, A Goldberg, G E Gonzalez, S Houde, M Liu, D S Moran, M Riemer and J D Weisz,  the Workshop on Generative AI and Theory of Mind in Communicating Agent at IJCAI 2025 [link]

  • Position: Theory of Mind Benchmarks are Broken for Large Language Models. M Riemer, Z Ashktorab, D Bouneffouf, P Das, M Liu J D Weisz and M Campbell, ICML2025 [link]

  • Evaluating the Prompt Steerability of Large Language Models. E Miehling, M Desmond, K N Ramamurthy, E M Daly, K R Varshney, E Farchi, P Dognin, J Rios, D Bouneffouf, M Liu, P Sattigeri, NAACL 2025 [link]

  • A Generalist Hanabi Agent.  A V Sudhakar, H Nekoei, M Reymond, M Liu, J Rajendran, Sarath Chandar, ICLR2025 [link]

  • Q-function Decomposition with Intervention Semantics for Factored Action Spaces. J Lee, T Gao, E Nelson, M Liu, D Bhattacharjya, S Lu, AISTATS2025 [link]

  • Contextual Value Alignment. P Dognin, J Rios, R Luss, P Sattigeri, M Liu, I Padhi, M Riemer, M Nagireddy, K Varshney, D Bouneffouf, ICASSP2025 [link]

2024

  • Evaluating the Prompt Steerability of Large Language Models. E Miehling, M Desmond, K Ramamurthy, E Daly, P Dognin, J Rios, D Bouneffouf, M Liu, Pluralistic Alignment Workshop at NeurIPS 2024 [link]

  • SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning.  S Zhang, H Fernando, M Liu, K Murugesan, S Lu, P Chen, T Chen, M Wang, ICML 2024 [link]

  • ComVas: Contextual Moral Values Alignment System. I Padhi, P Dognin, J Rios, R Luss, S Achintalwar, M Riemer, M Liu, P Sattigeri, M Nagireddy, K Varshney, Djallel Bouneffouf,  IJCAI2024 Demo Track [link]

  • Multi-agent text-based Hanabi challenge. H Nekoei, A V Sudhakar, J Rajendran, M Liu, S Chandar, ICLR 2024 Workshop on Generative Models for Decision Making [link]

  • Variance Reduction Can Improve Trade-off In Multi-Objective Learning. H Fernando, L Chen, S Lu, P Chen, M Liu, S Chaudhury, K Murugesan, G Liu, M Wang, T Chen, ICASSP2024 [link]

  • A Neuro-Symbolic Approach to Multi-Agent RL for Interpretability and Probabilistic Decision Making, C Subramanian, M Liu , N Khan, J Lenchner , A Amarnath, S Swaminathan, R Riegel , A Gray [link] 


2023

  • On the Convergence and Sample Complexity Analysis of Deep Q-Networks with e-Greedy Exploration. S Zhang, M Wang, H Li, M Liu, P Chen, S Lu, S Liu, K Murugesan, S Chaudhury, In Proc. of 37th Conference on Neural Information Processing Systems (NeurIPS), 2023 [link]

  • Towards Few-shot Coordination: Revisiting Ad-Hoc Teamplay Challenge in the Game of Hanabi. H Nekoei, X Zhao, J Rajendran, M Liu, S Chandar. In Proc. of the 2nd Conference on Lifelong Learning Agents (CoLLAs) 2023 [paper]

  • Model-free Causal Reinforcement Learning with Causal Diagrams. J Lee, T Gao, E Nelson, M Liu, D Bhattacharjya, The 1st International Workshop on Knowledge-Based Compositional Generalization (KBCG) at IJCAI 2023 [paper]

  • A Neuro-Symbolic Approach to Runtime Optimization in Resource Constrained Heterogeneous Systems. C K Subramanian, S Swaminathan, M Liu, M Longinos, A Amarnath, K Swaminathan, M Cochet, K Roman, and P Bose, Neuro-Symbolic Agents Workshop at IJCAI2023 [paper]

  • Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Approach. H D Fernando, H Shen, M Liu, S Chaudhury, K Murugesan and T Chen. In Proc. of the 11th International Conference on Learning Representations (ICLR), 2023 [paper][notable-top-5%]

  • Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks.  S Zhang, M Wang, P Chen, S Liu, S Lu and M Liu. In Proc. of the 11th International Conference on Learning Representations (ICLR), 2023 [paper]

  • Local Explanations for Reinforcement Learning.  R. Luss, A. Dhurandhar, and M Liu. In Proc. of the 37th AAAI Conference on Artificial Intelligence (AAAI), 2023 [paper]

2022

  • Influencing Long-Term Behavior in Multiagent Reinforcement Learning. D K Kim, M Riemer, M Liu, J N Foerster, M Everett, C Sun, G Tesauro, J P How,  In Proc. of 36th Conference on Neural Information Processing Systems (NeurIPS), 2022 [paper][MIT News]

  • Game-Theoretical Perspectives on Active Equilibria: A Preferred Solution Concept over Nash Equilibria. D K Kim, M Riemer, M Liu, J N Foerster, G Tesauro, J P How.  Workshop on Strategic Multi-agent Interactions: Game Theory for Robot Learning and Decision Making at CoRL 2022 [link]

  • Learning Multi-Objective Curricula for Robotic Policy Learning. J Kang, M Liu, A Gupta, C Pal, X Liu, and J Fu. In Proc. of the 6th Annual Conference on Robot Learning (CoRL), 2022 [link]

  • Cost-Efficient Reinforcement Learning for Optimal Trade Execution on Dynamic Market Environment. D Chen, Y Zhu, M Liu, and J Li. In the 3rd ACM International Conference on AI in Finance (ICAIF), 2022 [link]

  • IDYNO: Learning Nonparametric DAGs from Interventional Dynamic Data. T Gao, D Bhattacharjya, E Nelson, M Liu, and Y Yu. In Proc. of the 39th International Conference on Machine Learning (ICML), 2022 [paper][link]

  • Linerizing Contextual Bandits with Latent State Dynamics. E Nelson, D. Bhattacharjya, T. Gao, M Liu, D. Bouneffouf, and P Poupart. In Proc. of Conference on Uncertainty in Artificial Intelligence (UAI), 2022 [paper][link]

  • Context-specific representation abstraction for deep option learning. M Abdulhai, D K Kim, M Riemer, M Liu, G Tesauro, and J. P. How. In Proc. of the AAAI, 2022 [paper][code]

2021

  • A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning. D K Kim, M Liu, M Riemer, C Sun, M Abdulhai, G Habibi, S Lopez-Cot, G Tesauro and J P How., in Proc. of the 38th International Conference on Machine Learning (ICML), 2021 [paper][code]

  • RL Generalization in a Theory of Mind Game Through a Sleep Metaphor. T Malloy, T Klinger, M Liu, G Tesauro, M Riemer, and C R Sims. In Proc. of the AAAI, 2021 [paper]

  • Capacity-Limited Decentralized Actor-Critic for Multi-Agent Games. T Malloy, C R Sims, T Klinger, M. Liu, M. Riemer, and G Tesauro. In Proc. of the IEEE Conference on Games (CoG), 2021 [paper]

  • Modeling Capacity-Limited Decision Making Using a Variational Autoencoder. T Malloy, T. Klinger, M Liu, G Tesauro, M Riemer, and C R Sims. In Proc. of the Annual Meeting of the Cognitive Science Society, 2021 [paper]

  • Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games. T Malloy, T Klinger, M Liu, M Riemer, G Tesauro, C R Sims, AAAI-21 Workshop on Reinforcement Learning in Games [paper]

  • AI Planning Annotation in Reinforcement Learning: Options and Beyond. J Lee, M Katz, D J Agravante, M Liu, T Klinger, M Campbell, S Soharbi, and G Tesauro. In ICAPS21 workshop on Bridging the Gap Between AI Planning and Reinforcement Learning [paper]

2020

  • Learning Hierarchical Teaching Policies for Cooperative Agents. D K Kim, M Liu, S Omidshafiei, S Lopez-Cot, M Riemer, G Habibi, G Tesauro, S Mourad, M Campbell, and J P How. In the Proc. of the 19th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2020 [paper]

  • On the Role of Weights Sharing. M Riemer, I Cases, C Rosenbaum, M Liu, and G Tesauro. In Proc. of the 34th AAAI Conference on Artificial Intelligence (AAAI), 2020 [paper]

  • Deep RL with information constrained policies: Generalization in continuous control.  T Malloy, C R Sims, T Klinger, M Liu, M Riemer, G Tesauro [paper]

2019

  • Automatic Pan-Tilt Camera Control for Learning Dirichlet Process Gaussian Process (DPGP) Mixture Models of Multiple Moving Targets. H Wei, P Zhu, M Liu, J P How, and S Ferrari. IEEE Transactions on Automatic Control,64(1):159-173, 2019 [link]

  • Learning to Learn without Forgetting by Maximizing Transfer and Minimizing Interference. M Riemer, I Cases, R Ajemian, M Liu, I Rish, Y Tu, and G Tesauro. In Proc. of the 7th International Conference on Learning Representations (ICLR), 2019 [paper][link][code][Media report]

  • Learning to Teach in Cooperative Multiagent Reinforcement Learning. S Omidshafiei, D K Kim, M Liu, G Tesauro, M Riemer, C Amato, M Campbell, and J P How. In Proc. of the 33th AAAI Conference on Artificial Intelligence (AAAI), 2019 [paper][MIT News, IBM Blog][Outstanding Student Paper Honorable Mention]

2018

  • Gaussian Processes for Learning and Control — Tutorial with Examples. M Liu, G Chowdhary, B da Silva, S Liu, and J P How. Control Systems Magazine, IEEE, September 2018 [Link]

  • Learning Abstract Options. M Riemer, M Liu, and G Tesauro. Neural Information Processing Systems (NeurIPS), 2018 [paper]

  • Eigenoption Discovery through the Deep Successor Representation.  M C Machado, C Rosenbaum, X Guo, M Liu, G Tesauro, and M S Campbell. In Proc of the 6th International Conference on Learning Representations (ICLR), 2018 

2017

  • The Eigenoption Critic Framework. M Liu, M C Machado, G Tesauro and M S Campbell, In NeurIPS Workshop on Hierarchical Reinforcement Learning, 2017 [paper]

  • Learning for Multi-robot Cooperation in Partially Observable Stochastic Environments with Macro-actions. M Liu, K. Sivakumar, S Omidshafiei, C Amato and J P How. In Proc. of the 30th IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2017 [paper][link]

  • Socially Aware Motion Planning with Deep Reinforcement Learning. Y Chen, M Everett, M Liu, and J P How, In Proc. of the 30th IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2017 [paper][link][MIT News, Siliconrepublic, Neowin][Best Student Paper Award and Finalist-Best Paper Award on Cognitive Robotics]

  •  Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning. Y Chen, M. Liu, M. Everett and J P How. In Proc. of the IEEE International Conference on Robotics and Automation (ICRA), 2017 [paper] [Finalist-Best Multi-Robot System Paper]

  • Semantic-level Decentralized Multi-Robot Decision-Making using Probabilistic Macro-Observations. S Omidshafiei, S Liu, M. Everett and B T Lopez, C Amato, M Liu, J P How, and J. Vian. In Proc. of the IEEE International Conference on Robotics and Automation (ICRA), 2017 [paper]

  • Scalable Accelerated Decentralized MultiRobot Policy Search in Continuous Observation Spaces. S Omidshafiei, C Amato, M Liu, J P How and J. Vian. In the IEEE International Conference on Robotics and Automation (ICRA), 2017 [paper]

  • A Quickest Change Detection Approach for Dynamic Decision Making in Nonstationary Environments, T Benerjee, M Liu, and J P How. In Proc. of the American Control Conference (ACC), 2017 [paper]

2016

  • Learning for Multiagent Decentralized Control in Large, Partially-Observable Stochastic Environments. M Liu, C Amato, E Anesta, J D Griffith, and J P How. In Proc. of the 30th  AAAI Conference on Artificial Intelligence (AAAI), 2016 [link]

  • Motion Planning with Diffusion Maps. Y Chen, S Liu, M Liu, J Miller, and J P How. In Proc. of The IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2016 [paper]

  •  Augmented Dictionary Learning for Sparse Representations of Trajectories with Application to Motion Prediction. Y Chen, M Liu, and J P How. In Proc. of the IEEE International Conference on Robotics and Automation (ICRA), 2016 [paper]

  • Predictive Modeling of Pedestrian Motion Patterns with Bayesian Nonparametrics. Y Chen, M Liu, S Liu, J Miller, and J P How. AIAA Guidance, Navigation, and Control Conference (GNC), 2016. [link][paper]

2015

  •  Stick-Breaking Policy Learning in DECPOMDPs. M Liu, C Amato, X Liao, L Carin, and J P How. In Proc. of the 24rd International Joint Conference on Artificial Intelligence (IJCAI), 2015 [paper][longer version]

  • Learning for Multiagent Decentralized Control in Large Partially-Observable Stochastic Environments. M Liu, C Amato, E Anesta, J D Griffith, and J P How. In Proc. of the 2nd Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM), 2015  

  • Policy Based Reinforcement Learning in DEC-POMDPs with Bayesian Nonparametrics. M Liu and J P How. NeurIPS Workshop on Learning, Inference and Control of Multi-Agent Systems, 2015 [Best Poster Award]

2014 and Before

  • Efficient Bayesian Nonparametric Methods for Model-Free Reinforcement Learning in Centralized and Decentralized Sequential Environments, M Liu, Duke University PhD Thesis, 2014. [link]

  • Off-policy Reinforcement Learning with Gaussian processes. G Chowdhary, M Liu, R C Grande, T J Walsh, L Carin and J P How.  IEEE/CAA Journal of Automatica Sinica, Volume: 1, Issue: 3, July 2014. [link]

  • Dynamic Clustering via Asymptotics of the Dependent Dirichlet Process Mixture.T Campbell, M Liu, B Kulis, J P How, and L Carin., in Proc. of Neural Information Processing Systems (NeurIPS), 2013. [paper]

  • Online Expectation Maximization for Reinforcement Learning in POMDPs, M Liu, X Liao, and L Carin. In Proc. of the 23rd Int’l Joint Conf. on Artificial Intelligence (IJCAI), 2013. [paper]

  • Transfer Learning for Reinforcement Learning with Dependent Dirichlet Process and Gaussian Process. M Liu, G Girish, J P How, and L Carin. NeurIPS Workshop on Bayesian Nonparametric Models for Reliable Planning and Decision-Making Under Uncertainty, 2012. [paper, poster]

  • Infinite Regionalized Policy Representation. M Liu, X Liao, and L Carin. In Proc. of the 28th Int’l Conf. on Machine Learning (ICML), 2011. [paper]

  • Uterine Contraction Modeling and Simulation. M Liu, L Belfore, Y Shen, and M. Scerbo. In Selected Papers Presented at MODSIM World 2009 Conference and Expo, pages 135-140. NASA, Virginia Beach, Virginia, US, 2010. [link]

  • Multi-frame Super Resolution Based on Block Motion Vector Processing and Kernel Constrained Convex Set Projection. M Liu and Y Shen. In Proceedings of SPIE Visual Communications and Image Processing (VCIP), volume 7257, page 72571J, 2009 [link]

  •  A Near Optimum Detection in Alpha-Stable Impulse Noise. X Li, Y Jiang, and M Liu, In IEEE Int’l Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 3305-3308, 2009. [link]

  • Bi-parameter CGM model for approximation of alpha-stable PDF. XT Li, J Sun, LW Jin, and M Liu. IEE Letter V44, Issue 18, August 2008 [link]

  • A Survey of Computerized Fetal Heart Rate Monitoring and Interpretation Techniques. M Li, Y Shen, and M. Scerbo.  Modeling and Simulation Capstone Conference, 2008 [pdf] 

  •  New Edge Detection Based on Pyramid-Structure Wavelet Transform. S Yi, H Cao, X Li, and M Liu. In SPIE Proc. Visual Information Processing XV, page 62460U, 2006. [link]

  • A New Texture Representation with Multi-scale Wavelet Feature. S Yi, H Cao, X Li, and M Liu. In SPIE Proc. Visual Information Processing XV, page 62460X, 2006. [link]

  • A zero-watermarking algorithm based on DWT and chaotic modulation. H Cao, H Xiang, X. Li, M Liu, S Yi, and F. Wei. In Defense and Security Symposium, pages 624716-624716. International Society for Optics and Photonics, 2006.  [link]

  • Super Resolution Reconstruction Based on Motion Estimation Error and Edge Adaptive Constraints. M Liu, H Cao, X Li, and S Yi, In SPIE Proc. Visual Information Processing XV, page 62460B, 2006. [link]

Patents

  • Online Learning System with Contextual Bandits Feedback and Latent State Dynamics. E. Nelson, D. Bouneffouf, D. Bhattacharjya, T. Gao and M Liu P20220497

  • A System and Method for Understanding Reinforcement Learning Policies by Identifying Strategic States. R Luss, A Dhurandhar and M Liu, P202103551

  • A System and Method for​ Integrating AI Planners and Reinforcement Learning Agents through AI planning annotation in Reinforcement Learning. J Lee, M Katz, D J Agravante, M Liu, T Klinger, M Campbell, S Soharbi. P202102362

  • System and Method for Optimal Trade Execution. Y Zhu, M Liu, P Dey, N Gaur and A Gray. P202006957 

  • Dialog Agent for Conducting Task-Oriented Computer-based Communications. M S Campbell, M Liu, B Srivastava. US Patent 10740370

  • A Super-Resolution Video Reconstruction Method. L Xiong, H Cao, M Liu, Chinese Patent CN1863272A 

Google Sites
Report abuse
Page details
Page updated
Google Sites
Report abuse