Minghan Li
Postdoc Research Fellow at Harvard University
Minghan Li
Postdoc Research Fellow at Harvard University
I am a PostDoc Research Fellow at Harvard Medical School under the guidance of Prof. Mengyu Wang. In Harvard AI and Robotic lab, my research focuses on building world model for dynamic environments, including visual understanding, generation, and editing, physcial AI and AI for science. During this period, my PI and I co-organized the Harvard AI & Robotics Seminar Series, which brings together researchers to discuss advances in artificial intelligence and robotics.
🎓 In Aug. 2024, I received my PhD degree in the Department of Computing at the Hong Kong Polytechnic University (HKPolyU), advised by Prof. Lei Zhang. PhD thesis link: Exploring Spatiotemporal Consistency and Unified Frameworks for Video Segmentation.
🎓In 2019, I received my Master’s degree in the School of Mathematics and Statistics at Xi'an JiaoTong University under the supervision of Prof. Deyu Meng.
Research Interests
Visual Intelligence: Image/video understanding, generation, editing
High-level generation: image and video editing, personalization
Mid-level recognition: video grounding, detection, segmentation
Low-level vision: video rain/snow removal, low-light enhancement
Physcial AI: perception/understanding → 3D/4D reconstruction → interaction
AI for Science: vision language models for medical data
Work Experience
Sep/2024 -- now Postdoc research fellow @ Harvard AI & Robotics Lab, Harvard Medical School
Aug/2023 -- Sep/2024 Research intern @ OPPO Research Institute
May/2019 -- Nov/2019 Research assistant @ Hong Kong Polytechnic University
May/2018 -- Sep/2018 Research intern @ DAMO Academy, Alibaba
Mar/2020 -- Nov/2021 Research intern @ DAMO Academy, Alibaba
Presentation & Teaching & Mentorship
AI in Medicine @ Graduate Course at Harvard Medical School (Boston, USA)
ARVO 2025 Oral Presentation (Slat Lake City, USA)
Valse 2024 Oral Presentation (Chongqing, China)
Kempner Institute of Harvard University (Boston, USA)
CVPR 2024 (Seattle, USA)
Valse 2018 (Dalian, China)
Awards
Jun/2025 CVPR 2025 Outstanding Reviewer (< 5%)
May/2025 ARVO 2025 travel grant
Jun/2019 Graduate with honor (Master)
Nov/2017 National Third Prize of Graduate Mathematical Modeling Competition
Nov/2014 National First Prize of National Undergraduate Mathematical Competition
Jul/2016 Excellent graduation thesis (Bachelor)
Invited Presentations
3D 视觉工坊 Sep-18-2025, Video Editing Benchmark
AI Time Sep-12-2025, Video Editing Benchmark
Twelve Labs Unified Video Segmentation and Video Object Segmentation
VALSE 2024 Outstanding Student Forum (优秀学生论坛)
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2018-now
The IEEE/CVF International Conference on Computer Vision (ICCV) 2018-now
The European Conference on Computer Vision (ECCV) 2018-now
The International Conference on Learning Representations (ICLR) 2025
The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NIPS) 2025
The Association for the Advancement of Artificial Intelligence (AAAI) 2024
The International Joint Conference on Artificial Intelligence (IJCAI) 2025
Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
IEEE Transactions on Image Processing (TIP)
Pattern Recognition (PR)
Engineering Applications of Artificial Intelligence (EAAI)
Expert Systems with Applications (ESWA)
Image and Vision Computing
Signal Processing