You only see what I want you to see.
You only see what I want you to see.
🎉 I am a PostDoc Research Fellow at Harvard Medical School under the guidance of Prof. Mengyu Wang. In Harvard AI and Robotic lab, my research focuses on video understanding, generation, and editing.
🎓 In Aug. 2024, I received my PhD degree in the Department of Computing at the Hong Kong Polytechnic University (PolyU), advised by Prof. Lei Zhang.
🎓In 2019, I received my Master’s degree in the School of Mathematics and Statistics at Xi'an JiaoTong University under the supervision of Prof. Deyu Meng.
Research Interests
GenAI: Image/video understanding, generation, editing 💥💥💥
AI for Science: vision language models for medical data 💥
Recognition: video detection, segmentation and understanding
Low-level vision: video rain/snow removal, low-light enhancement
Work Experience
09/2024-now, Postdoc research fellow in Harvard Medical School.
08/2023-09/2024, I worked as a Research Intern in OPPO Research Institute.
05/2019-11/2019, I worked as a research assistant in the Hong Kong Polytechnic University.
05/2018-09/2018 and 03/2020-11/2021, I worked as a Research Intern in DAMO Academy, Alibaba.
Publications
Chenxi Xie*, Minghan Li* (co-first author), Shuai Li, Yuhui Wu, Qiaosi Yi, Lei Zhang
Chenxi Xie*, Minghan LI* (co-first author), Hui Zeng, Jun Luo, and Lei Zhang
Chaodong Xiao*, Minghan LI* (co-first author), Zhengqiang Zhang, Deyu Meng, Lei Zhang
BoxVIS: Video Instance Segmentation with Box Annotations
Minghan LI, and Lei Zhang
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos (CVPR 2023)
One-to-Few Label Assignment for End-to-End Dense Detection (CVPR 2023)
Shuai LI, Minghan LI, Ruihuang LI, Chenhang He, and Lei Zhang
Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization (CVPR 2022 Oral)
Yabin Zhang, Minghan LI, Ruihuang LI, Kui Jia, and Lei Zhang
Spatial Feature Calibration and Temporal Fusion for Effective One-Stage Video Instance Segmentation (CVPR 2021)
Minghan LI, Shuai LI, Lida LI, and Lei Zhang
Survey on rain removal from videos or a single image (Information sciences 2022)
Hong Wang, Yichen Wu, Minghan LI, Qian Zhao, and Deyu Meng
Online Rain/Snow Removal from Surveillance Videos (TIP 2021)
Minghan LI, Xiangyong Cao, Qian Zhao, Lei Zhang and Deyu Meng
Awards
06/2025 CVPR 2025 Outstanding Reviewer (< 5%)
05/2025 ARVO 2025 travel grant
06/2019 Graduate with honor (Master)
11/2017 National Third Prize of Graduate Mathematical Modeling Competition
11/2014 National First Prize of National Undergraduate Mathematical Competition
07/2016 Excellent graduation thesis (Bachelor)
Invited Presentations
3D 视觉工坊, Sep-18-2025, Video Editing Benchmark
AI Time, Sep-12-2025, Video Editing Benchmark
Twelve Labs: Unified Video Segmentation and Video Object Segmentation
VALSE 2024: Outstanding Student Forum (优秀学生论坛)
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2018-now
The IEEE/CVF International Conference on Computer Vision (ICCV) 2018-now
The European Conference on Computer Vision (ECCV) 2018-now
The International Conference on Learning Representations (ICLR) 2025
The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NIPS) 2025
The Association for the Advancement of Artificial Intelligence (AAAI) 2024
The International Joint Conference on Artificial Intelligence (IJCAI) 2025
Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
IEEE Transactions on Image Processing (TIP)
Pattern Recognition (PR)
Engineering Applications of Artificial Intelligence (EAAI)
Expert Systems with Applications (ESWA)
Image and Vision Computing
Signal Processing
Presentation & Teaching