You only see what I want you to see.
You only see what I want you to see.
I am a PostDoc Research Fellow at Harvard Medical School under the guidance of Prof. Mengyu Wang. In Harvard AI and Robotic lab, my research focuses on building world model for dynamic environments, including video understanding, generation, and editing, Embodied AI and AI for science. During this period, my PI and I co-organized the Harvard AI & Robotics Seminar Series, which brings together researchers to discuss advances in artificial intelligence and robotics.
🎓 In Aug. 2024, I received my PhD degree in the Department of Computing at the Hong Kong Polytechnic University (PolyU), advised by Prof. Lei Zhang.
🎓In 2019, I received my Master’s degree in the School of Mathematics and Statistics at Xi'an JiaoTong University under the supervision of Prof. Deyu Meng.
Research Interests
GenAI: Image/video understanding, generation, editing 💥💥💥
High-level generation: image and video editing, personalization
Mid-level recognition: video grounding, detection, segmentation
Low-level vision: video rain/snow removal, low-light enhancement
EmAI: understanding → reconstruction → interaction
AI for Science: vision language models for medical data 💥
Medical imaging: Data privacy, demography fairness, federated learning
LLMs for Genomics: Gene-Gene interaction, Gene-to-drug
Work Experience
09/2024-now, Postdoc research fellow in Harvard AI & Robotics Lab, Harvard Medical School.
08/2023-09/2024, I worked as a Research Intern in OPPO Research Institute.
05/2019-11/2019, I worked as a research assistant in the Hong Kong Polytechnic University.
05/2018-09/2018 and 03/2020-11/2021, I worked as a Research Intern in DAMO Academy, Alibaba.
Presentation & Teaching & Mentorship
AI in Medicine @ Graduate Course at Harvard Medical School (Boston, USA)
ARVO 2025 Oral Presentation (Slat Lake City, USA)
Valse 2024 Oral Presentation (Chongqing, China)
Kempner Institute of Harvard University (Boston, USA)
CVPR 2024 (Seattle, USA)
Valse 2018 (Dalian, China)
Awards
Jun/2025 CVPR 2025 Outstanding Reviewer (< 5%)
May/2025 ARVO 2025 travel grant
Jun/2019 Graduate with honor (Master)
Nov/2017 National Third Prize of Graduate Mathematical Modeling Competition
Nov/2014 National First Prize of National Undergraduate Mathematical Competition
Jul/2016 Excellent graduation thesis (Bachelor)
Invited Presentations
3D 视觉工坊, Sep-18-2025, Video Editing Benchmark
AI Time, Sep-12-2025, Video Editing Benchmark
Twelve Labs: Unified Video Segmentation and Video Object Segmentation
VALSE 2024: Outstanding Student Forum (优秀学生论坛)
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2018-now
The IEEE/CVF International Conference on Computer Vision (ICCV) 2018-now
The European Conference on Computer Vision (ECCV) 2018-now
The International Conference on Learning Representations (ICLR) 2025
The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NIPS) 2025
The Association for the Advancement of Artificial Intelligence (AAAI) 2024
The International Joint Conference on Artificial Intelligence (IJCAI) 2025
Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
IEEE Transactions on Image Processing (TIP)
Pattern Recognition (PR)
Engineering Applications of Artificial Intelligence (EAAI)
Expert Systems with Applications (ESWA)
Image and Vision Computing
Signal Processing