Welcome to my homepage! I am Haoang Li (pronounced as "Horn Lee"). I am currently an Assistant Professor at The Hong Kong University of Science and Technology (Guangzhou).
My research interest lies in 3D computer vision and robotics. Previously, I mainly focused on geometric SLAM and 3D reconstruction. Currently, along with my students, we are working on the following directions:
Embodied AI: 1) Vision-language action/manipulation; 2) Vision-language navigation; 3) Humanoid perception and control
AI-generated content: 3D/4D scene synthesis
3D/4D reconstruction and rendering: 1) Gaussian splatting/NeRF; 2) Dynamic SLAM; 3) Human avatar
We are working closely with Prof. Hesheng Wang from Shanghai Jiao Tong University.
I serve as an Associate Editor of ICRA, a Registration Chair of IROS 2025, and reviewers of top-tier journals and conferences.
Our work was selected as CVPR 2025 Best Paper Candidate. We also won ICCV 2021 Doctoral Consortium Award, third place in ICRA 2025 RoboDrive Challenge, CVPR 2021 Outstanding Reviewer Award, etc.
We are continuously looking for Ph.D. students, Research Assistants, and Interns/Visiting Students working on the above directions. For details, please refer to 小红书.
Email: haoang.li.cuhk@gmail.com
Assistant Professor, Thrust of Robotics and Autonomous Systems/Thrust of Intelligent Transportation, The Hong Kong University of Science and Technology (Guangzhou), February 2024 - Present
Leader of Intelligent Robot Perception and Navigation (IRPN) Lab
Postdoc, Department of Informatics, Technical University of Munich, August 2022 – February 2024
Supervisor: Prof. Daniel Cremers
Ph.D., Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, August 2018 – July 2022
Supervisor: Prof. Yun-Hui Liu
Visiting Ph.D., Department of Computer Science, ETH Zurich, November 2021 – February 2022
Supervisor: Prof. Marc Pollefeys
M.Eng., School of Remote Sensing and Information Engineering, Wuhan University, September 2016 – June 2018
Supervisor: Prof. Jian Yao
B.Eng., School of Remote Sensing and Information Engineering, Wuhan University, September 2012 – June 2016
Please refer to Google Scholar for the full publication list. The download links of codes and/or datasets are available in the papers.
"J": Journal, "C": Conference, "U": Under-review/Pre-print
[C8] B. Liao, Z. Zhao, H. Li, Y. Zhou, Y. Zeng, H. Li, P. Liu, "Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World", in IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[J2] H. Li, Y. Mai, M. Gao, J. He, Z. Liu, H. Wang, "Large-Scale LiDAR-Based Loop Closing via Combination of Equivariance and Invariance on SE (3)", IEEE/ASME Transactions on Mechatronics (TMECH).
[J1] K. Chen, J. Cao, Y. Li, H. Li, J. Ma, "G2-SDF: Geometry-Guided Neural Signed Distance Fields for Scalable and Detailed Reconstruction", IEEE Robotics and Automation Letters (RAL).
[C7] W. Song, J. Chen, P. Ding, H. Zhao, W. Zhao, Z. Zhong, Z. Ge, J. Ma, H. Li, "Accelerating Vision-language-action Model Integrated with Action Chunking via Parallel Decoding", IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[C6] Z. Bi, K. Chen, C. Zheng, Y. Li, H. Li, J. Ma, "Interactive Navigation for Legged Manipulators with Learned Arm-Pushing Controller", IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[C5] H. Liu, S. Guo, P. Mai, J. Cao, H. Li, J. Ma, "RoboDexVLM: Visual Language Model-Enabled Task Planning and Motion Control for Dexterous Robot Manipulation", IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[C4] R. Wang, Y. Ma, Y. Yao, S. Tao, H. Li, Z. Zhu, Y. Liu, X. Zuo, "L2COcc: Lightweight Camera-Centric Semantic Scene Completion via Distillation of LiDAR Model", IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[C3] T. Li, T. Huai, L. Zhen, Y. Gao, H. Li, X. Zheng, "SkyVLN: Vision-and-Language Navigationand NMPC Control for UAVs in Urban Environments", IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[C2] G. Jiang, T. Zhang, D. Li, Z. Zhao, H. Li, M. Li, H. Wang, "STG-Avatar: Animatable Human Avatars via Spacetime Gaussian", IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[C1] T. Wu, Y. Miao, Z. Li, H. Zhao, K. Dang, J. Su, L. Yu, H. Li, "EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting", International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI).
[U4] W. Song, J. Chen, P. Ding, Y. Huang, H. Zhao, D. Wang, H. Li, "CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding".
[U3] W. Song, J. Chen, W. Li, X. He, X. Zheng, Z. Liu, H. Wang, H. Li, "A Dual-system Vision-Language-Action Model for Rational Manipulation".
[U2] H. Yan, P. Hou, Z. Zhong, X. Zheng, Z. Liu, H. Wang, H. Li, "CARE: Contextually-Aligned and Realistic 4D Scene Generation from a Single Image and Text".
[U1] Z. Zhong, J. Lu, X. Liu, R. Yu, X. Zheng, Z. Liu, H. Wang, H. Li, "Spatial-Aware and Viewpoint-Robust Vision-Language Navigation of Mobile Robots".
[J2] T. Huang, H. Li, L. Peng, Y. Liu, Y.-H. Liu, "Efficient and robust point cloud registration via heuristics-guided parameter search", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).
[C3] Z. Zhong, J. Cao, S. Gu, S. Xie, L. Luo, H. Zhao, G. Zhou, H. Li, Z. Yan, "Structured-NeRF: Hierarchical Scene Graph with Neural Representation", European Conference on Computer Vision (ECCV).
[C2] B. Liao, Z. Zhao, L. Chen, H. Li, D. Cremers, P. Liu, "GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation", European Conference on Computer Vision (ECCV).
[J1] J. Ham, M. Kim, S. Kang, K. Joo, H. Li, P. Kim, "San Francisco World: Leveraging Structural Regularities of Slope for 3-DoF Visual Compass", IEEE Robotics and Automation Letters (RAL).
[C1] L. Cheng, J. Hu, H. Yan, M. Gladkova, T. Huang, Y.-H. Liu, D. Cremers, H. Li, "Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments", IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[U2] H. Li, X. Meng, X. Zuo, Z. Liu, H. Wang, D. Cremers, "PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments".
[U1] W. Li, W. Chen, S. Qian, J. Chen, D. Cremers, H. Li, "DynSUP: Dynamic Gaussian Splatting from An Unposed Image Pair".
[C13] Haoang Li*, Jinghu Dong*, Binghui Wen*, Ming Gao*, Tianyu Huang, Yun-Hui Liu, and Daniel Cremers, "DDIT: Semantic Scene Completion via Deformable Deep Implicit Templates," in IEEE/CVF International Conference on Computer Vision (ICCV), 2023.
[J3] Haoang Li, Ji Zhao, Jean-Charles Bazin, Pyojin Kim, Kyungdon Joo, Zhenjun Zhao, and Yun-Hui Liu, "Hong Kong World: Leveraging Structural Regularity for Line-based SLAM," IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023.
[C12] Tianyu Huang*, Haoang Li*, Kejing He, Congying Sui, Bin Li, and Yun-Hui Liu, "Learning Accurate 3D Shape Based on Stereo Polarimetric Imaging," IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
[J2] Haoang Li, Ji Zhao, Jean-Charles Bazin, and Yun-Hui Liu, "Quasi-globally Optimal and Near/True Real-time Vanishing Point Estimation in Manhattan World," IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022.
[C11] Wen Chen*, Haoang Li*, Qiang Nie, and Yun-Hui Liu, "Deterministic Point Cloud Registration via Novel Transformation Decomposition," IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
[C10] Haoang Li*, Kai Chen*, Pyojin Kim, Kuk-Jin Yoon, Zhe Liu, Kyungdon Joo, and Yun-Hui Liu, "Learning Icosahedral Spherical Probability Map Based on Bingham Mixture Model for Vanishing Point Estimation," IEEE/CVF International Conference on Computer Vision (ICCV), 2021.
[C9] Haoang Li, Kai Chen, Ji Zhao, Jiangliu Wang, Pyojin Kim, Zhe Liu, and Yun-Hui Liu, "Learning to Identify Correct 2D-2D Line Correspondences on Sphere," IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
[J1] Haoang Li, Ji Zhao, Jean-Charles Bazin, and Yun-Hui Liu, "Robust Estimation of Absolute Camera Pose via Intersection Constraint and Flow Consensus," IEEE Transactions on Image Processing (TIP), 2020.
[C8] Haoang Li, Pyojin Kim, Ji Zhao, Kyungdon Joo, Zhipeng Cai, Zhe Liu, and Yun-Hui Liu, "Globally Optimal and Efficient Vanishing Point Estimation in Atlanta World," European Conference on Computer Vision (ECCV), 2020.
[C7] Haoang Li, Ji Zhao, Jean-Charles Bazin, Wen Chen, Zhe Liu, and Yun-Hui Liu, "Quasi-globally Optimal and Efficient Vanishing Point Estimation in Manhattan World," IEEE/CVF International Conference on Computer Vision (ICCV), oral presentation, 2019.
[C6] Haoang Li, Wen Chen, Ji Zhao, Jean-Charles Bazin, Lei Luo, Zhe Liu, and Yun-Hui Liu, "Robust and Efficient Estimation of Absolute Camera Pose for Monocular Visual Odometry," IEEE International Conference on Robotics and Automation (ICRA), 2020.
[C5] Haoang Li, Ji Zhao, Jean-Charles Bazin, Wen Chen, Kai Chen, and Yun-Hui Liu, "Line-based Absolute and Relative Camera Pose Estimation in Structured Environments," IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2019.
[C4] Haoang Li, Yazhou Xing, Ji Zhao, Jean-Charles Bazin, Zhe Liu, and Yun-Hui Liu, "Leveraging Structural Regularity of Atlanta World for Monocular SLAM," IEEE International Conference on Robotics and Automation (ICRA), 2019.
[C3] Haoang Li, Ji Zhao, Jean-Charles Bazin, Lei Luo, Junlin Wu, and Jian Yao, "Robust Camera Pose Estimation via Consensus on Ray Bundle and Vector Field," IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2018.
[C2] Haoang Li, Jian Yao, Jean-Charles Bazin, Xiaohu Lu, Yazhou Xing, and Kang Liu, "A Monocular SLAM System Leveraging Structural Regularity in Manhattan World," IEEE International Conference on Robotics and Automation (ICRA), 2018.
[C1] Haoang Li, Jian Yao, Xiaohu Lu, and Junlin Wu, "Combining Points and Lines for Camera Pose Estimation and Optimization in Monocular Visual Odometry," IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017.