Professor
State Key Laboratory of Media Convergence and Communication, School of Information and Communication Engineering
Communication University of China
qimao@cuc.edu.cn
I am now a professor at the School of Information and Communication Engineering and State Key Laboratory of Media Convergence and Communication, Communication University of China.
I have obtained my P.H.D degree from Peking University in July, 2021. (Institute of Digital Media which is directed by Prof. Wen Gao)
I received the B.E. degree in Digital Media Technology and B.A. degree in Journalism in 2016 from Communication University of China.
My current research interests lie in
AIGC (Image/Video Controllable Generation and Editing)
Image/Video Compression based on Generative Models.
I am supervised by Prof. Siwei Ma and have been a visiting Ph.D. student at Vision and Learning Lab at University of California, Merced, under the supervision of Prof. Ming-Hsuan Yang
I am lucky to have opportunities to work with Prof. Mike Zheng Shou (NUS), Dr. Hsin-Ying Lee (Snap research), Dr. Hung-Yu Tseng (Meta), Dr.Jia-Bin Huang (University of Maryland), Dr. Shiqi Wang (City University of Hong Kong), Dr. Xinfeng Zhang (University of Chinese Academy of Sciences), and Dr. Shanshe Wang (Peking University).
Prospective students: I am always actively looking for strong and self-motivated PhD/MS/Undergraduate students to join our group! If you are interested in working with me, please feel free to drop me an email with your research interests and vita. Follow our group's WeChat official account:cuc-mipg.
[1] Junlong Gao, Zhimeng Huang, Qi Mao(*), Siwei Ma, Chuanmin Jia, Exploring Multimodal Knowledge for Image Compression via Large Foundation Models. IEEE Transactions on Image Processing (2025).
[2] Yuanhang Li, Qi Mao(*), Lan Chen, Zhen Fang, Lei Tian, Xinyan Xiao, Libiao Jin, Hua Wu. StarVid: Enhancing Semantic Alignment in Video Diffusion Models via Spatial and SynTactic Guided Attention Refocusing. IEEE Transactions on Multimedia (2025). (Accepted)
[3] Qi Mao, Lan Chen, Yuchao Gu, Zhen Fang, and Mike Zheng Shou. MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance. In Proceedings of the 32nd ACM International Conference on Multimedia(2024).
[4] Qi Mao, Chongyu Wang, Meng Wang, Shiqi Wang, Ruijie Chen, Libiao Jin, Siwei Ma. Scalable Face Image Coding via StyleGAN Prior. Towards Compression for Human-Machine Collaborative Vision. IEEE Transactions on Image Processing (2023).
[5] Q. Mao, and, S. Ma, Enhancing Style-Guided Image-to-Image Translation via Self-Supervised Metric Learning, IEEE Transactions on Multimedia (TMM), 2023.
6] J. Chang, J. Zhang, J. Li, S. Wang, Q. Mao, C. Jia, S. Ma, and W. Gao, Semantic-Aware Visual Decomposition for Image Coding, International Journal of Computer Vision (IJCV) , 2023.
[7] J. Chang, Z. Zhao, C. Jia, S. Wang, L. Yang, Q. Mao, J. Zhang, S. Ma, Conceptual compression via deep structure and texture synthesis, IEEE Transactions on Image Processing (TIP), 2022.[Paper]
[8] Q. Mao, H.-Y. Tseng, H.-Y. Lee, J.-B. Huang, S. Ma, and M.-H. Yang, Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors, International Journal of Computer Vision (IJCV) , 2022. [Paper] [Project]
[9] H.-Y. Lee*, H.-Y. Tseng*, Q. Mao*, J.-B. Huang, Y.-D. Lu, M. K. Singh, and M.-H. Yang, DRIT++: Diverse Image-to-Image Translation via Disentangled Representations, International Journal of Computer Vision (IJCV) ,2020.(* equal contribution)
[10] Q. Mao*, H.-Y. Lee*, H.-Y. Tseng*, S. Ma, and M.-H. Yang, Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis, 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, USA, Jun., 2019. (* equal contribution)[Paper] [Project]