Selected Publications
Conferences
Chong Zhang*, Yukun Ma*, Qian Chen, Wen Wang, Shengkui Zhao, Zexu Pan, Hao Wang, Chongjia Ni, Trung Hieu Nguyen, Kun Zhou, Yidi Jiang, Chaohong Tan, Zhifu Gao, Zhihao Du, Bin Ma. 2025
Yidi Jiang, Qian Chen, Shengpeng Ji, Yu Xi, Wen Wang, Chong Zhang, Xianghu Yue, ShiLiang Zhang, Haizhou Li
ACL 2025 Main.
Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter-Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024.
Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Qian Chen, Wen Wang, Eng Siong Chng, Bin Ma.
Proc. INTERSPEECH 2023.
Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma.
Proc. INTERSPEECH 2023.
Zhao Yang, Dianwen Ng, Chong Zhang, Xiao Fu, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma, Jizhong Zhao.
Proc. INTERSPEECH 2023.
Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng and Bin Ma.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , 2023.
Yukun Ma, Trung Hieu Nguyen, Jinjie Ni, Wen Wang, Qian Chen, Chong Zhang and Bin Ma.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , 2023.
Journals/Transactions
C. Zhang, P. Lim, A. K. Qin, and K. C. Tan
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), vol. 28, pp. 2306–2318, Oct 2017.
C. Zhang, K. C. Tan, H. Li, and G. S. Hong
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), vol. 30, no. 1, pp. 109-122, Jan. 2019.
Y. Ma, C. Zhang*, Q. Chen, W. Wang and B. Ma
IEEE Signal Processing Letters, vol. 31, pp. 1740-1744, 2024, doi: 10.1109/LSP.2024.3419719.
Technical Reports
InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation [paper][code]
Minmo: A multimodal large language model for seamless voice interaction [paper]