Chong ZHANG
Hi! I am a Senior Algorithm Engineer in the speech lab at Alibaba. I received my Ph.D. and Master degree from National University of Singapore in 2018 and 2012, respectively, under the supervision of Prof. Tan Kay Chen and Prof. Li Haizhou, and received my bachelor degree from Harbin Institute of Technology in 2011. My research interests include speech recognition, spoken language processing, multimodal representation learning, machine learning, artificial intelligence.
📂 Google Scholar 📗 ORCiD 📎 Linkedin 📑 OpenReview 📑 Semantic Scholar ✉️ chong.zhang AT alibaba-inc.com
Education
2013 - 2018 Ph.D., National University of Singapore
Supervisors: Chair Professor Tan Kay Chen (IEEE Fellow) and Presidential Chair Professor Li Haizhou (IEEE Fellow, ISCA Fellow, FSEng)
2011 - 2012 M.Sc., National University of Singapore
2007 - 2011 B.Eng, Harbin Institute of Technology
News
2023-12-22 ICASSP2024|通义实验室语音团队入选论文速览
2023-12-20 EMNLP2023|通义实验室语音团队入选论文解析
2023-06-13 INTERSPEECH2023|达摩院语音实验室入选论文全况速览
2023-03-01 ICASSP2023|达摩院语音实验室入选论文全况速览
Recent Preprints [Google Scholar]
Selected Publications
Conferences
Are Soft Prompts Good Zero-shot Learners for Speech Recognition? [paper]
Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter-Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma.
ICASSP 2024.
Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition
Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Qian Chen, Wen Wang, Eng Siong Chng, Bin Ma.
Proc. INTERSPEECH 2023.
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention [paper]
Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma.
Proc. INTERSPEECH 2023.
Dual Acoustic Linguistic Self-supervised Representation Learning for Cross-Domain Speech Recognition
Zhao Yang, Dianwen Ng, Chong Zhang, Xiao Fu, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma, Jizhong Zhao.
Proc. INTERSPEECH 2023.
A Unified Recognition and Correction Model under Noisy and Accent Speech Conditions
Zhao Yang, Dianwen Ng, Chong Zhang, Rui Jiang, Wei Xi, Yukun Ma, Chongjia Ni, Jizhong Zhao, Bin Ma, Eng Siong Chng.
Proc. INTERSPEECH 2023.
deHuBERT: Disentangling Noise In A Self-Supervised Model For Robust Speech Recognition [paper]
Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng and Bin Ma.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , 2023.
Contrastive Speech Mixup For Low-Resource Keyword Spotting [paper]
Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng and Bin Ma.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , 2023.
Auxiliary Pooling Layer For Spoken Language Understanding [paper]
Yukun Ma, Trung Hieu Nguyen, Jinjie Ni, Wen Wang, Qian Chen, Chong Zhang and Bin Ma.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , 2023.
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization [paper]
Dianwen Ng, Jia Qi Yip, Tanmay Surana, Zhao Yang, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, and Bin Ma.
2022 Asia Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference, 2022.
Journals/Transactions
C. Zhang, P. Lim, A. K. Qin, and K. C. Tan
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), vol. 28, pp. 2306–2318, Oct 2017.
C. Zhang, K. C. Tan, H. Li, and G. S. Hong
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), vol. 30, no. 1, pp. 109-122, Jan. 2019.
Professional Services
Senior Member, IEEE (The Institute of Electrical and Electronics Engineers)
Invited Reviewer
Journals/Transactions Reviewing: TNNLS, TKDE, TASLP, TPAMI, TETCI, Neurocomputing, MSSP, etc.
Conferences Reviewing: ICASSP ('22, '23), Interspeech ('21, '22), EMNLP ('23), CEC ('16 - '21,'23), IJCNN ('16, '22), etc.