Si-ioi (Herman) NG
Postdoctoral Scholar at College of Health Solutions, Arizona State University
Postdoctoral Scholar at College of Health Solutions, Arizona State University
I'm a postdoctoral scholar working with Prof. Visar Berisha and Prof. Julie Liss at the College of Health Solutions at Arizona State University (ASU). I obtained my B.Eng and Ph. D in the Department of Electronic Engineering, The Chinese University of Hong Kong (CUHK), working with Prof. Tan Lee. My research combines spoken-language processing, clinical speech science and AI to assess various health conditions.
Contact: siioing@asu.edu; Linkedin: Here
ASU (Co-lecturer): EEE 598 - Special Topics: Speech and Audio Processing and Perception (My lectures: Link).
CUHK (Graduate Teaching Assistant):
ENGG 2030 - Signal and Systems
ELEG 2201 - Digital Circuits and Systems; ELEG 2202 - Fundamental of Electric Circuits
ELEG 4998, ELEG 4999 - Final Year Projects
Conference organizing commitee: ISCSLP 2026 Tutorial Co-Chair
Journal reviewer: Computer Speech & Language, Journal of Speech, Language, and Hearing Research (JSLHR), The Journal of Prevention of Alzheimer's Disease, Frontiers in Education
Conference reviewer: ICASSP 2025-, Interspeech 2024-, ISCSLP 2024-, IJCNN 2025
Mentor: 11th Doctoral Consortium, Interspeech 2025
Panelist: Special Session on "Connecting Speech Science & Technology for Children's Speech", Interspeech 2025
Responsible development and translation of clinical speech analytics, at Interspeech 2024 Tutorial session (Link)
Responsible development of clinical speech analytics, at University of Pennsylvania (2024), at CUHK (2025)
Automatic Extraction of content information unit for picture description tasks, at University of Wisconsin-Madison (2025)
Ph.D Thesis
S. -I. Ng, “Automatic Detection of Speech Sound Disorder in Cantonese-speaking Pre-School Children,” PhD Thesis, The Chinese University of Hong Kong, 2023.
S. -I. Ng, L. Xu, I. Siegert, N. Cummins, N. R. Benway, J. Liss and V. Berisha, “An End-to-End Overview of Clinical Speech AI”, IEEE Transactions on Audio, Speech and Language Processing, vol. 34, pp. 1016-1048, 2026 (Overview Paper).
S. -I. Ng, C. W. -Y. Ng, J. Wang and T. Lee, “Automatic Detection of Speech Sound Disorder in Cantonese-speaking Pre-School Children,” in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 4355-4368, 2024.
S. -I. Ng, C. W. -Y. Ng and T. Lee, “A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children,” in Proc. Interspeech, 2023, pp. 4643–4647.
S. -I. Ng, C. W. -Y. Ng, J. Wang and T. Lee, “Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations,” in Proc. Interspeech, 2022, pp. 2853–2857.
S. -I. Ng, C. W. -Y. Ng, J. Li and T. Lee, “Detection of Consonant Errors in Disordered Speech Based on Consonant-Vowel Segment Embedding,” in Proc. Interspeech, 2021, pp. 2931–2935.
S. -I. Ng and T. Lee, “Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder,” in Proc. Interspeech, 2020, pp. 4476–4480. (Best Student Paper Award Finalist)
S. -I. Ng*, C. W. -Y. Ng*, J. Wang, T. Lee, K. Y. -S. Lee and M. C. -F. Tong “CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment,” in Proc. Interspeech, 2020, pp. 424–428. (*Co-First Author)
J. Wang, S. -I. Ng, D. Tao, C. W. -Y. Ng and T. Lee, “A study on acoustic modeling for child speech based on multi-task learning,” in Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP), 2018, pp. 389–393.
Speech AI for assessing neurodegenerative disease
S. -I. Ng, P. S. Ambadi, P. Kadambi, K. D. Mueller, J. Liss and V. Berisha, “Characterizing Maximum Achievable Performance in Clinical Speech AI" (Under Preparation)
S. -I. Ng, P. S. Ambadi, K. D. Mueller, J. Liss and V. Berisha, “Automated Extraction of Spatio-Semantic Graphs for Identifying Cognitive Impairment,” Proc. ICASSP, 2025, pp. 1-5.
S. -I. Ng, L. Xu, K. D. Muller, J. Liss and V. Berisha, “Segmental and Suprasegmental Speech Foundation Models for Classifying Cognitive Risk Factors: Evaluating Out-of-the-Box Performance”, Proc. Interspeech, 2024, pp. 917-921.
Speech production under physical stress
S. -I. Ng, R. Ma, T. Lee and R. K. -W. Sum, “Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy,” in Proc. Speech Prosody, 2022, pp. 200–204.
R. Ma, S. -I. Ng, T. Lee, Y. J. Yang, and R. K. -W. Sum, "Validation of A Speech Database for Assessing College Students' Physical Competence under The Concept of Physical Literacy," in International Journal of Environmental Research and Public Health, 19(12), 7046, 2022.
Speaker Verification
J. Li, S. -I. Ng and T. Lee, “Improving Text-Independent Speaker Verification with Auxiliary Speakers Using Graph,” Proc. Automatic Speech Recognition and Understanding Workshop (ASRU), 2021, pp. 198-205.
I play cello and piano. I was a cellist in the Macau Youth Symphony Orchestra. I performed at Musica Riva Festival (Italy), Festival Orchestre Giovanili (Italy), The Prodigy Collective (Australia) and Flânderies Musicales de Reims (France). J. S. Bach, Brahms and Chopin are my favourite composers.
Other hobbies outside research and classical music include basketball, hiking/camping, coffee brewing, photography.