Mentors
Mentors
Dr. Hyung-yi Lee is a Professor in the Department of Electrical Engineering and the Department of Computer Science & Information Engineering at National Taiwan University (NTU), Taipei, Taiwan. He is an alumnus of NTU and received his M.S. degree and Ph.D. from National Taiwan University (NTU), Taipei, Taiwan.
His research primarily focuses on advancing machine learning techniques, particularly deep learning, for applications in speech and language processing. He and his team have made significant strides in developing self-supervised learning methods that reduce the reliance on annotated data, facilitating more efficient training of speech recognition and understanding systems. Dr. Lee and his team at NTU work mainly to develop a series of language understanding and speech processing technology using deep learning.
Prof. Beena Ahmed
Dr. Beena Ahmed is an Associate Professor in Signal Processing at the School of Electrical Engineering and Telecommunications, UNSW Sydney. She completed her B.Sc. in Electrical Engineering from UET Lahore in 1993 and her Ph.D. from UNSW in 2004. Before rejoining UNSW in 2017, she served as an Assistant Professor at Texas A&M University at Qatar. Her academic career is marked by significant recognition, including awards such as the Superstar of STEM (2019) and multiple best paper and teaching awards.
Dr. Ahmed’s research lies at the intersection of healthcare and engineering. She focuses on applying machine learning and signal processing to areas such as speech therapy, mental stress tracking, and cognitive monitoring. Her work addresses low-resource clinical domains using transfer learning and unsupervised methods. She has received international funding for projects involving wearable health technologies, speech disorder diagnostics, and sleep disorder detection. Her efforts have also led to the founding of Say66, a startup using AI to make speech therapy more accessible.
Prof. Helena Moniz
Dr. Helena Moniz is a prominent researcher in computational linguistics, speech processing, and responsible AI, with over 111 publications. She currently serves as the President of both the European (2021–) and International (2023–) Associations for Machine Translation, and is an Assistant Professor at the University of Lisbon. She also chairs the Ethics Committee at the Center for Responsible AI and coordinates the Bridge AI project.
As a linguistic professional, Dr. Helena Moniz also contributed to over 20 international projects at INESC-ID/CLUL since 2000, and led a long-term collaboration with Unbabel on linguistic quality assurance. She has received multiple awards, such as the 2007 Research Prize from the Portuguese Linguistics Society and best system awards at WASSA 2024 and DSTC 2023. Her international research experience spans visits to Columbia University, Trinity College Dublin, and Dublin City University. Her core interests lie in prosody, translation, post-editing, and ethics in AI.
Dr. Ahmed Ali
Dr. Ahmed Ali is a seasoned expert in speech processing with over two decades of experience in research and engineering. As of 2024, he serves as a Principal Researcher at the Saudi Data and AI Authority (SDAIA), where he contributes to the advancement of AI-driven solutions, particularly in the field of speech and language technologies.
Prior to joining SDAIA, Dr. Ali was a Principal Engineer at the Qatar Computing Research Institute (QCRI) from 2011 to 2024. During his tenure at QCRI, he played a pivotal role in various large-scale projects in multilingual speech recognition, natural language processing, and language technologies for Arabic and other under-resourced languages. He is also associated with prominent academic institutions and holds confirmed academic emails from organizations such as QF (Qatar Foundation) and HBKU (Hamad Bin Khalifa University).
Dr. Mathew Magimai Doss
Dr. Mathew Magimai Doss is a senior research Scientist at Idiap Research Institute, Martigny, Switzerland (since April 2007). He is also co-Founder of AudioSearch Sarl, Martigny, Switzerland, which was founded in April 2010.
He completed the master of science (M.S.) by research in computer science and engineering from the Indian Institute of Technology, Madras, India, in 1999, the PreDoctoral diploma and the docteur ès sciences (Ph.D.) from Ecole Polytechnique Fédérale de Lausanne (EPFL), Switzerland, in 2000 and 2005, respectively. He was a postdoctoral fellow at International Computer Science Institute (ICSI), Berkeley, USA, from April 2006 till March 2007.
Dr. Rohan Kumar Das
Rohan Kumar Das worked as a Project Scientist at Assam Science Technology and Environment Council from 2010 to 2011. He later pursued his Ph.D. at the Indian Institute of Technology (IIT) Guwahati in 2017, focusing on speaker verification using short utterances for practical, application-oriented systems. After completing his doctoral studies, he joined Kovid Research Labs (now acquired by Kaliber Labs) as a Data Scientist, contributing to speech analytics-based application services. Later that year, he joined the Human Language Technology Laboratory at the National University of Singapore as a Research Fellow, where he led the speaker verification group's research until March 2021. He is currently serving as a Research and Development (R&D) Manager at Fortemedia, Singapore division, where he leads the research and product development unit.
Dr. Tanvina Patel
Tanvina Patel Senior Researcher at the Erasmus Medical Center (EMC), in collaboration with Delft University of Technology (TUDelft), The Netherlands. She is also working as a Machine Learning Engineer at DataQueue, Netherlands.
She completed her postdoctoral research at the Delft University of Technology, the Netherlands. She worked on Inclusive ASR and aims to improve usability of ASR systems. Her other research interests include speech signal analysis, and anti-spoofing for voice-biometrics. Prior to her Postdoc she was a Data-Scientist at Cogknit Semantics Pvt. Ltd., Bangalore for more than 4years. At Cogknit she was associated with developing speech recognition systems (for both data rich and low-resourced languages) and various speech-technology applications that enhance human-computer interaction and solve market problems.
Her selective achievements includes TU Delft submission for the GramVaani ASR Challenge 2022 that ranked 3rd in the 'Self-Supervised Category' and Cogknit's ASR submission that was ranked 2nd at the Microsoft Low Resource Challenge in Indian Languages, INTERSPEECH'18. As a part of her Ph.D she participated and developed the best performing countermeasure to detect Natural vs. Spoofed speech in 'ASV Spoof 2015 Challenge' held at INTERSPEECH 2015, Dresden, Germany. She holds 30+ publications and also is a reviewer for INTERSPEECH, ICASSP, Computer Speech & Language (Elsevier), Speech Communication and Transactions on Information Forensics & Security.
Dr. Si ioi Ng
Dr. Si-ioi Ng is currently a Postdoctoral Scholar at Arizona State University under the supervision of Prof. Visar Berisha. He received his Ph.D. in Electronic Engineering from The Chinese University of Hong Kong (CUHK) in 2023, where his doctoral research focused on the automatic detection of speech sound disorders in Cantonese-speaking pre-school children. He also earned his B.Eng. in Electronic Engineering from CUHK in 2018. His research interests include clinical speech analytics, pathological speech processing, and speech under physical stress.
Throughout his academic journey, Dr. Ng has been actively involved in teaching, mentoring undergraduate and postgraduate theses, and contributing to speech technology challenges and conferences such as INTERSPEECH, ASRU, and SLT. He has published widely in speech-related domains, with a particular emphasis on child speech and disordered speech recognition. His accomplishments are also reflected in numerous scholarships, travel grants, and innovation awards, and he is skilled in tools such as Kaldi, ESPnet, HuggingFace, and PyTorch.
Dr. Dayana Ribas is a Scientific Researcher with the ViVoLab group at the University of Zaragoza, Spain. She received her Ph.D. in Digital Signal Processing in 2016, where her doctoral research focused on robust speaker recognition in noisy environments. She also earned her M.Sc. in Signals and Systems and her B.Sc. in Telecommunication Engineering in Cuba. Her research interests include robust speech processing, speaker recognition, noise compensation for speech signals, and realism in speech processing.
Throughout her academic journey, Dr. Ribas has built an extensive international career, including a postdoctoral fellowship at INRIA in France, an invited researcher position at the University of Eastern Finland (UEF). Dr. Ribas is an active contributor to the speech community, serving as a guest editor for the Speech Communication journal and as a reviewer for premier journals and conferences like IEEE Transactions on Audio, Speech and Language Processing, INTERSPEECH, and ICASSP. She has also held organizational roles, such as serving as a Special Session Chair for INTERSPEECH 2016.