Organizers

Dr. Titouan Parcollet

Titouan is a Research Scientist at the Samsung AI Research center in Cambridge (UK) and a visiting scholar at the Cambridge Machine Learning Systems Lab from the University of Cambridge (UK). Previously, he was an Associate Professor in computer science at the Laboratoire Informatique d’Avignon (LIA), from Avignon University (FR). He also was a senior research associate at the University of Oxford (UK) within the Oxford Machine Learning Systems group. He received his PhD in computer science from the University of Avignon (FR) and in partnership with Orkis focusing on quaternion neural networks, automatic speech recognition, and representation learning. His current work involves efficient speech recognition, federated learning and self-supervised learning. He is also currently collaborating with the University of Montréal (Mila, QC, Canada) as the co-leader of the SpeechBrain project

Dr. Paola Garcia

Paola joined Johns Hopkins University in 2018, after extensive research experience in academia and industry, including highly regarded laboratories at Agnitio and Nuance Communications. She led a team of 20+ researchers from four of the best laboratories worldwide in far-field speech diarization and speaker recognition under the auspices of the JHU summer workshop 2019 in Montreal, Canada. She was also a researcher at Tec de Monterrey, Campus Monterrey, Mexico for ten years. She was a visiting scholar at Georgia Institute of Technology (2009) and Carnegie Mellon (2011). Recently, she has been working on children’s speech; including child speech recognition and diarization in day-long recordings. She collaborates with DARCLE.org and CCWD which analyze child-centered speech. She is also part of the JHU CHiME5, CHiME6, SRE18 and SRE19, SRE20, SRE21 teams. Her interests include diarization, speech recognition, speaker recognition, machine learning, and language processing.

Assoc. Prof. Xie Chen

Xie is currently a Tenure-Track Associate Professor in the Department of Computer Science and Engineering at Shanghai Jiao Tong University, China. He obtained his Bachelor's degree in the Electronic Engineering department from Xiamen University in 2009, a Master's degree in the Electronic Engineering department from Tsinghua University in 2012, and a Ph.D. degree in the information engineering department at Cambridge University (U.K.) in 2017. Prior to joining SJTU, he worked at Cambridge University as a Research Associate from 2017 to 2018, and in the speech and language research group at Microsoft as a senior and principal researcher from 2018 to 2021. His main research interest lies in deep learning, especially its application to speech processing, including speech recognition and synthesis.

Dr. Marcely Zanon Boito

Marcely is currently a post-doctoral researcher at NAVER Labs Europe.

She holds two bachelor degrees: Computer Science (UFRGS, Brazil) and Information Systems Engineering (Grenoble INP, France), and she received her Master's degree in Artificial Intelligence from the University Grenoble Alpes (France) in 2017. She received her Ph.D. in computer science from the same institution in 2021. Prior to joining NAVER Labs, she worked as a post-doctoral researcher at Avignon University.

Her main research interests are self-supervised learning models for speech processing, speech translation, and NLP and speech processing for under-resourced languages. She is currently part of the Horizon Europe UTTER project.

Dr. Po-Yao (Bernie) Huang

Bernie is a research scientist at Facebook AI Research (FAIR) Labs. He received his Ph.D. degree from the School of Computer Science at Carnegie Mellon University in 2021. Prior to his PhD study, he worked as a researcher at MediaTek to build speech enhancement and recognition algorithms on mobile devices. His current research interest is self-supervised multimodal machine learning. He is particularly interested in bridging speech, computer vision, and natural language processing for the tasks of audio event detection, multimodal machine translation, cross-modal retrieval, and large-scale multimodal data mining.

Prof. Yannick Estève

Yannick received the M.S. (1998) in computer science from the Aix-Marseilles University and the Ph.D. (2002) from Avignon University, France.

He joined Le Mans Université (LIUM lab) in 2003 as an associate professor, and became a full professor in 2010. He moved to Avignon University in 2019 and is the head of the Computer Science Laboratory of Avignon (LIA) since 2020. He has authored and co-authored more than 150 journal and conference papers in speech and language processing.

Dr. Tara Sainath

Tara Sainath received her S.B., M.Eng and PhD in Electrical Engineering and Computer Science (EECS) from MIT. After her PhD, she spent 5 years at the Speech and Language Algorithms group at IBM T.J. Watson Research Center, before joining Google Research. She has served as a Program Chair for ICLR in 2017 and 2018. Also, she has co-organized numerous special sessions and workshops for many speech and machine learning conferences. In addition, she has served as a member of the IEEE Speech and Language Processing Technical Committee (SLTC) as well as the Associate Editor for IEEE/ACM Transactions on Audio, Speech, and Language Processing. She is an IEEE and ISCA Fellow. In addition, she is the recipient of the 2021 IEEE SPS Industrial Innovation Award as well as the 2022 IEEE SPS Signal Processing Magazine Best Paper Award. She is currently a Principal Research Scientist at Google, working on applications of deep neural networks for automatic speech recognition.

Page updated

Google Sites

Report abuse