Activities

Education:

Ph.D. from Waseda University, School of Science and Engineering, February 2006, Supervisors: Profs. T. Kobayashi, Y. Matsuyama, Y. Sagisaka, and T. Matsushima
Master of Science from Waseda University, March 2001, Supervisor: Prof. Ichiro Ohba
Bachelor of Science from Waseda University, March 1999, Supervisor: Prof. Ichiro Ohba

Work Experience:

January 2021 - present: "Associate Professor," Language Technologies Institute at Carnegie Mellon University, Pittsburgh PA, USA
April 2025 - present: "Visiting Professor," Center for Language AI Research, Tohoku University, Japan
April 2025 - present: "Visiting Researcher," NHK Science & Technology Research Laboratories, Japan
September 2022 - present: "Courtesy Faculty," Department of Electrical and Computer Engineering at Carnegie Mellon University, Pittsburgh PA, USA
January 2021 - 2025: "Adjunct Associate Professor," Center for Language and Speech Processing at John Hopkins University, Baltimore MD, USA
July 2017 – December 2020: "Associate Research Professor," Department of Electrical and Computer Engineering at John Hopkins University, Baltimore MD, USA
October 2014 – June 2017: "Senior Principal Research Scientist" (formerly, Senior Principal Member Research Staff) at Mitsubishi Electric Research Laboratories (MERL), Cambridge MA, USA (Group manager: Dr. Anthony Vetro. Team leader: Dr. John R. Hershey)
April 2013 – September 2014: “Principal Member Research Staff” at Mitsubishi Electric Research Laboratories (MERL), Cambridge MA, USA (Group manager: Dr. Anthony Vetro. Team leader: Dr. John R. Hershey)
January 2012 – March 2013: “Visiting Member Research Staff” at Mitsubishi Electric Research Laboratories (MERL), Cambridge MA, USA (Group manager: Dr. Anthony Vetro. Team leader: Dr. John R. Hershey)
July 2011 – November 2011: “Research Scientist” at Nippon Telegraph and Telephone (NTT) Communication Science Laboratories, Signal Processing Research Group, Kyoto, Japan (Group leader: Dr. Atsushi Nakamura)
January 2009 – March 2009: “Visiting Scholar” at Georgia Institute of Technology, Atlanta, USA. (Supervisor: Professor Biing-Hwang (Fred) Juang)
April 2001 – June 2011: “Researcher” at Nippon Telegraph and Telephone (NTT) Communication Science Laboratories, Signal Processing Research Group, Kyoto, Japan (Group leader: Dr. Shoji Makino -> Dr. Masato Miyoshi -> Dr. Atsushi Nakamura)

Teaching:

"11751/18781 Speech Recognition and Understanding," Carnegie Mellon University, USA, Fall 2022--2024
"11492/11692/18495: Speech Technology for Conversational AI" Carnegie Mellon University, USA, Spring 2024
"11492/11692/18495: Speech Processing" Carnegie Mellon University, USA, Spring 2023
"11737: Multilingual Natural Language Processing," Carnegie Mellon University, USA, Spring 2022 with Graham Neubig and Alan W Black
"11751/18781 Speech Recognition and Understanding," Carnegie Mellon University, USA, Fall 2021 with Ian Lane
"DSTA: Multilingual Natural Language Processing," Carnegie Mellon University, USA, Spring 2021 with Graham Neubig and Alan W Black
"EN. 601.667: Introduction to Human Language Technology", Johns Hopkins University, USA, Fall 2019, 2020 with Philipp Koehn and other CLSP faculties
"End-to-end speech recognition" JHU Summer School on Human Language Technology (June 20, 2018) with Takaaki Hori
"EN. 520.666: Information Extraction", Johns Hopkins University, USA, Spring 2018--2020
"Machine Learning for Speech Processing," Department of Electronic Engineering, Tsinghua University, China, December 2012

Seminar:

"Scaling Multilingual Speech Recognition: From a Handful to Thousands of Languages," University of Southern California (host: Prof. Shri Narayanan), October 2025
"Case Study: How to Manage the Study of Foundation Models in Academia?" Nuts & Bolts Session at TTIC summer workshop on Foundations of Speech and Audio Foundation Models, September 2025
"Scaling Multilingual Speech Recognition: From a Handful to Thousands of Languages," Tohoku University (host: Prof. Jun Suzuki), July 2025
"Open Whisper-Style Speech Models: Transparency, Scalability, and Advancing Explainability," NTT Communication Science Laboratories (host: Dr. Marc Delcroix), July 2025
"Open Whisper-Style Speech Models: Transparency, Scalability, and Advancing Explainability," Kyoto University (host: Prof. Tatsuya Kawahara), June 2025
"Open Whisper-Style Speech Models: Transparency, Scalability, and Advancing Explainability," INESC-ID (host: Prof. Alberto Abad), June 2025
"Towards a robustly-evaluated and open ecosystem of audio foundation models," with Chris Donahue, Sony AI (host: Dr. Yuki Mitsufuji), May 2025
"Open Whisper-Style Speech Models: Transparency, Scalability, and Advancing Explainability," TTIC guest lecture (host: Prof. Karen Livescu), May 2025
"Open Whisper-Style Speech Models: Transparency, Scalability, and Advancing Explainability," Workshop on NL and Interactive Systems at Apple, May 2025
"Open Whisper-Style Speech Models: Transparency, Scalability, and Advancing Explainability," UC Berkeley (host: Prof. Gopala K. Anumanchipalli), May 2025
"Open Whisper-Style Speech Models: Transparency, Scalability, and Advancing Explainability," Conversational AI Reading Group at Mila, Febulary 2025
"Speech Self-Supervised Learning Using One Million Hours of Multilingual Data Across Four Thousand Languages," NLP Colloquium, Japan, January 2025.
"Reproducing Large Speech Foundation Models, " Ohio State University (host: Prf. Deliang Wang)
"Reproducing Large Speech Foundation Models," Chulalongkorn University, August 2024 (host: Prof. Ekapol Chuangsuwanich)
"Unifying Speech Processing Applications with Speech Foundation Models," DSTA Faculty Speaker Series, July 2024
"Toward Explainable Speech Foundation Models," The University of Rochester ECE Distinguished Lecture Series, April 2024 (host: Prof. Zhiyao Duan)
"Toward Explainable Speech Foundation Models," invited speaker at the IEEE ICASSP 2023 workshop Explainable Machine Learning for Speech and Audio, April 2024
"Unifying Speech Processing Applications with Speech Foundation Models," Amazon, January 2024 (host: Dr. Jin Liu)
"Explainable End-to-End Neural Networks for Far-Field Conversation Recognition,” Academia Sinica, Taiwan, December 2023 (host: Prof. Yu Tsao)
"Variational Bayesian Learning," invited speaker at the Symposium for Celebrating 40 Years of Bayesian Learning in Speech and Language Processing and Beyond, December 2023
"Attempts to reproduce large pre-trained speech models on an academic computing scale," Apple, September 2023 (host: Dr. Ahmed Abdelaziz)
"Attempts to reproduce large pre-trained speech models on an academic computing scale," Music and Audio Workshop, June 2023
"Attempts to reproduce large pre-trained speech models on an academic computing scale," panel talk at the IEEE ICASSP 2023 workshop Self-supervision in Audio, Speech and Beyond (SASB), June 2023
"Compositional framework for spoken language processing," Amazon Alexa Academy Distinguished Speaker Series, May 2023 (host: Jing Liu)
"CHiME Speech Separation and Recognition Challenge," ASJ Special Session, March 2023
"Controllable and Explainable End-to-End Speech Translation," Tencent, Dec 2022 (host: Dr. Raymond Yu)
"Controllable and Explainable End-to-End Speech Translation," SIG SLT Seminar, November 2022
"Explainable End-to-End Neural Networks for Far-Field Conversation Recognition," UT Austin, November 2022 (host. Prof. David Harwath)
"Toward End-to-End Neural Modeling for Multi-Speaker Conversation Recognition," AIRS Company, Hyundai Motor Group, September 2022 (host: Dr. Byeong-Yeol Kim)
"Toward End-to-End Neural Modeling for Multi-Speaker Conversation Recognition," Meta, August 2022
"Streaming Transformer for Speech Recognition, Understanding, and Translation," Nvidia, June 2022 (host: Dr. Boris Ginsburg)
"Toward End-to-End Neural Modeling for Multi-Speaker Conversation Recognition," Amazon, April 2022 (host: Dr. Tao Zhang)
"Toward End-to-End Neural Modeling for Multi-Speaker Conversation Recognition," Pindrop, April 2022 (host: Dr. Ganesh Sivaraman)
"End-to-End Speech Recognition and Spoken Language Understanding," JPMC Monthly Speaker Series, Febulary 2022 (host: Dr. Chandra Dhir)
"Simplifying Speech Recognition with Non-Autoregressive Modeling," Meta (FB) Speech & Audio Summit, December 2021
Panelist of "Convergence of Machine Learning and Signal Processing," APSIPA US Local Chapter, November 2021 (host: Prof. Anthony Kuh)
"Multi-Speaker Conversation Recognition based on End-to-End Neural Networks," Georgia Tech CSIP Seminar, November 2021
"Simplifying Speech Recognition with Non-Autoregressive Modeling," Amazon Pittsburgh, October 2021 (host: Dr. Markus Müller)
"Tackling Multispeaker Conversation Processing based on Speaker Diarization and Multispeaker Speech Recognition," Tencent Seattle AI Lab Online Talk, July 2021 (host: Dr. Meng Yu)
"Tackling Multispeaker Conversation Processing based on Speaker Diarization and Multispeaker Speech Recognition," Speech Science Forum at UCL, May 2021
"Introduction of ESPnet, end-to-end speech processing toolkit: new features, broadened applications, performance improvements, and future challenges," AIST AI Seminar, March 2021 (host: Dr. Jun Ogata)
"Tackling Multispeaker Conversation Processing based on Speaker Diarization and Multispeaker Speech Recognition," CUED Speech Group Seminars, Febluary 2021 (host: Prof. Kate Knill)
"Tackling Multispeaker Conversation Processing based on Speaker Diarization and Multispeaker Speech Recognition," Indiana University Bloomington Computer Science Department Colloquium, October 2020 (host: Prof. Donald S. Williamson)
"End-to-End Speech Processing: From Pipeline to Integrated Architecture," CMU, USA, October 2019 (host: Prof. Florian Metze)
"End-to-End Speech Processing: From Pipeline to Integrated Architecture," IBM, USA, August 2019 (host: Dr. Xiaodong Cui)
"End-to-End Speech Processing: From Pipeline to Integrated Architecture," University of Sheffield, UK, May 2019 (host: Prof. Jon Barker)
"Multichannel End-to-End Speech Recognition,” Toyota Technological Institute at Chicago (TTIC), USA, November 2018 (host: Prof. Karen Livescu)
"Multichannel End-to-End Speech Recognition,” Paderborn University, Germany, October 2018 (host: Prof. Reinhold Häb-Umbach)
"Hybrid CTC/Attention Architecture for End-to-End Speech Recognition," National Chiao Tung University, Taiwan, October 2018 (host: Prof. Jen-Tzung Chien)
"Multichannel End-to-End Speech Recognition,” Academia Sinica, Taiwan, October 2018 (host: Prof. Yu Tsao)
"Distant-Talk Speech Processing Toward Natural Conversation Understanding,” Johns Hopkins University, USA, January 2017 (host: Prof. Najim Dehak)
"Pushing the envelope at both ends — beamforming acoustic models and joint CTC/Attention schemes for end-to-end ASR,” Tokyo Institute of Technology, Japan, December. 2016 (host: Prof. Koichi Shinoda)
"Recent activities of distant talk speech recognition,” Nanyang Technological University, Singapore, March. 2016 (host: Dr. Xiong Xiao)
"Recent activities of distant talk speech recognition,” Shanghai Jiao Tong University, China, March. 2016 (host: Prof. Kai Yu)
“Practical Bayesian methods for speech and language processing,” Brno University of Technology, Czech Republic, September. 2015 (host: Prof. Jan "Honza" Cernocky)
“Recent trends in far-field speech recognition,” Nara Institute of Science and Technology, Japan, March. 2015 (host: Profs. Kevin Duh and Satoshi Nakamura)
“Recent trends on noise robust speech recognition,” Tokyo Institute of Technology, Japan, October. 2014 (host: Prof. Takahiro Shinozaki)
“Bayesian Learning for Speech and Language Processing,” The Institute of Statistical Mathematics, Japan, March. 2013 (host: Prof. Tomoko Matsui)
“Bayesian Learning for Speech and Language Processing,” Ruhr-Universität Bochum, Germany, Sep. 2013 (host: Prof. Dorothea Kolossa)
“Bayesian Learning for Speech and Language Processing,” National Taiwan University, Taiwan, July. 2013 (host: Prof. Lin-shan Lee)
“Structural Bayesian linear regression for hidden Markov models” Speech Signal Processing Workshop 2013, Taiwan, June. 2013 (host: Prof. Jen-Tzung Chien)
The 4th Young Researchers Forum on ALAGIN Speech Processing Session, Japan, Dec. 2012 (host: Prof. Daisuke Saito)
“Bayesian Learning for Speech and Language Processing,” BBN Technology, USA, Jun. 2012 (host: Dr. Ivan Bulyko)
“Speech Recognition by Tracking Acoustical and Linguistic Environments,” MIT CSAIL Seminar, USA, April 2012 (host: Prof. Jim Glass)
“Bayesian Linear Regression For Hidden Markov Model Based On Optimizing Variational Bounds,” NICT, Japan, Nov. 2011 (host: Dr. Chiori Hori)
“Bayesian Linear Regression For Hidden Markov Model Based On Optimizing Variational Bounds,” Tsinghua University, China, Sep 2011 (host: Prof. Zhijian Ou)
“Incremental adaptation of speech recognition based on macroscopic time evolution system,” Seminar on Human Language Technology and Pattern Recognition, RWTH Aachen University, Germany, Sep 2011 (host: Prof. Ralf Schlueter)
“Incremental adaptation of speech recognition based on macroscopic time evolution system,” Seminar on Tokuda & Lee Laboratory, Nagoya Institute of Technology, Japan, June 2011 (host: Prof. Keiichi Tokuda)
“Incremental adaptation of speech recognition based on macroscopic time evolution system,” Seminar on Machine Intelligence Laboratory, Cambridge University Department of Engineering, UK, May 2011 (host: Dr. Kai Yu)
“Speech recognition based on a Bayesian approach,” Seminar on NHK Science & Technology Laboratories, Japan, September 2009 (host: Dr. Toru Imai)
“Variational Bayesian estimation and clustering for speech recognition,” Georgia tech. CSIP seminar, USA, February 2009 (host: Prof. Fred Juang).

Award and Notable Achievement:

Co-authored paper achieved the IEEE ICASSP Best Student Paper Award in 2026
Co-authored paper achieved the IEEE SPS Young Author Best Paper Award in 2025
Co-authored paper achieved the ISCA Interspeech Best Student Paper Award in 2025
Best Paper Award at IEEE SLT in 2024
EMNLP Best Paper Award in 2024
Judges' Award at the DCASE 2024 Challenge Task 6
ISCA Interspeech Best Paper Award in 2024
Ranked 1st place at the DCASE 2024 Challenge Task 6
Outstanding Reviewer Recognition at IEEE ICASSP in 2023
Co-authored paper achieved the IEEE Ganesh N. Ramaswamy Memorial Student Grant (2023)
Ranked 1st place at the DCASE 2023 Challenge Task 6A
6 papers are recognized as one of the top 3% of all papers accepted at IEEE ICASSP in 2023
Ranked 1st place at the STOP challenge tasks 1 and 3 (2023)
Best student paper at IEEE SLT in 2022
Best paper candidates (2 papers) at IEEE SLT in 2022
Ranked 1st place at The 2nd Clarity Enhancement Challenge (2022)
Computer Speech & Language Best Review Paper Award (2022)
Ranked 1st place at the IWSLT 2022 Dialect Speech Translation task (2022)
Ranked 1st place at the L3DAS22 challenge task 1 (2022)
Computer Speech & Language Best Review Paper Award (2021)
Ranked 2nd place at The DIHARD III challenge task 1 & 2 full set in 2021
Ranked 1st place at The DCASE 2020 challenge task 4 in 2020
Facebook Research Award (Towards On-Device AI) in 2020
Tencent AI Lab Rhino-Bird Gift Fund in 2020
ASRU best paper award at the IEEE ASRU in 2019
ASRU best paper candidate (2 papers) at the IEEE ASRU in 2019
Google Faculty Research Award in 2019
Tencent AI Lab Rhino-Bird Gift Fund in 2018
Facebook Research Award (Speech and Audio Technology for Voice Interaction and Video Understanding) in 2018
Co-authored paper achieved the ISCA Interspeech Best Student Paper Award in 2018
Ranked 1st place at The First DIHARD Speech Diarization Challenge track1 in 2018
Best paper candidates (2 papers) and best student paper candidate at the IEEE ASRU in 2017
Ranked 3rd place at 4th CHiME Challenge 6 channel track in 2016
Ranked 2nd place at 5th Dialog State Tracking Challenge (DSTC5) in 2016
Ranked 2nd place at 3rd CHiME Challenge in 2015
Ranked 1st place at REVERB Challenge Official Data Track (2nd overall) in 2014
Ranked 1st place at 2nd CHiME Challenge Track 2 in 2013
Co-authored paper achieved the IEEE ICASSP 2012 Student Paper Award in 2012
Best In-Category Nominee at the International Semantic Web Conference in 2011
Ranked 1st place at 1st CHiME Challenge in 2011
Co-authored paper achieved the IEEE Signal Processing Society Japan Chapter Student Paper Award in 2010
The TELECOM System Technology Award from the Telecommunications Advancement Foundation in 2006
The Itakura Award from the Acoustical Society of Japan (ASJ) in 2006
The Best Paper Award from the Institute of Electronics, Information, and Communication Engineers of Japan (IEICE) in 2004
The Awaya Award from the Acoustical Society of Japan (ASJ) in 2003

Academic Activity:

Membership

1. Fellow "for contributions to speech recognition technology," the Institute of Electrical and Electronics Engineers (IEEE)
2. Fellow "For wide ranging, fundamental contributions to research and leadership in speech recognition technologies," the International Speech Communication Association (ISCA)
3. Member, the Acoustical Society of Japan (ASJ)
4. Member, the Institute of Electronics, Information, and Communication Engineers of Japan (IEICE)

Organizer/Committee Member

1. General Chair, IEEE International Conference on Audio, Speech, and Language Intelligence (ASLI) 2027
2. Co-organizer, "SALMA 2026: Speech and Audio Language Models Workshop (2nd Edition)," at EMNLP 2026 Satellite Workshop
3. Lead Area Chair, INTERSPEECH 2025 (2025.10 -- 2027.09)
4. Co-organizer, "The Joint Workshop on HSCMA and CHiME 2026," at ICASSP 2026 Satellite Workshop
5. Co-Organizers & Academic Partners, "The 2025 LRAC Challenge," at ICASSP 2026 Satellite Workshop
6. Challenge co-organizer, "URGENT Challenge Universality, Robustness, and Generalizability of speech EnhancemeNT systems," at ICASSP 2026 Grand Challenge
7. Evaluation Committee for the Best Paper Award of the Speech Communication in 2025
8. Challenge co-organizer, "MISP 2025 challenge," at Interspeech 2025
9. Area Chair, IJCAI-25 (2024.11 -- 2025.08)
10. Chair, the IEEE Signal Processing Society Speech and Language Technical Committee (IEEE SLTC) (2025.1 - 2026.12)
11. Challenge co-organizer, "The ML-SUPERB 2.0 Challenge: Towards Fair and Robust Speech Processing," at Interspeech 2025
12. Challenge co-organizer, "URGENT Challenge Universality, Robustness, and Generalizability of speech EnhancemeNT systems," at Interspeech 2025
13. Lead Area Chair, INTERSPEECH 2025 (2024.09 -- 2025.08)
14. Special session co-organizer, "Self-Supervised Learning for Conversational Speech Processing," at ICASSP 2025
15. General Co-Chair, IEEE ICASSP 2028 (2024.05 -- 2028.05)
16. NCSA’s DeltaAI External Advisory Committee (2024.04--)
17. Co-organizer, "Audio Imagination" at NeurIPS 2024 Workshop
18. Co-organizer, "CHiME 2024 Workshop" at Insterpseech Sattelite Workshop 2024
19. Co-organizer, "SynData4GenAI" at Insterpseech Sattelite Workshop 2024
20. Challenge co-organizer, "Speech Processing Using Discrete Speech Units" at Interspeech 2024
21. Special session co-organizer, "Spoken Language Models for Universal Speech Processing" at Interspeech 2024
22. Lead Area Chair, INTERSPEECH 2024 (2023.11 -- 2024.09)
23. Vice chair, the IEEE Signal Processing Society Speech and Language Technical Committee (IEEE SLTC) (2024.1 - 2024.12)
24. Senior Area Chair "Speech and Multimodality" in EMNLP 2023
25. ISCA Fellows Selection Committee (2023--24)
26. Area Chair "Speech and Multimodality" in ACL 2023
27. Co-Area Chair, INTERSPEECH 2023 (2022.10 -- 2023.08)
28. Member, the IEEE Signal Processing Society Speech and Language Technical Committee (IEEE SLTC) (2023.1 - 2025.12)
29. Organizing Committee, SANE 2022 - Speech and Audio in the Northeast (2022.10)
30. Special session co-organizer, "Resource-efficient Real-time Neural Speech Separation," at ICASSP 2023
31. International Liaison Committee for 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (2022.9 - 2023.12)
32. TPC Co-Chairs, 14th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (2022.5 -- 2022.11)
33. Area Chair, 2022 International Workshop on Acoustic Signal Enhancement (IWAENC 2022) (2022.5-2022.10)
34. Technical Chairs, 2022 IEEE Spoken Language Technology Workshop (SLT 2022) (2021.10 - 2023.1)
35. Publication Chair, Interspeech 2024
36. Special session co-organizer, "Non-Autoregressive Sequential Modeling for Speech Processing" at Interspeech 2021
37. Special session co-organizer, "Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing (ConferencingSpeech 2021)" at Interspeech 2021
38. TPC Chairs, 13th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (2020.4 -- 2020.12)
39. Co-organizer, NeurIPS 2020 workshop "Self-Supervised Learning for Speech and Audio Processing"
40. Area Chair, IEEE Spoken Language Technology Workshop (IEEE SLT 2021) (2020.8 -- 2021.1)
41. Senior Program Committee Member, the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021) (2020.8 -- 2021.2)
42. Area Chair, the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021) (2020.8 -- 2021.4)
43. Senior Program Committee Member, the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence (IJCAI-PRICAI 2020) (2019.11 -- 2020.7)
44. Area Chair, the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20) (2019.8 -- 2020.2)
45. APSIPA Distinguished Lecturers (2019-2020)
46. IEEE Signal Processing Society Liaison for 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (2018.9 - 2019.12)
47. Organizing Committee, 2018 SANE 2018 - Speech and Audio in the Northeast (2018.5-2018.10)
48. Member, the IEEE Signal Processing Society Machine Learning for Signal Processing Technical Committee (IEEE MLSP) (2018.1 - 2020.12)
49. Senior Member, Co-Leader, "Multilingual End-to-end ASR for Incomplete Data" at 2018 Jelinek Summer Workshop on Speech and Language Technology (2017.12--2018.8)
50. Area Chair "Vision, Robotics, Multimodal, Grounding and Speech" in ACL 2018
51. Special session co-organizer, "Multi-Microphone Speech Recognition" at ICASSP 2018.
52. Program Chair, the 2017 IEEE International Workshop on Machine Learning for Signal Processing (MLSP'17) (2016.9 - 2017.9)
53. Technical Chair, IEEE ASRU 2017 (2016.5--2017.12)
54. Organizing Committee, the 4th International Workshop on Speech Processing in Everyday Environments (CHiME 2016), (2016.5 - 2016.9)
55. Senior Affiliates, "Building Speech Recognition System from Untranscribed Data" at 2016 Jelinek Summer Workshop on Speech and Language Technology (2015.12--2016.8)
56. Chair, Industrial Membership Committee, APSIPA (2016-)
57. Organizing Committee, the 4th CHiME Speech Separation and Recognition Challenge (2016.4-2016.9)
58. Member, IEEE CIS Task Force on Computational Audio Processing (2015.12-)
59. Interactive Peer Reviewer, the 2016 Frederick Jelinek Memorial Summer Workshop on Speech and Language Technology (JSALT 2016), (2015.11)
60. Senior Member, "Far-Field Speech Enhancement and Recognition in Mismatched Settings" at 2015 Jelinek Summer Workshop on Speech and Language Technology (2014.12--2015.8)
61. Special session co-organizer, "Robust Speech Processing using Observation Uncertainty and Uncertainty Propagation" at Interspeech (2015)
62. Organizing Committee, The 3rd CHiME Speech Separation and Recognition Challenge at IEEE ASRU 2015 (2014.7-2015.12)
63. Sponsorship Chair, IEEE ASRU 2015 (2014.6-2015.12)
64. TPC member, Machine Learning Applications in Speech Processing Symposium in the 2nd IEEE Global Conference on Signal and Information Processing (Global SIP) (2013.12 - 2014.12)
65. Industrial Membership Committee, APSIPA (2014.1 - 2017.12)
66. Member, the APSIPA Speech, Language, and Audio Technical Committee (APSIPA SLA TC) (2014.1 - 2019.12)
67. Member, the IEEE Signal Processing Society Speech and Language Technical Committee (IEEE SLTC) (2014.1 - 2019.12)
68. Organizing Committee, the 2012 SANE 2012 - Speech and Audio in the Northeast (2012.7-2012.10)
69. Organizing Committee, the 2012 IEEE AASP `CHiME' SpeechSeparation and Recognition Challenge (2012.3-2013.3)
70. Publication Chair, the 2012 International Workshop on Statistical Machine Learning for Speech Processing (IWSML2012)
71. Executive Committee Member, the 2nd Young Researchers Forum on ALAGIN Speech Processing Session (2011.2 - 2011.3)
72. Panelist in IPSJ/2009-SLP-79, Tokyo, Japan (2009.12)
73. Program Committee Member, the 12th Workshop on Information-Based Induction Sciences (IBIS 2009) (2008.12 - 2009.11)

Editor

1. Associate Edier, EURASIP Signal Processing Open (2026.06 --)
2. Guest Editor, IEEE JSTSP Special issue on Deep Multimodal Speech Enhancement and Separation (2024.10 -- 2025.3)
3. Lead Guest Editor, Elsevier Computer Speech and Language (2024.8 --)
4. Senior Area Editors, IEEE/ACM Transactions on Audio Speech and Language Processing (2022.5 -- 2025.6)
5. Guest Editor, IEEE JSTSP Special issue on Self-Supervised Learning for Speech and Audio Processing (2021.4 -- 2022.1)
6. Guest Editor, Elsevier Computer Speech and Language (2020.10 -- 2021.10)
7. Subject Editor, Elsevier Speech Communication (2019.4 --)
8. Lead Guest Editor, IEEE JSTSP Special issue on Far-Field Speech Processing in the Era of Deep Learning (2018.5 -- 2019.8)
9. Associate Editor, APSIPA Transactions on Signal and Information Processing (2018.4 -- )
10. Associate Editor, IEICE Transactions on Information and Systems (2016.7 -- 2020)
11. Guest Editor, Elsevier Computer Speech and Language (2015.12 -- 2016.11)
12. Guest Associate Editor, IEICE Transactions on Information and Systems (2013.4 -- 2014.6)
13. Associate Editor, IEEE Transactions on Audio, Speech, and Language Processing (ASLP). (2012.4 -- 2015.4)

Session Chair

1. IEEE MLSP (2017)
2. ISCA Interspeech (2015, 2016, 2018, 2019, 2020, 2021, 2022, 2023, 2024, 2025)
3. IEEE ICASSP (2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023, 2024, 2025, 2026)
4. IEEE ASRU (2015, 2017, 2021, 2025)
5. IEEE SLT (2021, 2022)
6. Acoustical Society of Japan Meeting (2004.3-2010)

Reviewer:

Conference Review

1. Asian Conference on Machine Learning (ACML)
2. Association for Computational Linguistics (ACL)
3. Conference on Language Modeling (COLM)
4. DCASE workshop
5. Empirical Methods in Natural Language Processing (EMNLP)
6. European Chapter of the Association for Computational Linguistics (EACL)
7. European Conference on Computer Vision (ECCV)
8. European Signal Processing Conference (EUSIPCO)
9. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
10. IEEE Global Conference on Signal and Information Processing (GlobalSIP)
11. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
12. IEEE International Workshop on Machine Learning for Signal Processing (MLSP)
13. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
14. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
15. International Conference on Learning Representations (ICLR)
16. International Conference on Machine Learning (ICML)
17. International Conference on Pattern Recognition (ICPR)
18. International Conference on Artificial Neural Networks (ICANN)
19. International Joint Conferences on Artificial Intelligence (IJCAI)
20. International Joint Conference on Neural Networks (IJCNN)
21. International Symposium on Chinese Spoken Language Processing (ISCSLP)
22. International Workshop on Acoustic Signal Enhancement (IWAENC)
23. International Workshop on Statistical Machine Learning for Speech Processing (IWSML)
24. ISCA INTERSPEECH
25. ISCA Young Female Researchers in Speech Workshop (YFRSW)
26. Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA)
27. Neural Information Processing Systems (NeurIPS)
28. REVERB workshop

Journal/Transaction Review

1. APSIPA Transactions on Signal and Information Processing
2. Computer Speech and Language
3. EURASIP Journal on Advances in Signal Processing
4. IEEE Journal of Selected Topics in Signal Processing
5. IEEE Signal Processing Letters
6. IEEE Signal Processing Magazine
7. IEEE Transaction on Audio, Speech and Language Processing
8. IEEE Transaction on Emerging Topics in Computational Intelligence
9. IEEE Transaction on Signal Processing
10. Information Processing Society of Japan (IPSJ) Transactions
11. Institute of Electronics, Information and Communication Engineers (IEICE) Transactions
12. Journal of Signal Processing Systems
13. Journal of Statistical Planning and Inference
14. Journal of the Acoustical Society of America
15. Journal of the Acoustical Society of Japan
16. Speech Communication
17. Transactions of the Association for Computational Linguistics (TACL)
18. Transactions on Machine Learning Research (TMLR)

External PhD Thesis Review/Committee

1. Ryan Whetten (University of Avignon)
2. Pu Wang (KU Leuven)
3. Sreyan Ghosh (University of Maryland)
4. Kohei Saijo (Waseda University)
5. Julius Richter (University of Hamburg)
6. Salah Zaiem (IP Paris)
7. Wen-Chin Huang (Nagoya University)
8. Yusuke Fujita (Waseda University)
9. Yosuke Higuchi (Waseda University)
10. Salvador Medina Maza (CMU)
11. Billy Li (CMU)
12. Ngoc Quan Pham (Karlsruhe Institute of Technology)
13. Albert Zeyer (RWTH Aachen University)
14. Chunting Zhou (CMU)
15. Shucong Zhang (University of Edinburgh)
16. Thai Son Nguyen (Karlsruhe Institute of Technology)
17. Ruizhi Li (JHU)
18. Nelson Enrique Yalta Soplin (Waseda University)
19. Suyoun Kim (CMU)
20. Tomoki Hayashi (Nagoya University)
21. Dung Tien Tran (INRIA)
22. Antti Hurmalainen (Tampere University of Technology)

Joint Research Projects:

Anuttacon (2025 – )
Sesami (2025 – )
DSTA (2024 – )
Amazon (2024 – )
Apple (2023.5--)
NAVER/LINE (2021.12--2022.11)
Honda Research Institute, Japan (2021.9--2025)
JHU HLTCOE (2021.9--)
Google, USA (2021.9--2022.11)
Facebook, USA (2021.9--2024)
Hyundai, Korea (2021.7--2024)
MIT Lincoln Laboratry, USA (2021.2--)
ASAPP, USA (2020.5--2024)
NAVER, Korea (2020.5--2021.4)
Sony, Tokyo, Japan (2019.7--)
Yahoo! Japan, Tokyo, Japan (2018.9--)
Hitachi Corporation, Tokyo, Japan (2018.7--2023)
NTT Communication Science Laboratories, Kyoto, Japan. (2017.12 -- )
Professor Takahiro Shinozaki at Tokyo Institute of Technology, Tokyo, Japan. (2014.4 – 2018.3)
Professor Shigeki Sagayama at the University of Tokyo, Tokyo, Japan. (2008.7 – 2011.11)
Professor Tetsunori Kobayashi at Waseda University, Tokyo, Japan. (2009.9 – 2010.2)
Professor Shigeru Katagiri at Doshisha University, Kyoto, Japan. (2010. 4 – 2011.11)
Professor Nobuyuki Minematsu at the University of Tokyo, Tokyo, Japan. (2010.7 – 2011.3)
Professor Biing-Hwang (Fred) Juang at Georgia Institute of Technology, Atlanta, USA. (2010.11 – 2011.11)

Collaborators:

At CMU

Visiting faculty

2023. 09 - 2024. 06: Karen Livescu (TTI-Chicago)

Post-doc

2021. 08 - 2024. 07: Zhong-Qiu Wang

2022. 03 - 2024. 08: Soumi Maiti
2023. 05 - 2024. 09: Jee-Weon Jung
2023. 10 -: Samuele Cornell
2024. 02 - 2025. 05: Hye-jin Shim

CMU student

2025. 08 -: Jaeyeon Kim (co-supervisor)
2025. 08 -: Chien-yu Huang
2024. 08 -: Shikhar Bharadwaj
2024. 08 - 2026. 05: Chyi-Jiunn Lin
2024. 08 -: Masao Someki
2023. 08 -: 2025. 05: Kwanghee Choi
2023. 08 -: Jinchuan Tian
2022. 08 - 2024. 05: Shih-Lun Wu
2022. 08 -: William Chen
2022. 05 - 2026. 04: Li-wei Chen (co-supervisor)
2022. 04 - 2024. 01: Jessica Huynh (co-supervisor)
2022. 02 - 2024. 06: Muqiao Yang (co-supervisor)
2021. 09 - 2025. 05: Yifan Peng
2021. 09 - 2023. 08: Dan Berrebbi
2021. 08 - 2023. 05: Xinjian Li
2021. 08 - 2026. 01: Jiatong Shi
2021. 05 - 2026. 03: Siddhant Arora
2021. 01 - 2024. 06: Xuankai Chang
2020. 09 - 2026. 05: Brian Yan
2022. 01 - 2022. 12: Dorsa Zeinali
2021. 09 - 2022. 12: Karthik Ganesan (MIIS directed study)
2022. 01 - 2022. 08: Debayan Ghosh
2021. 09 - 2022. 08: Sujay Suresh Kumar (MIIS directed study)
2021. 01 - 2022. 08: Peter Wu (co-supervisor)
2021. 01 - 2022. 08: Chaitanya Narisetty
2020. 09 - 2022. 08: Siddharth Dalmia (co-supervisor)

Visitor

2026. 05 - 2027. 05: Kuang-Da Wang (National Yang Ming Chiao Tung University)
2026. 05 - 2026. 08: Thanapat Trachu (University of Southern California)
2026. 02 - 2026. 07: Dahee Yang (Hanyang University)
2026. 01 - 2026. 05: Alexander Polok (Brno University of Technology)
2025. 12 - 2026. 06: Xun Gong (Shanghai Jiao Tong University)
2025. 08 - 2025. 12: Haoran Wang (Shanghai Jiao Tong University)
2025. 05 - 2025. 11: Ji Hoon Kim (KAIST)
2025. 04 - 2026. 03: Bo-Hao Su (National Tsing Hua University)
2025. 02 - 2025. 07: Jialu Li (UIUC)
2025. 01 - 2025. 04: Pu Wang (KU Leuven)
2024. 10 - 2024. 12: Carlos Carvalho (Instituto Superior Técnico (IST))
2024. 10 - 2025. 01: Holger Severin Bovbjerg (Aalborg University)
2024. 11 - 2024. 12: Junyi Peng (Brno University of Technology)
2024. 08 - 2024. 12: Kalvin Chang (CMU)
2024. 08 - 2024. 11: Yoshiaki Band (AIST)
2024. 07 - 2024. 11: Shih-Heng (Stan) Wang (National Taiwan University)
2024. 04 - 2024. 12: Shuichiro Shimizu (Kyoto University)
2023. 11 - 2024. 04: Chenda Li (Shanghai Jiao Tong University)
2023. 09 - 2024. 08: Yihan Wu (Renmin University)
2023. 08 - 2023. 11: Min Su Kim (KAIST)
2023. 04 - 2023. 07: Kohei Saijo (Waseda University)
2023. 02 - 2024. 02: Wangyou Zhang (Shanghai Jiao Tong University)
2022. 11 - 2023. 01: Takaaki Saeki (University of Tokyo)
2022. 04 - 2022. 09: Samuele Cornell (Universita Politecnica delle Marche)
2022. 03 - 2022. 06: Yoshiki Masuyama (Tokyo Metropolitan University)
2022. 03 - 2022. 06: Yosuke Higuchi (Waseda University)
2021. 12 - 2022. 12: Yosuke Kashiwagi (Sony)
2021. 08 - 2021. 12: Yen-Ju Lu (Academia Sinica)
2021. 07 - 2022. 07: Yushi Ueda (Japan Patent Office)

At JHU

JHU student

2020. 08 - 2021. 08: Tianzi Wang (CUHK)
2019. 09 - 2021. 09: Jiatong Shi (transferred to CMU)

2019. 09 - 2020. 12: Xuankai Chang (transferred to CMU)
2018. 07 - 2019. 06: Zhiqi Wang
2017. 12 - 2020. 12: Matthew Wiesner (co-supervisor, JHU HLTCOE)
2017. 10 - 2022. 05: Matthew Maciejewski (co-supervisor, Amazon)
2017. 09 - 2018. 12: Szu-Jui Chen
2017. 09 - 2022. 05:: Aswin Shanmugam Subramanian

Visitor

2020. 01 - 2021. 01: Pengcheng Guo (Northwestern Polytechnical University)
2019. 12 - 2020. 03: Yosuke Higuchi (Waseda University)
2019. 12 - 2020. 12: Jing Shi (Chinese Academy of Science)
2019. 07 - 2019. 10: Katsuki Inoue (Okayama University)
2018. 11 - 2019. 05: Murali Karthick Baskar (Brno University of Technology)
2018. 09 - 2018. 12: Xuankai Chang (Shanghai Jiao Tong University)
2018. 08 - 2018. 09: Hirofumi Inaguma (Kyoto University)
2018. 07 - 2020. 03: Yusuke Fujita (Hitachi Ltd.)
2018. 04 - 2018. 09: Nelson Enrique Yalta Soplin (Waseda University)

At MERL

2016. 11 - 2016. 01: Tsubasa Ochiai (Doshisha University -> NTT Labs)
2016. 08 - 2016. 11: Tomoki Hayashi (Nagoya University)
2016. 05 - 2016. 08: Zhong Meng (Georgia Institute of Technology -> Microsoft)
2016. 05 - 2016. 08: Suyoun Kim (Carnegie Mellon University -> Facebook)
2015. 09 - 2015. 12: Morteza Shahriari (University of Florida)
2014. 11 - 2015. 05: Zhuo Chen (Columbia University -> Microsoft)
2014. 09 - 2014. 11: Ahmed Hussen Abdelaziz (Ruhr-Universität Bochum -> ICSI -> Apple)
2014. 06 - 2014. 09: Yi Luan (University of Washington)
2013. 09 - 2014. 02: Felix Weninger (Technische Universität München -> Nuance)
2013. 06 - 2013. 08: Hao Tang (Toyota Technological Institute at Chicago -> MIT -> University of Edinburgh)
2012. 12 - 2013. 02: Koichiro Yoshino (Kyoto University -> NAIST)

At NTT

2011. 05 - 2011. 08: Ekapol Chuangsuwanich (Massachusetts Institute of Technology -> Chulalongkorn University)
2011. 01 - 2011. 03: Yasuhisa Fujii (Toyohashi University of Technology -> Google)
2010. 01 - 2010. 08: Jose Domingo Esparza Garcia (Technical University of Cartagena -> Ulm University)
2010. 01 - 2010. 03: Daisuke Saito (University of Tokyo)
2009. 08: Denis Babani (NAIST -> OwnerIQ, Inc.)
2009. 08 - 2009. 09, 2010. 08 - 2010. 09: Naohiro Tawara (Waseda University -> NTT Labs )
2008. 10 - 2008. 12: Yotaro Kubo (Waseda University -> NTT Labs -> Amazon -> Google)
2007. 12 - 2008. 11: David Cournapeau (Kyoto University -> Enthought)
2007. 08: Kenta Nishiki (University of Tokyo -> NTT Labs)
2006. 01 - 2006. 08: Rasa Narkeviciute (Kaunas University of Technology -> AEA Technology)
2006. 01 - 2006. 02: Hirokazu Kameoka (University of Tokyo -> NTT Labs/University of Tokyo)
2005. 07 - 2006. 05: Wesley William Arnquist (University of Washington -> Costco Wholesale)
2003. 08: Toshiaki Kubo (Waseda University -> Mitsubishi Electric)
2003. 04 - 2004. 03, 2008.10: Atsushi Sako (Ryukoku University -> Kobe University -> Nintendo)

Google Sites

Report abuse