Activities
Education:
Ph.D. from Waseda University, School of Science and Engineering, February 2006, Supervisors: Profs. T. Kobayashi, Y. Matsuyama, Y. Sagisaka, and T. Matsushima
Master of Science from Waseda University, March 2001, Supervisor: Prof. Ichiro Ohba
Bachelor of Science from Waseda University, March 1999, Supervisor: Prof. Ichiro Ohba
Work Experience:
September 2022 - present: "Courtesy Faculty," Department of Electrical and Computer Engineering at Carnegie Mellon University, Pittsburgh PA, USA
January 2021 - present: "Associate Professor," Language Technologies Institute at Carnegie Mellon University, Pittsburgh PA, USA
January 2021 - present: "Adjunct Associate Professor," Center for Language and Speech Processing at John Hopkins University, Baltimore MD, USA
July 2017 – December 2020: "Associate Research Professor," Department of Electrical and Computer Engineering at John Hopkins University, Baltimore MD, USA
October 2014 – June 2017: "Senior Principal Research Scientist" (formerly, Senior Principal Member Research Staff) at Mitsubishi Electric Research Laboratories (MERL), Cambridge MA, USA (Group manager: Dr. Anthony Vetro. Team leader: Dr. John R. Hershey)
April 2013 – September 2014: “Principal Member Research Staff” at Mitsubishi Electric Research Laboratories (MERL), Cambridge MA, USA (Group manager: Dr. Anthony Vetro. Team leader: Dr. John R. Hershey)
January 2012 – March 2013: “Visiting Member Research Staff” at Mitsubishi Electric Research Laboratories (MERL), Cambridge MA, USA (Group manager: Dr. Anthony Vetro. Team leader: Dr. John R. Hershey)
July 2011 – November 2011: “Research Scientist” at Nippon Telegraph and Telephone (NTT) Communication Science Laboratories, Signal Processing Research Group, Kyoto, Japan (Group leader: Dr. Atsushi Nakamura)
January 2009 – March 2009: “Visiting Scholar” at Georgia Institute of Technology, Atlanta, USA. (Supervisor: Professor Biing-Hwang (Fred) Juang)
April 2001 – June 2011: “Researcher” at Nippon Telegraph and Telephone (NTT) Communication Science Laboratories, Signal Processing Research Group, Kyoto, Japan (Group leader: Dr. Shoji Makino -> Dr. Masato Miyoshi -> Dr. Atsushi Nakamura)
Teaching:
"11492/11692/18495: Speech Technology for Conversational AI" Carnegie Mellon University, USA, Spring 2024
"11492/11692/18495: Speech Processing" Carnegie Mellon University, USA, Spring 2023
"11751/18781 Speech Recognition and Understanding," Carnegie Mellon University, USA, Fall 2022, Fall 2023
"11737: Multilingual Natural Language Processing," Carnegie Mellon University, USA, Spring 2022 with Graham Neubig and Alan W Black
"11751/18781 Speech Recognition and Understanding," Carnegie Mellon University, USA, Fall 2021 with Ian Lane
"DSTA: Multilingual Natural Language Processing," Carnegie Mellon University, USA, Spring 2021 with Graham Neubig and Alan W Black
"EN. 601.667: Introduction to Human Language Technology", Johns Hopkins University, USA, Fall 2020 with Philipp Koehn and other CLSP faculties
"EN. 601.667: Introduction to Human Language Technology", Johns Hopkins University, USA, Fall 2019 with Philipp Koehn and other CLSP faculties
"End-to-end speech recognition" JHU Summer School on Human Language Technology (June 20, 2018) with Takaaki Hori
"EN. 520.666: Information Extraction", Johns Hopkins University, USA, Spring 2020
"EN. 520.666: Information Extraction", Johns Hopkins University, USA, Spring 2019
"EN. 520.666: Information Extraction", Johns Hopkins University, USA, Spring 2018
"Machine Learning for Speech Processing," Department of Electronic Engineering, Tsinghua University, China, December 2012
Seminar:
"Toward Explainable Speech Foundation Models," The University of Rochester ECE Distinguished Lecture Series , April 2024 (host: Prof. Zhiyao Duan)
"Toward Explainable Speech Foundation Models," invited speaker at the IEEE ICASSP 2023 workshopExplainable Machine Learning for Speech and Audio, April 2024
"Unifying Speech Processing Applications with Speech Foundation Models," Amazon, January 2024 (host: Dr. Jin Liu)
"Explainable End-to-End Neural Networks for Far-Field Conversation Recognition,” Academia Sinica, Taiwan, December 2023 (host: Prof. Yu Tsao)
"Variational Bayesian Learning," invited speaker at the Symposium for Celebrating 40 Years of Bayesian Learning in Speech and Language Processing and Beyond, December 2023
"Attempts to reproduce large pre-trained speech models on an academic computing scale," Apple, September 2023 (host: Dr. Ahmed Abdelaziz)
"Attempts to reproduce large pre-trained speech models on an academic computing scale," Music and Audio Workshop, June 2023
"Attempts to reproduce large pre-trained speech models on an academic computing scale," panel talk at the IEEE ICASSP 2023 workshop Self-supervision in Audio, Speech and Beyond (SASB), June 2023
"Compositional framework for spoken language processing," Amazon Alexa Academy Distinguished Speaker Series, May 2023 (host: Jing Liu)
"CHiME Speech Separation and Recognition Challenge," ASJ Special Session, March 2023
"Controllable and Explainable End-to-End Speech Translation," Tencent, Dec 2022 (host: Dr. Raymond Yu)
"Controllable and Explainable End-to-End Speech Translation," SIG SLT Seminar, November 2022
"Explainable End-to-End Neural Networks for Far-Field Conversation Recognition," UT Austin, November 2022 (host. Prof. David Harwath)
"Toward End-to-End Neural Modeling for Multi-Speaker Conversation Recognition," AIRS Company, Hyundai Motor Group, September 2022 (host: Dr. Byeong-Yeol Kim)
"Toward End-to-End Neural Modeling for Multi-Speaker Conversation Recognition," Meta, August 2022
"Streaming Transformer for Speech Recognition, Understanding, and Translation," Nvidia, June 2022 (host: Dr. Boris Ginsburg)
"Toward End-to-End Neural Modeling for Multi-Speaker Conversation Recognition," Amazon, April 2022 (host: Dr. Tao Zhang)
"Toward End-to-End Neural Modeling for Multi-Speaker Conversation Recognition," Pindrop, April 2022 (host: Dr. Ganesh Sivaraman)
"End-to-End Speech Recognition and Spoken Language Understanding," JPMC Monthly Speaker Series, Febulary 2022 (host: Dr. Chandra Dhir)
"Simplifying Speech Recognition with Non-Autoregressive Modeling," Meta (FB) Speech & Audio Summit, December 2021
Panelist of "Convergence of Machine Learning and Signal Processing," APSIPA US Local Chapter, November 2021 (host: Prof. Anthony Kuh)
"Multi-Speaker Conversation Recognition based on End-to-End Neural Networks," Georgia Tech CSIP Seminar, November 2021
"Simplifying Speech Recognition with Non-Autoregressive Modeling," Amazon Pittsburgh, October 2021 (host: Dr. Markus Müller)
"Tackling Multispeaker Conversation Processing based on Speaker Diarization and Multispeaker Speech Recognition," Tencent Seattle AI Lab Online Talk, July 2021 (host: Dr. Meng Yu)
"Tackling Multispeaker Conversation Processing based on Speaker Diarization and Multispeaker Speech Recognition," Speech Science Forum at UCL, May 2021
"Introduction of ESPnet, end-to-end speech processing toolkit: new features, broadened applications, performance improvements, and future challenges," AIST AI Seminar, March 2021 (host: Dr. Jun Ogata)
"Tackling Multispeaker Conversation Processing based on Speaker Diarization and Multispeaker Speech Recognition," CUED Speech Group Seminars, Febluary 2021 (host: Prof. Kate Knill)
"Tackling Multispeaker Conversation Processing based on Speaker Diarization and Multispeaker Speech Recognition," Indiana University Bloomington Computer Science Department Colloquium, October 2020 (host: Prof. Donald S. Williamson)
"End-to-End Speech Processing: From Pipeline to Integrated Architecture," CMU, USA, October 2019 (host: Prof. Florian Metze)
"End-to-End Speech Processing: From Pipeline to Integrated Architecture," IBM, USA, August 2019 (host: Dr. Xiaodong Cui)
"End-to-End Speech Processing: From Pipeline to Integrated Architecture," University of Sheffield, UK, May 2019 (host: Prof. Jon Barker)
"Multichannel End-to-End Speech Recognition,” Toyota Technological Institute at Chicago (TTIC), USA, November 2018 (host: Prof. Karen Livescu)
"Multichannel End-to-End Speech Recognition,” Paderborn University, Germany, October 2018 (host: Prof. Reinhold Häb-Umbach)
"Hybrid CTC/Attention Architecture for End-to-End Speech Recognition," National Chiao Tung University, Taiwan, October 2018 (host: Prof. Jen-Tzung Chien)
"Multichannel End-to-End Speech Recognition,” Academia Sinica, Taiwan, October 2018 (host: Prof. Yu Tsao)
"Distant-Talk Speech Processing Toward Natural Conversation Understanding,” Johns Hopkins University, USA, January 2017 (host: Prof. Najim Dehak)
"Pushing the envelope at both ends — beamforming acoustic models and joint CTC/Attention schemes for end-to-end ASR,” Tokyo Institute of Technology, Japan, December. 2016 (host: Prof. Koichi Shinoda)
"Recent activities of distant talk speech recognition,” Nanyang Technological University, Singapore, March. 2016 (host: Dr. Xiong Xiao)
"Recent activities of distant talk speech recognition,” Shanghai Jiao Tong University, China, March. 2016 (host: Prof. Kai Yu)
“Practical Bayesian methods for speech and language processing,” Brno University of Technology, Czech Republic, September. 2015 (host: Prof. Jan "Honza" Cernocky)
“Recent trends in far-field speech recognition,” Nara Institute of Science and Technology, Japan, March. 2015 (host: Profs. Kevin Duh and Satoshi Nakamura)
“Recent trends on noise robust speech recognition,” Tokyo Institute of Technology, Japan, October. 2014 (host: Prof. Takahiro Shinozaki)
“Bayesian Learning for Speech and Language Processing,” The Institute of Statistical Mathematics, Japan, March. 2013 (host: Prof. Tomoko Matsui)
“Bayesian Learning for Speech and Language Processing,” Ruhr-Universität Bochum, Germany, Sep. 2013 (host: Prof. Dorothea Kolossa)
“Bayesian Learning for Speech and Language Processing,” National Taiwan University, Taiwan, July. 2013 (host: Prof. Lin-shan Lee)
“Structural Bayesian linear regression for hidden Markov models” Speech Signal Processing Workshop 2013, Taiwan, June. 2013 (host: Prof. Jen-Tzung Chien)
The 4th Young Researchers Forum on ALAGIN Speech Processing Session, Japan, Dec. 2012 (host: Prof. Daisuke Saito)
“Bayesian Learning for Speech and Language Processing,” BBN Technology, USA, Jun. 2012 (host: Dr. Ivan Bulyko)
“Speech Recognition by Tracking Acoustical and Linguistic Environments,” MIT CSAIL Seminar, USA, April 2012 (host: Prof. Jim Glass)
“Bayesian Linear Regression For Hidden Markov Model Based On Optimizing Variational Bounds,” NICT, Japan, Nov. 2011 (host: Dr. Chiori Hori)
“Bayesian Linear Regression For Hidden Markov Model Based On Optimizing Variational Bounds,” Tsinghua University, China, Sep 2011 (host: Prof. Zhijian Ou)
“Incremental adaptation of speech recognition based on macroscopic time evolution system,” Seminar on Human Language Technology and Pattern Recognition, RWTH Aachen University, Germany, Sep 2011 (host: Prof. Ralf Schlueter)
“Incremental adaptation of speech recognition based on macroscopic time evolution system,” Seminar on Tokuda & Lee Laboratory, Nagoya Institute of Technology, Japan, June 2011 (host: Prof. Keiichi Tokuda)
“Incremental adaptation of speech recognition based on macroscopic time evolution system,” Seminar on Machine Intelligence Laboratory, Cambridge University Department of Engineering, UK, May 2011 (host: Dr. Kai Yu)
“Speech recognition based on a Bayesian approach,” Seminar on NHK Science & Technology Laboratories, Japan, September 2009 (host: Dr. Toru Imai)
“Variational Bayesian estimation and clustering for speech recognition,” Georgia tech. CSIP seminar, USA, February 2009 (host: Prof. Fred Juang).
Award and Notable Achievement:
Ranked 1st place at the DCASE 2024 Challenge Task 6
Outstanding Reviewer Recognition at IEEE ICASSP in 2023
Co-authored paper achieved the IEEE Ganesh N. Ramaswamy Memorial Student Grant (2023)
Ranked 1st place at the DCASE 2023 Challenge Task 6A
6 papers are recognized as one of the top 3% of all papers accepted at IEEE ICASSP in 2023
Ranked 1st place at the STOP challenge tasks 1 and 3 (2023)
Best student paper at IEEE SLT in 2022
Best paper candidates (2 papers) at IEEE SLT in 2022
Ranked 1st place at The 2nd Clarity Enhancement Challenge (2022)
Computer Speech & Language Best Review Paper Award (2022)
Ranked 1st place at the IWSLT 2022 Dialect Speech Translation task (2022)
Ranked 1st place at the L3DAS22 challenge task 1 (2022)
Computer Speech & Language Best Review Paper Award (2021)
Ranked 2nd place at The DIHARD III challenge task 1 & 2 full set in 2021
Ranked 1st place at The DCASE 2020 challenge task 4 in 2020
Facebook Research Award (Towards On-Device AI) in 2020
Tencent AI Lab Rhino-Bird Gift Fund in 2020
ASRU best paper award at the IEEE ASRU in 2019
ASRU best paper candidate (2 papers) at the IEEE ASRU in 2019
Google Faculty Research Award in 2019
Tencent AI Lab Rhino-Bird Gift Fund in 2018
Facebook Research Award (Speech and Audio Technology for Voice Interaction and Video Understanding) in 2018
Co-authored paper achieved the ISCA Interspeech Best Student Paper Award in 2018
Ranked 1st place at The First DIHARD Speech Diarization Challenge track1 in 2018
Best paper candidates (2 papers) and best student paper candidate at the IEEE ASRU in 2017
Ranked 3rd place at 4th CHiME Challenge 6 channel track in 2016
Ranked 2nd place at 5th Dialog State Tracking Challenge (DSTC5) in 2016
Ranked 2nd place at 3rd CHiME Challenge in 2015
Ranked 1st place at REVERB Challenge Official Data Track (2nd overall) in 2014
Ranked 1st place at 2nd CHiME Challenge Track 2 in 2013
Co-authored paper achieved the IEEE ICASSP 2012 Student Paper Award in 2012
Best In-Category Nominee at the International Semantic Web Conference in 2011
Ranked 1st place at 1st CHiME Challenge in 2011
Co-authored paper achieved the IEEE Signal Processing Society Japan Chapter Student Paper Award in 2010
The TELECOM System Technology Award from the Telecommunications Advancement Foundation in 2006
The Itakura Award from the Acoustical Society of Japan (ASJ) in 2006
The Best Paper Award from the Institute of Electronics, Information, and Communication Engineers of Japan (IEICE) in 2004
The Awaya Award from the Acoustical Society of Japan (ASJ) in 2003
Academic Activity:
Membership
Fellow, the Institute of Electrical and Electronics Engineers (IEEE)
Fellow, the International Speech Communication Association (ISCA)
Member, the Acoustical Society of Japan (ASJ)
Member, the Institute of Electronics, Information, and Communication Engineers of Japan (IEICE)
Organizer/Committee Member
General Co-Chair, IEEE ICASSP 2028 (2024.05 -- 2028.05)
NCSA’s DeltaAI External Advisory Committee (2024.04--)
Special session co-organizer, "Spoken Language Models for Universal Speech Processing" at Interspeech 2024
Area Chair, INTERSPEECH 2024 (2023.11 -- 2024.09)
Vice chair, the IEEE Signal Processing Society Speech and Language Technical Committee (IEEE SLTC) (2024.1 - 2024.12)
Senior Area Chair "Speech and Multimodality" in EMNLP 2023
ISCA Fellows Selection Committee (2023--24)
Area Chair "Speech and Multimodality" in ACL 2023
Co-Area Chair, INTERSPEECH 2023 (2022.10 -- 2023.08)
Member, the IEEE Signal Processing Society Speech and Language Technical Committee (IEEE SLTC) (2023.1 - 2025.12)
Organizing Committee, SANE 2022 - Speech and Audio in the Northeast (2022.10)
Special session co-organizer, "Resource-efficient Real-time Neural Speech Separation," at ICASSP 2023
International Liaison Committee for 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (2022.9 - 2023.12)
TPC Co-Chairs, 14th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (2022.5 -- 2022.11)
Area Chair, 2022 International Workshop on Acoustic Signal Enhancement (IWAENC 2022) (2022.5-2022.10)
Technical Chairs, 2022 IEEE Spoken Language Technology Workshop (SLT 2022) (2021.10 - 2023.1)
Publication Chair, Interspeech 2024
Special session co-organizer, "Non-Autoregressive Sequential Modeling for Speech Processing" at Interspeech 2021
Special session co-organizer, "Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing (ConferencingSpeech 2021)" at Interspeech 2021
TPC Chairs, 13th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (2020.4 -- 2020.12)
Co-organizer, NeurIPS 2020 workshop "Self-Supervised Learning for Speech and Audio Processing"
Area Chair, IEEE Spoken Language Technology Workshop (IEEE SLT 2021) (2020.8 -- 2021.1)
Senior Program Committee Member, the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021) (2020.8 -- 2021.2)
Area Chair, the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021) (2020.8 -- 2021.4)
Senior Program Committee Member, the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence (IJCAI-PRICAI 2020) (2019.11 -- 2020.7)
Area Chair, the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20) (2019.8 -- 2020.2)
APSIPA Distinguished Lecturers (2019-2020)
IEEE Signal Processing Society Liaison for 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (2018.9 - 2019.12)
Organizing Committee, 2018 SANE 2018 - Speech and Audio in the Northeast (2018.5-2018.10)
Member, the IEEE Signal Processing Society Machine Learning for Signal Processing Technical Committee (IEEE MLSP) (2018.1 - 2020.12)
Senior Member, Co-Leader, "Multilingual End-to-end ASR for Incomplete Data" at 2018 Jelinek Summer Workshop on Speech and Language Technology (2017.12--2018.8)
Area Chair "Vision, Robotics, Multimodal, Grounding and Speech" in ACL 2018
Special session co-organizer, "Multi-Microphone Speech Recognition" at ICASSP 2018.
Program Chair, the 2017 IEEE International Workshop on Machine Learning for Signal Processing (MLSP'17) (2016.9 - 2017.9)
Technical Chair, IEEE ASRU 2017 (2016.5--2017.12)
Organizing Committee, the 4th International Workshop on Speech Processing in Everyday Environments (CHiME 2016), (2016.5 - 2016.9)
Senior Affiliates, "Building Speech Recognition System from Untranscribed Data" at 2016 Jelinek Summer Workshop on Speech and Language Technology (2015.12--2016.8)
Chair, Industrial Membership Committee, APSIPA (2016-)
Organizing Committee, the 4th CHiME Speech Separation and Recognition Challenge (2016.4-2016.9)
Member, IEEE CIS Task Force on Computational Audio Processing (2015.12-)
Interactive Peer Reviewer, the 2016 Frederick Jelinek Memorial Summer Workshop on Speech and Language Technology (JSALT 2016), (2015.11)
Senior Member, "Far-Field Speech Enhancement and Recognition in Mismatched Settings" at 2015 Jelinek Summer Workshop on Speech and Language Technology (2014.12--2015.8)
Special session co-organizer, "Robust Speech Processing using Observation Uncertainty and Uncertainty Propagation" at Interspeech (2015)
Organizing Committee, The 3rd CHiME Speech Separation and Recognition Challenge at IEEE ASRU 2015 (2014.7-2015.12)
Sponsorship Chair, IEEE ASRU 2015 (2014.6-2015.12)
TPC member, Machine Learning Applications in Speech Processing Symposium in the 2nd IEEE Global Conference on Signal and Information Processing (Global SIP) (2013.12 - 2014.12)
Industrial Membership Committee, APSIPA (2014.1 - 2017.12)
Member, the APSIPA Speech, Language, and Audio Technical Committee (APSIPA SLA TC) (2014.1 - 2019.12)
Member, the IEEE Signal Processing Society Speech and Language Technical Committee (IEEE SLTC) (2014.1 - 2019.12)
Organizing Committee, the 2012 SANE 2012 - Speech and Audio in the Northeast (2012.7-2012.10)
Organizing Committee, the 2012 IEEE AASP `CHiME' SpeechSeparation and Recognition Challenge (2012.3-2013.3)
Publication Chair, the 2012 International Workshop on Statistical Machine Learning for Speech Processing (IWSML2012)
Executive Committee Member, the 2nd Young Researchers Forum on ALAGIN Speech Processing Session (2011.2 - 2011.3)
Panelist in IPSJ/2009-SLP-79, Tokyo, Japan (2009.12)
Program Committee Member, the 12th Workshop on Information-Based Induction Sciences (IBIS 2009) (2008.12 - 2009.11)
Editor
Senior Area Editors, IEEE/ACM Transactions on Audio Speech and Language Processing (2022.5 - 2025.6)
Guest Editor, IEEE JSTSP Special issue on Self-Supervised Learning for Speech and Audio Processing (2021.4 -)
Guest Editor, Computer Speech and Language (2020.10--2021.10)
Subject Editor, Elsevier Speech Communication (2019.4 -- 2021.4)
Lead Guest Editor, IEEE JSTSP Special issue on Far-Field Speech Processing in the Era of Deep Learning (2018.5 - 2019.8)
Associate Editor, APSIPA Transactions on Signal and Information Processing (2018.4 -- )
Associate Editor, IEICE Transactions on Information and Systems (2016.7--2020)
Guest Editor, Computer Speech and Language (2015.12--2016.11)
Guest Associate Editor, IEICIE Transactions on Information and Systems (2013.4-2014.6)
Associate Editor, IEEE Transaction on Audio, Speech, and Language Processing (ASLP). (2012.4-2015.4)
Session Chair
IEEE MLSP (2017)
ISCA Interspeech (2015, 2016, 2018, 2019, 2020, 2021, 2022, 2023)
IEEE ICASSP (2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023)
IEEE ASRU (2015, 2017, 2021)
IEEE SLT (2021, 2022)
Acoustical Society of Japan Meeting (2004.3-2010)
Reviewer:
Conference Review
Asian Conference on Machine Learning (ACML)
Association for Computational Linguistics (ACL)
Conference on Language Modeling (COLM)
DCASE workshop
Empirical Methods in Natural Language Processing (EMNLP)
European Chapter of the Association for Computational Linguistics (EACL)
European Signal Processing Conference (EUSIPCO)
IEEE Global Conference on Signal and Information Processing (GlobalSIP)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
IEEE International Workshop on Machine Learning for Signal Processing (MLSP)
IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
International Conference on Learning Representations (ICLR)
International Conference on Machine Learning (ICML)
International Conference on Pattern Recognition (ICPR)
International Conference on Artificial Neural Networks (ICANN)
International Joint Conferences on Artificial Intelligence (IJCAI)
International Joint Conference on Neural Networks (IJCNN)
International Symposium on Chinese Spoken Language Processing (ISCSLP)
International Workshop on Acoustic Signal Enhancement (IWAENC)
International Workshop on Statistical Machine Learning for Speech Processing (IWSML)
ISCA INTERSPEECH
ISCA Young Female Researchers in Speech Workshop (YFRSW)
Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA)
Neural Information Processing Systems (NeurIPS)
REVERB workshop
Journal/Transaction Review
APSIPA Transactions on Signal and Information Processing
Computer Speech and Language
EURASIP Journal on Advances in Signal Processing
IEEE Journal of Selected Topics in Signal Processing
IEEE Signal Processing Letters
IEEE Signal Processing Magazine
IEEE Transaction on Audio, Speech and Language Processing
IEEE Transaction on Emerging Topics in Computational Intelligence
IEEE Transaction on Signal Processing
Information Processing Society of Japan (IPSJ) Transactions
Institute of Electronics, Information and Communication Engineers (IEICE) Transactions
Journal of Signal Processing Systems
Journal of Statistical Planning and Inference
Journal of the Acoustical Society of America
Journal of the Acoustical Society of Japan
Speech Communication
External PhD Thesis Review/Committee
Salah Zaiem (IP Paris)
Wen-Chin Huang (Nagoya University)
Yusuke Fujita (Waseda University)
Yosuke Higuchi (WasedaUniversity)
Salvador Medina Maza (CMU)
Billy Li (CMU)
Ngoc Quan Pham (Karlsruhe Institute of Technology)
Albert Zeyer (RWTH Aachen University)
Chunting Zhou (CMU)
Shucong Zhang (University of Edinburgh)
Thai Son Nguyen (Karlsruhe Institute of Technology)
Ruizhi Li (JHU)
Nelson Enrique Yalta Soplin (Waseda University)
Suyoun Kim (CMU)
Tomoki Hayashi (Nagoya University)
Dung Tien Tran (INRIA)
Antti Hurmalainen (Tampere University of Technology)
Joint Research Projects:
Apple (2023.5--)
NAVER/LINE (2021.12--2022.11)
Honda Research Institute, Japan (2021.9--)
JHU HLTCOE (2021.9--)
Google, USA (2021.9--2022.11)
Facebook, USA (2021.9--)
Hyundai, Korea (2021.7--)
MIT Lincoln Laboratry, USA (2021.2--)
ASAPP, USA (2020.5--)
NAVER, Korea (2020.5--2021.4)
Sony, Tokyo, Japan (2019.7--)
Yahoo! Japan, Tokyo, Japan (2018.9--)
Hitachi Corporation, Tokyo, Japan (2018.7--)
NTT Communication Science Laboratories, Kyoto, Japan. (2017.12 -- )
Professor Takahiro Shinozaki at Tokyo Institute of Technology, Tokyo, Japan. (2014.4 – 2018.3)
Professor Shigeki Sagayama at the University of Tokyo, Tokyo, Japan. (2008.7 – 2011.11)
Professor Tetsunori Kobayashi at Waseda University, Tokyo, Japan. (2009.9 – 2010.2)
Professor Shigeru Katagiri at Doshisha University, Kyoto, Japan. (2010. 4 – 2011.11)
Professor Nobuyuki Minematsu at the University of Tokyo, Tokyo, Japan. (2010.7 – 2011.3)
Professor Biing-Hwang (Fred) Juang at Georgia Institute of Technology, Atlanta, USA. (2010.11 – 2011.11)
Collaborators:
At CMU
Visiting faculty
2023. 09 -- 2024. 06 : Karen Livescu (TTI-Chicago)
Post-doc
2021. 08- : Zhong-Qiu Wang
2022. 03- : Soumi Maiti
2023. 05- : Jee-Weon Jung
2023. 10- : Samuele Cornell
2024. 02- : Hye-jin Shim
CMU student
2023. 08-: Kwanghee Choi
2023. 08-: Jinchuan Tian
2022. 08 -: Shih-Lun Wu
2022. 08 -: William Chen
2022. 05 -: Li-wei Chen (co-supervisor)
2022. 04 -: Jessica Huynh (co-supervisor)
2022. 02 -: Muqiao Yang (co-supervisor)
2021. 09 -: Yifan Peng
2021. 09 - 2023. 08: Dan Berrebbi
2021. 08 - 2023. 05: Xinjian Li
2021. 08 -: Jiatong Shi
2021. 05 -: Siddhant Arora
2021. 01 -: Xuankai Chang
2020. 09 -: Brian Yan
2022. 01 - 2022. 12: Dorsa Zeinali
2021. 09 - 2022. 12: Karthik Ganesan (MIIS directed study)
2022. 01 - 2022. 08: Debayan Ghosh
2021. 09 - 2022. 08: Sujay Suresh Kumar (MIIS directed study)
2021. 01 - 2022. 08: Peter Wu (co-supervisor)
2021. 01 - 2022. 08: Chaitanya Narisetty
2020. 09 - 2022. 08: Siddharth Dalmia (co-supervisor)
Visitor
2024. 04 - 2024.12: Shuichiro Shimizu (Kyoto University)
2023. 11 - 2024.04: Chenda Li (Shanghai Jiao Tong University)
2023. 09 - 2024.08: Yihan Wu (Renmin University)
2023. 08 - 2023.11: Min Su Kim (KAIST)
2023. 04 - 2023.07: Kohei Saijo (Waseda University)
2023. 02 - 2024.02: Wangyou Zhang (Shanghai Jiao Tong University)
2022. 11 - 2023.01: Takaaki Saeki (University of Tokyo)
2022. 04 - 2022.09: Samuele Cornell (Universita Politecnica delle Marche)
2022. 03 - 2022.06: Yoshiki Masuyama (Tokyo Metropolitan University)
2022. 03 - 2022.06: Yosuke Higuchi (Waseda University)
2021. 12 - 2022.12: Yosuke Kashiwagi (Sony)
2021. 08 - 2021.12: Yen-Ju Lu (Academia Sinica)
2021. 07 - 2022.07: Yushi Ueda (Japan Patent Office)
At JHU
JHU student
2020. 08 - 2021.08: Tianzi Wang (CUHK)
2019. 09 - 2021.09: Jiatong Shi (transferred to CMU)
2019. 09 - 2020. 12: Xuankai Chang (transferred to CMU)
2018. 07 - 2019. 06: Zhiqi Wang
2017. 12 - 2020. 12: Matthew Wiesner (co-supervisor, JHU HLTCOE)
2017. 10 - 2022. 05: Matthew Maciejewski (co-supervisor, Amazon)
2017. 09 - 2018. 12: Szu-Jui Chen
2017. 09 - 2022. 05:: Aswin Shanmugam Subramanian
Visitor
2020. 01 - 2021. 01: Pengcheng Guo (Northwestern Polytechnical University)
2019. 12 - 2020. 03: Yosuke Higuchi (Waseda University)
2019. 12 - 2020. 12: Jing Shi (Chinese Academy of Science)
2019. 07 - 2019. 10: Katsuki Inoue (Okayama University)
2018. 11 - 2019. 05: Murali Karthick Baskar (Brno University of Technology)
2018. 09 - 2018. 12: Xuankai Chang (Shanghai Jiao Tong University)
2018. 08 - 2018. 09: Hirofumi Inaguma (Kyoto University)
2018. 07 - 2020. 03: Yusuke Fujita (Hitachi Ltd.)
2018. 04 - 2018. 09: Nelson Enrique Yalta Soplin (Waseda University)
At MERL
2016. 11 - 2016. 01: Tsubasa Ochiai (Doshisha University -> NTT Labs)
2016. 08 - 2016. 11: Tomoki Hayashi (Nagoya University)
2016. 05 - 2016. 08: Zhong Meng (Georgia Institute of Technology -> Microsoft)
2016. 05 - 2016. 08: Suyoun Kim (Carnegie Mellon University -> Facebook)
2015. 09 - 2015. 12: Morteza Shahriari (University of Florida)
2014. 11 - 2015. 05: Zhuo Chen (Columbia University -> Microsoft)
2014. 09 - 2014. 11: Ahmed Hussen Abdelaziz (Ruhr-Universität Bochum -> ICSI -> Apple)
2014. 06 - 2014. 09: Yi Luan (University of Washington)
2013. 09 - 2014. 02: Felix Weninger (Technische Universität München -> Nuance)
2013. 06 - 2013. 08: Hao Tang (Toyota Technological Institute at Chicago -> MIT -> University of Edinburgh)
2012. 12 - 2013. 02: Koichiro Yoshino (Kyoto University -> NAIST)
At NTT
2011. 05 - 2011. 08: Ekapol Chuangsuwanich (Massachusetts Institute of Technology -> Chulalongkorn University)
2011. 01 - 2011. 03: Yasuhisa Fujii (Toyohashi University of Technology -> Google)
2010. 01 - 2010. 08: Jose Domingo Esparza Garcia (Technical University of Cartagena -> Ulm University)
2010. 01 - 2010. 03: Daisuke Saito (University of Tokyo)
2009. 08: Denis Babani (NAIST -> OwnerIQ, Inc.)
2009. 08 - 2009. 09, 2010. 08 - 2010. 09: Naohiro Tawara (Waseda University -> NTT Labs )
2008. 10 - 2008. 12: Yotaro Kubo (Waseda University -> NTT Labs -> Amazon -> Google)
2007. 12 - 2008. 11: David Cournapeau (Kyoto University -> Enthought)
2007. 08: Kenta Nishiki (University of Tokyo -> NTT Labs)
2006. 01 - 2006. 08: Rasa Narkeviciute (Kaunas University of Technology -> AEA Technology)
2006. 01 - 2006. 02: Hirokazu Kameoka (University of Tokyo -> NTT Labs/University of Tokyo)
2005. 07 - 2006. 05: Wesley William Arnquist (University of Washington -> Costco Wholesale)
2003. 08: Toshiaki Kubo (Waseda University -> Mitsubishi Electric)
2003. 04 - 2004. 03, 2008.10: Atsushi Sako (Ryukoku University -> Kobe University -> Nintendo)