Tomoki Toda

Top Page

[Japanese| English]

First Name : Tomoki
Last Name : Toda
Birthday : January 18, 1977
Blood Type : O
Home Town : Ama-shi, Aichi, Japan
Job : Professor
Hobby : Car, Guitar, Baseball, Reading (manga...), Exercise
Biography
CV

Research

- Speech Processing
  - - Speech Conversion: Voice conversion, voice quality control, articulatory controllable speech modification & non-native speech correction
    - Speech Synthesis: Statistical waveform generation, statistical parametric speech synthesis & concatenative speech synthesis
    - Speech Analysis: Speech parameter estimation & para-/non-linguistic information analysis
    - Speech Recognition: Acoustic modeling & body-conducted speech recognition
    - Spoken Dialogue: Statistical dialogue management & response generation
    - Speech Translation: Para-/non-linguistic translation
    - Augmented Speech Production: Speaking-aid, body-conducted speech enhancement & voice changer
    - Speech Assessment: Speech quality assessment & anti-spoofing
- Music Processing
  - - Singing Voice Analysis: Singing voice quality estimation & description
    - Singing Voice Generation: Singing voice conversion, singing voice synthesis & singing-aid
    - Music Analysis: Acoustic feature estimation, similarity learning, score transcription & score following
    - Music Signal Separation: Monaural/stereo music source separation & vocal extraction
    - Music Generation: Drum pattern modeling, polyphonic music composition
- Sound Environment Processing
  - - Sound Event Recognition: Polyphonic sound event detection and symbolization & audio captioning
    - Anomalous Sound Detection: Generative & discriminative anomalous sound detection
    - Multichannel Sound Signal Processing: Microphone array processing, air-/body-conducted sound signal processing & noise reduction

Recent Talks

2025/12: Invited talk at Symposium on Speech & Behavior Informatics
- "Lessons learned from research in speech signal processing" [Slides]
2025/08: Survey talk at INTERSPEECH 2025
- "Recent advances and future directions in voice conversion" [Slides]
2024/07: Invited talk at USTC Frontier Forum on Intelligent Speech Analysis and Generation
- "Voice conversion techniques to separately control static and dynamic speech characteristics" [Slides]
2024/04: Invited talk at RASDAP 2024 "Challenges and Opportunities of Speech Technology Research in the Era of Large Models"
- "Challenges in leveraging large models for augmented speech production" [Slides]

Contact

Global Research Institute for Mobility in Society,
Institutes of Innovation for Future Society,
Nagoya University

Furo-cho, Chikusa-ku, Nagoya, 464-8601, JAPAN

E-mail: toda.tomoki.v6__at__f.mail.nagoya-u.ac.jp (Please replace __at__ with @.)

TEL: +81-52-789-4346

[Graduate School of Informatics]

[Nagoya University]

Page updated

Google Sites

Report abuse

Top Page

Research

Speech Processing

Music Processing

Sound Environment Processing

Recent Talks

Contact