コーパス (Corpus)
YODAS: Youtube-Oriented Dataset for Audio and Speech
Multi-lingual AudioCaps: Multi-lingual datasets for audio captioning (2023)
JVNV: Japanese phoneme-balanced speech corpus with verbal and non-verbal vocalization (2023)
Coco-Nut: In-the-wild Japanese speech corpus for prompt-based TTS (2023)
JTubeSpeech-ASV: Japanese corpus for automatic speaker verification (2023)
Laughterscape: in-the-wild Japanese laughter (2023)
DCASE2023-task7: Foley sound synthesis (2023)
CPJD: Crowdsourced Parallel Speech Corpus of Japanese Dialect (2023)
JNV: Japanese Nonverbal Vocalization corpus (2023)
STUDIES2: Complaint handling and Attentive Listening Lines Speech (2023)
Tohoku-folktale: Tohoku region folktale corpus (2022)
jaCappella: Japanese a Cappella Vocal Ensemble Corpus (2022)
STUDIES: Japanese empathetic dialogue speech corpus
SMASH:third-person audio commentaries on gameplay (2022)
SpeedSpeech-JA-2022: Japanese speech with a variety of speaking rate (2022)
JLecSponSpeech: Japanese spontaneous speech corpus (2022)
JECS: Japanese-English bilingual code-switching corpus (2022)
tri-jek: Japanese-English-Korean tri-lingual corpus (2021)
JSSS-misc: misc tasks of JSSS corpus (2021)
JTubeSpeech: Corpus of Japanese speech collected from YouTube (2021)
J-MAC: Japanese multi-speaker audiobook corpus (2021)
J-KAC: Japanese Kamishibai and audiobook corpus (2021)
JMD: Japanese multi-dialect corpus (2021)
JSSS: Japanese multi-style (summarization and simplification) corpus (2020)
RWCP-SSD-Onomatopoeia: onomatopoeic word dataset for environmental sounds (2020)
Life-m: landmark image-themed music corpus (2020)
PJS: Phoneme-balanced Japanese singing voice corpus (2020)
JVS-MuSiC: Japanese multi-speaker singing-voice corpus (2020)
JVS: Japanese multi-speaker voice corpus (2019)
JSUT-book: audiobook corpus by a single Japanese speaker (2021)
JSUT-vi: vocal imitation corpus by a single Japanese speaker (2018)
JSUT-song: singing voice corpus by a single Japanese singer (2018)
JSUT: a large-scaled corpus of reading-style Japanese speech by a single speaker (2017)