JLecSponSpeech

JLecSponSpeech (Japanese lecture spontaneous speech corpus)

JLecSponSpeech is the corpus consisting of transcription of lectures in the University of Tokyo. This corpus is available for research of spontaneous speech synthesis etc.

東京大学における講義の書き起こしデータです．非流暢性のタグなどの情報が含まれており，自発音声合成 (spontaneous speech synthesis) の研究などに利用可能です．

Download / ダウンロード

You can download it from here.

ここからダウンロードできます.

Description / 内容

The transcription of lectures in the University of Tokyo. It consists of the following information.

tag of disfluency
- word fragments, mispronunciation, misstatement, rephrasing, repeating, filled pauses, pauses, lengthening, laughing
- the format of tags is <tag_name　start_time　end_time>disfluency_text</tag_name>
comment of transcription

以下のリンクにある，東京大学の講義動画の書き起こしデータです．データは以下の情報を含みます．

非流暢性タグ（非流暢性の種類と時間情報）
- 語断片，発音誤り，言い誤り，言い直し，繰り返し，フィラー，ポーズ（間），伸ばし，笑
- タグのフォーマットは，<タグ名　開始時刻　終了時刻>非流暢性テキスト</タグ名>
書き起こし時のコメント（聞き取りにくい部分など）

Link / 講義動画のリンク

You can check in here.

ここから確認できます．

Terms of use / 使い方

You can use this corpus only for research for non-commercial purposes.

書き起こしデータは，「非商用目的での研究」の場合に限り使用可能です．

Contributors / 作成者

Yuta Matsunaga (the University of Tokyo) / 松永裕太（東京大学）[main contributer]
Takaaki Saeki (the University of Tokyo) / 佐伯高明（東京大学）
Shinnosuke Takamichi (the University of Tokyo) / 高道慎之介（東京大学）
Hiroshi Saruwatari (the University of Tokyo) / 猿渡洋（東京大学）

Citation / 引用

Please cite this paper.

下記論文を引用してください．

Yuta Matsunaga, Takaaki Saeki, Shinnosuke Takamichi, Hiroshi Saruwatari, "Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech Synthesis," in Proc. APSIPA ASC, Nov. 2022, pp. 1898-‌1903.