J-KAC: Japanese Kamishibai and audiobook corpus

ダウンロード (download)

Download from here.

This corpus consists of voices of Japanese Kamishibai (picture card show) and audiobooks. The specifications are as follows.

Voices by a professional speaker: The speaker is acting in a variety of styles and characters.
Structured text: Sentences are structured in chapters, sections, etc.
- character name: annotation: Spoken phrases are annotated with the character name.
Illustration: Illustrations of the Kamishibais and books are also stored.

The voice duration is 9 hours, and the number of works is 42.

本コーパスは，紙芝居とオーディオブックの朗読音声からなります．スペックは以下のとおりです．

音声の長さは9時間で，作品数は42です．

Research only (visit the download page to see the details.)

研究のみ (詳細はダウンロードページを参照)

丹治尚子

Naoko Tanji

高道慎之介，中田亘，郡山知樹，丹治尚子，井島勇祐，増村亮，猿渡洋，"J-KAC：日本語オーディオブック・紙芝居朗読音声コーパス," 情報処理学会研究報告， Jun. 2021. (日本語)
Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Naoko Tanji, Yusuke Ijima, Ryo Masumura and Hiroshi Saruwatari, "Audiobook Speech Synthesis Conditioned by Cross-Sentence Context-Aware Word Embeddings," Proc. The 11th ISCA SSW, Online, Aug. 2021.

本コーパスの構築は，以下のプロジェクトを受けて実施したものです．

Google Sites

Report abuse