TTS Training Data

https://ee.iisc.ac.in/limmitsdataset/

The creation of the dataset was supported by Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ) on behalf of the German Ministry for Economic Cooperation and Development

Dataset available at https://ee.iisc.ac.in/limmitsdataset/


The dataset consists of Male and Female speakers in the following languages -

- English (Indian)

- Kannada

- Bengali

- Chattisgrahi

- Hindi

- Telugu

- Marathi


This results in a TTS corpus of 14 speakers, of 7 Indian languages. We share 40 hours of data from each speaker, resulting the challenge corpora of 560 hours.