The SpeeD-TB Project

Speech Datasets and Models for Tibeto-Burman Languages

Funded by the Ministry of Electronics and Information Technology, Govt. of India

We are hiring! Click here to apply [Deadline: May 16, 2022]

Objective of the Project

The main objective of the project is to build a speech dataset of at least 1,200 hours consisting of around 200 hours in 6 Indian languages from the Tibeto-Burman language family - Toto, Chokri, Nyishi, Kok Borok, Bodo and Meetei. The project will also prepare phone sets, language models and baseline models for speech recognition in these languages.

The Project is to be implemented by the consortium of

Council for Strategic and Defense Research, New Delhi and Dr. Bhimrao Ambedkar University, Agra (Consortium Leader)
Manipur University, Imphal
Tezpur University, Tezpur
Indian Institute of Technology, Kharagpur
UnReaL-TecE LLP, Agra
Karya Inc., Gurugram
Panlingua Language Processing LLP, New Delhi

Page updated

Google Sites

Report abuse

The SpeeD-TB Project

Speech Datasets and Models for Tibeto-Burman Languages

Funded by the Ministry of Electronics and Information Technology, Govt. of India

We are hiring! Click here to apply [Deadline: May 16, 2022]

Objective of the Project

The Project is to be implemented by the consortium of

Get involved: