Speech Lab, IIT Madras, presents the Automatic Speech Recognition (ASR) in Hindi challenge. This challenge is a part of the National Language Translation Mission funded by MeitY. It aims towards helping and encouraging the advancement of ASR in Indian Languages. We plan to have a series of challenges with increasing difficulty in different Indian languages, and release appropriate data with each challenge. In the first few challenges, we will release everything including source codes etc, so that start-ups/Universities/Research-Labs without previous experience in ASR can also participate and get familiar with it.
Recent advancements in Speech technology have shown that ASR systems can work on par with humans. To build an efficient ASR system, it would require large amounts of training data and high-end computational resources.
However, when it comes to Indian languages, not everyone, especially academic institutions and startups, have access to these resources. As a part of this challenge, we will be releasing speech data in Hindi. Everyone who participates in this challenge will then be free to use this data for research purposes.
The data set comprises of Hindi read speech data along with the corresponding transcriptions. The text data was crawled from newspapers, and then volunteers were asked to read them. It covers genres like politics, sports, entertainment, etc. The following data sets will be released for this challenge:
Train set - 40 hours
Development set - 5 hours
Evaluation set - 5 hours
Lexicon, baseline models, results and recipes to replicate the baseline experiments will also be made available.
Closed Hindi-ASR Challenge: Only the training data distributed as part of the challenge can be used to train the models (both acoustic and language models). Please do not use dev set data.
Open Hindi-ASR Challenge: You can use any external/additional data to train the acoustic and language models.
How to Participate
Enroll yourself by registering on this link: Register Now!
Registering on the above link provides access to the user license and download the training and test data for Hindi challenge
Submit results: Use submission portal submit your results.
The submission portal will open on 21st of August 4th of September and closes at midnight on 30th of August 13th of September (midnight anywhere in the world, i.e., 12pm UTC on August 30 September 13)
Submissions should include the ASR output produced by the system and a brief description of the system (see the documentation for further instructions about formatting and the submission procedure)
Participating teams can submit a maximum of 10 submissions per team
Results will be displayed on a leader board throughout the period that the submission site is open
Baseline results and recipes
Release of training data (40 hours), development data (5 hours), lexicon and baseline system: July 6, 2020
Evaluation data released and opening of submission site: August 21, 2020 September 4, 2020
Closing of submission site: August 30, 2020 September 13, 2020 (midnight anywhere in the world, i.e., 12pm UTC on August 30, 2020 September 13, 2020)
Announcement of results: September 4, 2020 September 18, 2020
About Speech Lab IITM
Speech lab IIT Madras is headed by Prof. S. Umesh and is part of the Dept. of Electrical Engg. Our focus is on building state of the art speech recognition systems, especially in Indian languages. Our research interests are in low-resource modelling, multilingual speech recognition and speaker normalisation.