How to Participate and/or access the data base?
Ans) Enrol yourself by registering on this link: Register Here!!!
How to submit the challenge results?
Ans) Individual submission <LINK>
Team submission <LINK>
Can participants use pre-trained acoustic and/or language models?
Ans)
Closed Challenge: One cannot use any pretrained models or data (Can use 100 hours Gramvaani data only).
Self Supervised Closed Challenge: One can use the 1000 hour Gramvaani data we are sharing to develop a pretrained model that is built only on Gramvaani data. No other pretrained models or data can be used. (Can use 100 hours Gramvaani data + 1000 hours Gramvaani data here)
Open Challenge: Open to all other external data, model etc
Can participants use external data to train the model?
Ans) Yes, participants are allowed to use external data to train the model only on the Open Challenge track
Can participants use different techniques such as data augmentation, speech enhancement etc with the ASR?
Ans) Yes, any approaches can be used with ASR
Can participants use the data provided, outside the challenge?
Ans) After the completion of the challenge, one can use this data for other research activities. But data citation has to be done.
What is the composition of sampling rate in test set?
Ans) The sampling rate distribution follows that of the training set.
Can a participant build their own lexicon?
Ans) Yes, they can build their own lexicon.
Can participants use external unlabelled data in Self Supervised Closed Challenge?
Ans) External data of any type can only be used in open challenge track.
Can participants use pseudo-labelling in Self Supervised Closed Challenge?
Ans) Yes
Availability of Dictionary?
Ans) Dictionary is Available in the baseline Github page. Please check the baseline tab webpage.
Is the 100 hours training data transcription noisy?
Ans) Yes it is noisy.
Can I use some other challenge data for training?
Ans) For open challenge track one can use, but not for the closed ones.
Can participants use lexicon file present in baseline model for challenge 1 and 2 (closed challenges). Lexicon file contains more words than Train+Dev ?
Ans) Yes they can.
How many submissions allowed from an Organization?
Ans) Not more than 3 submissions from an organization are allowed for each of the tracks.
Is it allowed to add Dev set data to Train set?
Ans) Yes you can. You can use Dev and the Train data to train a model and evaluate the Eval. But make sure that you will not use that model to evaluate the Dev data.