Challenge

Challenge tracks

To ensure the maximum participation from research communities across the world and bring out various novel approaches, we have kept 4 different tracks that are shown below in the table. Track 1 encourages participants to utilize only the data shared through this special session and try out different modelling techniques and architectures. In tracks 2 and 3, we encourage participants to utilize the existing (beyond what is shared in this special session) resources or model checkpoints for either the acoustic model (AM) or the language model (LM) while building the other using in-corpus data (shared in this special session) only. This might be helpful in understanding the role of both AM and LM in building a reliable ASR in a low-resource setting. Finally, track 4 encourages participants to use any existing resources along with the in-corpus audio and text to get the best performance on the blind test set. All 4 tracks are summarized in the table below:

In corpus = approx 1000 hours dialect balanced supervised audio + text, along with approx 100k dialect labelled sentences

Challenge Timeline

Registration opens - 27 May 2023

Dataset (train+dev) shared - 27 May 2023

Baselines release - 28 May 2023

Dataset (test) sharing - 28 June 2023

Challenge submission opens - 30 June 2023

Paper submission deadline - 03 July 2023 10 July 2023

Final challenge submission deadline - 5 July 2023 10 July 2023

Challenge acceptance results - 07 July 2023 12 July 2023

Paper revision deadline - 10 July 2023 17 July 2023

Note: Anywhere on Earth (AoE) time is used for deadlines

Page updated

Google Sites

Report abuse