ASR Strong Baseline

We provide the following baseline, consisting of a pre-trained version of the Wav2Vec 2.0 model and strongly recommend that participants use this model for transfer learning:

https://huggingface.co/Edresson/wav2vec2-large-xlsr-coraa-portuguese

Step by step to train and test the model:

https://github.com/Edresson/Wav2Vec-Wrapper/tree/main/Papers/SE%26R-Challenge#train-your-own-model-using-transfer-learning-of-baseline

Details of our baseline can be found in the following pre-print: https://arxiv.org/abs/2110.15731

CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese

Arnaldo Candido Junior, Edresson Casanova, Anderson Soares, Frederico Santos de Oliveira, Lucas Oliveira, Ricardo Corso Fernandes Junior, Daniel Peixoto Pinto da Silva, Fernando Gorgulho Fayet, Bruno Baldissera Carlotto, Lucas Rafael Stefanel Gris, Sandra Maria Aluísio

Page updated

Google Sites

Report abuse