Speech corpora

Arpod corpus is a speech corpus is built for language and dialect identification. It can be downloaded from:

https://github.com/computational-linguistics-department/Spoken-Language-and-Topic-Identification-Datasets

More details can be found in:

Khaled Lounnas, Mourad Abbas, Mohamed Lichouri. Building a Speech Corpus based on Arabic Podcasts for Language and Dialect Identification, 3rd International Conference on Natural Language and Speech Processing (ICNLSP2019m Italy, 2019.