Speech corpora
Arpod corpus is a speech corpus is built for language and dialect identification. It can be downloaded from:
https://github.com/computational-linguistics-department/Spoken-Language-and-Topic-Identification-Datasets
More details can be found in:
Khaled Lounnas, Mourad Abbas, Mohamed Lichouri. Building a Speech Corpus based on Arabic Podcasts for Language and Dialect Identification, 3rd International Conference on Natural Language and Speech Processing (ICNLSP2019m Italy, 2019.