#31 – Fine-tuning Whisper with Spontaneous Persian Speech (SPS) (online)
Behnoosh Namdarzadeh, Nicolas Ballier
#6 – Small Language Models for Less-Resourced Languages in a Real-World Scenario: The Case for Catalan
Roser Saurí, Josep Sànchez-Ferreres, Lluís Padró, Josep Carmona
#7 – AmazoniaNLP: A Survey of Extreme Low-Resource Languages in the Peruvian-Brazilian Amazon
Rodolfo Joel Zevallos, Fabrício Carraro, John E. Ortega
#33 – AfriVoices-KE: A Multilingual Speech Dataset for Kenyan Languages (online)
Lilian Wanzare, Cynthia Jayne Amol, Ezekiel Maina, Nelson Odhiambo, Hope Kerubo, Leila Misula, Vivian Oloo, Rennish Mboya, Edwin Onkoba, Edward Ombui, Joseph Muguro, Ciira wa Maina, Andrew Kipkebut, Alfred Omondi Otom, Ian Ndung'u Kang'ethe, Angela Wambui Kanyi, Brian Gichana Omwenga
#36: BiST: A Gold Standard Bangla–English Bilingual Corpus for Sentence Structure and Tense Classification with Inter-Annotator Agreement (online)
Abdullah Al Shafi, Swapnil Kundu Argha, M. A. Moyeen, Abdul Muntakim and Shoumik Barman Polok
#19: Urdu-CLEVR: A Novel Benchmark for Visual Reasoning in an Under-Resourced Linguistic Context (online)
Sohail Ashraf, Adeel Zafar, Slawomir Nowaczyk and Ahthasham Sajid
#23 – HuNeBR: A Multitask Benchmark to Evaluate LLMs' Understanding of Northeastern Brazilian Portuguese Humor (online)
José Matheus do Nascimento Gama, David Candeia Maia, Leandro Balby Marinho, Fabio Morais, João Brunet
#21 – Open Machine Translation for Esperanto
Ona de Gibert, Lluís de Gibert Atienza