#38 – How Well Do Large Language Models Reason in Under-Resourced Languages? Evidence from Vietnamese
Tuan Anh Do, Jelke Bloem
#42 – Register Sensitivity in Scalar MT Evaluation: Evidence from Spanish–Basque Informal Discourse
Nora Aranberri
#1 – Corpus-Linguists' Little Helpers? Evaluating LLMs for Linguistic Annotation (online)
Petra Bago, Virna Karlić
#45 – LLM as a Morphological Disambiguator for Belarusian (online)
Vladislav Poritski, Oksana Volchek, Ilia Afanasev
#9 – Interlinear Glosses as a Multilingual Pivot for Machine Translation: An Updated Study on Turkish with Restricted Resources
Volkan Ozer, Shu Okabe, Alexander Fraser
#37 – Benchmarking Multilingual LLM Translation Accuracy for Fuzhounese (online)
Sue Zheng, Jelke Bloem
#11 – Evaluating Nepali NER and POS Tagging Models on the Achhami Dialect (online)
Samikshya Dhamala, Rishav Beejukchhen, Subresh Thakulla, Supriya Khadka
#16 – From LLM Prompts to Acoustic Baselines: A Scalable Pipeline for Under-Resourced Disfluent Code-Mixed Speech
Anuran Mitra, Anirvan Chakravarty, Tapabrata Mondal, Sivaji Bandyopadhyay
#46 – Beyond Fine-Tuning: Procrustes Alignment of Multilingual Embeddings for Low-Resource Cross-Lingual Retrieval
Ali Faheem, Muhammad Hammad, Faizad Ullah, Ahmed Hassan, Fezan Rasool, Asim Karim
Papers: #14, #15, #18, #24, #26, #30, #32, #34, #35, #43, #44
Omnilinguality: Scaling AI to any language
Marta Ruiz Costa-Jussà (Meta)
#40 – Towards a General Theory of Linguistic Diversity (online)
Steven Bird
#3 – Language Identification for Low-Resource Formosan Languages (online)
Henry Gagnier
#25 – Rebelòt: Datasets and Token-Level Language Identification for Lombard-Italian-English Code-Mixing
Edoardo Signoroni, Emma Bednaříková, Pavel Rychly