Accepted Papers
Archival
CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models
Yuhang Wang, Yanxu Zhu, Chao Kong, Shuyu Wei, Xiaoyuan Yi, Xing Xie, Jitao Sang
Conformity Confabulation and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration
Razan Baltaji, Babak Hemmatian, Lav R. Varshney
Synchronizing Approach in Designing Annotation Guidelines for Multilingual Datasets: A COVID-19 Case Study Using English and Japanese Tweets
Kiki Ferawati, Wan Jou She, Shoko Wakamiya, Eiji Aramaki
CRAFT: Extracting and Tuning Cultural Instructions from the Wild
Bin Wang, Geyu Lin, Zhengyuan Liu, Chengwei Wei, Nancy F. Chen
Does Cross-Cultural Alignment Change the Commonsense Morality of Language Models?
Yuu Jinnai
Do Multilingual Large Language Models Mitigate Stereotype Bias?
Shangrui Nie, Michael Fromm, Charles Welch, Rebekka Görge, Akbar Karimi, Joan Plepi, Nazia Afsan Mowmita, Nicolas Flores-Herr, Mehdi Ali
Sociocultural Considerations in Monitoring Anti-LGBTQ+ Content on Social Media
Sidney Gig-Jan Wong
Are Generative Language Models Multicultural? A Study on Hausa Culture and Emotions using ChatGPT
Ibrahim Said Ahmad, Shiran Dudy, Resmi Ramachandranpillai, Kenneth Church
Computational Language Documentation: Designing a Modular Annotation and Data Management Tool for Cross-cultural Applicability
Alexandra O'Neil, Daniel Glen Swanson, Shobhana Lakshmi Chelliah
Non-Archival
Examining the Dialect Robustness of Language Models for Conversation Understanding
Dipankar Srirag, Aditya Joshi
CULTURE-GEN: Natural Language Prompts Reveal Uneven Country Presence in Language Models
Huihan Li, Liwei Jiang, Jena D. Hwang, Hyunwoo Kim, Sebastin Santy, Taylor Sorensen, Bill Yuchen Lin, Nouha Dziri, Xiang Ren, Yejin Choi
K2EVAL: Harnessing the Evaluation of Linguistic Fluency and Ethnolinguistic Knowledge in Korean
Guijin Son, Hyunwoo Ko, Hoyoung Lee, Seunghyeok Hong, Yewon Kim, Jungwoo Kim
NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models
Abhinav Sukumar Rao, Akhila Yerukola, Vishwa Shah, Katharina Reinecke, Maarten Sap
Assessing LLMs' Ability to Navigate Cultural Knowledge Conflicts
Guijin Son, Hanwool Lee, Seonkyu Lim, Suzie Oh, Eunji Kim, Minchang Kim, Sangyub Lee, Seunghyeok Hong, Mingyou Sung, Dasol Choi, Yoonseo Han, Jeehyun Lee, Ilgyun Jeong, SangWon Baek
RSA+C3: Cross-Cultural Communication using RSA in Codenames
Isadora White, Sashrika Pandey, Michelle Pan
BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages
Junho Myung, Nayeon Lee, Yi Zhou, Jiho Jin, Rifki Afina Putri, Eunsu Kim, Dimosthenis Antypas, Hsuvas Borkakoty, Hwaran Lee, Victor Gutierrez Basulto, Mohammad Taher Pilehvar, Carla Perez-Almendros, Nedjma Ousidhoum, Jose Camacho-Collados, Alice Oh
From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets
Manuel Tonneau, Diyi Liu, Samuel Fraiberger, Ralph Schroeder, Scott A. Hale, Paul Röttger
Bridging Background Knowledge Gaps in Translation with Automatic Explicitation
HyoJung Han, Jordan Lee Boyd-Graber, Marine Carpuat
CIC: A Framework for Culturally-Aware Image Captioning
Youngsik Yun, Jihie Kim
DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures
Agrima Seth, Sanchit Ahuja, Kalika Bali, Sunayana Sitaram
KoBBQ: Korean Bias Benchmark for Question Answering
Jiho Jin, Jiseon Kim, Nayeon Lee, Haneul Yoo, Alice Oh, Hwaran Lee
Perceptions of Language Technology Failures from South Asian English Speakers
Faye Holt, William Barr Held, Diyi Yang
CreoleVal: Multilingual Multitask Benchmarks for Creoles
Heather Lent, Kushal Tatariya, Raj Dabre, Yiyi Chen, Marcell Richard Fekete, Esther Ploeger, Li Zhou, Ruth-Ann Armstrong, Abee Eijansantos, Catriona Malau, Hans Erik Heje, Ernests Lavrinovičs, Diptesh Kanojia, Paul Belony, Marcel Bollmann, Loïc Grobol, Miryam de Lhoneux, Daniel Hershcovich, Michel DeGraff, Anders Søgaard, Johannes Bjerva
A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism
Brian Thompson, Mehak Preet Dhaliwal, Peter Frisch, Tobias Domhan, Marcello Federico
Does Mapo Tofu Contain Coffee? Probing LLMs for Food-related Cultural Knowledge
Li Zhou, Taelin Karidi, Nicolas Garneau, Yong Cao, Wanlong Liu, Wenyu Chen, Daniel Hershcovich
Towards Measuring and Modeling “Culture" in LLMs: A Survey
Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Shivdutt Singh, Ashutosh Dwivedi, Alham Fikri Aji, Jacki O'Neill, Ashutosh Modi, Monojit Choudhury
Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese
Rifki Afina Putri, Faiz Ghifari Haznitrama, Dea Adhista, Alice Oh
D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation
Aida Mostafazadeh Davani, Mark Diaz, Dylan K Baker, Vinodkumar Prabhakaran