Accepted Papers

Archival

CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models

Yuhang Wang, Yanxu Zhu, Chao Kong, Shuyu Wei, Xiaoyuan Yi, Xing Xie, Jitao Sang


Conformity Confabulation and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration

Razan Baltaji, Babak Hemmatian, Lav R. Varshney


Synchronizing Approach in Designing Annotation Guidelines for Multilingual Datasets: A COVID-19 Case Study Using English and Japanese Tweets

Kiki Ferawati, Wan Jou She, Shoko Wakamiya, Eiji Aramaki


CRAFT: Extracting and Tuning Cultural Instructions from the Wild

Bin Wang, Geyu Lin, Zhengyuan Liu, Chengwei Wei, Nancy F. Chen


Does Cross-Cultural Alignment Change the Commonsense Morality of Language Models?

Yuu Jinnai


Do Multilingual Large Language Models Mitigate Stereotype Bias?

Shangrui Nie, Michael Fromm, Charles Welch, Rebekka Görge, Akbar Karimi, Joan Plepi, Nazia Afsan Mowmita, Nicolas Flores-Herr, Mehdi Ali


Sociocultural Considerations in Monitoring Anti-LGBTQ+ Content on Social Media

Sidney Gig-Jan Wong


Are Generative Language Models Multicultural? A Study on Hausa Culture and Emotions using ChatGPT

Ibrahim Said Ahmad, Shiran Dudy, Resmi Ramachandranpillai, Kenneth Church


Computational Language Documentation: Designing a Modular Annotation and Data Management Tool for Cross-cultural Applicability

Alexandra O'Neil, Daniel Glen Swanson, Shobhana Lakshmi Chelliah


Non-Archival

Examining the Dialect Robustness of Language Models for Conversation Understanding

Dipankar Srirag, Aditya Joshi


CULTURE-GEN: Natural Language Prompts Reveal Uneven Country Presence in Language Models

Huihan Li, Liwei Jiang, Jena D. Hwang, Hyunwoo Kim, Sebastin Santy, Taylor Sorensen, Bill Yuchen Lin, Nouha Dziri, Xiang Ren, Yejin Choi


K2EVAL: Harnessing the Evaluation of Linguistic Fluency and Ethnolinguistic Knowledge in Korean

Guijin Son, Hyunwoo Ko, Hoyoung Lee, Seunghyeok Hong, Yewon Kim, Jungwoo Kim


NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models

Abhinav Sukumar Rao, Akhila Yerukola, Vishwa Shah, Katharina Reinecke, Maarten Sap


Assessing LLMs' Ability to Navigate Cultural Knowledge Conflicts

Guijin Son, Hanwool Lee, Seonkyu Lim, Suzie Oh, Eunji Kim, Minchang Kim, Sangyub Lee, Seunghyeok Hong, Mingyou Sung, Dasol Choi, Yoonseo Han, Jeehyun Lee, Ilgyun Jeong, SangWon Baek


RSA+C3: Cross-Cultural Communication using RSA in Codenames

Isadora White, Sashrika Pandey, Michelle Pan


BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

Junho Myung, Nayeon Lee, Yi Zhou, Jiho Jin, Rifki Afina Putri, Eunsu Kim, Dimosthenis Antypas, Hsuvas Borkakoty, Hwaran Lee, Victor Gutierrez Basulto, Mohammad Taher Pilehvar, Carla Perez-Almendros, Nedjma Ousidhoum, Jose Camacho-Collados, Alice Oh


From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets

Manuel Tonneau, Diyi Liu, Samuel Fraiberger, Ralph Schroeder, Scott A. Hale, Paul Röttger


Bridging Background Knowledge Gaps in Translation with Automatic Explicitation

HyoJung Han, Jordan Lee Boyd-Graber, Marine Carpuat


CIC: A Framework for Culturally-Aware Image Captioning

Youngsik Yun, Jihie Kim


DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures

Agrima Seth, Sanchit Ahuja, Kalika Bali, Sunayana Sitaram


KoBBQ: Korean Bias Benchmark for Question Answering

Jiho Jin, Jiseon Kim, Nayeon Lee, Haneul Yoo, Alice Oh, Hwaran Lee


Perceptions of Language Technology Failures from South Asian English Speakers

Faye Holt, William Barr Held, Diyi Yang


CreoleVal: Multilingual Multitask Benchmarks for Creoles

Heather Lent, Kushal Tatariya, Raj Dabre, Yiyi Chen, Marcell Richard Fekete, Esther Ploeger, Li Zhou, Ruth-Ann Armstrong, Abee Eijansantos, Catriona Malau, Hans Erik Heje, Ernests Lavrinovičs, Diptesh Kanojia, Paul Belony, Marcel Bollmann, Loïc Grobol, Miryam de Lhoneux, Daniel Hershcovich, Michel DeGraff, Anders Søgaard, Johannes Bjerva


A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism

Brian Thompson, Mehak Preet Dhaliwal, Peter Frisch, Tobias Domhan, Marcello Federico


Does Mapo Tofu Contain Coffee? Probing LLMs for Food-related Cultural Knowledge

Li Zhou, Taelin Karidi, Nicolas Garneau, Yong Cao, Wanlong Liu, Wenyu Chen, Daniel Hershcovich


Towards Measuring and Modeling “Culture" in LLMs: A Survey

Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Shivdutt Singh, Ashutosh Dwivedi, Alham Fikri Aji, Jacki O'Neill, Ashutosh Modi, Monojit Choudhury


Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese

Rifki Afina Putri, Faiz Ghifari Haznitrama, Dea Adhista, Alice Oh


D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation

Aida Mostafazadeh Davani, Mark Diaz, Dylan K Baker, Vinodkumar Prabhakaran