The "Shared task on Natural Language Understanding of Devanagari Script Languages" at CHIPSAL@COLING 2025 focuses on addressing key challenges in processing Devanagari-scripted languages. In multilingual contexts, accurate language identification is critical, making the first subtask, Devanagari Script Language Identification, essential for identifying whether a given text is in Devanagari script. Hate speech detection is another significant aspect of understanding social dynamics, especially within online spaces. Subtask B, Hate Speech Detection, aims to determine whether a given text contains hate speech, with annotated datasets marking the presence or absence of such content. Building on this, Subtask C, Targets of Hate Speech Identification, focuses on identifying specific targets of hate speech, such as individuals, organizations, or communities. This shared task facilitates comprehensive Devanagari Script Language understanding, targeting key challenges in script identification, hate speech detection, and the identification of hate speech targets.
Devanagari Script Language Identification: Given a sentence in Devanagari script, the goal is to determine the language it belongs to among Nepali, Marathi, Sanskrit, Bhojpuri, and Hindi. This task addresses the critical need for accurate language identification in multilingual contexts.
Hate Speech Detection in Devanagari Script Language: Given a text, the goal of this task is to identify whether it contains hate speech or not. The text dataset for this subtask will have binary annotations for the prevalence of hate speech.
Target Identification for Hate Speech in Devanagari Script Language: The goal of this subtask is to identify the targets of hate speech in a given hateful text. The text is annotated for "individual", "organization", and "community" targets.
In order to participate in the shared task, please Join our codalab competition here
More about the shared task: https://github.com/therealthapa/chipsal24
Training & Evaluation data available: August 19, 2024
Test data available: September 27, 2024
Testing Phase start: September 27, 2024
Test end: October 17, 2024
System Description Paper submissions due: November 3, 2024 November 10, 2024
Notification to authors after review: December 3, 2024
Camera-ready: December 13, 2024
CHIPSAL Workshop: January 19, 2025
Surendrabikram Thapa (Virginia Tech, USA)
Kritesh Rauniyar (Delhi Technological University, India)
Farhan Ahmad Jafri (Jamia Millia Islamia, India)
Surabhi Adhikari (Columbia University, USA)
Kengatharaiyer Sarveswaran (University of Jaffna, Sri Lanka)
Bal Krishna Bal (Kathmandu University, Nepal)
Usman Naseem (Macquarie University, Australia)
If there are any questions related to the shared task, please contact rauniyark11@gmail.com