Shared Tasks

Shared Task 1: Gender Rewriting

    • Description: The task of gender rewriting refers to generating alternatives to a given sentence to match different target user gender contexts (e.g., a female speaker with a male listener, a male speaker with a male listener, etc.). This requires changing the grammatical gender (masculine or feminine) of certain words referring to the users (speaker/1st person and listener/2nd person). In this task, we focus on Arabic, a gender-marking morphologically rich language. The task of gender rewriting was introduced by Alhafni et al. (2022).

    • Organizers: Bashar Alhafni, Ossama Obeid, Houda Bouamor, and Nizar Habash.

    • For more information, please visit the shared task website: https://sites.google.com/view/arabic-gender-rewriting/

Shared Task 2: Nuanced Arabic Dialect Identification (NADI)

  • Description: Arabic has a widely varying collection of dialects. Many of these dialects remain under-studied due to the rarity of resources. The goal of the Nuanced Arabic Dialect Identification (NADI) shared task series is to alleviate this bottleneck by providing datasets and modeling opportunities for participants to carry out dialect identification. Dialect identification is the task of automatically detecting the source variety of a given text or speech segment. In addition to nuanced dialect identification at the country level, NADI 2022 also offers a new subtask focused on country-level sentiment analysis. While we invite participation in either of the two subtasks, we hope that teams will submit systems to both tasks (i.e., participate in the two tasks rather than only one task). By offering two subtasks, we hope to receive systems that exploit diverse machine learning architectures. This could include multi-task learning systems as well as sequence-to-sequence architectures in a single model such as the text-to-text Transformer. Other approaches could also be possible. We introduce the two subtasks next.

  • Organizers: Muhammad Abdul-Mageed, Chiyu Zhang, Abdelrahim Elmadany, Nizar Habash, and Houda Bouamor.

  • For more information, please visit the shared task website: https://nadi.dlnlp.ai/

Shared Task 3: Propaganda Detection in Arabic Social Media Text

  • Description: Propaganda is defined as an expression of opinion or action by individuals or groups deliberately designed to influence the opinions or actions of other individuals or groups with reference to predetermined ends. This is achieved by means of well-defined rhetorical and psychological devices. Currently, propaganda (or persuasion) techniques have been commonly used on social media to manipulate or mislead social media users. More detail on the techniques can be found here: https://propaganda.qcri.org/annotations/definitions.html. The goal of the shared task is to build models for identifying such techniques in Arabic social media text (specifically tweets).

  • Organizers: Firoj Alam, Hamdy Mubarak, Wajdi Zaghouani, Preslav Nakov, and Giovanni Da San Martino.

  • For more information, please visit the shared task’s website: https://sites.google.com/view/propaganda-detection-in-arabic/