This is the website for OffensEval 2019: OffensEval: Identifying and Categorizing Offensive Language in Social Media (SemEval 2019 - Task 6).
The competition is now finished. For more information, please consult the OffensEval 2019 report. When referring to the competition, please use the bib entry below:
@inproceedings{zampieri2019semeval,
title={SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval)},
author={Zampieri, Marcos and Malmasi, Shervin and Nakov, Preslav and Rosenthal, Sara and Farra, Noura and Kumar, Ritesh},
booktitle={Proceedings of the 13th International Workshop on Semantic Evaluation},
pages={75--86},
year={2019}
}
Task Description
In OffensEval 2019 we break down offensive content into the following three sub-tasks taking the type and target of offenses into account.
Sub-task A - Offensive language identification;
Sub-task B - Automatic categorization of offense types;
Sub-task C - Offense target identification.
Data
The dataset used in OffensEval 2019 is the Offensive Language Identification dataset (OLID). OLID contains 14,200 English tweets annotated using a hierarchical three-level annotation model.
For more information, please check the dataset paper. To download the dataset please visit OLID's page.
Dates
28 Nov 2018: Training Data Release
15 Jan 2019: Sub-task A test data release
17 Jan 2019: Submission sub-task A
22 Jan 2019: Sub-task B test data release
24 Jan 2019: Submission sub-task B
29 Jan 2019: Sub-task C test data release
31 Jan 2019: Submission sub-task C
5 Feb 2019: Results announced
10 Mar 2019: System and task description paper submissions due
10 Apr 2019: Author notifications
20 Apr 2019: Camera ready submissions due
Organizers
Marcos Zampieri (University of Wolverhampton, UK)
Shervin Malmasi (Harvard Medical School, USA)
Preslav Nakov (Qatar Computing Research Insitute, Qatar)
Sara Rosenthal (IBM Research, USA)
Noura Farra (Columbia University, USA)
Ritesh Kumar (Bhim Rao Ambedkar University, India)