Home

This is the website for OffensEval, a series of shared tasks on offensive language identification organized at the International Workshop on Semantic Evaluation (SemEval). OffensEval models offensive content using a hierarchical annotation described in Zampieri et al., 2019 focusing on type and target of offensive content.

You can find information about OffensEval previous editions: 
  • OffensEval 2020: Multilingual Offensive Language Identification in Social Media (SemEval-2020 Task 12) [webpage] [report]
  • OffensEval 2019: Identifying and Categorizing Offensive Language in Social Media (SemEval-2019 Task 6) [webpage] [report]

You can also find information about the English datasets used at OffensEval:
  • OLID: Offensive Language Identification Dataset used in OffensEval 2019 [webpage] [paper]
  • SOLID: Semi-Supervised Offensive Language Identification Dataset used in OffensEval 2020 [webpage] [paper]

Finally, on this page, you will find links to datasets in other languages. OffensEval 2020 featured Arabic, Danish, Greek, and Turkish datasets.