Online harassment is becoming prevalant as a specific communication type in Twitter. Considering the huge amount of user-genrated tweets each day, the problem of detecting and possibly limiting these contents automaticaaly in real time is becoming a fundamental problem specifically for female figures who have been harassed for a long time and Twitter was incapable of haleping them.
The proposed competition focusing of online harassment in Twitter in English. It has two related tasks: the first task is a binary classification to classify online harassment tweets versus not_harassment tweets, the second task is multi-class classification of online harassment tweets into three categories of "Indirect harassment", "sexual harassment" and "physical harassment".
Important dates
Join the SIMAH mailing group: simah_competition_ecmlpkdd2019[at]googlegroups.com
Please note that the Google group will act as the main communication channel between the organizers and the participants.
Please fill in this questionnaire in order to obtain the dataset.
If you do not receive the dataset or have troubles accessing the data, let us know.
For more information regarding the competition please refer to the Codalab website:CodaLab-SIMAH
Organizers: