[Licensed under Creative Common Non-Commercial Share-Alike 4.0 licence CC-BY-NC-SA 4.0]
@InProceedings{KUMAR18.861,
author = {Ritesh Kumar and Aishwarya N. Reganti and Akshit Bhatia and Tushar Maheshwari},
title = "{Aggression-annotated Corpus of Hindi-English Code-mixed Data}",
booktitle = {Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)},
year = {2018},
month = {May 7-12, 2018},
address = {Miyazaki, Japan},
editor = {Nicoletta Calzolari (Conference chair) and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Koiti Hasida and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis and Takenobu Tokunaga},
publisher = {European Language Resources Association (ELRA)},
isbn = {979-10-95546-00-9},
language = {english}
}
@inproceedings{kumar-etal-2018-benchmarking,
title = "Benchmarking Aggression Identification in Social Media",
author = "Kumar, Ritesh and
Ojha, Atul Kr. and
Malmasi, Shervin and
Zampieri, Marcos",
booktitle = "Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying ({TRAC}-2018)",
month = aug,
year = "2018",
address = "Santa Fe, New Mexico, USA",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/W18-4401",
pages = "1--11"
}
The workshop includes a shared task on ‘Aggression Identification’. The task will be to develop a classifier that could make a 3-way classification in between ‘Overtly Aggressive’, ‘Covertly Aggressive’ and ‘Non-aggressive’ text data.
We are making available a dataset of 15,000 aggression-annotated Facebook Posts and Comments each in Hindi (in both Roman and Devanagari script) and English for training and validation. We will release additional data for testing your system. Please register here to download the data and participate in the task.
The submitted system will be evaluated on the basis of weighted macro-averaged F-scores. The individual F-score of each class will be weighted by the proportion of the concerned class in the test set and the final F-score will be the average of these individual F-scores of each class.
Dates
Training set release March 13, 2018 [Extended Date]
Test set release April 21,2018 April 25, 2018 [Extended Date]
Submissions due April 24, 2018 April 30, 2018 [Extended Date]
Results announcement April 28, 2018 May 5, 2018 [Extended Date]
System papers deadline May 25, 2018 May 28, 2018 [Extended Date]
Reviews for papers June 20, 2018 June 25, 2018
Camera-ready versions June 30, 2018 July 5, 2018
[Timezone: as long as it’s the date mentioned, anywhere on earth; UTC-12.]
Data will be made publicly available after the end of the competition under Creative Commons Non-Commercial Share-Alike 4.0 licence CC-BY-NC-SA 4.0! Please Click Here to get the dataset used in the task.