For both tracks, we split the data into training and testing partitions. For developing their methods, participants will use the training partition, and subsequently the test partition will be used to evaluate the participant methods and to determine the winner of the challenge. For ranking participants, we will use the F1 measure: macro average F1 measure for the author profiling track, and the F1 on the aggressive class for the aggressiveness identification track.
The training data file is password-protected; to obtain the password you first need to be registered as participant.
In order to provide more elements for the participants' experiments, we make available to all teams the following resources:
Download) .Download).Download last layer, download penultimate layer). The performance of your author profiling solution will be ranked by the average of the F1 measure of gender, residence and the occupation dimensions. We will use the macro average F1 measure.
The performance of your aggressive detection solution will the ranked by the F1 measure on the aggressive class.
Participants are allowed to submit up to two runs for each track: one primary and one secondary. The participants must clearly flag each of the two.
Submissions formatted as described below and sent via email to the account: mex.a3t@gmail.com
Your software has to output for each task of the dataset a corresponding txt file. The file must contain one line per classified instance. Each line looks like this:
"TaskName"\t"IdentifierOfAnInstance"\t"Class"\n
It's important to respect the format with the " character, \t (tabulator) and \n (linux enter). The naming of the output files is up to you, we recommend to use the author and a run's identifier as filename with "txt" as extension.
For the aggressiveness track the possible labels are:
For the author profiling track we need three different files and the possible labels are:
1. For gender identification task:
2. For the location identification task:
3. For the occupation identification task:
4. The name of the outputs files:
Participants of the tasks will be given the opportunity to write a paper that describes their system, resources used, results, and analysis that will be part of the official IberLef-2019 proceedings. The paper is to be FOUR pages long plus two pages at most for references, and are required to be formatted in the Springer LNCS format (see http://www.springer.de/comp/lncs/authors.html).
Papers must be written in English.