The GENIA event extraction (GENIA) task is a main task in BioNLP Shared Task 2011 (BioNLP-ST '11).
For the GENIA task, the task definition remains the same as BioNLP Shared Task 2009 (BioNLP-ST'09). With the unchanged task definition, the purpose of running this task is to measure the progress of the community on the task. In order to avoid over-fitting to the evaluation data, additional training/evaluation data sets will be provided together with those for 2009 Shared Task. As the additional datasets will come from full text articles, the task includes generalization of the technology from abstracts only to full text articles.
homepage of BioNLP-ST'09. Here, we provide abstract of the task definition.
The format "Arg(Type)" indicates that an event takes an argument "Arg" which should identify an entity of type "Type": for example, Localization takes one Theme of protein type. five newly annotated PMC full paper articles are included in each of the training, development, and test sets (15 articles in total). Evaluation will be provided for each of the PubMed and PMC subsets. Note that five PMC full paper articles roughly amount to 150 PubMed abstracts which may not be sufficient for a separate run of training. We expect a kind of adaptation techniques to utilize the PubMed portion for the PMC portion to be useful.
The evaluation methods is this task is described here.
development data set and the test data set.
As BioNLP-ST 2011 data include BioNLP-ST 2009 data, the above evaluation service also can be used for the evaluation of BioNLP Shared Task 2009.