The AGAC Task is part of the BioNLP Open Shared Tasks (BioNLP-OST: http://2019.bionlp-ost.org) and meets the BioNLP-OST standard of quality, originality and data formats.
Participants are welcomed to select tasks provided in AGAC Track. Among three subtasks, Task 1 is a traditional NER for 12 labels, which cultivate molecular phenomena related to gene mutation; Task 2 is a relation extraction task, which capture the thematic roles between entities; while Task 3 is a prediction task for the novel link discovery, which extract triple information among gene, function change, and disease out of the corpus texts.
The results of the AGAC Task will be presented at the BioNLP-OST workshop which is collocated with EMNLP-IJCNPL in Hong-Kong. Participating teams will be invited to submit their system description for publication in the proceedings of the workshop.
11 Mar, 2019. Sample data (50 texts) release.
10 Apr, 2019. Training data (250 texts) release.
12 Jul, 2019. Testing data (1000 texts) release.
12 Jul-19 Jul, 2019. Evaluation period.
19 Aug, 2019. Workshop paper submission due
TBD. Notification of paper acceptance
TBD. Camera ready paper submission
3 or 4 Nov, 2019. Workshop to be collocated with EMNLP-IJCNLP 2019 (Hong Kong)
28 Mar, 2020. Special issue submission due
(As a track in an open-shared task, AGAC track does not require any pre-registration.
Join Gmail group for contacting )
Identifying disease related genes and their mutations has long been a hot spot in biomedical community. One of the application is to help drug discovery by taking these genes as target genes. Therefore, the extraction of mutation-disease knowledge from PubMed is a worthy task. The gene-function change-disease knowledge in this track not only contains the relationship between mutation and disease, but also indicates the function change of the mutation, i.e., gain of function (GOF) and loss of function (LOF).
The purpose of AGAC track is to test the performance of various natural language processing (NLP) approaches on mutation-disease knowledge extraction in AGAC corpus.
250 texts with NER and Rel annotation labels for Task 1 and Task 2. (Click to download)
50 "Gene;Function change;disease" links for Task 3. (Click to download,submit format )
1000 testing data. Link.
94 cancers related "Gene; Function Change; Disease" links for expanding the knowledge discovery based on Task 3, along with abstract level evidences. Link.