The system description paper should let another researcher:
Verify what the system does and how it has been trained.
Reimplement the system to reproduce the results.
Understand the system’s strengths and weaknesses.
The paper will be included in the WANLP 2020 workshop proceedings and needs to use the COLING 2020 templates. Please Download the MS Word and LaTeX templates here: https://coling2020.org/coling2020.zip
The paper is expected to be 4 pages long but you can extend to 6 pages plus any number of pages for reference and supplementary material. Please, note that the review process is not double blind, so anonymity is not required.
Submission Website: submissions should be done via softconf: https://www.softconf.com/coling2020/WANLP2020/
Please, make sure you choose NADI Shared Task Paper as a Submission Type, and the paper length is short paper (Check the screenshot below).
Shared task system paper submissions deadline: July 10, 2020, 11:59PM UTC-12:00 ("anywhere on Earth").
A common structure for system description papers is:
Abstract: four/five sentences highlighting your approach and key results.
Introduction: ¾ a page expanding on the abstract mentioning key background such as why the task is challenging for current modeling techniques and why your approach is interesting/novel.
Data: review of the data you used to train your system. Be sure to mention the size of the training, validation and test sets that you’ve used, and the label distributions, as well as any tools you used for preprocessing data.
System: a detailed description of how the systems were built and trained. If you’re using a neural network, were there pre-trained embeddings, how was the model trained, what hyperparameters were chosen and experimented with? How long did the model take to train, and on what infrastructure? Linking to source code is valuable here as well, but the description should be able to stand alone as a full description of how to reimplement the system. While other paper styles include background as a separate section, it’s fine to simply include citations to similar systems which inspired your work as you describe your system.
Results: a description of the key results of the paper. If you have done extra error analysis into what types of errors the system makes, this is extremely valuable for the reader. Unofficial results from after the submission deadline can be very useful as well.
Discussion: general discussion of the task and your system. Description of characteristic errors and their frequency over a sample of development data. What would you do if you had another 3 months to work on it?
Conclusion: a restatement of the introduction, highlighting what was learned about the task and how to model it.
Please use the following bibtex entry to cite the shared task description paper:
@inproceedings{mageed-etal-2020-nadi,
title ={{NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task}},
author = {Abdul-Mageed, Muhammad and Zhang, Chiyu and Bouamor, Houda and Habash, Nizar},
booktitle ={{Proceedings of the Fifth Arabic Natural Language Processing Workshop (WANLP 2020)}},
year = {2020},
address = {Barcelona, Spain}
}
You can download the bibtext entry here.
Acknowledgment
This page is an adaptation of the guidelines shared by the organizers of the MADAR shared task at WANLP 2019.