WNGT 2020 - DGT Task

WNGT 2020 DGT Shared Task

The WNGT 2020 DGT shared task on "Document-Level Generation and Translation” considers generating textual documents from either structured data or documents in another language or both.

News

2020/04/23

Our submission form is open. Please follow this link.
We have extended the submission deadline to Friday, April 24th, 2020 (23:59:59 anywhere on earth (UTC-12)).

Document-level Generation and Translation Task

Depending on the input / output pairs, there will be three types of systems that we will compare in the shared task:

NLG Track: Systems that take in structured data and output text in the target language.
MT Track: Systems that take in the text in a source language and outputs text in the target language.
NLG+MT Track: Systems that take in structured data and text in the source language and output text in the target language.

In addition, the data, listed below, is English-German, and accordingly, there will be two target languages:

English
German

This results in a total of 6 tracks. Participants can choose to work on a subset of the task or all the tracks.

Diagram of different tracks.

Dataset

We re-use RotoWire English-German dataset from the DGT task at WNGT2019. This is a subset of RotoWire dataset [1] with professional German translations. Each instance in this dataset has three components: Box score of a NBA game, the game summary in English, its German translation by domain experts. Please see here for more details. The downloaded dataset format is described here.

Usable Data Sources

Participants can further utilize the following resources for respective tracks. Systems that use data resources other than the listed resources will be marked as unconstrained and will be compared separately for fairer comparison.

1 & 2. NLG (Data -> En, Data -> De)

RotoWire English-German dataset (train / development sets)
The original RotoWire dataset (train / development sets)
Any of the monolingual resources specified below

3 & 4. MT (En <-> De)

RotoWire English-German dataset (train / development sets)
Any parallel data allowable by the WMT 2019 English-German news task
Any of the monolingual resources specified below

5 & 6. MT+NLG ([Data+En] -> De, [Data+De] -> En)

RotoWire English-German dataset (train / development sets)
The full RotoWire dataset (train / development sets)
Any parallel data allowable by the WMT 2019 English-German news task
Any of the monolingual resources specified below

Monolingual Resources (usable in all tracks)

Any monolingual data allowable by the WMT 2019 English-German news task
Pre-trained word/subword/character embeddings (e.g., GloVe, fastText)
Pre-trained contextualized embeddings (e.g., ELMo, BERT)
Pre-trained language models (e.g., GPT-2)

Evaluation

There will be a baseline system as described in [2]. Systems will be evaluated on the test split of RotoWire English-German dataset according to the following metrics:

Textual Accuracy measures
- BLEU
- ROUGE
Content Accuracy measures [1]
- Content Selection
- Relation Generation
- Content Ordering

Content accuracy evaluation is performed using this tool. Suggestions for evaluation methods are also very welcome.

Submission

Helper tools can be downloaded here.

1. Preparing the format

For all the tracks, we ask the participants to save the generated results in a similar format to the original dataset. Specifically, a submission file should be a single JSON file which contains a list of records with the following fields:

id: ID of each document.
summary: Word-tokenized generated summary.

For example, a valid submission file would look like below:

    {"id": "02_24_16-Cavaliers-Hornets-TheEasternConference-leadingClevelandCavaliers",

     "summary": ["Die", "in", "der", ...]},

    {"id": "01_01_16-Knicks-Bulls-TheChicagoBulls(19",

     "summary": ["Die", "Chicago", "Bulls", ...]},

    {"id": "11_07_16-Pelicans-Warriors-AnthonyDaviscontinuestobe",

     "summary": ["Anthony", "Davis", "ist", ...]},

    {"id": "04_01_16-Cavaliers-Hawks-Inwhatwasahistoric",

     "summary": ["In", "einer", "historischen", ...]},

    {"id": "01_07_17-Thunder-Nuggets-RussellWestbrookrecordedyetanother",

     "summary": ["Russell", "Westbrook", "verzeichnete", ...]},

...

Note that indentation like the example above is not needed for the submission file.

For MT track, we provide a script which converts sentence-by-sentence plaintext outputs into the specified format. Download the helper tools and run the script as follows:

$ python plain2json.py --source-dir /path/to/translations --target-json output.json

where each file in /path/to/translations directory should have one target language sentence per line.

2. Validate the submission file

Download the helper tools and run the validator as follows:

$ python validate_outputs.py /path/to/your/submission/file

Please fix the errors if prompted.

3. Submit

Please follow this submission form.

Timelines

All deadlines are 23:59:59 anywhere on earth (UTC-12).

January 2020: Task announcement, data release
April 10th, 2020: Resource addition cutoff
April 24th, 2020: System submission deadline (Extended)
May 1st, 2020: System descriptions due
May 10th, 2020: System description feedback provided
May 18th, 2020: Camera-ready system descriptions due

References

[1] Sam Wiseman, Stuart Shieber and Alexander Rush. Challenges in Data-to-Document Generation. EMNLP 2017.

[2] Ratish Puduppully, Li Dong, and Mirella Lapata. Data-to-text generation with content selection and planning. AAAI 2019.