The Text Translation Task this year addresses a new translation direction: from English to Czech. We would like participants to investigate the translation into a moderate morphologically rich language and to overcome the difficulty of having less resources. Furthermore, the participants need to consider applying domain and genre adaptation methods, as we would test the translation system on spoken style TED talks.
Other allowed data will be updated here and informed to the participants via the evaluation mailing list.
TAR archive file structure:
< UserID >/< Set >.< Task >.< UserID >.primary.xml
/< Set >.< Task >.< UserID >.contrastive1.xml
/< Set >.< Task >.< UserID >.contrastive2.xml
/...
where:
< UserID > = user ID of participant used to download data files
< Set > = IWSLT19.SLT.tst2019
<Task> = <fromLID>-<toLID>
<fromLID>, <toLID> = Language identifiers (LIDs) as given by ISO 639-1 codes; see for example the WIT3 webpage.
The PRIMARY run for each Multilingual Task will be used for the official scoring; nevertheless, CONTRASTIVE runs will be evaluated as well. In the same archive, runs for different tasks can be included. Re-submitting runs is allowed as far as the mails arrive BEFORE the submission deadline. In case that multiple TAR archives are submitted by the same participant, only runs of the most recent submission mail will be used for the IWSLT 2019 evaluation and previous mails will be ignored.