Data Provided or Permissible

This site lists the training data is permissible for the training of MT systems and language models for ASR.

Provided Data

Other Permissible Data

Parallel:

Monolingual:

TED

LDC

Miscellaneous: