This corpus is a sample extracted from the corpus made available by the annual workshops/conferences on Statistical Machine Translation (WMT, see \url{http://www.statmt.org/}) from the News domain. To this end, 1104 English sentences and their corresponding human translations into Czech, German a...
XGLUE is a new benchmark dataset to evaluate the performance of cross-lingual pre-trained models with respect to cross-lingual natural language understanding and generation. XGLUE is composed of 11 tasks spans 19 languages. For each task, the training data is only available in English. This me...
Filter by:
Bulgarian (12)
German (12)
Dutch; Flemish (11)
English (11)
Spanish; Castilian (10)
Czech (9)
French (9)
Italian (9)
Polish (9)
Portuguese (9)
Latvian (8)
Romanian (8)
Estonian (7)
Finnish (7)
Hungarian (7)
Lithuanian (7)
Swedish (7)
Croatian (6)
Danish (6)
Irish (6)
Maltese (6)
Slovak (6)
Slovenian (6)
Basque (3)
Arabic (1)
Chinese (1)
Hindi (1)
Icelandic (1)
Russian (1)
Swahili (1)
Thai (1)
Turkish (1)
Urdu (1)
Vietnamese (1)