Multilingual (CEF languages) corpus acquired from website (https://ec.europa.eu/commission/presscorner/) of the EU portal (8th July 2020). It contains 23 TMX files (EN-X, where X is a CEF language) with 151895 TUs in total.
Multilingual (CEF languages) corpus acquired from the website https://antibiotic.ecdc.europa.eu/ . It contains 20981 TUs (in total) for EN-X language pairs, where X is a CEF language.
XGLUE is a new benchmark dataset to evaluate the performance of cross-lingual pre-trained models with respect to cross-lingual natural language understanding and generation. XGLUE is composed of 11 tasks spans 19 languages. For each task, the training data is only available in English. This me...
Filter by:
English (13)
French (13)
Polish (13)
German (11)
Romanian (11)
Spanish; Castilian (11)
Bulgarian (10)
Finnish (10)
Italian (10)
Swedish (10)
Latvian (9)
Dutch; Flemish (8)
Croatian (7)
Czech (7)
Estonian (7)
Hungarian (7)
Lithuanian (7)
Portuguese (7)
Danish (6)
Irish (6)
Maltese (6)
Slovak (6)
Slovenian (6)
Arabic (1)
Chinese (1)
Hindi (1)
Icelandic (1)
Russian (1)
Swahili (1)
Thai (1)
Turkish (1)
Urdu (1)
Vietnamese (1)