Multilingual (CEF languages) corpus acquired from the website https://antibiotic.ecdc.europa.eu/ . It contains 20981 TUs (in total) for EN-X language pairs, where X is a CEF language.
Multilingual (CEF languages) corpus acquired from website (https://eur-lex.europa.eu/legal-content) of the EU portal (9th July 2020). It contains 23 TMX files (EN-X, X is a CEF language) with 475,931 translation units pairs in total.
XGLUE is a new benchmark dataset to evaluate the performance of cross-lingual pre-trained models with respect to cross-lingual natural language understanding and generation. XGLUE is composed of 11 tasks spans 19 languages. For each task, the training data is only available in English. This me...
Filter by:
English (13)
Polish (13)
French (13)
German (11)
Romanian (11)
Spanish; Castilian (11)
Bulgarian (10)
Finnish (10)
Italian (10)
Swedish (10)
Latvian (9)
Dutch; Flemish (8)
Croatian (7)
Czech (7)
Estonian (7)
Hungarian (7)
Lithuanian (7)
Portuguese (7)
Danish (6)
Irish (6)
Maltese (6)
Slovak (6)
Slovenian (6)
Arabic (1)
Chinese (1)
Hindi (1)
Icelandic (1)
Russian (1)
Swahili (1)
Thai (1)
Turkish (1)
Urdu (1)
Vietnamese (1)