This corpus is a sample extracted from the corpus made available by the annual workshops/conferences on Statistical Machine Translation (WMT, see \url{http://www.statmt.org/}) from the News domain. To this end, 1104 English sentences and their corresponding human translations into Czech, German a...
Multilingual (CEF languages) corpus acquired from the website https://antibiotic.ecdc.europa.eu/ . It contains 20981 TUs (in total) for EN-X language pairs, where X is a CEF language.
Multilingual (CEF languages) corpus acquired from website (https://eur-lex.europa.eu/legal-content) of the EU portal (9th July 2020). It contains 23 TMX files (EN-X, X is a CEF language) with 475,931 translation units pairs in total.
Filter by:
Dutch; Flemish (13)
Bulgarian (12)
English (11)
German (11)
Portuguese (10)
Spanish; Castilian (10)
Czech (9)
French (8)
Italian (8)
Polish (8)
Latvian (7)
Romanian (7)
Croatian (6)
Danish (6)
Estonian (6)
Finnish (6)
Hungarian (6)
Irish (6)
Lithuanian (6)
Maltese (6)
Slovak (6)
Slovenian (6)
Swedish (6)
Basque (4)
Afrikaans (1)
Arabic (1)
Chinese (1)
Hindi (1)
Icelandic (1)
Russian (1)
Swahili (1)
Thai (1)
Turkish (1)
Urdu (1)
Vietnamese (1)
Written Language (6)
Multilingual (13)