Multilingual (CEF languages) corpus acquired from website (https://ec.europa.eu/commission/presscorner/) of the EU portal (8th July 2020). It contains 23 TMX files (EN-X, where X is a CEF language) with 151895 TUs in total.
Parallel corpora is a set of parallel texts in the domain of Law and Health, with 1 G per language. Languages: cs-pt, de-pt, en-pt, es-pt, fr-pt, it-pt, and pt-sk.
Filter by:
French (32)
English (30)
German (25)
Spanish; Castilian (24)
Italian (20)
Portuguese (19)
Polish (13)
Romanian (12)
Finnish (11)
Swedish (11)
Bulgarian (10)
Czech (10)
Latvian (9)
Dutch; Flemish (8)
Croatian (7)
Estonian (7)
Hungarian (7)
Lithuanian (7)
Slovak (7)
Danish (6)
Irish (6)
Maltese (6)
Slovenian (6)
Galician (4)
Arabic (2)
Basque (2)
Chinese (2)
Latin (2)
Central Khmer (1)
Hindi (1)
Icelandic (1)
Russian (1)
Swahili (1)
Thai (1)
Turkish (1)
Urdu (1)
Vietnamese (1)
Social Questions (5)
LAW (4)
INDUSTRY (2)
HEALTH (1)
POLITICS (1)
SOCIAL QUESTIONS (1)
SOCIAL QUESTIONS (1)
TRADE (1)