Parallel corpora is a set of parallel texts in the domain of Law and Health, with 1 G per language. Languages: cs-pt, de-pt, en-pt, es-pt, fr-pt, it-pt, and pt-sk.
Multilingual (CEF languages) corpus acquired from the website https://antibiotic.ecdc.europa.eu/ . It contains 20981 TUs (in total) for EN-X language pairs, where X is a CEF language.
Filter by:
French (32)
English (30)
German (25)
Spanish; Castilian (24)
Italian (20)
Portuguese (19)
Polish (13)
Romanian (12)
Finnish (11)
Swedish (11)
Bulgarian (10)
Czech (10)
Latvian (9)
Dutch; Flemish (8)
Croatian (7)
Estonian (7)
Hungarian (7)
Lithuanian (7)
Slovak (7)
Danish (6)
Irish (6)
Maltese (6)
Slovenian (6)
Galician (4)
Arabic (2)
Basque (2)
Chinese (2)
Latin (2)
Central Khmer (1)
Hindi (1)
Icelandic (1)
Russian (1)
Swahili (1)
Thai (1)
Turkish (1)
Urdu (1)
Vietnamese (1)
Social Questions (5)
LAW (4)
INDUSTRY (2)
HEALTH (1)
POLITICS (1)
SOCIAL QUESTIONS (1)
SOCIAL QUESTIONS (1)
TRADE (1)