COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)

Multilingual (CEF languages) corpus acquired from the website https://antibiotic.ecdc.europa.eu/ . It contains 20981 TUs (in total) for EN-X language pairs, where X is a CEF language.

Resource Type:Corpus
Media Type:Text
Languages:Bokmål, Norwegian; Norwegian Bokmål
Bulgarian
Croatian
Czech
Danish
Dutch; Flemish
English
Estonian
Finnish
French
German
Greek, Modern (1453-)
Hungarian
Icelandic
Irish
Italian
Latvian
Lithuanian
Maltese
Moldavian; Moldovan
Polish
Portuguese
Romanian
Slovak
Slovenian
Spanish; Castilian
Swedish
COVID-19 EU presscorner v2 dataset. Multilingual (CEF languages)

Multilingual (CEF languages) corpus acquired from website (https://ec.europa.eu/commission/presscorner/) of the EU portal (8th July 2020). It contains 23 TMX files (EN-X, where X is a CEF language) with 151895 TUs in total.

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
Croatian
Czech
Danish
Dutch; Flemish
English
Estonian
Finnish
French
German
Greek, Modern (1453-)
Hungarian
Irish
Italian
Latvian
Lithuanian
Maltese
Moldavian; Moldovan
Polish
Portuguese
Romanian
Slovak
Slovenian
Spanish; Castilian
Swedish
Parallel corpora

Parallel corpora is a set of parallel texts in the domain of Law and Health, with 1 G per language. Languages: cs-pt, de-pt, en-pt, es-pt, fr-pt, it-pt, and pt-sk.

Resource Type:Corpus
Media Type:Text
Languages:Arabic
Chinese
Czech
English
French
German
Portuguese
Spanish; Castilian
Spanish-English website parallel corpus (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. This is a parallel corpus of bilingual texts crawled fro...

Resource Type:Corpus
Media Type:Text
Languages:English
Spanish; Castilian
QTLeap specialized lexicons

This resource is part of Deliverable 5.7 of the European Comission project QTLeap FP7-ICT-2013.4.1-610516 (http://qtleap.eu). This gazetteer comprises multilingual lexicon entries used for the translation of specific IT domain expressions for Basque, Bulgarian, Czech, Dutch, Engli...

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Basque
Bulgarian
Czech
Dutch; Flemish
English
Portuguese
Spanish; Castilian
QTLeap Specialized lexicons

This resource comprises multilingual lexicon entries used for the translation of specific IT domain expressions. This gazetteer has been collected from four different sources: VLC, LibreOffice and KDE localization projects and IT domain Wikipedia articles.

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Basque
Czech
English
German
Portuguese
Spanish; Castilian
Termcat Neoloteca

Terms that have (more or less) recently been accepted and normalised by Termcat, mixed fields

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Basque
Catalan; Valencian
English
French
Galician
German
Italian
Latin
Portuguese
Spanish; Castilian
Termcat Industry

Industry terms

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Basque
Catalan; Valencian
English
French
German
Italian
Portuguese
Spanish; Castilian
Termcat Social Webs

Terms of Social Webs

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Catalan; Valencian
English
French
Galician
Italian
Portuguese
Spanish; Castilian
Termcat Research Thesaurus

Terms of Research Thesaurus

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Catalan; Valencian
English
French
German
Italian
Latin
Portuguese
Spanish; Castilian

Order by:

Filter by:

English (39)
French (24)
German (24)
Italian (19)
Czech (17)
Bulgarian (15)
Basque (11)
Finnish (11)
Polish (11)
Swedish (11)
Romanian (10)
Latvian (7)
Slovak (7)
Danish (6)
Irish (6)
Maltese (6)
Arabic (2)
Chinese (2)
Latin (2)
Hindi (1)
Russian (1)
Swahili (1)
Thai (1)
Turkish (1)
Urdu (1)