EUIPO - list of goods and services German and English (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. EUIPO list of goods and services format: TMX

Resource Type:Corpus
Media Type:Text
Languages:English
German
Khresmoi Query Translation Test Data for the Medical Domain version 1.0

This package contains data sets for development and testing of machine translation of medical search short queries between Czech, English, French, and German. The queries come from the general public and from medical experts.

Resource Type:Corpus
Media Type:Text
Languages:Czech
English
French
German
N3-Collection

We publish three novel datasets called N3. N3 will be published using NIF ensuring a greater interoperability to overcome the need for corpus-specific parsers. The data can be downloaded from our project homepage.

Resource Type:Corpus
Media Type:Text
Languages:English
German
NoSta-D: German NER Dataset Train/Dev

Freely available large dataset, manually annotated for German NER. Includes nested span annotations. Source text from German Wikipedia and news. This data set does not contain the test data, which is used for the GermEval 2014 NER task at KONVENS. Test data will be available from September 2014.

Resource Type:Corpus
Media Type:Text
Language:German
COVID-19 ANTIBIOTIC dataset. Multilingual (CEF languages)

Multilingual (CEF languages) corpus acquired from the website https://antibiotic.ecdc.europa.eu/ . It contains 20981 TUs (in total) for EN-X language pairs, where X is a CEF language.

Resource Type:Corpus
Media Type:Text
Languages:Bokmål, Norwegian; Norwegian Bokmål
Bulgarian
Croatian
Czech
Danish
Dutch; Flemish
English
Estonian
Finnish
French
German
Greek, Modern (1453-)
Hungarian
Icelandic
Irish
Italian
Latvian
Lithuanian
Maltese
Moldavian; Moldovan
Polish
Portuguese
Romanian
Slovak
Slovenian
Spanish; Castilian
Swedish
BMVI Publications (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. TMX file with 11555 TUs, bilingual German/English, publi...

Resource Type:Corpus
Media Type:Text
Languages:English
German
BMI Brochures 2011-2015 (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English translations of German BMI brochures from the la...

Resource Type:Corpus
Media Type:Text
Languages:English
German
QTLeap Corpus V1.2

The QTLeap corpus is composed by 4000 question and answer pairs in the domain of computer and IT troubleshooting for both hardware and software. This material was collected using a support service via chat, this implies that the corpus is composed by naturally occurring utterances produced by use...

Resource Type:Corpus
Media Type:Text
Languages:Basque
Bulgarian
Czech
Dutch; Flemish
English
German
Portuguese
Spanish; Castilian
QTLeap LRT-M31-WP4

Treebanks and semantic lexicons for Basque, Bulgarian, Dutch, German and Portuguese. Created within European project QTLeap.

Resource Type:Corpus
Media Type:Text
Languages:Basque
Bulgarian
Dutch; Flemish
German
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Letter of rights for persons arrested on the basis of a ...

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
Dutch; Flemish
English
French
German
Greek, Modern (1453-)
Italian
Latvian
Polish
Romanian

Order by:

Filter by: