NoSta-D: German NER Dataset Train/Dev

Freely available large dataset, manually annotated for German NER. Includes nested span annotations. Source text from German Wikipedia and news. This data set does not contain the test data, which is used for the GermEval 2014 NER task at KONVENS. Test data will be available from September 2014.

Resource Type:Corpus
Media Type:Text
Language:German
Termcat Digital Marketing

Terms for Digital Marketing

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Catalan; Valencian
English
French
Galician
German
Italian
Portuguese
Spanish; Castilian
QTLeap LRT-M31-WP4

Treebanks and semantic lexicons for Basque, Bulgarian, Dutch, German and Portuguese. Created within European project QTLeap.

Resource Type:Corpus
Media Type:Text
Languages:Basque
Bulgarian
Dutch; Flemish
German
Luxembourg Museum Websites (de-en) (Processed)

Luxembourg Museum Websites (de-en) (Processed)

Resource Type:Corpus
Media Type:Text
Languages:English
French
German
Termcat Fairs and Congresses

Terms for Fairs and Congresses

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Catalan; Valencian
English
French
German
Italian
Portuguese
Spanish; Castilian
BMI Brochures and Website 2016 (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual tmx file of German to English translations of ...

Resource Type:Corpus
Media Type:Text
Languages:English
German
BMI Brochures 2011-2015 (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English translations of German BMI brochures from the la...

Resource Type:Corpus
Media Type:Text
Languages:English
German
Termcat Economical Crisis

Economical Crisis terms

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Catalan; Valencian
English
French
German
Italian
Portuguese
Spanish; Castilian
QTLeap News Corpus

This corpus is a sample extracted from the corpus made available by the annual workshops/conferences on Statistical Machine Translation (WMT, see \url{http://www.statmt.org/}) from the News domain. To this end, 1104 English sentences and their corresponding human translations into Czech, German a...

Resource Type:Corpus
Media Type:Text
Languages:Basque
Bulgarian
Czech
Dutch; Flemish
English
German
Portuguese
Spanish; Castilian
QTLeap Corpus V1.2

The QTLeap corpus is composed by 4000 question and answer pairs in the domain of computer and IT troubleshooting for both hardware and software. This material was collected using a support service via chat, this implies that the corpus is composed by naturally occurring utterances produced by use...

Resource Type:Corpus
Media Type:Text
Languages:Basque
Bulgarian
Czech
Dutch; Flemish
English
German
Portuguese
Spanish; Castilian

Order by:

Filter by: