Bilingual hr-en parallel corpus from Croatian National Bank website (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of http://www.hnb.hr were crawled, aligned on d...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
Corpus of State-related content from the Latvian Web (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Latvian Web, home pages of ministries and state public s...

Resource Type:Corpus
Media Type:Text
Languages:English
Latvian
Bilingual hr-en parallel corpus from Croatian Mine Action website (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of http://www.hcr.hr website downloaded, aligne...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
Parallel corpus from Parliament of Estonia (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel corpus compiled from contents of website of Par...

Resource Type:Corpus
Media Type:Text
Languages:English
Estonian
Bilingual Bulgarian-English corpus from the National Revenue Agency (BG) (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual Bulgarian-English corpus of administrative doc...

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
English
Bilingual resource with Bulgarian strategic documents in the field of telecommunications and broadband (Bulgarian - English) (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual collection of documents in the field of teleco...

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
English
UIMA/U-Compare OpenNLP POS Tagger

This is a UIMA wrapper for the OpenNLP Tokenizer tool. It assigns part-of-speech tags to tokens in English text. The tagset used in from the Penn Treebank). The tool forms part of the in-built library of components provided with the U-Compare platform (Kano et al., 2009; Kano et al., 2011; see se...

Resource Type:Tool / Service
Language:English
The CIEMPIESS Proper-Names Pronouncing Dictionary

Transcriptions in the CIEMPIESS-PNPD are based on a phonetic alphabet called Mexbet. Mexbet was design for the Spanish of Central Mexico and it has several levels of granularity. The CIEMPIESS-PNPD comes in two versions: Mexbet T29 and Mexbet T66. Level T29 of Mexbet means that transcriptions ...

Resource Type:Corpus
Media Type:Text
Language:Spanish; Castilian
The Wixarika-Spanish Parallel Corpus

Wixarika is an indigenous language spoken in central west Mexico by approximately fifty thousand people. For indigenous languages like Wixarika, there is a lack of digital resources in general since native speakers do not necessarily generate a digital fingerprint on public forums. The lack of...

Resource Type:Corpus
Media Type:Text
Language:Spanish; Castilian
Bilingual collection of reports of the Greek Public Power Corporation (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A bilingual collection of translation units extracted fr...

Resource Type:Corpus
Media Type:Text
Languages:English
Greek, Modern (1453-)

Order by:

Filter by: