Parallel corpus from Parliament of Estonia (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel corpus compiled from contents of website of Par...

Resource Type:Corpus
Media Type:Text
Languages:English
Estonian
Parallel corpus from Estonian Ministry of Foreign Affairs (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel corpus from content of Estonian Ministry of For...

Resource Type:Corpus
Media Type:Text
Languages:English
Estonian
Parallel corpus from Estonian Cabinet of Ministers (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel corpus composed from content of Estonian Cabine...

Resource Type:Corpus
Media Type:Text
Languages:English
Estonian
Parallel corpus from Bank of Estonia (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel corpus from content of Bank of Estonia website ...

Resource Type:Corpus
Media Type:Text
Languages:English
Estonian
Parallel corpus (en-pl) from the Export Promotion Portal of Poland (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A paralell corpus constructed from data acquired form th...

Resource Type:Corpus
Media Type:Text
Languages:English
Polish
Parallel corpus (Bulgarian - English) in the public administration domain (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel (bg-en) corpus of 11262 translation units in th...

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
English
ParaCrawl release 7 Portuguese-English

Portuguese-English parallel from release 7 of the ParaCrawl project, specifically "Broader Web-Scale Provision of Parallel Corpora for European Languages". This version is filtered with BiCleaner with a threshold of 0.5. Data was crawled from the web following robots.txt, as is standard practice....

Resource Type:Corpus
Media Type:Text
Languages:English
Portuguese
Natolin European Centre Dataset (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. The Polish-English parallel corpus is composed of three ...

Resource Type:Corpus
Media Type:Text
Language:Polish
National Health Fund Dataset (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. The dataset is a 274K-token Polish-English parallel reso...

Resource Type:Corpus
Media Type:Text
Languages:English
Polish
N3-Collection

We publish three novel datasets called N3. N3 will be published using NIF ensuring a greater interoperability to overcome the need for corpus-specific parsers. The data can be downloaded from our project homepage.

Resource Type:Corpus
Media Type:Text
Languages:English
German

Order by:

Filter by: