Parallel corpora finely aligned (subsentencial granularity)

Text corpus for bilingual concordancing, single- and multi-word translation extraction, machine translation. Languages: cs-pt, de-pt, en-pt, es-pt, fr-pt, it-pt, and pt-sk. Size: 1 G per language (phrases aligned). Domain: Law and Health.

Resource Type:Corpus
Media Type:Text
Languages:Czech
English
French
German
Italian
Portuguese
Slovak
Spanish; Castilian
Letter of rights for persons arrested and or detained (Processed)  

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Collection of transaltion units (1906 in total) in 21 la...

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
English
French
Greek, Modern (1453-)
Latvian
Polish
Romanian
Croatian-English corpus with studies on the challenges to the Croatian Accession to the European Union from the Croatian Institute of Public Finance website (Processed)  

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English corpus with studies on the challenges t...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
Bilingual hr-en parallel corpus from the Journal of the Croatian Association of Civil Engineers website (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of http://casopis-gradjevinar.hr were crawled, ...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
Bilingual hr-en parallel corpus from the National and University Library in Zagreb website (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of http://www.nsk.hr were crawled, aligned on d...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
Croatian-English corpus with statistical reports and studies from the Croatian Bureau of Statistics website (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English corpus with statistical reports and stu...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
Hallituskausi 2007-2011 -- Finnish-English Translation Memory (Processed)

ID: ELRA-W0220 This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. The "Hallituskausi 2007–2011" translat...

Resource Type:Corpus
Media Type:Text
Languages:English
Finnish
Parallel corpus from Estonian Cabinet of Ministers (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel corpus composed from content of Estonian Cabine...

Resource Type:Corpus
Media Type:Text
Languages:English
Estonian
Parallel corpus from Bank of Estonia (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel corpus from content of Bank of Estonia website ...

Resource Type:Corpus
Media Type:Text
Languages:English
Estonian
DA-EN Danish Ministry of Higher Education and Science 3 (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts Danish-English from the Danish Ministry o...

Resource Type:Corpus
Media Type:Text
Languages:Danish
English

Order by:

Filter by:

Text (445)
Audio (18)
Image (1)