Search and Browse – PORTULAN CLARIN

Bulgarian-English Wikipedia WSD/NED corpus

Bulgarian-English Wikipedia WSD/NED corpus is composed of articles from the Bulgarian version of Wikipedia and their English counterparts.

Resource Type:	Corpus
Media Type:	Text
Languages:	Bulgarian
Languages:	English

Central Statistical Office Dataset (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Two Polish-English publications of the Polish Central St...

Resource Type:	Corpus
Media Type:	Text
Languages:	English
Languages:	Polish

Civil Aviation Regulations (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A collection of parallel Polish-English texts published ...

Resource Type:	Corpus
Media Type:	Text
Languages:	English
Languages:	Polish

Code-switched English-Spanish Tweets

This package contains the collection of tweets described in the LREC 2018 paper: "Collecting Code-Switched Data from Social Media", Gideon Mendels, Victor Soto, Aaron Jaech and Julia Hirschberg, LREC 2018. Please remember to cite this paper if you use this resource. The tagged_tweets_ids file con...

Resource Type:	Corpus
Media Type:	Text
Languages:	English
Languages:	Spanish; Castilian

Compendium The Social Insurance Institution (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A compendium on the Polish Social Insurance Insitution (...

Resource Type:	Corpus
Media Type:	Text
Languages:	English
Languages:	Polish

Convention against Torture and Other Cruel, Inhuman or Degrading Treatment or Punishment - United Nations (French-English-Greek) (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English text of the Convention against Torture and Other...

Resource Type:	Corpus
Media Type:	Text
Languages:	English
	French
	Greek, Modern (1453-)

Convention on the transfer of sentenced persons (English - Greek) (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Convention, additional protocol on the convention, recom...

Resource Type:	Corpus
Media Type:	Text
Languages:	English
Languages:	Greek, Modern (1453-)

Corpus of State-related content from the Latvian Web (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Latvian Web, home pages of ministries and state public s...

Resource Type:	Corpus
Media Type:	Text
Languages:	English
Languages:	Latvian

Corpus on Finance and Economics from Bank of Latvia (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of web site https://makroekonomika.lv/ -- Latvi...

Resource Type:	Corpus
Media Type:	Text
Languages:	English
Languages:	Latvian

COVID-19 ANTIBIOTIC dataset. Bilingual (EN-PT)

Bilingual (EN-PT) corpus acquired from the website https://antibiotic.ecdc.europa.eu/

Resource Type:	Corpus
Media Type:	Text
Languages:	English
Languages:	Portuguese

Order by:

Filter by: