Romanian - English New Criminal Procedure Code (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. The New Civil Procedure Code in Romanian and English (bi...

Resource Type:Corpus
Media Type:Text
Languages:English
Romanian
Maltese Acquis Communautaire

This is the Maltese version of the Acquis Communautaire (AC), which is the total body of European Union (EU) law applicable in the EU Member States. It consists of selected texts between the 1950s and today, translated to Maltese.

Resource Type:Corpus
Media Type:Text
Language:Maltese
Laws of Malta - Maltese

The corpus contains the Laws of Malta in Maltese from the official government website. The unannotated raw text files were extracted from the pdf files that can be found on the website.

Resource Type:Corpus
Media Type:Text
Language:Maltese
Bilingual corpus made out of PDF documents from the European Medicines Agency, (EMEA), https://www.ema.europa.eu, (February 2020) (EN-PT)

EN-PT Bilingual corpus made out of PDF documents from the European Medicines Agency, (EMEA), https://www.ema.europa.eu, (February 2020).

Resource Type:Corpus
Media Type:Text
Languages:English
Portuguese
Portuguese-English bilingual corpus from the Portuguese Constitution (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Complete text of the Portuguese Constitution in Portugue...

Resource Type:Corpus
Media Type:Text
Languages:English
Portuguese
Portuguese-English bilingual corpus from Legislation concerning the Portuguese Parliament (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Legislation concerning Portuguese Parliament; three bili...

Resource Type:Corpus
Media Type:Text
Languages:English
Portuguese
Financial Stability Reports from the National Bank of Poland (2015-16) (Processed)  

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Financial Stability Reports from the National Bank of Po...

Resource Type:Corpus
Media Type:Text
Languages:English
Polish
LX-Battig

The LX-Battig was created from Battig test.set (Baroni et al., 2010). This data set has 83 concrete concepts of the following 10 categories: mammals, birds, fish, vegetables, fruit, trees, vehicles, clothes, tools and kitchenware. The categories names and the concepts were translated by two trans...

Resource Type:Corpus
Media Type:Text
Language:Portuguese
CIPM-POS

CIPM-POS is a set of historical, religious, notarial, literary texts in prose and verse, written is medieval portuguese. It contains around 88000 words.

Resource Type:Corpus
Media Type:Text
Language:Portuguese
Georeferenced Tweets

Tweets annotated with geographic coordinates

Resource Type:Corpus
Media Type:Text
Language:English

Order by:

Filter by:

Text (428)
Audio (17)
Image (1)