ExtraGLUE-instruct

ExtraGLUE-instruct is a data set with examples from tasks, with instructions and with prompts that integrate instructions and examples, for both the European variant of Portuguese, spoken in Portugal, and the American variant of Portuguese, spoken in Brazil. For each variant, it contains over 170...

Resource Type:Corpus
Media Type:Text
Language:Portuguese
HiEve

A corpus of manually annotated event hierarchies in news stories.

Resource Type:Corpus
Media Type:Text
Language:English
DVPM-browser

DVPM-browser is a browser for the DVPM lexical database of medieval Portuguese.

Resource Type:Tool / Service
National Health Fund Dataset (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. The dataset is a 274K-token Polish-English parallel reso...

Resource Type:Corpus
Media Type:Text
Languages:English
Polish
COVID-19 EUROPARL v2 dataset. Bilingual (EN-PT)

Bilingual (EN-PT) corpus acquired from the website (https://www.europarl.europa.eu/) of the European Parliament (9th May 2020)

Resource Type:Corpus
Media Type:Text
Languages:English
Portuguese
Luxembourg Museum Websites (de-en) (Processed)

Luxembourg Museum Websites (de-en) (Processed)

Resource Type:Corpus
Media Type:Text
Languages:English
French
German
UIMA/U-Compare Stanford Parser

Syntactic parser for English. Outputs dependency relations. Also outputs parts-of-speech for each token. The tool is provided as a UIMA component, specifically as Java archive (jar) file, which can be incorporated within any UIMA workflow. However, it is particularly designed use in the U-Com...

Resource Type:Tool / Service
Language:English
BMI Brochures and Website 2016 (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual tmx file of German to English translations of ...

Resource Type:Corpus
Media Type:Text
Languages:English
German
BMI Brochures 2011-2015 (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English translations of German BMI brochures from the la...

Resource Type:Corpus
Media Type:Text
Languages:English
German
COVID-19 Parallel Global Voices dataset. Bilingual (EN-PT)

EN-PT Bilingual COVID-19-related corpus acquired from the website (https://globalvoices.org/) of GlobalVoices (28th April 2020)

Resource Type:Corpus
Media Type:Text
Languages:English
Portuguese

Order by:

Filter by:

Text (446)
Audio (18)
Image (1)