Dataset of Nuanced Assertions on Controversial Issues (NAoCI dataset)

The Dataset of Nuanced Assertions on Controversial Issues (NAoCI) dataset consists of over 2,000 assertions on sixteen different controversial issues. It has over 100,000 judgments of whether people agree or disagree with the assertions, and of about 70,000 judgments indicating how strongly peopl...

Resource Type:Corpus
Media Type:Text
Language:English
SIP Publications (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Publications from the Luxembourgish government edited by...

Resource Type:Corpus
Media Type:Text
Languages:English
French
German
Bilingual hr-en parallel corpus from the Journal of the Croatian Association of Civil Engineers website (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of http://casopis-gradjevinar.hr were crawled, ...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
General Romanian-English bilingual corpus (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Romanian – English corpus built from a Wikipedia dump.

Resource Type:Corpus
Media Type:Text
Languages:English
Romanian
Croatian-English corpus with the Rural Development Programme for the Period 2014-2020 from the Croatian Rural Development Programme website (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English corpus with the Rural Development Progr...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
English-Estonian corpus from Finnish Information Bank (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. http://www.infopankki.fi - Finland in your language - In...

Resource Type:Corpus
Media Type:Text
Languages:English
Estonian
Inquirições reais

Royal inquiries of 1258 (primarily published in the Portugaliae Monumenta Historica).

Resource Type:Corpus
Media Type:Text
Language:Portuguese
CIPM-POS

CIPM-POS is a set of historical, religious, notarial, literary texts in prose and verse, written is medieval portuguese. It contains around 88000 words.

Resource Type:Corpus
Media Type:Text
Language:Portuguese

Order by:

Filter by:

Text (442)
Audio (18)
Image (1)