Code-switched English-Spanish Tweets

This package contains the collection of tweets described in the LREC 2018 paper: "Collecting Code-Switched Data from Social Media", Gideon Mendels, Victor Soto, Aaron Jaech and Julia Hirschberg, LREC 2018. Please remember to cite this paper if you use this resource. The tagged_tweets_ids file con...

Resource Type:Corpus
Media Type:Text
Languages:English
Spanish; Castilian
Polish Food 4 & Food Policy Dataset (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A collection of Polish-English translations of the Polis...

Resource Type:Corpus
Media Type:Text
Languages:English
Polish
Macroeconomic Developments (Processed)  

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bulletins of Macroeconomic Developments

Resource Type:Corpus
Media Type:Text
Languages:English
Greek, Modern (1453-)
Bulgarian-English Wikipedia WSD/NED corpus

Bulgarian-English Wikipedia WSD/NED corpus is composed of articles from the Bulgarian version of Wikipedia and their English counterparts.

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
English
Radio Bulgaria WSD/NED corpus

Radio Bulgaria WSD/NED corpus is composed of texts from Bulgarian and English articles from the website of Radio Bulgaria.

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
English
PKN Orlen Dataset (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Dataset of the Polish public sector company PKN Orlen, a...

Resource Type:Corpus
Media Type:Text
Languages:English
Polish
Compendium The Social Insurance Institution (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A compendium on the Polish Social Insurance Insitution (...

Resource Type:Corpus
Media Type:Text
Languages:English
Polish
Czech Banking Association Terminology (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Terms in Czech - English relating to finance

Resource Type:Corpus
Media Type:Text
Languages:Czech
English
Laws of Malta (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Compilation of bilingual Maltese legislation (Maltese-En...

Resource Type:Corpus
Media Type:Text
Languages:English
Maltese
Maltese-English website parallel corpus (Processed)  

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. This is a parallel corpus of bilingual texts crawled fro...

Resource Type:Corpus
Media Type:Text
Languages:English
Maltese

Order by:

Filter by: