Code-switched English-Spanish Tweets

This package contains the collection of tweets described in the LREC 2018 paper: "Collecting Code-Switched Data from Social Media", Gideon Mendels, Victor Soto, Aaron Jaech and Julia Hirschberg, LREC 2018. Please remember to cite this paper if you use this resource. The tagged_tweets_ids file con...

Resource Type:Corpus
Media Type:Text
Languages:English
Spanish; Castilian
EUIPO - list of goods and services Spanish and English (Processed)  

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. EUIPO list of goods and services format: TMX

Resource Type:Corpus
Media Type:Text
Languages:English
Spanish; Castilian
Spanish-English website parallel corpus (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. This is a parallel corpus of bilingual texts crawled fro...

Resource Type:Corpus
Media Type:Text
Languages:English
Spanish; Castilian
Memorias de traducción Portal oficial de turismo de España www.spain.info

Memoria de traducción Portal oficial de turismo de España www.spain.info

Resource Type:Corpus
Media Type:Text
Languages:English
French
German
Italian
Portuguese
Spanish; Castilian
Parallel texts from Swedish National Food Agency (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts in pdf file format. Original in Swedish, ...

Resource Type:Corpus
Media Type:Text
Languages:English
Finnish
French
Polish
Spanish; Castilian
Swedish
Parallel texts from Swedish Labour market agency (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts, all in pdf files, have been gathered fro...

Resource Type:Corpus
Media Type:Text
Languages:English
Finnish
French
German
Romanian
Spanish; Castilian
Swedish
Parallel texts from Swedish Labour market agency. Part 2 (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Same as part 1, but with the Readme-file. (Processed)

Resource Type:Corpus
Media Type:Text
Languages:English
Finnish
French
German
Polish
Romanian
Spanish; Castilian
Swedish
Parallel corpora finely aligned (subsentencial granularity)

Text corpus for bilingual concordancing, single- and multi-word translation extraction, machine translation. Languages: cs-pt, de-pt, en-pt, es-pt, fr-pt, it-pt, and pt-sk. Size: 1 G per language (phrases aligned). Domain: Law and Health.

Resource Type:Corpus
Media Type:Text
Languages:Czech
English
French
German
Italian
Portuguese
Slovak
Spanish; Castilian
Parallel texts from Swedish Social Security Authority (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts, email templates and forms in pdf file fo...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
Finnish
French
German
Italian
Polish
Romanian
Spanish; Castilian
Swedish
Parallel texts from Swedish Work environment Authority (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts from the Swedish Work Environment authori...

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
Czech
English
Estonian
Finnish
French
German
Greek, Modern (1453-)
Hungarian
Italian
Latvian
Lithuanian
Polish
Romanian
Spanish; Castilian
Swedish

Order by: