Czech Association of Medical Physicists - Physics Glossary (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A dictionary of 3281 terms relating to physics for medic...

Resource Type:Corpus
Media Type:Text
Languages:Czech
English
Khresmoi Query Translation Test Data for the Medical Domain version 1.0

This package contains data sets for development and testing of machine translation of medical search short queries between Czech, English, French, and German. The queries come from the general public and from medical experts.

Resource Type:Corpus
Media Type:Text
Languages:Czech
English
French
German
The Coimisineir Teanga Bilingual Corpus of Reports and Press Releases (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Reports and Press Release data from the Language Commiss...

Resource Type:Corpus
Media Type:Text
Languages:English
Irish
The Coimisineir Teanga Bilingual Corpus of Reference Documents (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. General Reference content from the Language Commissioner...

Resource Type:Corpus
Media Type:Text
Languages:English
Irish
Monolingual documents from the Government of Lithuania (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Monolingual documents received from the Government of th...

Resource Type:Corpus
Media Type:Text
Language:Lithuanian
The Gaois bilingual corpus of English-Irish legislation (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual corpus of English-Irish legislation provided b...

Resource Type:Corpus
Media Type:Text
Languages:English
Irish
Slovenian-English corpus with statistical reports from the Statistical Office of the Republic of Slovenia website (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Slovenian-English corpus with statistical reports from t...

Resource Type:Corpus
Media Type:Text
Languages:English
Slovenian
Secretariat-General parallel corpus SL-EN and EN-SL (part 1) (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English-Slovenian parallel corpus in TMX format from the...

Resource Type:Corpus
Media Type:Text
Languages:English
Slovenian
Secretariat-General parallel corpus SL-EN and EN-SL (part 2) (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English-Slovenian parallel corpus in TMX format from the...

Resource Type:Corpus
Media Type:Text
Languages:English
Slovenian
An Arabic Twitter Corpus for Subjectivity and Sentiment Analysis

An Arabic twitter data set of 7,503 tweets. The released data contains manual Sentiment Analysis annotations as well as automatically extracted features, saved in Comma Separated (CSV) and Attribute-Relation File Format (ARFF) file formats. Due to twitter privacy restrictions we replaced the orig...

Resource Type:Corpus
Media Type:Text
Language:Arabic

Order by:

Filter by:

Text (445)
Audio (18)
Image (1)