Summ-it

The corpus was developed as a linguistic resource for Automatic Summarization research and his relation with different issues to engage studies on the discourse treatment. Summ-it consists of fifty texts from Science domain extracted from Science section of Brazilian daily newspaper Folha de Sã...

Resource Type:Corpus
Media Type:Text
Language:Portuguese
Parallel texts from Swedish Social Security Authority (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts, email templates and forms in pdf file fo...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
Finnish
French
German
Italian
Polish
Romanian
Spanish; Castilian
Swedish
Civil Aviation Regulations (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A collection of parallel Polish-English texts published ...

Resource Type:Corpus
Media Type:Text
Languages:English
Polish
English-Swedish parallel corpus from the web site of the Swedish Migration Board - Migrationsverket (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. All texts have been collected from their website of the ...

Resource Type:Corpus
Media Type:Text
Languages:English
Swedish
SIP Publications (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Publications from the Luxembourgish government edited by...

Resource Type:Corpus
Media Type:Text
Languages:English
French
German
Luxembourg Museum Websites (de-en) (Processed)

Luxembourg Museum Websites (de-en) (Processed)

Resource Type:Corpus
Media Type:Text
Languages:English
French
German
Radio Bulgaria WSD/NED corpus

Radio Bulgaria WSD/NED corpus is composed of texts from Bulgarian and English articles from the website of Radio Bulgaria.

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
English
Parallel texts from Swedish Work environment Authority (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts from the Swedish Work Environment authori...

Resource Type:Corpus
Media Type:Text
Languages:Bulgarian
Czech
English
Estonian
Finnish
French
German
Greek, Modern (1453-)
Hungarian
Italian
Latvian
Lithuanian
Polish
Romanian
Spanish; Castilian
Swedish
CINTIL DependencyBank PREMIUM

CINTIL DependencyBank PREMIUM is a corpus of Portuguese utterances manually annotated with the representation of grammatical dependency relations and the information of part-of-speech, inflection and lemmas. It is being developed and maintained at the University of Lisbon. The current version is ...

Resource Type:Corpus
Media Type:Text
Language:Portuguese
EUIPO - IP case law French-English (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. EUIPO - IP case law (BoA) French-English

Resource Type:Corpus
Media Type:Text
Languages:English
French

Order by:

Filter by:

Text (446)
Audio (18)
Image (1)