Bilingual collection of reports of the Greek Public Power Corporation (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A bilingual collection of translation units extracted fr...

Resource Type:Corpus
Media Type:Text
Languages:English
Greek, Modern (1453-)
Bilingual Croatian-English Parallel Corpus (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual Croatian-English Parallel Corpus of 21340 tran...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
BMVI Publications (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. TMX file with 11555 TUs, bilingual German/English, publi...

Resource Type:Corpus
Media Type:Text
Languages:English
German
BMI Brochures 2011-2015 (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English translations of German BMI brochures from the la...

Resource Type:Corpus
Media Type:Text
Languages:English
German
Corpus de Produções Escritas de Aprendentes de PL2 (PEAPL2)

A Portuguese as a non-native language learners' corpus of written texts with three independent subcorpora: - Portuguese as a Foreign Language: Subcorpus Português Língua Estrangeira (PEAPL2_PLE) http://teitok2.iltec.pt/peapl2-ple/index.php?action=home - East Timorese Portuguese: Subcorpus T...

Resource Type:Corpus
Media Type:Text
Language:Portuguese
CINTIL DependencyBank PREMIUM

CINTIL DependencyBank PREMIUM is a corpus of Portuguese utterances manually annotated with the representation of grammatical dependency relations and the information of part-of-speech, inflection and lemmas. It is being developed and maintained at the University of Lisbon. The current version is ...

Resource Type:Corpus
Media Type:Text
Language:Portuguese
CINTIL-TreeBank

The CINTIL-TreeBank (Branco et al., 2011) is a corpus of syntactic constituency trees of Portuguese texts composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), novels (399 sentences; 3,082 tokens). In addition, there are ...

Resource Type:Corpus
Media Type:Text
Language:Portuguese
Archivo dos Açores, dir. Ernesto do Canto, 1.ª série, Ponta Delgada, Vol. 1-12

A publicação Arquivo dos Açores, consagrada como obra de referência para a investigação histórica sobre o arquipélago dos Açores, conta com duas séries, num total de 20 volumes. A primeira série do Arquivo dos Açores, composta por 15 volumes, decorreu entre 1878 e 1959, com grandes interrupções r...

Resource Type:Corpus
Media Type:Text
Language:Portuguese
TreeBankPT

The TreeBankPT (Branco et al., 2011) is a corpus of syntactic constituency trees of the translated news composed of 3,406 sentences and 44,598 tokens taken from the Wall Street Journal. For the creation of this TreeBank we adopted a semi-automatic analysis with a double-blind annotation followed...

Resource Type:Corpus
Media Type:Text
Language:Portuguese
FLY corpus - morpho

FLY Corpus is a corpus composed by 2000 informal letters written in Portuguese, in the years spanning from 1900 to 1974, in the context of war, migration, imprisonment and exile. Each letter is in an XML file with two main parts: (a) the header, which contains metadata about the document (the ...

Resource Type:Corpus
Media Type:Text
Language:Portuguese

Order by:

Filter by:

Text (445)
Audio (18)
Image (1)