Carolina: General Corpus of Contemporary Brazilian Portuguese with provenance and typology information

Carolina is an open corpus for Linguistics and Artificial Intelligence with a robust volume of texts of varied typology in contemporary Brazilian Portuguese (1970-2021).

Resource Type:Corpus
Media Type:Text
Language:Brazilian Portuguese
Manually annotated corpora for teaching and learning purposes of Brazilian Portuguese, Dutch, Estonian, and Slovene

These are manually annotated corpora for teaching and learning purposes of Brazilian Portuguese, Dutch, Estonian, and Slovene, as a contribution to the Manually Annotated Corpora Family available in CLARIN. Sentences are annotated with “problematic” or “non-problematic” labels, from the point of ...

Resource Type:Corpus
Media Type:Text
Languages:Brazilian Portuguese
Dutch
Estonian
Slovene
NomLex-BR

A computational lexicon for Portuguese that provides mappings between verbs and their nominalizations.

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Brazilian Portuguese
Portuguese

Order by: