CINTIL-Corpus Internacional do Português is a linguistically interpreted corpus of Portuguese. At present it is composed of 1 Million annotated tokens, verified by human expert annotators. The annotation comprises information on part-of-speech, open classes lemma and inflection, multi-word expres...
Carolina: General Corpus of Contemporary Brazilian Portuguese with provenance and typology information
Carolina is an open corpus for Linguistics and Artificial Intelligence with a robust volume of texts of varied typology in contemporary Brazilian Portuguese (1970-2021).
1970 -2002 (1)
Human Use (1)
Lexicon Access (1)
Pos Tagging (1)
Plain text / xml (1)