This resource includes a spoken corpus with approximately 300.000 words, covering both formal (152.755 words) and informal (165.838 words) speech, with aligned sound and orthographic transcription and POS-tag information.
CINTIL-Corpus Internacional do Português is a linguistically interpreted corpus of Portuguese. At present it is composed of 1 Million annotated tokens, verified by human expert annotators. The annotation comprises information on part-of-speech, open classes lemma and inflection, multi-word expres...
Filter by:
Portuguese (12)
English (1)
1810-1940 (1)
1970 -2002 (1)
1970-1975 (1)
1970-2001 (1)
1970-2002 (1)
1974-2004 (1)
1986 -1987 (1)
1996-1997 (1)
1996-2011 (1)
Corpus (9)