This resource includes a spoken corpus with approximately 300.000 words, covering both formal (152.755 words) and informal (165.838 words) speech, with aligned sound and orthographic transcription and POS-tag information.
This resource includes a spoken Portuguese corpus - with aligned sound and orthographic transcription -, collected among sociolinguistically diverse speakers. It consists of recordings from informal conversations.
Filter by:
Portuguese (22)
English (1)
Human Use (9)
Pos Tagging (7)
Lemmatization (6)
Lexicon Access (6)
Parsing (4)
Other (3)
Annotation (1)
Semantic Web (1)
Speech Analysis (1)
Text Mining (1)
Web Services (1)
Corpus (15)