The SIMPLE Portuguese Lexicon is constituted by 10,438 entries semantically encoded, accordingly to the parole common encoding standards.
The Spoken Corpus Mozambique contains approximately 121,958 running words of spoken Portuguese from Mozambique. It includes 40 transcriptions of spoken recordings (in a total of 40 hours of recordings) that were recorded between 1986 and 1987.
Uplug (see Tiedemann, 2003a) is a collection of tools and scripts for processing text-corpora, for automatic alignment and for term extraction from parallel corpora. Several tools have been integrated in Uplug. Pre-processing tools include a sentence splitter, a general tokenizer and wrappers a...
Filter by:
Portuguese (10)
English (2)
1810-1940 (1)
1970 -2002 (1)
1974-2004 (1)
1986 -1987 (1)
1996-1997 (1)
1996-2011 (1)
Until 2006 (1)
Linguistic Research (13)
Human Use (9)
Lexicon Access (7)
Pos Tagging (5)
Lemmatization (3)
Text Mining (2)
Other (1)
Semantic Web (1)
Web Services (1)