The TreeBankPT (Branco et al., 2011) is a corpus of syntactic constituency trees of the translated news composed of 3,406 sentences and 44,598 tokens taken from the Wall Street Journal. For the creation of this TreeBank we adopted a semi-automatic analysis with a double-blind annotation followed...
The CINTIL-LogicalFormBank (Branco, 2009, and Branco et al., 2011) is a corpus of semantic dependencies of sentences from Portuguese texts composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), novels (399 sentences; 3,082...
Filter by:
Portuguese (22)
English (1)
Human Use (9)
Pos Tagging (7)
Lemmatization (6)
Lexicon Access (6)
Parsing (4)
Other (3)
Annotation (1)
Semantic Web (1)
Speech Analysis (1)
Text Mining (1)
Web Services (1)
Corpus (15)