We present SETimes.HR ― the first linguistically annotated corpus of Croatian that is freely available for all purposes. The corpus is built on top of the SETimes parallel corpus of nine Southeast European languages and English. It is manually annotated for lemmas, morphosyntactic tags, named ent...
The PropBankPT (Branco et al., 2012) is a set of sentences annotated with their constituency structure and semantic role tags, composed of 3,406 sentences and 44,598 tokens taken from the Wall Street Journal translated. For the creation of this PropBank we adopted a semi-automatic analysis with...
Filter by:
Portugal (4)
Parsing (22)
Pos Tagging (8)
Text Mining (8)
Lemmatization (4)
Annotation (2)
Event Extraction (2)
Other (1)
Text (20)
Text/xml (4)
Plain text (1)