The CINTIL-WordSenses corpus, built upon the CINTIL International Corpus of Portuguese (Barreto et al., 2006), is composed of 23,825 sentences of written Portuguese with open-class terms manually disambiguated and annotated with synset identifiers from the Portuguese MultiWordNet (MWNPT) (Pianti ...
The DeepBankPT (Branco et. al. 2010) is a corpus of semantic dependencies of translated texts composed of 3,406 sentences and 44,598 tokens taken from the Wall Street Journal. The DeepBankPT is composed of MRS and AVM representations, derivation tree, and syntactic tree with grammatical and se...
The DepBankPT (Branco et al., 2011a) is a corpus of grammatical dependencies of the translated news composed of 3,406 sentences and 44,598 tokens taken from the Wall Street Journal. The DepBankPT is aligned to a constituency bank, the TreeBankPT (see Branco et al., 2011b). The key bridging eleme...
The LogicalFormBankPT (Branco, 2009, and Branco et al., 2011) is a corpus of semantic dependencies of translated texts composed of 3,406 sentences and 44,598 tokens taken from the Wall Street Journal. The LogicalFormBankPT is composed of MRS representations of each sentence’s semantic relation...
The present tool, that was built to deal with specific issues concerning orthographic conventions adopted for Portuguese, marks sentence boundaries with <s>…</s>, and paragraph boundaries with <p>…</p>. Unwraps sentences split over different lines. A f-score of 99.94% was obtained when testing o...
LX-Conjugator is a freely available online service for fully-fledged conjugation of Portuguese verbs. It was developed and is maintained by the NLX-Natural Language and Speech Group at the University of Lisbon, Department of Informatics. LX-Conjugator takes a Portuguese infinitive verb form a...
LX-DepParser is a free online service for the syntactic analysis of Portuguese. It allows the automatic parsing of sentences in Portuguese in terms of the grammatical functions of their words. This service was developed and is maintained at the University of Lisbon by the NLX-Speech and Natural ...
LX-Lemmatizer is a freely available online service for fully-fledged lemmatization of Portuguese verbs. It was developed and is maintained at University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. LX-Lemmatizer takes a Portuguese verb form and deliv...
LX-NER is a freely available online service for the recognition of expressions for named entities in Portuguese. It was developed and is maintained by the NLX-Natural Language and Speech Group at the University of Lisbon, Department of Informatics. LX-NER takes a segment of Portuguese text an...
LX-Proficiency is an online service for the quantitative analysis of texts along a range of linguistic metrics, and for the estimation of the proficiency level of texts. These quantitative metrics are meant to provide support in the classification of texts according to the proficiency levels i...