UEvora Tagger is a freely available on-line service for tagging sentences written in Portuguese. This service was developed and is maintained at the University of Évora by the VISTA - Video, Image, Speech, and Text Analysis Group of the Department of Informatics.
LX-Syllabifier is a language processing tool for the syllabification of Portuguese text. This service was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. LX-Syllabifier performs syllabification following a r...
Bilingual concordancer is a language independent concordancer tool for bilingual concordancing, translation revision, post-editing, etc. Note that the tool is also able to be used as a monolingual concordancer. Several corpora are also included in this resource.
LX-USuite is a tool for shallow processing of Portuguese that adopts the Universal Part-of-Speech (UPOS) tagset and Universal feature bundles, related to the Universal Dependency framework, with an initial performance of 99.06% for POS tagging, 98.75% for featurizer model, and 99.08% for the lemm...
CINTIL Corpus Concordancer is a freely available online concordancing service to support the research usage of the CINTIL Corpus. This concordancer was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics, in coopera...
The OntoLP system is a plug-in for the construction environment of the ontologies Protégé. The plug-in intents to be an assistant for the engineer of ontologies for Portuguese during the execution of initial steps concerning the ontologies construction: extraction of terms which are candidates fo...
Monolingual concordancer is a language independent concordancer tool. Note that the tool is also able to be used as a bilingual concordancer. Several corpora are also included in this resource.
LX-Proficiency is an online service for the quantitative analysis of texts along a range of linguistic metrics, and for the estimation of the proficiency level of texts. These quantitative metrics are meant to provide support in the classification of texts according to the proficiency levels i...
The Computational Linguistics Toolset is a set of tools for computational linguistics. It contains re-usable code for cleaning, splitting, refining, and taking samples from corpora (ICE, Penn, and a native one), for tagging them using the TnT-tagger, for doing permutation statistics on N-grams (u...
LX-Lemmatizer is a freely available online service for fully-fledged lemmatization of Portuguese verbs. It was developed and is maintained at University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. LX-Lemmatizer takes a Portuguese verb form and deliv...