Monolingual concordancer is a language independent concordancer tool. Note that the tool is also able to be used as a bilingual concordancer. Several corpora are also included in this resource.
LX-NER is a freely available online service for the recognition of expressions for named entities in Portuguese. It was developed and is maintained by the NLX-Natural Language and Speech Group at the University of Lisbon, Department of Informatics. LX-NER takes a segment of Portuguese text an...
LX-Suite is a freely available online service for the shallow processing of Portuguese. It was developed and is maintained by the NLX-Natural Language and Speech Group at the University of Lisbon, Department of Informatics. LX-Suite is composed by a set of shallow processing tools: - LX Sente...
LX-DepParser is a free online service for the syntactic analysis of Portuguese. It allows the automatic parsing of sentences in Portuguese in terms of the grammatical functions of their words. This service was developed and is maintained at the University of Lisbon by the NLX-Speech and Natural ...
LX-Proficiency is an online service for the quantitative analysis of texts along a range of linguistic metrics, and for the estimation of the proficiency level of texts. These quantitative metrics are meant to provide support in the classification of texts according to the proficiency levels i...
LX-Quantitative is an online service for the quantitative analysis of texts along a range of linguistic metrics. This service is based on automatic text processing tools. Hence, the results it returns may not be always totally correct. Its high accuracy rate tough allows it to provide a useful...
The purpose of the U-Compare platform is to facilitate easy and rapid development and evaluation of NLP and text mining systems. It includes utilities (including a graphical user interface, the U-Compare workbench, see separate record in META-SHARE) to create workflows from individual, interopera...
The present tool, that was built to deal with Portuguese-specific issues concerning a few non-trivial cases that involve tokenization-ambigous strings, segments text into lexically relevant tokens, using whitespace as the separator. Note that, in these examples, the | (vertical bar) symbol is use...
MBT is a memory-based tagger-generator and tagger in one. The tagger-generator part can generate a sequence tagger on the basis of a training set of tagged sequences; the tagger part can tag new sequences. MBT can, for instance, be used to generate part-of-speech taggers or chunkers for natural l...
Part-of-speech tagger tuned to biomedical text. The tool is provided as a UIMA component, which forms part of the in-built library of components provided with the U-Compare platform (see separate META-SHARE record) for building and evaluating text mining workflows. The U-Compare Workbench (se...