Bilingual concordancer is a language independent concordancer tool for bilingual concordancing, translation revision, post-editing, etc. Note that the tool is also able to be used as a monolingual concordancer. Several corpora are also included in this resource.
YamCha is a generic, customizable, and open source text chunker oriented toward a lot of NLP tasks, such as POS tagging, Named Entity Recognition, base NP chunking, and Text Chunking. We used it for NP chunking.
SENTER is a SENtence splitTER for Portuguese.
LX-NER is a freely available online service for the recognition of expressions for named entities in Portuguese. It was developed and is maintained by the NLX-Natural Language and Speech Group at the University of Lisbon, Department of Informatics. LX-NER takes a segment of Portuguese text an...
LX-DepParser is a free online service for the syntactic analysis of Portuguese. It allows the automatic parsing of sentences in Portuguese in terms of the grammatical functions of their words. This service was developed and is maintained at the University of Lisbon by the NLX-Speech and Natural ...
LX-TimeAnalyzer is a freely available online service for the extraction of temporal information from Portuguese text. It was developed and is maintained by the NLX-Natural Language and Speech Group at the University of Lisbon, Department of Informatics. LX-TimeAnalyzer extracts temporal inform...
CINTIL Corpus Concordancer is a freely available online concordancing service to support the research usage of the CINTIL Corpus. This concordancer was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics, in coopera...
Syntactic parser for English. Outputs dependency relations. Also outputs parts-of-speech for each token. The tool is provided as a UIMA component, specifically as Java archive (jar) file, which can be incorporated within any UIMA workflow. However, it is particularly designed use in the U-Com...
This is a UIMA wrapper for the OpenNLP Tokenizer tool. It splits English sentences into individual tokens. The tool forms part of the in-built library of components provided with the U-Compare platform (see separate META-SHARE record) for building and evaluating text mining workflows. The U-Comp...
LX Semantic Similarity is an online service for measuring the semantic similarity between words in Portuguese. This service uses the LX-DSemVectors, a distributional semantics model (a.k.a. word embeddings) of the Portuguese language. The model represents each word in its vocabulary by a vecto...