Reddit Dataset Extraction Tool (RDET) is a tool that takes advantage of the resources available at 'pushshift.io' that relate to Reddit comments and submissions and generates new datasets based on any given subreddit.
A coreference solver for Portuguese and Spanish
The Computational Linguistics Toolset is a set of tools for computational linguistics. It contains re-usable code for cleaning, splitting, refining, and taking samples from corpora (ICE, Penn, and a native one), for tagging them using the TnT-tagger, for doing permutation statistics on N-grams (u...
LX-USuite is a tool for shallow processing of Portuguese that adopts the Universal Part-of-Speech (UPOS) tagset and Universal feature bundles, related to the Universal Dependency framework, with an initial performance of 99.06% for POS tagging, 98.75% for featurizer model, and 99.08% for the lemm...
LX-Parser is a freely available on-line service for constituency parsing of Portuguese sentences. This service was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. LX-Parser performs a syntactic analysis of P...
Monolingual concordancer is a language independent concordancer tool. Note that the tool is also able to be used as a bilingual concordancer. Several corpora are also included in this resource.
Conta-me Histórias [http://contamehistorias.pt] is a temporal summarization framework of news articles that allows users to explore and revisit events in the past. To select relevant stories of different time-periods, we rely on YAKE! [http://yake.inesctec.pt] a keyword extraction algorithm devel...
LX-SRLabeler is a freely available on-line service for constituency parsing and semantic role labeling of Portuguese sentences. This service was developed and is maintained at University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. LX-SRLabeler is su...
LX-Conjugator is a freely available online service for fully-fledged conjugation of Portuguese verbs. It was developed and is maintained by the NLX-Natural Language and Speech Group at the University of Lisbon, Department of Informatics. LX-Conjugator takes a Portuguese infinitive verb form a...
LX-Translator is a freely available on-line service for translation between Portuguese and Chinese. This service was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. Intrinsic evaluation of the model for the ...