Reddit Dataset Extraction Tool

Reddit Dataset Extraction Tool (RDET) is a tool that takes advantage of the resources available at 'pushshift.io' that relate to Reddit comments and submissions and generates new datasets based on any given subreddit.

Resource Type:Tool / Service
CoRef Resolution

A coreference solver for Portuguese and Spanish

Resource Type:Tool / Service
ComLinToo: The Computational Linguistics Toolset

The Computational Linguistics Toolset is a set of tools for computational linguistics. It contains re-usable code for cleaning, splitting, refining, and taking samples from corpora (ICE, Penn, and a native one), for tagging them using the TnT-tagger, for doing permutation statistics on N-grams (u...

Resource Type:Tool / Service
LX-USuite

LX-USuite is a tool for shallow processing of Portuguese that adopts the Universal Part-of-Speech (UPOS) tagset and Universal feature bundles, related to the Universal Dependency framework, with an initial performance of 99.06% for POS tagging, 98.75% for featurizer model, and 99.08% for the lemm...

Resource Type:Tool / Service
LX-Parser

LX-Parser is a freely available on-line service for constituency parsing of Portuguese sentences. This service was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. LX-Parser performs a syntactic analysis of P...

Resource Type:Tool / Service
Monolingual concordancer

Monolingual concordancer is a language independent concordancer tool. Note that the tool is also able to be used as a bilingual concordancer. Several corpora are also included in this resource.

Resource Type:Tool / Service
Tell me Stories - Temporal Summarization framework

Conta-me Histórias [http://contamehistorias.pt] is a temporal summarization framework of news articles that allows users to explore and revisit events in the past. To select relevant stories of different time-periods, we rely on YAKE! [http://yake.inesctec.pt] a keyword extraction algorithm devel...

Resource Type:Tool / Service
LX-SRLabeler

LX-SRLabeler is a freely available on-line service for constituency parsing and semantic role labeling of Portuguese sentences. This service was developed and is maintained at University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. LX-SRLabeler is su...

Resource Type:Tool / Service
LX-Conjugator

LX-Conjugator is a freely available online service for fully-fledged conjugation of Portuguese verbs. It was developed and is maintained by the NLX-Natural Language and Speech Group at the University of Lisbon, Department of Informatics. LX-Conjugator takes a Portuguese infinitive verb form a...

Resource Type:Tool / Service
LX-Translator

LX-Translator is a freely available on-line service for translation between Portuguese and Chinese. This service was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. Intrinsic evaluation of the model for the ...

Resource Type:Tool / Service

Order by:

Filter by:

Text (446)
Audio (18)
Image (1)