FEUP CoRef

FEUP CoRef is a freely available online service for coreference resolution in Portuguese and Spanish. This service was developed and is maintained at the Faculdade de Engenharia da Universidade do Porto Department of Informatics.

Resource Type:Tool / Service
SenseClusters

SenseClusters is a package of (mostly) Perl programs that allows a user to cluster similar contexts together using unsupervised knowledge-lean methods.

Resource Type:Tool / Service
LX-Parser

LX-Parser is a freely available on-line service for constituency parsing of Portuguese sentences. This service was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. LX-Parser performs a syntactic analysis of P...

Resource Type:Tool / Service
LX-USuite

LX-USuite is a tool for shallow processing of Portuguese that adopts the Universal Part-of-Speech (UPOS) tagset and Universal feature bundles, related to the Universal Dependency framework, with an initial performance of 99.06% for POS tagging, 98.75% for featurizer model, and 99.08% for the lemm...

Resource Type:Tool / Service
LX-Syllabifier

LX-Syllabifier is a language processing tool for the syllabification of Portuguese text. This service was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. LX-Syllabifier performs syllabification following a r...

Resource Type:Tool / Service
Tell me Stories - Temporal Summarization framework

Conta-me Histórias [http://contamehistorias.pt] is a temporal summarization framework of news articles that allows users to explore and revisit events in the past. To select relevant stories of different time-periods, we rely on YAKE! [http://yake.inesctec.pt] a keyword extraction algorithm devel...

Resource Type:Tool / Service
SENTER

SENTER is a SENtence splitTER for Portuguese.

Resource Type:Tool / Service
Language:Portuguese
UIMA/U-Compare GENIA Tokeniser (GENIA Tagger)

Tokenisation is one of the functionalities of the GENIA tagger, which additionally outputs the base forms, part-of-speech tags, chunk tags, and named entity tags. The tagger is specifically tuned for biomedical text such as MEDLINE abstracts. The tool is a UIMA component, which forms part of th...

Resource Type:Tool / Service
Language:English
RudriCo-TOK

RudriCo-TOK is a tokenizer tool that splits contractions. De-contraction rules: 178.

Resource Type:Tool / Service
Language:Portuguese
ixa-pipe-coref-eu

ixa-pipe-coref-eu is a Basque coreference resolution tool, which is an adaptation of Stanford Deterministic Coreference Resolution (http://www-nlp.stanford.edu/downloads/dcoref.shtml). This tool reads a text document annotated with lemmas, named entities and constituents formated in Natural La...

Resource Type:Tool / Service
Language:Basque

Order by:

Filter by:

English (35)
Basque (10)
Maltese (6)
Catalan (3)
Czech (3)
Spanish (3)
Bosnian (1)
French (1)
Serbian (1)
Slovak (1)
Welsh (1)
Grammar (1)
Tagger (1)
Yes (102)
No (15)
Text (88)
Audio (2)
Yes (13)
No (11)
Text (88)
Audio (2)