LX-Chunker

The present tool, that was built to deal with specific issues concerning orthographic conventions adopted for Portuguese, marks sentence boundaries with <s>…</s>, and paragraph boundaries with <p>…</p>. Unwraps sentences split over different lines. A f-score of 99.94% was obtained when testing o...

Resource Type:Tool / Service
Language:Portuguese
LX Semantic Similarity

LX Semantic Similarity is an online service for measuring the semantic similarity between words in Portuguese. This service uses the LX-DSemVectors, a distributional semantics model (a.k.a. word embeddings) of the Portuguese language. The model represents each word in its vocabulary by a vecto...

Resource Type:Tool / Service
UIMA/U-Compare OpenNLP Tokenizer

This is a UIMA wrapper for the OpenNLP Tokenizer tool. It splits English sentences into individual tokens. The tool forms part of the in-built library of components provided with the U-Compare platform (see separate META-SHARE record) for building and evaluating text mining workflows. The U-Comp...

Resource Type:Tool / Service
Language:English
UIMA/U-Compare GENIA Tagger

The GENIA tagger analyzes English sentences and outputs the base forms, part-of-speech tags, chunk tags, and named entity tags. The tagger is specifically tuned for biomedical text such as MEDLINE abstracts. The tool is provided as a UIMA component, which forms part of the in-built library of...

Resource Type:Tool / Service
Language:English
UIMA/U-Compare OpenNLP Sentence Detector

This is a UIMA wrapper for the OpenNLP Sentence Detector tool. It splits English text into individual sentences. The tool forms part of the in-built library of components provided with the U-Compare platform (see separate META-SHARE record) for building and evaluating text mining workflows. ...

Resource Type:Tool / Service
Language:English
YamCha: Yet Another Multipurpose CHunk Annotator

YamCha is a generic, customizable, and open source text chunker oriented toward a lot of NLP tasks, such as POS tagging, Named Entity Recognition, base NP chunking, and Text Chunking. We used it for NP chunking.

Resource Type:Tool / Service
U-Compare Segmentation Service

Web service created by exporting UIMA-based workflow from the U-Compare text mining system. Functionality: Identifies clauses/segments in plain text. Also identifies sentences, tokens, POS tags and lemmas. Tools in workflow: Cafetiere Sentence Splitter (University of Manchester), TTL Tokenizer...

Resource Type:Tool / Service
Language:Romanian
U-Compare syntactic chunking service

Web service created by exporting UIMA-based workflow from the U-Compare text mining system. Functionality: Identifies and categorises syntactic chunks in plain text Tools in workflow: Freeling shallow parser web service (service provided by the PANACEA project) NOTE: The licence provided cove...

Resource Type:Tool / Service
Language:Galician
CINTIL Corpus Concordancer

CINTIL Corpus Concordancer is a freely available online concordancing service to support the research usage of the CINTIL Corpus. This concordancer was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics, in coopera...

Resource Type:Tool / Service
U-Compare Discourse Parsing Service

Web service created by exporting UIMA-based workflow from the U-Compare text mining system. Functionality: Performs discourse parsing on plain text. Also identifies sentences, tokens, parts of speech, lemmas, clauses and coreference chains Tools in workflow: UAIC-POSTagger, UAIC-NPChunker, UAI...

Resource Type:Tool / Service
Language:Romanian

Order by:

Filter by:

Text (446)
Audio (18)
Image (1)