LXGram

LX-Gram is a grammar for the computational processing of Portuguese. It is being developed under the following major design features: - precision: it is a precision grammar delivering accurate, linguistically grounded information of natural language sentences - deep processing: it is a gram...

Resource Type:Tool / Service
LX-Syllabifier

LX-Syllabifier is a language processing tool for the syllabification of Portuguese text. This service was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. LX-Syllabifier performs syllabification following a r...

Resource Type:Tool / Service
CoRef Resolution

A coreference solver for Portuguese and Spanish

Resource Type:Tool / Service
Reddit Dataset Extraction Tool

Reddit Dataset Extraction Tool (RDET) is a tool that takes advantage of the resources available at 'pushshift.io' that relate to Reddit comments and submissions and generates new datasets based on any given subreddit.

Resource Type:Tool / Service
LX-NER

LX-NER is a freely available online service for the recognition of expressions for named entities in Portuguese. It was developed and is maintained by the NLX-Natural Language and Speech Group at the University of Lisbon, Department of Informatics. LX-NER takes a segment of Portuguese text an...

Resource Type:Tool / Service
U-Compare Tokenisation service

Web service created by exporting UIMA-based workflow from the U-Compare text mining system. Functionality: Identifies sentences and tokens in plain text. Tools in workflow: Freeling sentence splitter web service (service provided by the PANACEA project), LX-Tokenizer (web service provided by th...

Resource Type:Tool / Service
Language:Portuguese
Czech to English Machine translation module

Technical Description: http://qtleap.eu/wp-content/uploads/2015/05/Pilot1_technical_description.pdf http://qtleap.eu/wp-content/uploads/2015/05/TechnicalDescriptionPilot2_D2.7.pdf http://qtleap.eu/wp-content/uploads/2016/11/TechnicalDescriptionPilot3_D2.10.pdf

Resource Type:Tool / Service
Languages:Czech
English
LexMan-POSTagger

LexMan-POSTagger is a morphological analyser tool that morphologically tags all words. Size: Lemmas verbs: 12 995; Lemmas nouns and adj: 38 180; Lemmas adverbs: 7 250; Compound words: 35 201. Language: Portuguese.

Resource Type:Tool / Service
Language:Portuguese
CINTIL-Treebank Online Searcher

CINTIL-Treebank Online Searcher is a freely available online service to search and view the constituency and dependency tree of the CINTIL-Treebank. This service was developed and is maintained at University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. ...

Resource Type:Tool / Service
LexMan-ChunkerTokenizer

LexMan-ChunkerTokenizer is a tokenizer and sentence splitter tool. Marks sentence boundaries, multi-word boundaries. Size: Lemmas verbs: 12 995; Lemmas nouns and adj: 38 180; Lemmas adverbs: 7 250; Compound words: 35 201. Language: Portuguese.

Resource Type:Tool / Service
Language:Portuguese

Order by:

Filter by:

English (35)
Basque (10)
Maltese (6)
Catalan (3)
Czech (3)
Spanish (3)
Bosnian (1)
French (1)
Serbian (1)
Slovak (1)
Welsh (1)
Grammar (1)
Tagger (1)
Yes (102)
No (15)
Text (88)
Audio (2)
Yes (13)
No (11)
Text (88)
Audio (2)