Treat is a toolkit for natural language processing and computational linguistics in Ruby. The Treat project aims to build a language- and algorithm- agnostic NLP framework for Ruby with support for tasks such as document retrieval, text chunking, segmentation and tokenization, natural language pa...
The present tool, that was built to deal with Portuguese-specific issues concerning a few non-trivial cases that involve tokenization-ambigous strings, segments text into lexically relevant tokens, using whitespace as the separator. Note that, in these examples, the | (vertical bar) symbol is use...
Filter by:
Portugal (4)
Parsing (22)
Pos Tagging (8)
Text Mining (8)
Lemmatization (4)
Annotation (2)
Event Extraction (2)
Other (1)
Text (20)
Text/xml (4)
Plain text (1)