The present tool, that was built to deal with Portuguese-specific issues concerning a few non-trivial cases that involve tokenization-ambigous strings, segments text into lexically relevant tokens, using whitespace as the separator. Note that, in these examples, the | (vertical bar) symbol is use...
The GENIA tagger analyzes English sentences and outputs the base forms, part-of-speech tags, chunk tags, and named entity tags. The tagger is specifically tuned for biomedical text such as MEDLINE abstracts.
Filter by:
Written Language (12)
Spoken Language (2)
Tool Service (12)
Tool (12)
Tagger (1)
Corpus (12)
Text (12)
Lemmatization (3)
Segmentation (1)
Yes (1)
Text (12)
Text numerical (1)