LexMan-ChunkerTokenizer
Handle: | https://hdl.handle.net/21.11129/0000-000D-F932-2 (persistent URL to this page) |
---|
LexMan-ChunkerTokenizer is a tokenizer and sentence splitter tool. Marks sentence boundaries, multi-word boundaries. Size: Lemmas verbs: 12 995; Lemmas nouns and adj: 38 180; Lemmas adverbs: 7 250; Compound words: 35 201. Language: Portuguese.
DownloadPeople who looked at this resource also viewed the following: