U-Compare Apertium Part-of-Speech Tagging Workflow
|Handle:||https://hdl.handle.net/21.11129/0000-000B-D35F-D (persistent URL to this page)|
This is a workflow that is designed especially for use in the UIMA-based U-Compare workbench (see separate META-SHARE record). The workflow is in "ucz" format (specific to U-Compare) and can be imported via the "Import Workflow" item in the "Workflows" menu of the U-Compare interface. It includes the "Apertium Mopho" and "Apertium POS" UIMA components, that are not part of U-Compare's component library. These two components are part the the Apertium Machine Translation system.
The purpose of the workflow is to perform tokenisation, morphological analysis and part of speech tagging on plain text.
The provided workflow can currently operate on a subset of the languages that are supported by the Apertium system, namely: English, Spanish, Calatan, Galician, Portuguese and Basque.