U-Compare Apertium Part-of-Speech Tagging Workflow

Handle:	https://hdl.handle.net/21.11129/0000-000B-D35F-D (persistent URL to this page)
URL:	http://www.nactem.ac.uk/ucompare/
URL:	http://www.apertium.org/

This is a workflow that is designed especially for use in the UIMA-based U-Compare workbench (see separate META-SHARE record). The workflow is in "ucz" format (specific to U-Compare) and can be imported via the "Import Workflow" item in the "Workflows" menu of the U-Compare interface. It includes the "Apertium Mopho" and "Apertium POS" UIMA components, that are not part of U-Compare's component library. These two components are part the the Apertium Machine Translation system.

The purpose of the workflow is to perform tokenisation, morphological analysis and part of speech tagging on plain text.

The provided workflow can currently operate on a subset of the languages that are supported by the Apertium system, namely: English, Spanish, Calatan, Galician, Portuguese and Basque.

Download

DistributionLicence

GPL

User Nature: Academic

Distribution Access/Medium: Downloadable

Contact Person

Sophia Ananiadou

University of Manchester

Professor

[javascript protected email address]

School of Computer Science

[javascript protected email address]

Tool/Service

Tool

Language Dependent

Input

Media type: Text

Resource type: Corpus

Modality: Written Language

Language: English, Spanish; Castilian, Portuguese, Catalan; Valencian, Galician, Basque

Character encoding: UTF - 8

Output

Media type: Text

Resource type: Corpus

Modality: Written Language

Language: English, Spanish; Castilian, Portuguese, Catalan; Valencian, Galician, Basque

Character encoding: UTF - 8

Annotation type: Lemmatization, Morphosyntactic Annotation - Pos Tagging, Structural Annotation

Segmentation level: Word

Operation

Operating system: Os - Independent

Resource Creation

Resource Creator

University of Manchester

School of Computer Science

University of Manchester

[javascript protected email address]

Funding Project

METANET4U - Enhancing the Linguistic Infrastructure of Europe (METANET4U)

Funding Type: Eu Funds

Metadata

Created: 02/01/2013

Last Updated: 02/15/2013

Metadata Creator

Paul Thompson

University of Manchester

Research Associate

[javascript protected email address]

School of Computer Science

[javascript protected email address]

Usage

Foreseen UseNlp Applications

Use NLP Specific: Morphological Analysis, Pos Tagging, Text Mining

Documentation

Tool Documentation: Online

Document Type: Other

Paul Thompson, U-Compare Apertium Part-of-Speech Tagging Workflow, http://www.nactem.ac.uk/meta-net/Narratives/U-Compare_Apertium_Part_of_Speech_Tagging_workflow.pdf

People who looked at this resource also viewed the following:

People who downloaded this resource also downloaded the following:

Resources from the same project

Resources from the same creators