<?xml version='1.0' encoding='UTF-8'?>
<OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd">
  <responseDate>2026-05-21T20:31:06Z</responseDate>
  <request verb="GetRecord" metadataPrefix="olac" identifier="8209ff367fd011ec9b5802420a87011bc00c66db3fe94c6b9f0ce1d040550447">https://portulanclarin.net/repository/repository/oaipmh/</request>
  <GetRecord>
    <record>
      <header>
        <identifier>8209ff367fd011ec9b5802420a87011bc00c66db3fe94c6b9f0ce1d040550447</identifier>
        <datestamp>2022-07-11T00:57:15Z</datestamp>
        <setSpec>toolService</setSpec>
        <setSpec>toolService:tool</setSpec>
      </header>
      <metadata>
        <olac:olac xmlns:olac="http://www.language-archives.org/OLAC/1.1/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/" xsi:schemaLocation="http://purl.org/dc/elements/1.1/ http://www.language-archives.org/OLAC/1.1/dc.xsd http://purl.org/dc/terms/ http://www.language-archives.org/OLAC/1.1/dcterms.xsd http://www.language-archives.org/OLAC/1.1/ http://www.language-archives.org/OLAC/1.1/olac.xsd">
          <dc:title xml:lang="en">LX-UTagger</dc:title>
          <dc:description xml:lang="en">LX-UTagger is a POS tagger for Portuguese that adopts the Universal Part-of-Speech tagset (UPOS), related to the Universal Dependency framework, with an initial performance of 99.06% under a ten-fold cross validation scheme.

It is described in this article:

António Branco, João Ricardo Silva, Luís Gomes and João Rodrigues, 2022, "Universal Grammatical Dependencies for Portuguese with CINTIL Data, LX Processing and CLARIN support", In Proceedings, 13th Conference on Language Resources and Evaluation (LREC2022).

which should be used as its canonical citation, and which interested users are referred for detailed information.

This tagger is trained with its companion CINTIL-UPos corpus, with around 1 Million manually annotated tokens, which can be obtained here: https://hdl.handle.net/21.11129/0000-000E-8B30-F.

You may also be interested in the following related resources that can also be found in this repository:
LX-USuite (https://hdl.handle.net/21.11129/0000-000F-327C-E),
LX-UDParser (https://hdl.handle.net/21.11129/0000-000E-8B31-E),
LX-Suite (https://hdl.handle.net/21.11129/0000-000E-5991-A),
LX-Tagger (https://hdl.handle.net/21.11129/0000-000B-D325-D),
LX-DepParser (https://hdl.handle.net/21.11129/0000-000E-598D-0),
LX-Parser (https://hdl.handle.net/21.11129/0000-000E-5999-2).</dc:description>
          <dc:identifier xsi:type="dcterms:URI">https://hdl.handle.net/21.11129/0000-000E-8B2F-2</dc:identifier>
          <dc:type xsi:type="dcterms:DCMIType">Software</dc:type>
          <dc:subject>language resources, language independent tool</dc:subject>
          <dcterms:license>
	CC-BY-NC-ND
	Restrictions of Use: academic-nonCommercialUse, attribution, noDerivatives
	User Nature: academic
	</dcterms:license>
          <dcterms:rightsHolder>IPR Holder: António Branco, antonio.branco[at]di.fc.ul.pt, University of Lisbon, Faculty of Sciences </dcterms:rightsHolder>
          <dcterms:medium>downloadable</dcterms:medium>
          <dc:contributor xsi:type="olac:role" olac:code="depositor">António Branco, antonio.branco[at]di.fc.ul.pt, University of Lisbon, Faculty of Sciences</dc:contributor>
        </olac:olac>
      </metadata>
    </record>
  </GetRecord>
</OAI-PMH>
