CINTIL-PropBank

Handle:	https://hdl.handle.net/21.11129/0000-000B-D300-6 (persistent URL to this page)

The CINTIL-PropBank (Branco et al., 2012) is a corpus of sentences annotated with their constituency structure and semantic role tags, composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), and novels (399 sentences; 3,082 tokens). In addition, there are 779 sentences (5,654 tokens) used for regression testing of the computational grammar that supported the annotation of the corpus.
For the creation of this PropBank we adopted a semi-automatic analysis with a double-blind annotation followed by adjudication. The resulting dataset contains three information levels: phrase constituency, grammatical functions, and phrase semantic roles.
The main motivation behind the creation of this resource was to build a high quality data set with semantic information that could support the development of automatic semantic role labelers for Portuguese.
You may also be interested in the related resources CINTIL-TreeBank, CINTIL-DeepBank, CINTIL-DependencyBank and CINTIL-LogicalFormBank, also available from this repository.

Download

DistributionLicence

MS - NC - No ReD - ND

Licensors:

António Branco

http://www.di.fc.ul.pt/~ahb/

University of Lisbon, Faculty of Sciences

FCUL

Associate Professor with Habilitation

[javascript protected email address]

Faculdade de Ciências de Lisboa, Departamento de Informática. Campo Grande, 1749-016 Lisboa, Portugal

1749-016 Lisbon

Tel.: +351 217 500 087

Fax: +351 217 500 084

Department of Informatics

http://nlx.di.fc.ul.pt/

FCUL

Faculdade de Ciências de Lisboa, Departamento de Informática. Campo Grande, 1749-016 Lisboa, Portugal

1749-016 Lisbon

Portugal

[javascript protected email address]

Tel.: +351 217 500 087

Fax: +351 217 500 084

Distribution rights holders:

António Branco

http://www.di.fc.ul.pt/~ahb/

University of Lisbon, Faculty of Sciences

FCUL

Associate Professor with Habilitation

[javascript protected email address]

Faculdade de Ciências de Lisboa, Departamento de Informática. Campo Grande, 1749-016 Lisboa, Portugal

1749-016 Lisbon

Tel.: +351 217 500 087

Fax: +351 217 500 084

Department of Informatics

http://nlx.di.fc.ul.pt/

FCUL

Faculdade de Ciências de Lisboa, Departamento de Informática. Campo Grande, 1749-016 Lisboa, Portugal

1749-016 Lisbon

Portugal

[javascript protected email address]

Tel.: +351 217 500 087

Fax: +351 217 500 084

IPR Holder

University of Lisbon, Faculty of Sciences

http://nlx.di.fc.ul.pt/

Department of Informatics

University of Lisbon, Faculty of Sciences

FCUL

[javascript protected email address]

Faculdade de Ciências de Lisboa, Departamento de Informática. Campo Grande, 1749-016 Lisboa, Portugal

1749-016 Lisbon

Portugal

Tel.: +351 217 500 087

Fax: +351 217 500 084

Contact Person

António Branco

http://www.di.fc.ul.pt/~ahb/

University of Lisbon, Faculty of Sciences

FCUL

Associate Professor with Habilitation

[javascript protected email address]

Faculdade de Ciências de Lisboa, Departamento de Informática. Campo Grande, 1749-016 Lisboa, Portugal

1749-016 Lisbon

Tel.: +351 217 500 087

Fax: +351 217 500 084

Department of Informatics

http://nlx.di.fc.ul.pt/

FCUL

Faculdade de Ciências de Lisboa, Departamento de Informática. Campo Grande, 1749-016 Lisboa, Portugal

1749-016 Lisbon

Portugal

[javascript protected email address]

Tel.: +351 217 500 087

Fax: +351 217 500 084

text

Monolingual text corpusLanguages

Portuguese (10,140 Sentences)

Linguality

Linguality type: Monolingual

Text Format

text/xml (10,140 Sentences)

Size

110,166 Tokens

10,040 Sentences

Character encoding

UTF - 8 (10,140 Sentences)

Domains

Novels (403 Sentences)

News (8,952 Sentences)

Test Suite (785 Sentences)

Modalities

Written Language

Geographic coverage

Portugal (10,140 Sentences)

Creation

Creation mode: Mixed

Resource Creation

Resource Creator

António Branco

http://www.di.fc.ul.pt/~ahb/

University of Lisbon, Faculty of Sciences

FCUL

Associate Professor with Habilitation

[javascript protected email address]

Faculdade de Ciências de Lisboa, Departamento de Informática. Campo Grande, 1749-016 Lisboa, Portugal

1749-016 Lisbon

Tel.: +351 217 500 087

Fax: +351 217 500 084

Department of Informatics

http://nlx.di.fc.ul.pt/

FCUL

Faculdade de Ciências de Lisboa, Departamento de Informática. Campo Grande, 1749-016 Lisboa, Portugal

1749-016 Lisbon

Portugal

[javascript protected email address]

Tel.: +351 217 500 087

Fax: +351 217 500 084

Funding Project

SemanticShare - Resources and Tools for Semantic Processing (SemanticShare - FCT/PTDC/PLP/81157/2006)

URL: http://nlx.di.fc.ul.pt/projects.html

Funding Type: National Funds

Funder: FCT - Fundação para a Ciência e Tecnologia

Funding Country: Portugal

Project duration: 06/01/2006 - 12/31/2010

Metadata

Created: 06/01/2012

Last Updated: 01/06/2021

Source: METANET4U

META-SHARE

Metadata Language: english

Metadata Creator

Catarina Carvalheiro

http://nlx-server.di.fc.ul.pt/~catarina/

University of Lisbon, Faculty of Sciences

FCUL

Researcher

[javascript protected email address]

Departamento de Informática NLX - Grupo de Fala e Linguagem Natural, Faculdade de Ciências da Universidade de Lisboa, Edifício C6

1749-016 Lisbon

Tel.: +351 217 500 087

Fax: +351 217 500 084

Department of Informatics

http://nlx.di.fc.ul.pt/

FCUL

Faculdade de Ciências de Lisboa, Departamento de Informática. Campo Grande, 1749-016 Lisboa, Portugal

1749-016 Lisbon

Portugal

[javascript protected email address]

Tel.: +351 217 500 087

Fax: +351 217 500 084

Version

Version: 1

Last Updated: 06/01/2012

Usage

Foreseen UseNlp Applications

Use NLP Specific: Parsing, Semantic Role Labelling

Actual Use - Nlp Applications

Use NLP Specific: Parsing, Semantic Role Labelling

Documentation

Tool Documentation: Online

Samples Location: https://portulanclarin.net/repository/extradocs/propbanksample.txt

Document Type: Other

Catarina Carvalheiro, CINTIL PropBank Narrative Description., http://portulanclarin.net/repository/extradocs/CINTIL-Propbank.pdf , 2012

Document Type: In Proceedings

António, Branco; Catarina, Carvalheiro; Sílvia, Pereira; Mariana, Avelãs; Clara, Pinto; Sara, Silveira; Francisco, Costa; João, Silva; Sérgio, Castro , A PropBank for Portuguese: the CINTIL-PropBank , http://www.di.fc.ul.pt/~ahb/#publications , Proceedings of the Eight International Conference on Language Resources and Evaluation , 2012

Document Language: english

People who looked at this resource also viewed the following:

People who downloaded this resource also downloaded the following:

Resources from the same project

Resources from the same creators