|Handle:||https://hdl.handle.net/21.11129/0000-000B-D34C-2 (persistent URL to this page)|
The DepBankPT (Branco et al., 2011a) is a corpus of grammatical dependencies of the translated news composed of 3,406 sentences and 44,598 tokens taken from the Wall Street Journal.
The DepBankPT is aligned to a constituency bank, the TreeBankPT (see Branco et al., 2011b). The key bridging elements are the grammatical function tags decoring the nodes, in the treebank, and the arcs, in the dependencybank (see http://lxcenter.di.fc.ul.pt/services/en/LXServicesSearcher.html). This means that the DepBankPT was extended from the PropBank PT so that besides the tags for the different dependency relations, the arcs are further decorated with tags indicating the semantic relation at stake.
The main motivation behind the creation of this resource was to build a high quality data set with dependency information that could support the development of a large set of automatic resources and tools for Portuguese for NLP studies.
The development of this resource started under the METANET4U project (at: http://metanet4u.eu/) whose main goal is to contribute to the establishment of a pan-European digital platform that makes available language resources and services, encompassing both datasets and software tools, for speech and language processing, and supports a new generation of exchange facilities for them.
You may also be interested in the related resources DeepBankPT, TreeBankPT, PropBankPT and LogicalFormBankPT, also available from this repository.