CINTIL-Definitions

The corpus presented here is a collection of several tutorials and scientific papers in the field of Information Technology with 603 annotated definitions from Portuguese. The texts were collected from the Web at the beginning of the 2006 and they are organised in 32 files of three different sub-domains with 268,064 tokens: Information Society (91,825 tokens), Information Technology (80,483 tokens), and e-Learning (94,756 tokens).

Download





  • Web




    • Question Answering (QA), Ontology learning, dictionary, and glossary construction.



    People who looked at this resource also viewed the following:
    People who downloaded this resource also downloaded the following:
    Resources from the same project
    Resources from the same creators