GENIA Event Corpus with meta-knowledge annotation

The corpus consists of 1000 MEDLINE abstracts. It is a subset of the original GENIA POS & term corpus, which was selected using the three MeSH terms human, blood cells and transcription factors. In each sentence, three types of information are annotated 1) biomedical terms are identified and assigned categories from the GENIA term ontology. 2) event structures are identified and assigned categories from the GENIA event ontology. 3) Thirdly, detailed information is annotated about how the event should be interpreted, according to its textual context. We call this information meta-knowledge.

Contact Resource Maintainer





People who looked at this resource also viewed the following: