Search and Browse – PORTULAN CLARIN

Radio Bulgaria WSD/NED corpus

Radio Bulgaria WSD/NED corpus is composed of texts from Bulgarian and English articles from the website of Radio Bulgaria.

Resource Type:	Corpus
Media Type:	Text
Languages:	Bulgarian
Languages:	English

GREC

GREC is a semantically annotated corpus of 240 MEDLINE abstracts (167 on the subject of E. coli species and 73 on the subject of the Human species) which is intended for training IE systems and/or resources which are used to extract events from biomedical literature.

Resource Type:	Corpus
Media Type:	Text
Language:	English

Hesita-POS

Hesita-POS is an annotaded corpus. Tv News.

Resource Type:	Corpus
Media Type:	Text
Language:	Portuguese

LX-Battig

The LX-Battig was created from Battig test.set (Baroni et al., 2010). This data set has 83 concrete concepts of the following 10 categories: mammals, birds, fish, vegetables, fruit, trees, vehicles, clothes, tools and kitchenware. The categories names and the concepts were translated by two trans...

Resource Type:	Corpus
Media Type:	Text
Language:	Portuguese

Order by:

Filter by: