LX-AP was created from the translation of Almuhareb-Poesio (ap) benchmark (Almuhareb and Poesio, 2005). The original data set was created considering three aspects: POS, frequency and ambiguity. It contains 402 names from 21 categories of WordNet, with 13 to 21 names from each one of those categ...
Carolina is an open corpus for Linguistics and Artificial Intelligence with a robust volume of texts of varied typology in contemporary Brazilian Portuguese (1970-2021).
Filter by:
News (8)
Novels (6)
Test Suite (6)
General (2)
ECONOMICS (1)
Fiction (1)
General (1)
INDUSTRY (1)
Medical History (1)
News articles (1)
Political (1)
SOCIAL QUESTIONS (1)
Science (1)
Human Use (4)
Parsing (5)
Lexicon Access (4)
Annotation (3)
Pos Tagging (3)
Event Extraction (2)
Lemmatization (2)
Text Mining (2)
Summarisation (1)
Text Generation (1)
Text (32)