MotaMot French-Khmer Pivot Database

French-Khmer pivot lexical database

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Central Khmer
French
English-Estonian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. EASTIN-CL Multilingual Ontology of Assistive Technology ...

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:English
Estonian
LX-DSemVectors

LX-DSemVectors is distributional lexical semantics model, also known as word embeddings, for Portuguese (Rodrigues et al., 2016). This version, 2.2b, was trained on a corpus of 2 billion tokens and achieved state-of-the-art results on multiple lexical semantic tasks (Rodrigues & Branco, 2018). ...

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Portuguese
English-Latvian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. EASTIN-CL Multilingual Ontology of Assistive Technology ...

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:English
Latvian
PicName

PicName (see Castro et al., 1997, 1999; Gomes et al., 2006; Neves et al., 1995) is a picture-naming task that can be used to collect spontaneous speech samples and to measure articulation abilities in Portuguese-speaking children. It is an updated version of the Sounds-in-Words task included in t...

Resource Type:Lexical / Conceptual
Media Types:Text
Image
Language:Portuguese
English-Lithuanian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. EASTIN-CL Multilingual Ontology of Assistive Technology ...

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:English
Lithuanian
Embeddings for Comparative Probing of Lexical Semantics Theories

Embeddings used in: Branco, António, João Rodrigues, Małgorzata Salawa, Ruben Branco and Chakaveh Saedi, 2020. Comparative Probing of Lexical Semantics Theories for Cognitive Plausibility and Technological Usefulness. In Proceedings of the International Conference on Computational Linguistics (C...

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Portuguese
ViPER verb lexical database

ViPER is a verb lexical database with +7,000 verb senses, along with their structural, distributional, and transformational properties. The verb senses are classified based on the main syntactic properties of their construction. Around 70 formal classes have been devised. For each verb sense, its...

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Portuguese
Port-AoA Words

Port-AoA Words (Cameirão & Vicente, 2010) is a lexical database containing 7 psycholinguistic characteristics (e.g. neighborhood density, written-word frequency, familiarity, imageability, etc). Standard adult vocabulary.

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Portuguese
AuCoPro - Splitting

The AuCoPro-Splitting dataset contains compounds annotated with their compound boundaries and linking morphemes. The dataset consists of two files, one for Afrikaans and one for Dutch. The annotation was performed according to annotation guidelines as described in Verhoeven, van Zaanen, van Huyss...

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Afrikaans
Dutch; Flemish

Order by:

Filter by:

Text (446)
Audio (18)
Image (1)