English-Latvian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. EASTIN-CL Multilingual Ontology of Assistive Technology ...

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:English
Latvian
UIMA/U-Compare Enju parser

Syntactic parser for English. Outputs predicate-argument structures. Also outputs base forms for each token. The tool is provided as a UIMA component, which forms part of the in-built library of components provided with the U-Compare platform (see separate META-SHARE record) for building and...

Resource Type:Tool / Service
Language:English
GENIA Tagger

The GENIA tagger analyzes English sentences and outputs the base forms, part-of-speech tags, chunk tags, and named entity tags. The tagger is specifically tuned for biomedical text such as MEDLINE abstracts.

Resource Type:Tool / Service
Language:English
LX-DSemVectors

LX-DSemVectors is distributional lexical semantics model, also known as word embeddings, for Portuguese (Rodrigues et al., 2016). This version, 2.2b, was trained on a corpus of 2 billion tokens and achieved state-of-the-art results on multiple lexical semantic tasks (Rodrigues & Branco, 2018). ...

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Portuguese
DVPM-SynSem

DVPM-SynSem is a lexical database with syntactic and semantic information in Medieval Portuguese. It contains around 3000 verbs.

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Portuguese
Enju parser

Enju is a syntactic parser for English. The grammar used by the parser is based on Head Driven Phrase Structure Grammar (HPSG). Enju can analyse syntactic/semantic structures of English sentences can output phrase structure and predicate-argument structures.

Resource Type:Tool / Service
Language:English
LX-UTagger

LX-UTagger is a POS tagger for Portuguese that adopts the Universal Part-of-Speech tagset (UPOS), related to the Universal Dependency framework, with an initial performance of 99.06% under a ten-fold cross validation scheme. It is described in this article: António Branco, João Ricardo Silv...

Resource Type:Tool / Service
Language:Portuguese
SENTER

SENTER is a SENtence splitTER for Portuguese.

Resource Type:Tool / Service
Language:Portuguese
Croatian-English corpus with the Rural Development Programme for the Period 2014-2020 from the Croatian Rural Development Programme website (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English corpus with the Rural Development Progr...

Resource Type:Corpus
Media Type:Text
Languages:Croatian
English
English-Finnish corpus from Finnish Information Bank (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. http://www.infopankki.fi - Finland in your language - In...

Resource Type:Corpus
Media Type:Text
Languages:English
Finnish

Order by:

Filter by:

Text (446)
Audio (18)
Image (1)