Search and Browse – PORTULAN CLARIN

Basque-English ParDeepBank

This resource is part of Deliverable 4.6 of the QTLeap FP7 project (Contract number 610516). In its current development (15% of the intended goal of the project), it is composed of 150 sentences (1,416 English tokens and 1,275 Basque tokens). The sentences are excerpts from journalistic text from...

Resource Type:	Corpus
Media Type:	Text
Languages:	Basque
Languages:	English

Elhuyar-QTLeap WSD/NED corpus

This corpus is created from documents from translation memorios of Elhuyar Fundation (obtained via Eleka, member of the Advisory Board of Potential Users).

Resource Type:	Corpus
Media Type:	Text
Languages:	Basque
Languages:	English

ixa-pipe-ned-ukb

ixa-pipe-ned-ukb is a multilingual Named Entity Disambiguation tool. It is based on UKB (http://ixa2.si.ehu.es/ukb/), a graph-based Word Sense Disambiguation tool. The Wikipedia graph built from the hyperlinks between Wikipedia articles is used for the processing. The input of the tool is ...

Resource Type:	Tool / Service
Languages:	Basque
Languages:	English

Basque to English Machine translation module

Technical Description: http://qtleap.eu/wp-content/uploads/2015/05/Pilot1_technical_description.pdf http://qtleap.eu/wp-content/uploads/2015/05/TechnicalDescriptionPilot2_D2.7.pdf http://qtleap.eu/wp-content/uploads/2016/11/TechnicalDescriptionPilot3_D2.10.pdf

Resource Type:	Tool / Service
Languages:	Basque
Languages:	English

English to Basque Machine translation module

Resource Type:	Tool / Service
Languages:	Basque
Languages:	English

QTLeap Specialized lexicons

This resource comprises multilingual lexicon entries used for the translation of specific IT domain expressions. This gazetteer has been collected from four different sources: VLC, LibreOffice and KDE localization projects and IT domain Wikipedia articles.

Resource Type:	Lexical / Conceptual
Media Type:	Text
Languages:	Basque
	Czech
	English
	German
	Portuguese
	Spanish; Castilian

U-Compare Apertium Part-of-Speech Tagging Workflow

This is a workflow that is designed especially for use in the UIMA-based U-Compare workbench (see separate META-SHARE record). The workflow is in "ucz" format (specific to U-Compare) and can be imported via the "Import Workflow" item in the "Workflows" menu of the U-Compare interface. It include...

Resource Type:	Tool / Service
Languages:	Basque
	Catalan; Valencian
	English
	Galician
	Portuguese
	Spanish; Castilian

Termcat Industry

Industry terms

Resource Type:	Lexical / Conceptual
Media Type:	Text
Languages:	Basque
	Catalan; Valencian
	English
	French
	German
	Italian
	Portuguese
	Spanish; Castilian

Termcat Neoloteca

Terms that have (more or less) recently been accepted and normalised by Termcat, mixed fields

Resource Type:	Lexical / Conceptual
Media Type:	Text
Languages:	Basque
	Catalan; Valencian
	English
	French
	Galician
	German
	Italian
	Latin
	Portuguese
	Spanish; Castilian

UIMA/U-Compare Apertium Morphological Analyser

This tool performs tokenization of text and assigns all possible morphological analyses to each token. These analyses include the base form of the token, part-of-speech, information about number and gender. The morphological analyser is a module of Apertium machine translation system. The provide...

Resource Type:	Tool / Service
Languages:	Basque
	Catalan
	English
	Galician
	Portuguese
	Spanish

Order by:

Filter by: