QTLeap LRT-M31-WP4

Treebanks and semantic lexicons for Basque, Bulgarian, Dutch, German and Portuguese. Created within European project QTLeap.

Resource Type:Corpus
Media Type:Text
Languages:Basque
Bulgarian
Dutch; Flemish
German
UIMA Apertium Translator

This tool translates text from a source language into a target language. It operates on text that has previously been tokenised and morphologically analysed, and POS-tagged. Target language tokens are assigned POS tags and morphological analyses. The Apertium Translator is a module of Apertium ma...

Resource Type:Tool / Service
Languages:Basque
Catalan
English
Galician
Portuguese
Spanish
Parallel corpus (en-pl) from the Export Promotion Portal of Poland (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A paralell corpus constructed from data acquired form th...

Resource Type:Corpus
Media Type:Text
Languages:English
Polish
MBT – Memory-Based Tagger-Generator and Tagger

MBT is a memory-based tagger-generator and tagger in one. The tagger-generator part can generate a sequence tagger on the basis of a training set of tagged sequences; the tagger part can tag new sequences. MBT can, for instance, be used to generate part-of-speech taggers or chunkers for natural l...

Resource Type:Tool / Service
EUIPO - list of goods and services German and English (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. EUIPO list of goods and services format: TMX

Resource Type:Corpus
Media Type:Text
Languages:English
German
COVID-19 EUROPARL dataset v1. Bilingual (EN-PT)

Bilingual (EN-PT) corpus acquired from the website (https://www.europarl.europa.eu/) of the European Parliament (25th April 2020)

Resource Type:Corpus
Media Type:Text
Languages:English
Portuguese
MLSS Chunker

The MLSS (Maltese Language Software Services) Chunker is an online tool which looks for sequences of part-of-speech tags in a tagged text, using one or more patterns that capture the typical members of, say, a noun phrase, and returning a list of chunks from the text that match the searched patte...

Resource Type:Tool / Service
Language:Maltese
MLSS Paragraph Splitter Web service

The paragraph splitter is a web service tool which takes text as input and outputs the identified paragraphs surrounded by tags. The tool is language independent. The download for this resource only contains the narrative description in a Word file. The service has one method which can be invo...

Resource Type:Tool / Service
Language:Maltese
EUIPO - list of goods and services Spanish and English (Processed)  

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. EUIPO list of goods and services format: TMX

Resource Type:Corpus
Media Type:Text
Languages:English
Spanish; Castilian
Monolingual documents from the Government of Lithuania (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Monolingual documents received from the Government of th...

Resource Type:Corpus
Media Type:Text
Language:Lithuanian

Order by:

Filter by:

Text (446)
Audio (18)
Image (1)