Search and Browse – PORTULAN CLARIN

XGLUE benchmark dataset

XGLUE is a new benchmark dataset to evaluate the performance of cross-lingual pre-trained models with respect to cross-lingual natural language understanding and generation. XGLUE is composed of 11 tasks spans 19 languages. For each task, the training data is only available in English. This me...

Resource Type:	Corpus
Media Type:	Text
Languages:	Arabic
	Bulgarian
	Chinese
	Dutch; Flemish
	English
	French
	German
	Greek, Modern (1453-)
	Hindi
	Italian
	Polish
	Portuguese
	Russian
	Spanish; Castilian
	Swahili
	Thai
	Turkish
	Urdu
	Vietnamese

Termcat Research Thesaurus

Terms of Research Thesaurus

Resource Type:	Lexical / Conceptual
Media Type:	Text
Languages:	Catalan; Valencian
	English
	French
	German
	Italian
	Latin
	Portuguese
	Spanish; Castilian

Termcat Neoloteca

Terms that have (more or less) recently been accepted and normalised by Termcat, mixed fields

Resource Type:	Lexical / Conceptual
Media Type:	Text
Languages:	Basque
	Catalan; Valencian
	English
	French
	Galician
	German
	Italian
	Latin
	Portuguese
	Spanish; Castilian

Termcat Industry

Industry terms

Resource Type:	Lexical / Conceptual
Media Type:	Text
Languages:	Basque
	Catalan; Valencian
	English
	French
	German
	Italian
	Portuguese
	Spanish; Castilian

Termcat Fairs and Congresses

Terms for Fairs and Congresses

Resource Type:	Lexical / Conceptual
Media Type:	Text
Languages:	Catalan; Valencian
	English
	French
	German
	Italian
	Portuguese
	Spanish; Castilian

Termcat Exotic Wood

Terms of Exotic Wood

Resource Type:	Lexical / Conceptual
Media Type:	Text
Languages:	Catalan; Valencian
	English
	French
	German
	Italian
	Portuguese
	Spanish; Castilian

Termcat Economical Crisis

Economical Crisis terms

Resource Type:	Lexical / Conceptual
Media Type:	Text
Languages:	Catalan; Valencian
	English
	French
	German
	Italian
	Portuguese
	Spanish; Castilian

Termcat Digital Marketing

Terms for Digital Marketing

Resource Type:	Lexical / Conceptual
Media Type:	Text
Languages:	Catalan; Valencian
	English
	French
	Galician
	German
	Italian
	Portuguese
	Spanish; Castilian

SIP Publications (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Publications from the Luxembourgish government edited by...

Resource Type:	Corpus
Media Type:	Text
Languages:	English
	French
	German

Parallel texts from Swedish Work environment Authority (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts from the Swedish Work Environment authori...

Resource Type:	Corpus
Media Type:	Text
Languages:	Bulgarian
	Czech
	English
	Estonian
	Finnish
	French
	German
	Greek, Modern (1453-)
	Hungarian
	Italian
	Latvian
	Lithuanian
	Polish
	Romanian
	Spanish; Castilian
	Swedish

Order by:

Filter by: