XGLUE benchmark dataset

XGLUE is a new benchmark dataset to evaluate the performance of cross-lingual pre-trained models with respect to cross-lingual natural language understanding and generation. XGLUE is composed of 11 tasks spans 19 languages. For each task, the training data is only available in English. This me...

Resource Type:Corpus
Media Type:Text
Languages:Arabic
Bulgarian
Chinese
Dutch; Flemish
English
French
German
Greek, Modern (1453-)
Hindi
Italian
Polish
Portuguese
Russian
Spanish; Castilian
Swahili
Thai
Turkish
Urdu
Vietnamese
U-Compare Lemmatisation service

Web service created by exporting UIMA-based workflow from the U-Compare text mining system. Functionality: Identifies sentences and tokens in plain text. Parts of speech and lemmas are assigned to tokens. Language is automatically identified amongst the supported languages and language-specific ...

Resource Type:Tool / Service
Languages:English
French
Romanian
Trilingual Documents related to International Judicial Cooperation in Civil Matters (Greek-English-French) (Processed)

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Trilingual (Greek-English-French) documents - standard f...

Resource Type:Corpus
Media Type:Text
Languages:English
French
Greek, Modern (1453-)
Termoteca

Terms from different sciences and industries - ecology, economy, law, sociology, medecine, tourism and computation.

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:English
French
Galician
Portuguese
Spanish; Castilian
Termcat Social Webs

Terms of Social Webs

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Catalan; Valencian
English
French
Galician
Italian
Portuguese
Spanish; Castilian
Termcat Research Thesaurus

Terms of Research Thesaurus

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Catalan; Valencian
English
French
German
Italian
Latin
Portuguese
Spanish; Castilian
Termcat Neoloteca

Terms that have (more or less) recently been accepted and normalised by Termcat, mixed fields

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Basque
Catalan; Valencian
English
French
Galician
German
Italian
Latin
Portuguese
Spanish; Castilian
Termcat Industry

Industry terms

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Basque
Catalan; Valencian
English
French
German
Italian
Portuguese
Spanish; Castilian
Termcat Fairs and Congresses

Terms for Fairs and Congresses

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Catalan; Valencian
English
French
German
Italian
Portuguese
Spanish; Castilian
Termcat Exotic Wood

Terms of Exotic Wood

Resource Type:Lexical / Conceptual
Media Type:Text
Languages:Catalan; Valencian
English
French
German
Italian
Portuguese
Spanish; Castilian

Order by:

Filter by: