Illum Corpus

The full editions of ILLUM from 12/11/2006 to 30/05/2010 (185 issues).

Resource Type:Corpus
Media Type:Text
Language:Maltese
Maltese Acquis Communautaire

This is the Maltese version of the Acquis Communautaire (AC), which is the total body of European Union (EU) law applicable in the EU Member States. It consists of selected texts between the 1950s and today, translated to Maltese.

Resource Type:Corpus
Media Type:Text
Language:Maltese
Laws of Malta - Maltese

The corpus contains the Laws of Malta in Maltese from the official government website. The unannotated raw text files were extracted from the pdf files that can be found on the website.

Resource Type:Corpus
Media Type:Text
Language:Maltese
Maltese Wikipedia

This corpus is part of the collection of the Wikipedia Dumps which was retrieved from wikipedia.org on April 8, 2010. It comes with two individual XML files, one containing the Wikipedia articles and another containing the metadata about it.

Resource Type:Corpus
Media Type:Text
Language:Maltese
Maltese Wordlist

Wordlist for spell-checking

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Maltese
Maltese Speech Engine Database

Description

Resource Type:Corpus
Media Types:Text
Audio
Language:Maltese
MLRS Corpus

142,397 Maltese texts from 10 genres. The file “corpus.zip” expands into a folder “corpus”, containing the file “tagged.zip”, which expands into the folder “cwb.final”. This folder contains the files: • filelist.txt • malti02.academic.txt • malti02.law.txt • malti02.literature.txt • malti...

Resource Type:Corpus
Media Type:Text
Language:Maltese
MALTESE AUTOMATIC COLLOCATIONS DICTIONARY

Maltese Automatic Collocations Dictionary =========================================== Lexical Computing Limited, October 2012 This is an Automatic Collocations Dictionary produced by Lexical Computing Limited, for delivery to the EU CESAR project. The method is • Take a corpus of the l...

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Maltese
Maltese Fiction Wordlist

This is a wordlist which was created from 32 Maltese fiction books. These texts were originally in PDF file format and were converted to txt format. In the next step, the text file was tokenized and a frequency count was performed on the separate tokens. The resulting list (with about 50,000 entr...

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Maltese
MLSS Tagger Web Service

The part of speech tagger for Maltese is based on TnT, the statistical part of speech tagger by Thorsten Brants (http://www.coli.uni-saarland.de/~thorsten/tnt/). It was modified for the Maltese Language Resource Server (MLRS) by Albert Gatt (Linguistics Department, University of Malta). The mode...

Resource Type:Tool / Service
Language:Maltese

Order by: