MalToBi/SPAN Corpus

Audio corpus: 8 subfolders with .wav files Each containing : • 2 sound files containing a read story (“The sun and the wind”, each by speaker A and speaker B) • 2 sound files containing each 30 read sentences (each by speaker A and speaker B) • 2 x each of the 30 sentences as a single sound f...

Resource Type:Corpus
Media Type:Audio
Language:Maltese
MLRS Corpus

142,397 Maltese texts from 10 genres. The file “corpus.zip” expands into a folder “corpus”, containing the file “tagged.zip”, which expands into the folder “cwb.final”. This folder contains the files: • filelist.txt • malti02.academic.txt • malti02.law.txt • malti02.literature.txt • malti...

Resource Type:Corpus
Media Type:Text
Language:Maltese
Maltese Fiction Wordlist

This is a wordlist which was created from 32 Maltese fiction books. These texts were originally in PDF file format and were converted to txt format. In the next step, the text file was tokenized and a frequency count was performed on the separate tokens. The resulting list (with about 50,000 entr...

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Maltese
Maltese Wordlist

Wordlist for spell-checking

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Maltese
Illum Corpus

The full editions of ILLUM from 12/11/2006 to 30/05/2010 (185 issues).

Resource Type:Corpus
Media Type:Text
Language:Maltese
Laws of Malta - Maltese

The corpus contains the Laws of Malta in Maltese from the official government website. The unannotated raw text files were extracted from the pdf files that can be found on the website.

Resource Type:Corpus
Media Type:Text
Language:Maltese
F_Mona_1/ Spoken Newspaper

108 WAV files of spoken Maltese newspaper texts, subdivided into 12 directories with a variable number of sentences (sometimes: clauses) each. They come together with transcriptions and tables of phoneme durations.

Resource Type:Corpus
Media Type:Audio
Language:Maltese
Chinese Open Wordnet

We are creating a large scale, freely available, semantic dictionary of Mandarin Chinese: the Chinese Open Wordnet, inspired by the Princeton WordNet and the Global WordNet Grid. All relations (hypernyms, meronyms ...) come from Princeton WordNet 3.0. We have enriched the synsets with Chinese lex...

Resource Type:Lexical / Conceptual
Media Type:Text
Language:Mandarin Chinese

Order by:

Filter by: