Handle: https://hdl.handle.net/21.11129/0000-000D-FE92-0 (persistent URL to this page)
Corpus of mixed language (French, German,Luxemburguish) sentences from {sc Chamber} (House of Parliament) debate reports manually annotated at segment level with 6 labels : Lux, Fre, Ger, Lux + Fre, Lux + Ger, Lux + Fre + Ger
Back
Download
Distribution Licence CC - BY - SA
Restrictions: Attribution, Share Alike
Download location: hidden
Contact Person
Multilingual text corpus Languages
German
French
Luxembourgish; Letzeburgesch
Linguality Linguality type: Multilingual
Size
Metadata Created: 09/29/2020
Last Updated: 11/18/2020
Usage Foreseen Use Nlp Applications Use NLP Specific: Language Identification
Documentation
Document Type: In Proceedings
Thomas Lavergne, Gilles Adda, Martine Adda-Decker and Lori Lamel,
Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish ,
, 2014
Editor: Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis
Publisher: European Language Resources Association (ELRA)
Book Title: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
ISBN: 978-2-9517408-8-4
Document Language:
English