This corpus is a sample extracted from the corpus made available by the annual workshops/conferences on Statistical Machine Translation (WMT, see \url{http://www.statmt.org/}) from the News domain. To this end, 1104 English sentences and their corresponding human translations into Czech, German a...
QTLeap WSD/NED corpus This corpora is part of Deliverable 5.5 of the European Commission project QTLeap FP7-ICT-2013.4.1-610516 (http://qtleap.eu). The texts are Q&A interactions from the real-user scenario (batches 1 and 2). The interactions in this corpus are available in Basque, Bulgar...
Filter by:
Portuguese (12)
Spanish; Castilian (12)
English (12)
Bulgarian (11)
Czech (11)
German (9)
Dutch; Flemish (8)
French (7)
Italian (7)
Polish (6)
Slovak (6)
Basque (5)
Croatian (5)
Danish (5)
Estonian (5)
Finnish (5)
Hungarian (5)
Irish (5)
Latvian (5)
Lithuanian (5)
Maltese (5)
Romanian (5)
Slovenian (5)
Swedish (5)
Arabic (1)
Chinese (1)
Hindi (1)
Icelandic (1)
Russian (1)
Swahili (1)
Thai (1)
Turkish (1)
Urdu (1)
Vietnamese (1)