The LT Corpus (Literary Corpus) contains approximately 1,781,083 running words of European and Brazilian Portuguese. It includes 70 copyright-free classics (61 Portugal and 9 from Brazil) published before 1940.
The PTPARL Corpus contains approximately 975,806 running words of European Portuguese. It includes 1076 texts consisting of adapted transcriptions of the Portuguese parliament sessions, which were made available in 2004.
Web service created by exporting UIMA-based workflow from the U-Compare text mining system. Functionality: Identifies paragraphs in plain text Tools in workflow: MLRS Paragraph Splitter (University of Malta) NOTE: The licence provided covers the web service only. Tools used to create the workf...
LX-Translator is a freely available on-line service for translation between Portuguese and Chinese. This service was developed and is maintained at the University of Lisbon by the NLX-Natural Language and Speech Group of the Department of Informatics. Intrinsic evaluation of the model for the ...