This corpus was run through BiRoamer https://github.com/bitextor/biroamer to anonymise the Portuguese-English parallel data from release 7 of the ParaCrawl project, specifically "Broader Web-Scale Provision of Parallel Corpora for European Languages". This version is filtered with BiCleaner with ...
Filter by:
English (31)
Portuguese (31)
Spanish; Castilian (16)
Czech (14)
Bulgarian (13)
German (12)
French (10)
Dutch; Flemish (9)
Italian (9)
Polish (7)
Slovak (7)
Basque (6)
Croatian (6)
Danish (6)
Estonian (6)
Finnish (6)
Hungarian (6)
Irish (6)
Latvian (6)
Lithuanian (6)
Maltese (6)
Romanian (6)
Slovenian (6)
Swedish (6)
Arabic (2)
Chinese (2)
Hindi (1)
Icelandic (1)
Russian (1)
Swahili (1)
Thai (1)
Turkish (1)
Urdu (1)
Vietnamese (1)
1996-2011 (1)
Portugal (1)
United Kingdom (1)