EUROPARL Corpus Parallel Corpora: Portuguese-English

EUROPARL

The EUROPARL Corpus (subpart Portuguese-English of the parallel corpora), available at http://www.statmt.org/europarl/, was extracted from the proceedings of the European Parliament (Koehn, 2005). It contains transcriptions of sessions dating back from 1996 to 2011, in a total of approximately 58,324,562 tokens words of European Portuguese (L1) and 49,216,896 tokens of English (translation).

Download











      People who looked at this resource also viewed the following:
      People who downloaded this resource also downloaded the following:
      Resources from the same project
      Resources from the same creators