Public Procurement Dataset 1 (Processed)
|Handle:||https://hdl.handle.net/21.11129/0000-000D-F8F8-4 (persistent URL to this page)|
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu.
A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in the XLiFF format.
There are two publications in the collection:
a) Report on functioning of public procurement system in 2009 (raport_uzp_2009.xlf, 1495 segments 65237 words) and
b) Report on functioning of public procurement system in 2010 (raport_uzp_2010.xlf, 1188 segments, 58684 words).
The total size of the collection is 123 921 words in 2683 parallel segments.
It was converted into a 1578-TUs English-Polish resource in TMX format.