Public Procurement Dataset 2 (Processed)
|Handle:||https://hdl.handle.net/21.11129/0000-000D-F8F9-3 (persistent URL to this page)|
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu.
A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in the XLiFF format.
There are two publications in the collection:
a) Report on functioning of public procurement system in 2006 (raport_uzp_2006.xlf, 705 segments 31141 words) and
b) Report on functioning of public procurement system in 2011 (raport_uzp_2010.xlf, 1434 segments,73456 words).
The total size of the collection is 104597 words in 2139 parallel segments.
It was converted into a 1842-TUs English-Polish resource in TMX format.