Revision 15 as of 2013-01-24 16:32:59

Clear message
Locked History Actions

CESCorpus

Manually aligned Polish-English parallel corpus of Centre for European Studies texts

This resource contains 1 445 000 words in 58 Polish-English texts (6 CES reports, issues of CES Studies and 22 issues of the CES policy briefs "Point of View"). The texts have been aligned manually on the sentence level using the MemoQ software, with care taken to represent non-trivial translation equivalence types[1].

The resource is provided as TEI P5-compliant XML files with custom extensions and in the XLIFF and TMX formats.

The corpus is licensed under the CC-BY-NC license.

Contents:

Original title

Translated title

Original words

Translation words

CES Studies

391 782

491 413

Point of View

166 161

130 601

CES reports

Ekspansja Gazpromu w UE – kooperacja czy dominacja

Gazprom’s expansion in the EU: co-operation or domination?

5 017

6 276

Problemy z gazem łupkowym w Bułgarii, Czechach i Rumunii

Shale gas in Bulgaria, the Czech Republic and Romania

6 431

8 635

Rosja w kryzysie: rok pierwszy

Russia in crisis: year one

15 894

19 798

Skutki rozszerzenia obszaru Schengen dla wschodnich sàsiadów Unii Europejskiej

Consequences of the Schengen area enlargement for the EU’s Eastern European Neighbours

7 178

8 728

Turcja po rozpoczęciu negocjacji z Unią Europejską - relacje zagraniczne i polityka wewnętrzna

Turkey after the start of negotiations with the European Union – foreign relations and the domestic situation

44 383

55 174

Wyzwania migracyjne w państwach wschodniego sąsiedztwa Unii Europejskiej

Migration challenges in the European Union’s Eastern Neighbourhood

38 853

49 014

117 756

147 625

Total

675 699

769 639

[1] A detailed account of the alignment procedure is provided at the website of the PELCRA Research Group at the University of Łódź.