Locked History Actions

Diff for "CESCorpus"

Differences between revisions 11 and 12
Revision 11 as of 2013-01-24 16:20:08
Size: 1341
Editor: LukaszDrozdz
Comment:
Revision 12 as of 2013-01-24 16:20:34
Size: 1352
Editor: LukaszDrozdz
Comment:
Deletions are marked like this. Additions are marked like this.
Line 18: Line 18:
|||| ||'''675 699'''||'''769 639'''|| ||'''Total'''|| ||'''675 699'''||'''769 639'''||

Manually aligned Polish-English parallel corpus of Centre for European Studies texts

This resource contains 1 445 000 words in 58 Polish-English texts (6 CES reports, issues of CES Studies and 22 issues of the CES policy briefs "Point of View"). The texts have been aligned manually on the sentence level using the MemoQ software, with care taken to represent non-trivial translation equivalence types[1].

The resource is provided as TEI P5-compliant XML files with custom extensions and in the XLIFF and TMX formats.

The corpus is licensed under the CC-BY-NC license.

Contents:

Original title

Translated title

Original words

Translation words

CES Studies

391 782

491 413

Point of View

CES reports

Total

675 699

769 639

[1] A detailed account of the alignment procedure is provided at the website of the PELCRA Research Group at the University of Łódź.