Locked History Actions

Diff for "PolishDiscourseCorpus"

Differences between revisions 4 and 8 (spanning 4 versions)
Revision 4 as of 2020-12-30 15:19:35
Size: 916
Comment:
Revision 8 as of 2022-02-01 10:47:55
Size: 996
Comment:
Deletions are marked like this. Additions are marked like this.
Line 4: Line 4:
The Polish Discourse Corpus is a corpus of discourse relations based on the [[PCC|Polish Coreference Corpus]] as part of the [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL]] project. The following corpus of discourse relations is based on the [[PCC|Polish Coreference Corpus]] as part of the [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL]] project. The annotation of the corpus was completed using [[Discann|Discann annotation tool]].
Line 8: Line 8:
Please see the [[attachment:instrukcja-anotacji-metatekstu.pdf|annotation instructions]], in Polish. Please see the [[attachment:instrukcja-anotacji-metatekstu.pdf|annotation instructions]], in Polish (by Celina Heliasz). 
Line 16: Line 16:
== Downloads == == Download ==
Line 22: Line 22:
== Citing ==
Please cite:
== Publication ==

Polish Discourse Corpus / Polski Korpus Metatekstowy

The following corpus of discourse relations is based on the Polish Coreference Corpus as part of the CLARIN-PL project. The annotation of the corpus was completed using Discann annotation tool.

Documentation

Please see the annotation instructions, in Polish (by Celina Heliasz).

Licence

Creative Commons Attribution 3.0 Unported License

http://i.creativecommons.org/l/by/3.0/88x31.png

Download

The corpus is available for download in the form of a zip file containing:

  • 1773 source XML TEI files of the Polish Coreference Corpus
  • metatext.xml file containing descriptions of all relations

Publication

List of publications

Celina Heliasz and Maciej Ogrodniczuk. Eksplicytność a implicytność w świetle analizy korpusowej (meta)tekstu. Linguistica Copernicana, 16:75–100, 2019.