Differences between revisions 1 and 4 (spanning 3 versions)
List of publications
Size: 1705
Comment:
|
Size: 916
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 4: | Line 4: |
This page offers the official [[http://creativecommons.org/licenses/by/3.0/deed.en_US|Creative Commons Attribution 3.0 Unported License]] release of the corpus of discourse relations created as a part of the [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL]] project. By downloading the corpus data you accept the conditions of that licence. | The Polish Discourse Corpus is a corpus of discourse relations based on the [[PCC|Polish Coreference Corpus]] as part of the [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL]] project. |
Line 6: | Line 6: |
'''Contact person:''' [[MaciejOgrodniczuk|Maciej Ogrodniczuk]]<<BR>> '''License:''' CC BY v.3 |
== Documentation == Please see the [[attachment:instrukcja-anotacji-metatekstu.pdf|annotation instructions]], in Polish. == Licence == [[http://creativecommons.org/licenses/by/3.0/deed.en_US|Creative Commons Attribution 3.0 Unported License]] |
Line 12: | Line 16: |
== Documentation == * [[attachment:PCC_README_EN.pdf|Description of the corpus, in English]] * [[attachment:PCC_README_PL.pdf|Description of the corpus, in Polish]] |
|
Line 19: | Line 18: |
The corpus is available for download in 3 formats: * [[attachment:PCC-1.5-MMAX.zip|full corpus in MMAX format]] ([[attachment:example_text_mmax.zip|example text in MMAX format]]) * [[attachment:PCC-1.5-TEI.zip|full corpus in TEI format]] ([[attachment:example_text_tei.zip|example text in TEI format]]) * [[attachment:PCC-1.5-BRAT.zip|full corpus in BRAT format]] ([[attachment:example_text_brat.zip|example text in BRAT format]]) == Online version == The corpus is available: * [[http://cothec.nlp.ipipan.waw.pl/|for browsing]] * [[http://pcc.nlp.ipipan.waw.pl/|for search]] You may also want to see [[PolishCoreferenceTools|Polish Coreference Tools site]]. |
The corpus is available for download in the form of a [[attachment:corpus.tar.gz|zip file]] containing: * 1773 source XML TEI files of the Polish Coreference Corpus * metatext.xml file containing descriptions of all relations |
Line 33: | Line 23: |
When using Polish Coreference Corpus, please cite our book on coreference: <<BibMate(key, "ogr:etal:15:gruyter", omitYears=true)>> but you can also check [[http://core.ipipan.waw.pl/|the project page]] for earlier publications. |
Please cite: <<BibMate(key, "hel:ogr:19:lc", omitYears=true)>> |
Polish Discourse Corpus / Polski Korpus Metatekstowy
The Polish Discourse Corpus is a corpus of discourse relations based on the Polish Coreference Corpus as part of the CLARIN-PL project.
Documentation
Please see the annotation instructions, in Polish.
Licence
Creative Commons Attribution 3.0 Unported License
Downloads
The corpus is available for download in the form of a zip file containing:
- 1773 source XML TEI files of the Polish Coreference Corpus
- metatext.xml file containing descriptions of all relations
Citing
Please cite: