Locked History Actions

Diff for "Scwad/CDSCorpus"

Differences between revisions 9 and 10
Revision 9 as of 2017-08-02 07:57:09
Size: 708
Comment:
Revision 10 as of 2017-08-09 09:12:13
Size: 1014
Comment:
Deletions are marked like this. Additions are marked like this.
Line 16: Line 16:
Alina Wróblewska and Katarzyna Krasnowska-Kieraś (2017) Polish evaluation dataset for compositional distributional semantics models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 784–792, DOI: doi.org/10.18653/v1/P17-1073.

Polish CDSCorpus

Polish CDSCorpus consists of 10K Polish sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish.

Download

You can have a look at a part of CDSCorpus (1K annotated sentence pairs). If you wish to get the entire CDSCorpus (10K annotated sentence pairs) please contact alina <at> ipipan.waw.pl (replace <at> with @).

Publication

Alina Wróblewska and Katarzyna Krasnowska-Kieraś (2017) Polish evaluation dataset for compositional distributional semantics models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 784–792, DOI: doi.org/10.18653/v1/P17-1073.

Contact

For contacting Alina Wróblewska, please write to the email alina <at> ipipan.waw.pl.