Locked History Actions

Diff for "Scwad/CDSCorpus"

Differences between revisions 1 and 12 (spanning 11 versions)
Revision 1 as of 2017-04-21 11:52:19
Size: 40
Comment:
Revision 12 as of 2017-08-09 09:16:27
Size: 1178
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
CDSCorpus will be soon available here. #format wiki
#language en
#acl +All:read Default

= Polish CDSCorpus =

Polish CDSCorpus consists of 10K Polish sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish. For more details, please refer to the [[http://www.aclweb.org/anthology/P/P17/P17-1073.pdf|paper]] describing the dataset (Wróblewska and Krasnowska-Kieraś, 2017).

== Download ==

You can have a look at a part of [[attachment:dataset_1000.csv|CDSCorpus]] (1K annotated sentence pairs). If you wish to get the entire CDSCorpus (10K annotated sentence pairs) please contact ''alina'' <at> ''ipipan.waw.pl'' (replace <at> with @).

== Publication ==

Alina Wróblewska and Katarzyna Krasnowska-Kieraś (2017) Polish evaluation dataset for compositional distributional semantics models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 784–792, DOI: doi.org/10.18653/v1/P17-1073.

== Contact ==
For contacting Alina Wróblewska, please write to the email ''alina'' <at> ''ipipan.waw.pl''.

Polish CDSCorpus

Polish CDSCorpus consists of 10K Polish sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish. For more details, please refer to the paper describing the dataset (Wróblewska and Krasnowska-Kieraś, 2017).

Download

You can have a look at a part of CDSCorpus (1K annotated sentence pairs). If you wish to get the entire CDSCorpus (10K annotated sentence pairs) please contact alina <at> ipipan.waw.pl (replace <at> with @).

Publication

Alina Wróblewska and Katarzyna Krasnowska-Kieraś (2017) Polish evaluation dataset for compositional distributional semantics models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 784–792, DOI: doi.org/10.18653/v1/P17-1073.

Contact

For contacting Alina Wróblewska, please write to the email alina <at> ipipan.waw.pl.