Polish CDSCorpus
Polish CDSCorpus consists of 10K Polish sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish.
Download
You can have a look at the part of CDSCorpus (1k annotated sentence pairs). If you wish to get the entire CDSCorpus (10k annotated sentence pairs) please contact alina <at> ipipan.waw.pl (replace <at> with @).
Publication
Contact
For contacting Alina Wróblewska, please write to the email alina <at> ipipan.waw.pl.