Polish CDSCorpus

Polish CDSCorpus consists of 10K Polish sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish.

Download

You can have a look at the part of CDSCorpus (1k annotated sentence pairs). If you wish to get the entire CDSCorpus (10k annotated sentence pairs) please contact alina <at> ipipan.waw.pl (replace <at> with @).

Publication

Contact

For contacting Alina Wróblewska, please write to the email alina <at> ipipan.waw.pl.

Scwad/CDSCorpus

Menu

Polish CDSCorpus

Download

Publication

Contact