Differences between revisions 17 and 18

Polish CDSCorpus

The dataset for compositional distributional semantics

Polish CDSCorpus consists of 10K Polish sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish. The dataset was presented at ACL 2017. Please refer to the Wróblewska and Krasnowska-Kieraś (2017) for a detailed description of the resource.

Download

You can have a look at a part of CDSCorpus (1K annotated sentence pairs). If you wish to get the entire CDSCorpus (10K annotated sentence pairs) please contact alina <at> ipipan.waw.pl (replace <at> with @).

People

Alina Wróblewska
Katarzyna Krasnowska-Kieraś
Alicja Dziedzic-Rawska
Bożena Itoya
Magdalena Król
Anna Latusek
Justyna Małek
Małgorzata Michalik
Agnieszka Norwa
Małgorzata Szajbel-Keck
Alicja Walichnowska
Konrad Zieliński
and some other

Publication

Alina Wróblewska and Katarzyna Krasnowska-Kieraś (2017) Polish evaluation dataset for compositional distributional semantics models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 784–792, DOI: doi.org/10.18653/v1/P17-1073.

Contact

For contacting Alina Wróblewska, please write to the email alina <at> ipipan.waw.pl.

Acknowledgments

The building of the resource was supported by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland.

-  ⇤ ← Revision 17 as of 2017-08-09 09:51:11 → 
  Size: 1786
  Editor: AlinaWroblewska
  Comment:
+   ← Revision 18 as of 2017-08-09 09:51:28 → ⇥
  Size: 1797
  Editor: AlinaWroblewska
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 29:
- * ...
+ * and some other

Diff for "Scwad/CDSCorpus"

Menu