Differences between revisions 11 and 26 (spanning 15 versions)

Polish CDSCorpus

The dataset for compositional distributional semantics

Polish CDSCorpus consists of 10K Polish sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish. The dataset was presented at ACL 2017. Please refer to the Wróblewska and Krasnowska-Kieraś (2017) for a detailed description of the resource.

Dataset

Go to CDSCorpus repository.

Publication

List of publications

Alina Wróblewska and Katarzyna Krasnowska-Kieraś. Polish evaluation dataset for compositional distributional semantics models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 784–792, Vancouver, Canada, 2017. Association for Computational Linguistics.

List of publications

Katarzyna Krasnowska-Kieraś and Alina Wróblewska. Empirical linguistic study of sentence embeddings. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 5729–5739, Florence, Italy, 2019. Association for Computational Linguistics.

Licence

The resources is distributed under the CC BY-NC-SA 4.0 licence.

Contact

For contacting Alina Wróblewska, please write to the email alina <at> ipipan.waw.pl.

People

Alina Wróblewska
Katarzyna Krasnowska-Kieraś
Alicja Dziedzic-Rawska
Bożena Itoya
Magdalena Król
Anna Latusek
Justyna Małek
Małgorzata Michalik
Agnieszka Norwa
Małgorzata Szajbel-Keck
Alicja Walichnowska
Konrad Zieliński
and some other

Acknowledgments

The building of the resource was supported by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland.

-  ⇤ ← Revision 11 as of 2017-08-09 09:14:53 → 
  Size: 1123
  Editor: AlinaWroblewska
  Comment:
+   ← Revision 26 as of 2021-02-08 13:24:40 → ⇥
  Size: 1786
  Editor: AlinaWroblewska
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 7:
-Polish CDSCorpus consists of 10K Polish sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish. For more details, please refer to the paper describing the dataset (Wróblewska and Krasnowska-Kieraś, 2017).
+== The dataset for compositional distributional semantics ==
 Line 9:
-== Download ==
+Polish CDSCorpus consists of 10K Polish sentence pairs which are human-annotated for semantic relatedness and entailment. The dataset may be used for the evaluation of compositional distributional semantics models of Polish. The dataset was presented at ACL 2017. Please refer to the [[http://www.aclweb.org/anthology/P/P17/P17-1073.pdf|Wróblewska and Krasnowska-Kieraś (2017)]] for a detailed description of the resource.
 Line 11:
+== Dataset ==
Go to [[http://git.nlp.ipipan.waw.pl/Scwad/SCWAD-CDSCorpus|CDSCorpus]] repository.

{{{#!wiki comment
-Line 12:
+Line 16:
+}}}
-Line 15:
+Line 21:
-Alina Wróblewska and Katarzyna Krasnowska-Kieraś (2017) Polish evaluation dataset for compositional distributional semantics models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 784–792, DOI: doi.org/10.18653/v1/P17-1073.
+<<BibMate(key, "wro:kra:17", omitYears=true)>>

<<BibMate(key, "kra:wro:2019", omitYears=true)>>

== Licence ==

The resources is distributed under the [[https://creativecommons.org/licenses/by-nc-sa/4.0/|CC BY-NC-SA 4.0]] licence.
-Line 19:
+Line 32:
+== People ==

 * Alina Wróblewska
 * Katarzyna Krasnowska-Kieraś
 * Alicja Dziedzic-Rawska
 * Bożena Itoya
 * Magdalena Król
 * Anna Latusek
 * Justyna Małek
 * Małgorzata Michalik
 * Agnieszka Norwa
 * Małgorzata Szajbel-Keck
 * Alicja Walichnowska
 * Konrad Zieliński
 * and some other

== Acknowledgments ==
The building of the resource was supported by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland.

Diff for "Scwad/CDSCorpus"

Menu