Locked History Actions

Diff for "CoDeS"

Differences between revisions 52 and 53
Revision 52 as of 2018-02-08 12:55:39
Size: 4107
Editor: PiotrRychlik
Comment:
Revision 53 as of 2018-02-08 12:56:42
Size: 4106
Editor: PiotrRychlik
Comment:
Deletions are marked like this. Additions are marked like this.
Line 49: Line 49:
 * Mykowiecka, A., M. Marciniak and P. Rychlik (2018) Sim Lex-999 for Polish, LREC 2018.  * Mykowiecka, A., M. Marciniak and P. Rychlik (2018) SimLex-999 for Polish, LREC 2018.

CoDeS project

Project factsheet

English name:

Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts

Polish name:

Wykorzystanie metod kompozycyjnej semantyki dystrybucyjnej do identyfikacji i rozróżniania znaczeń w języku polskim

Project type:

A National Science Centre grant 2014/15/B/ST6/05186

Duration:

Aug 2015 ‒ Aug 2018

Project Web page:

http://zil.ipipan.waw.pl/CoDeS

Principal investigator:

Agnieszka Mykowiecka

Project summary

The general aim of the project is to evaluate existing methods and devise new techniques of computational distributional semantics (CDS) in the area of discrimination and disambiguation of senses for Polish. One of the basic problems which has to be solved while analyzing natural language utterances is semantic ambiguity of their elements. Our goal is to elaborate methods determining the meaning of a particular word in the context in which it appears and methods for automatic detection if a word is used in more than one sense in an analyzed text. We want to focus our attention on noun-adjectives constructions and investigate how the meaning of a whole phrase may contribute to creation of sense models for its elements. We also plan to extend these methods to recognize words or phrases which are not used in their literal sense but figuratively. One of the project’s goal is to look for answers for some of general questions, for example, to what degree we will be able to describe sense differences using CDS methods, and which types of models are better suited for such a task and Polish data.

Resources

Polish word embeddings http://dsmodels.nlp.ipipan.waw.pl/ (A. Mykowiecka, M. Marciniak, P. Rychlik, 2017)

DSmodels demo - web service for calculating word similarity using Polish word embeddings

Polish version of SimLex-999 (A. Mykowiecka, M. Marciniak, P. Rychlik, 2018).

Plain text (UTF-8 encoded) of Polish SimLex-999.

Publications