Locked History Actions

Diff for "ZILStart"

Differences between revisions 139 and 140
Revision 139 as of 2018-06-19 11:01:43
Size: 13503
Revision 140 as of 2018-06-19 11:03:01
Size: 13503
Deletions are marked like this. Additions are marked like this.
Line 16: Line 16:
|| http://zil.ipipan.waw.pl/DorotaKomosi%C5%84ska|Dorota Komosińska]], MSc   || [[mailto:dorota.komosinska@gmail.com|dorota.komosinska@gmail.com]] || || [[http://zil.ipipan.waw.pl/DorotaKomosi%C5%84ska|Dorota Komosińska]], MSc || [[mailto:dorota.komosinska@gmail.com|dorota.komosinska@gmail.com]] ||

The Linguistic Engineering Group

The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (ICS PAS).


Core team

Anna Andrzejczuk, PhD


Tomasz Bartosiak, MSc


Zbigniew Gawłowicz


Elżbieta Hajnicz, PhD, Assoc. Prof.


Witold Kieraś, PhD


Łukasz Kobyliński, PhD


Dorota Komosińska, MSc


Katarzyna Krasnowska, MSc


Małgorzata Marciniak, PhD, Assoc. Prof.


Agnieszka Mykowiecka, PhD, Assoc. Prof.


Bartłomiej Nitoń, MSc


Maciej Ogrodniczuk, PhD, Head of the Group


Agnieszka Patejuk, PhD


Adam Przepiórkowski, PhD, Assoc. Prof.


Piotr Rychlik, PhD


Aleksander Wawer, PhD


Marcin Woliński, PhD


Alina Wróblewska, PhD



Konrad Gołuchowski, MSc


Wojciech Jaworski, PhD


Jakub Piskorski, PhD


Piotr Rybak


Filip Stefaniuk


Jakub Szymanik, PhD


Beata Wójtowicz, PhD



The main research areas of the Group

  • (Polish) corpus linguistics; cf. the IPI PAN Corpus of Polish and the National Corpus of Polish,

  • syntactic and semantic parsing of Polish; cf. Spejd and Świgra,

  • extraction of linguistic knowledge from corpora,
  • information extraction,
  • sentiment analysis,
  • morphosyntactic system of Polish,
  • generative linguistic formalisms, esp., HPSG and LFG.

The Group is a member of CLARIN, DARIAH-PL, ELRC, FLaReNet and META-NET.

Current externally funded projects

  • CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure)

  • COTHEC (Unified theory of coreference in Polish and its corpus-based verification)

  • Chronofleks (A diachronic formal model of Polish inflection and its implementation)

  • CoDeS (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts)

  • DARIAH-PL (Digital Research Infrastructure for the Arts and Humanities)

  • ELRC (European Language Resource Coordination)

  • KORBA (Electronic corpus of 17th and 18th century Polish texts)

  • Kwantyfikatory w języku: użycie i znaczenie (Quantifiers in Language: Use and Meaning)

  • Parthenos (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies)

  • Scwad (Compositional distributional modelling of Polish language semantics)

  • SYNAMET (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse)

Some of our past projects

Publicly available tools and resources

Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN.

Some tools (all open source, under GPL; see also CLIP):

  • Świgra – a DCG parser,

  • Spejd – a shallow parsing and disambiguation system,

  • TaKIPI – a morphosyntactic tagger for Polish,

  • PANTERA – a morphosyntactic tagger for Polish,

  • Poliqarp – a corpus indexing and search engine,

  • Poliqarp2 – a new generation corpus indexing and search engine,

  • Dendrarium – a treebank development system (under development),

  • Anotatornia – a system for multi-level manual annotation of corpora (forthcoming),

  • WSDDE – a system for designing and performing Word Sense Disambiguation experiments,

  • Multiservice – web service for various of our tools,

  • TermoPL - multiword terms extraction from text

  • DSmodels - web service for calculating word similarity using Polish word embeddings

  • etc.

Main resources (many more at CLIP):

Other activities

Links to some other activities of the Group:

  • Journal of Language Modelling

  • NLP Seminar at IPI PAN

  • conferences organised by the Group:
    • Intelligent Information Systems series of conferences

    • PolTAL 2014 – 9th International Conference on Natural Language Processing, 17–19 September 2014, Warsaw, Poland

    • TLT14 – 14th International Workshop on Treebanks and Linguistic Theories, 11–12 December 2015, Warsaw, Poland

    • CORBON 2016 – Coreference Resolution Beyond OntoNotes workshop at NAACL 2016 (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US

    • HeadLex16 – Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, 24–29 July 2016, Warsaw, Poland

    • CORBON 2017 – 2nd Workshop on Coreference Resolution Beyond OntoNotes at EACL 2017 (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain

    • CRAC: Computational Models of Reference, Anaphora, and Coreference at NAACL 2017 (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA

    • 16th ArgDiaP conference, part of the WAW 2018 (Warsaw Argumentation Week), 15–16 September 2018, Warsaw