Locked History Actions

Diff for "ZILStart"

Differences between revisions 115 and 217 (spanning 102 versions)
Revision 115 as of 2017-10-17 23:07:35
Size: 11235
Comment:
Revision 217 as of 2021-09-04 14:38:11
Size: 16296
Comment:
Deletions are marked like this. Additions are marked like this.
Line 4: Line 4:
The Linguistic Engineering (LE) Group is part of the [[http://www.ipipan.waw.pl/en/dept/dept-ai.html|Department of Artificial Intelligence]] at the [[http://www.ipipan.waw.pl/en/|Institute of Computer Science]], [[http://www.english.pan.pl/|Polish Academy of Sciences]] (ICS PAS). The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the [[http://www.ipipan.waw.pl/en/|Institute of Computer Science]], [[https://institution.pan.pl/|Polish Academy of Sciences]] (IPI PAN).
Line 10: Line 10:
|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/TomaszBartosiak|Tomasz Bartosiak]], MSc || [[mailto:tomasz.bartosiak@gmail.com|tomasz.bartosiak@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/ElzbietaHajnicz|Elżbieta Hajnicz]], PhD, Assoc. Prof.        || [[mailto:elzbieta.hajnicz@ipipan.waw.pl|elzbieta.hajnicz@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/WitoldKieras|Witold Kieraś]], PhD (part time) || [[mailto:wkieras@gmail.com|wkieras@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], PhD || [[mailto:lkobylinski@ipipan.waw.pl|lukasz.kobylinski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]], MSc (part time) || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MalgorzataMarciniak|Małgorzata Marciniak]], PhD, Assoc. Prof.       || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AgnieszkaMykowiecka|Agnieszka Mykowiecka]], PhD, Assoc. Prof.        || [[mailto:agnieszka.mykowiecka@ipipan.waw.pl|agnieszka.mykowiecka@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/BartlomiejNiton|Bartłomiej Nitoń]], MSc || [[mailto:bartek.niton@gmail.com|bartek.niton@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD, Head of the Group             || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AgnieszkaPatejuk|Agnieszka Patejuk]], PhD || [[mailto:aep@ipipan.waw.pl|agnieszka.patejuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Assoc. Prof. || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/PiotrRychlik|Piotr Rychlik]], PhD || [[mailto:rychlik@ipipan.waw.pl|piotr.rychlik@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], PhD || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD                   || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AlinaWroblewska|Alina Wróblewska]], PhD || [[mailto:alina.wroblewska@ipipan.waw.pl|alina.wroblewska@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD        || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/TomaszBartosiak|Tomasz Bartosiak]], MSc        || [[mailto:tomasz.bartosiak@gmail.com|tomasz.bartosiak@gmail.com]] ||
|| Zbigniew Gawłowicz, BEng || [[mailto:zgawlowicz@gmail.com|zgawlowicz
@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/ElzbietaHajnicz|Elżbieta Hajnicz]], PhD, Assoc. Prof. || [[mailto:elzbieta.hajnicz@ipipan.waw.pl|elzbieta.hajnicz@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/KonradKaczynski|Konrad Kaczyński]], MSc || [[mailto:konrad.kaczynski@ipipan.waw.pl|konrad.kaczynski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/
WitoldKieras|Witold Kieraś]], PhD                              || [[mailto:witold.kieras@ipipan.waw.pl|witold.kieras@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MateuszKlimaszewski|Mateusz Klimaszewski]], MSc || [[mailto:mk.klimaszewski
@gmail.com|mk.klimaszewski@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], PhD        || [[mailto:lkobylinski@ipipan.waw.pl|lukasz.kobylinski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/DorotaKomosi%C5%84ska|Dorota Komosińska]], MSc                 || [[mailto:dorota.komosinska@gmail.com|dorota.komosinska@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska-Kieraś]], MSc
|| [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MalgorzataMarciniak|Małgorzata Marciniak]], PhD, Assoc. Prof. || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AgnieszkaMykowiecka|Agnieszka Mykowiecka]], PhD, Assoc. Prof. || [[mailto:agnieszka.mykowiecka@ipipan.waw.pl|agnieszka.mykowiecka@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/BartlomiejNiton|Bartłomiej Nitoń]], MSc                        || [[mailto:bartek.niton@gmail.com|bartek.niton@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD, Assoc. Prof., Head of the Group || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AgnieszkaPatejuk|Agnieszka Patejuk]], PhD                      || [[mailto:aep@ipipan.waw.pl|agnieszka.patejuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Full Prof.   || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/PiotrPrzybyla|Piotr Przybyła]], PhD || [[mailto:piotr.przybyla@ipipan.waw.pl|piotr.przybyla
@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/PiotrRychlik|Piotr Rychlik]], PhD        || [[mailto:rychlik@ipipan.waw.pl|piotr.rychlik@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], PhD        || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] ||
|| Grzegorz Wojdyga, MSc || [[mailto:g.wojdyga@ipipan.waw.pl|g.wojdyga@ipipan.waw.pl]] ||
|| Joanna
Wołoszyn, PhD || ||
|| [[http://zil.ipipan.waw.pl/
MarcinWolinski|Marcin Woliński]], PhD, Assoc. Prof. || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD, Assoc. Prof. || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz
@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AlinaWroblewska|Alina Wróblewska]], PhD        || [[mailto:alina.wroblewska@ipipan.waw.pl|alina.wroblewska@ipipan.waw.pl]] ||
Line 30: Line 38:
|| [[http://zil.ipipan.waw.pl/KonradGoluchowski|Konrad Gołuchowski]], MSc (part time) || [[mailto:kodieg@gmail.com|kodieg@gmail.com]] ||
|| [[http://www.mimuw.edu.pl/~wjaworski/|Wojciech Jaworski]], PhD (part time) || [[mailto:wjaworski@mimuw.edu.pl|wjaworski@mimuw.edu.pl]] ||
|| [[http://zil.ipipan.waw.pl/JakubPiskorski|Jakub Piskorski]], PhD (associate) || [[mailto:jakub.piskorski@ipipan.waw.pl|jakub.piskorski@ipipan.waw.pl]] ||
|| Piotr Rybak || [[mailto:piotr.cezary.rybak@gmail.com|piotr.cezary.rybak@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD (part time) || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] ||
|| Jakub Piskorski, PhD || [[mailto:jpiskorski@gmail.com|jpiskorski@gmail.com]] ||
|| Piotr Rybak || [[mailto:piotr.cezary.rybak@gmail.com|piotr.cezary.rybak@gmail.com]] ||
|| Jakub Szymanik, PhD || [[mailto:jakub.szymanik@gmail.com|jakub.szymanik@gmail.com]] ||
Line 41: Line 47:
 * (Polish) corpus linguistics; cf. the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]],
 * syntactic and semantic parsing of Polish; cf. [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]],
 * (Polish) corpus linguistics ([[http://nkjp.pl/|National Corpus of Polish]]), /* ; cf. the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]], */
 * morphosyntactic tagging and lemmatisation of Polish,
 * syntactic an
d semantic parsing of Polish,
Line 45: Line 52:
 * distributional semantics and compositional distributional semantics,
Line 46: Line 54:
 * morphosyntactic system of Polish,  * credibility assessment of online content,
 /*
* morphosyntactic system of Polish, */
Line 53: Line 62:
 * [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]])
 * [[http://clip.ipipan.waw.pl/COTHEC|COTHEC]]
 * [[http://zil.ipipan.waw.pl/Chronofleks|Chronofleks]]
 * [[http://zil.ipipan.waw.pl/CoDeS|CoDeS]] (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts)
 * [[http://dariah.pl/|DARIAH-PL]]
 * [[http://clip.ipipan.waw.pl/ELRC|ELRC]]
 * [[http://clip.ipipan.waw.pl/KORBA|KORBA]]
 * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing)
 * [[http://clip.ipipan.waw.pl/Parthenos|Parthenos]]
 * [[http://zil.ipipan.waw.pl/Scwad|Scwad]]
 * [[http://synamet.uw.edu.pl/|SYNAMET]]
 * [[http://clip.ipipan.waw.pl/TextLink|TextLink]]
 * [[http://clip.ipipan.waw.pl/CLARIN-PL-3|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]])
 * [[CORMETAN]] (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts)
 * [[http://clip.ipipan.waw.pl/CURLICAT|CURLICAT]] (Curated Multilingual Language Resources for CEF AT)
 * [[http://dariah.pl/|DARIAH-PL]] (Digital Research Infrastructure for the Arts and Humanities)
 * [[http://clip.ipipan.waw.pl/ELE|ELE]] (European Language Equality)
 * [[http://clip.ipipan.waw.pl/ELG|ELG]] (European Language Grid)
 * [[http://clip.ipipan.waw.pl/ELRC|ELRC]] (European Language Resource Coordination)
 * [[http://clip.ipipan.waw.pl/KORBA-2|KORBA 2]] (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish")
 * [[HOMADOS|HOMADOS]] (Hampering Misinformation by Assessing Credibility of Online Sources)
 * [[http://clip.ipipan.waw.pl/MARCELL|MARCELL]] (Multilingual Resources for CEF.AT in the legal domain)
 * [[http://clip.ipipan.waw.pl/Nexus|Nexus Linguarum]] (European network for Web-centred linguistic data science)
 * [[http://zil.ipipan.waw.pl/Quantifiers|Kwantyfikatory w języku: użycie i znaczenie]] (Quantifiers in Language: Use and Meaning)
 * [[http://zil.ipipan.waw.pl/Scwad|Scwad]] (Compositional distributional modelling of Polish language semantics)
 * [[http://synamet.uw.edu.pl/|SYNAMET]] (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse)
Line 67: Line 78:
 * [[http://zil.ipipan.waw.pl/CoDeS|CoDeS]] (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts)
 * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS)
 * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]]
 * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]]
 * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]]
 * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources)
 * [[http://zil.ipipan.waw.pl/Chronofleks|Chronofleks]] (A diachronic formal model of Polish inflection and its implementation)
 * [[CLARIN|CLARIN]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]], see also [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL 2]])
 * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]]
 * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts)
 * [[http://clip.ipipan.waw.pl/COTHEC|COTHEC]] (Unified theory of coreference in Polish and its corpus-based verification)
 * [[HPSG Grammar of Polish|HPSG Grammar of Polish]]
 * [[Information Extraction from Polish free text|Information Extraction from Polish free text]]
 * [[IPI PAN Corpus|The IPI PAN Corpus of Polish]]
 * [[http://clip.ipipan.waw.pl/KORBA|KORBA]] (Electronic corpus of 17th and 18th century Polish texts)
 * [[LT4eL|LT4eL]] (Language Technology for eLearning)
 * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support
 * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet)
 * [[NKJP|NKJP]] (National Corpus of Polish)
 * [[http://zil.ipipan.waw.pl/OPTA|OPTA]] (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim)
 * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing)
 * [[http://clip.ipipan.waw.pl/Parthenos|Parthenos]] (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies)
Line 69: Line 101:
 * [[http://zil.ipipan.waw.pl/OPTA|OPTA]] (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim)  * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society)
 * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]]
 * [[http://clip.ipipan.waw.pl/TextLink|TextLink]] (Structuring Discourse in Multilingual Europe)
Line 71: Line 105:
 * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet),
 * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society),
 * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]],
 * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS),
 * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources),
 * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]],
 * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts),
 * [[CLARIN|CLARIN]] (Common Language Resources and Technology Infrastructure),
 * [[NKJP|NKJP]] (National Corpus of Polish),
 * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]],
 * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support,
 * [[LT4eL|LT4eL]] (Language Technology for eLearning),
 * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]],
 * [[Information Extraction from Polish free text|Information Extraction from Polish free text]],
 * [[IPI PAN Corpus|The IPI PAN Corpus of Polish]],
 * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]],
 * [[HPSG Grammar of Polish|HPSG Grammar of Polish]].
Line 95: Line 112:
 * [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]] – a DCG parser,  * [[http://morfeusz.sgjp.pl/|Morfeusz 2]] – a morphological analyser of Polish,
Line 97: Line 114:
 * [[http://zil.ipipan.waw.pl/%C5%9Awigra|Świgra]] – a DCG parser,
 * [[https://github.com/360er0/COMBO|COMBO]] – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling,
 * [[http://zil.ipipan.waw.pl/Concraft|Concraft]] — a CRF morphosyntactic tagger of Polish compatible with Morfeusz SGJP,
 * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish,
Line 98: Line 119:
 * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish,
Line 102: Line 122:
 * [[http://zil.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming),  * [[http://zil.ipipan.waw.pl/Anotatornia2/|Anotatornia 2]] – an annotation tool geared towards historical corpora,
Line 106: Line 126:
 * [[http://nlp.ipipan.waw.pl/PPJP/|etc.]]  * [[http://dsmodels.nlp.ipipan.waw.pl/sim1.html|DSmodels]] - web service for calculating word similarity using Polish word embeddings
Line 112: Line 133:
 * [[http://nkjp.pl/index.php?page=0&lang=1|National Corpus of Polish]].  * [[http://nkjp.pl/index.php?page=0&lang=1|National Corpus of Polish]],
 * [[http://zil.ipipan.waw.pl/CoDeS|Polish word embeddings based on NKJP and Wikipedia]],
 * Polish dependency banks: [[http://zil.ipipan.waw.pl/PDB|PDB]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/PDB-UD_current|PDB-UD]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/PUD-PL_current|PUD-PL]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/NKJP1M-UD_current|NKJP1M-UD]],
 * [[http://zil.ipipan.waw.pl/PDB/PDBparser|Dependency parsing models for Polish]].
Line 122: Line 146:
 * [[http://poleval.pl/|PolEval]], the evaluation campaign for natural language processing tools for Polish
Line 124: Line 149:
  * [[http://poltal.ipipan.waw.pl/|PolTAL 2014]] – 9th International Conference on Natural Language Processing, 17–19 September 2014, Warsaw, Poland
  * [[http://tlt14.ipipan.waw.pl/|TLT14]] – 14th International Workshop on Treebanks and Linguistic Theories, 11–12 December 2015, Warsaw, Poland
  * [[http://corbon.nlp.ipipan.waw.pl/2016/|CORBON 2016]] – Coreference Resolution Beyond !OntoNotes workshop at [[http://naacl.org/naacl-hlt-2016/|NAACL 2016]] (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US
  * [[http://headlex16.ipipan.waw.pl/|HeadLex16]] – Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, 24–29 July 2016, Warsaw, Poland
  * [[http://corbon.nlp.ipipan.waw.pl/|CORBON 2017]] – 2nd Workshop on Coreference Resolution Beyond !OntoNotes at [[http://eacl2017.org/|EACL 2017]] (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain
  * [[http://poltal.ipipan.waw.pl/|9th International Conference on Natural Language Processing]] (PolTAL 2014), 17–19 September 2014, Warsaw, Poland
  * [[http://tlt14.ipipan.waw.pl/|14th International Workshop on Treebanks and Linguistic Theories]] (TLT14), 11–12 December 2015, Warsaw, Poland
  * [[http://corbon.nlp.ipipan.waw.pl/2016/|Coreference Resolution Beyond OntoNotes]] (CORBON 2016) workshop at [[http://naacl.org/naacl-hlt-2016/|NAACL 2016]] (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US
  * [[http://headlex16.ipipan.waw.pl/|Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar]] (!HeadLex16), 24–29 July 2016, Warsaw, Poland
  * [[http://corbon.nlp.ipipan.waw.pl/|2nd Workshop on Coreference Resolution Beyond OntoNotes]] (CORBON 2017) at [[http://eacl2017.org/|EACL 2017]] (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain
  * [[http://anawiki.essex.ac.uk/dali/crac18/|Computational Models of Reference, Anaphora, and Coreference]] workshop (CRAC) at [[http://naacl2018.org/|NAACL 2018]] (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA
  * [[https://nlpday.pl/|AI & NLP Workshop Day]], 19 October 2018, Warsaw
  * [[https://sites.google.com/view/crac2019/|Second Workshop on Computational Models of Reference, Anaphora and Coreference]] (CRAC 2019), 6 ot 7 June 2019, Minneapolis
  * [[http://www.dynamicsoflanguage.edu.au/lfg-2019/|The 24th International Lexical-Functional Grammar Conference]] (LFG 2019), 8–10 July 2019, Canberra


== Selected publications ==

<<BibMate(author, "Andrzejczuk", "Bartosiak", "Gawłowicz", "Hajnicz", "Kaczyński", "Kieraś", "Klimaszewski", "Kobyliński", "Krasnowska", "Marciniak", "Mykowiecka", "Nitoń", "Ogrodniczuk", "Patejuk", "Przepiórkowski", "Przybyła", "Rychlik, "Wawer", "Wojdyga", "Wołoszyn", "Woliński", "Wójtowicz", "Wróblewska")>>

The Linguistic Engineering Group

The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (IPI PAN).

People

Core team

Anna Andrzejczuk, PhD

anna.andrzejczuk@ipipan.waw.pl

Tomasz Bartosiak, MSc

tomasz.bartosiak@gmail.com

Zbigniew Gawłowicz, BEng

zgawlowicz@gmail.com

Elżbieta Hajnicz, PhD, Assoc. Prof.

elzbieta.hajnicz@ipipan.waw.pl

Konrad Kaczyński, MSc

konrad.kaczynski@ipipan.waw.pl

Witold Kieraś, PhD

witold.kieras@ipipan.waw.pl

Mateusz Klimaszewski, MSc

mk.klimaszewski@gmail.com

Łukasz Kobyliński, PhD

lukasz.kobylinski@ipipan.waw.pl

Dorota Komosińska, MSc

dorota.komosinska@gmail.com

Katarzyna Krasnowska-Kieraś, MSc

katarzyna.krasnowska@ipipan.waw.pl

Małgorzata Marciniak, PhD, Assoc. Prof.

malgorzata.marciniak@ipipan.waw.pl

Agnieszka Mykowiecka, PhD, Assoc. Prof.

agnieszka.mykowiecka@ipipan.waw.pl

Bartłomiej Nitoń, MSc

bartek.niton@gmail.com

Maciej Ogrodniczuk, PhD, Assoc. Prof., Head of the Group

maciej.ogrodniczuk@ipipan.waw.pl

Agnieszka Patejuk, PhD

agnieszka.patejuk@ipipan.waw.pl

Adam Przepiórkowski, PhD, Full Prof.

adam.przepiorkowski@ipipan.waw.pl

Piotr Przybyła, PhD

piotr.przybyla@ipipan.waw.pl

Piotr Rychlik, PhD

piotr.rychlik@ipipan.waw.pl

Aleksander Wawer, PhD

aleksander.wawer@ipipan.waw.pl

Grzegorz Wojdyga, MSc

g.wojdyga@ipipan.waw.pl

Joanna Wołoszyn, PhD

Marcin Woliński, PhD, Assoc. Prof.

marcin.wolinski@ipipan.waw.pl

Beata Wójtowicz, PhD, Assoc. Prof.

beata.wojtowicz@ipipan.waw.pl

Alina Wróblewska, PhD

alina.wroblewska@ipipan.waw.pl

Associates

Jakub Piskorski, PhD

jpiskorski@gmail.com

Piotr Rybak

piotr.cezary.rybak@gmail.com

Jakub Szymanik, PhD

jakub.szymanik@gmail.com

Research

The main research areas of the Group

  • (Polish) corpus linguistics (National Corpus of Polish),

  • morphosyntactic tagging and lemmatisation of Polish,
  • syntactic and semantic parsing of Polish,
  • extraction of linguistic knowledge from corpora,
  • information extraction,
  • distributional semantics and compositional distributional semantics,
  • sentiment analysis,
  • credibility assessment of online content,

  • generative linguistic formalisms, esp., HPSG and LFG.

The Group is a member of CLARIN, DARIAH-PL, ELRC, FLaReNet and META-NET.

Current externally funded projects

  • CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure)

  • CORMETAN (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts)

  • CURLICAT (Curated Multilingual Language Resources for CEF AT)

  • DARIAH-PL (Digital Research Infrastructure for the Arts and Humanities)

  • ELE (European Language Equality)

  • ELG (European Language Grid)

  • ELRC (European Language Resource Coordination)

  • KORBA 2 (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish")

  • HOMADOS (Hampering Misinformation by Assessing Credibility of Online Sources)

  • MARCELL (Multilingual Resources for CEF.AT in the legal domain)

  • Nexus Linguarum (European network for Web-centred linguistic data science)

  • Kwantyfikatory w języku: użycie i znaczenie (Quantifiers in Language: Use and Meaning)

  • Scwad (Compositional distributional modelling of Polish language semantics)

  • SYNAMET (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse)

Some of our past projects

Publicly available tools and resources

Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN.

Some tools (all open source, under GPL; see also CLIP):

  • Morfeusz 2 – a morphological analyser of Polish,

  • Spejd – a shallow parsing and disambiguation system,

  • Świgra – a DCG parser,

  • COMBO – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling,

  • Concraft — a CRF morphosyntactic tagger of Polish compatible with Morfeusz SGJP,

  • PANTERA – a morphosyntactic tagger for Polish,

  • TaKIPI – a morphosyntactic tagger for Polish,

  • Poliqarp – a corpus indexing and search engine,

  • Poliqarp2 – a new generation corpus indexing and search engine,

  • Dendrarium – a treebank development system (under development),

  • Anotatornia 2 – an annotation tool geared towards historical corpora,

  • WSDDE – a system for designing and performing Word Sense Disambiguation experiments,

  • Multiservice – web service for various of our tools,

  • TermoPL - multiword terms extraction from text

  • DSmodels - web service for calculating word similarity using Polish word embeddings

Main resources (many more at CLIP):

Other activities

Links to some other activities of the Group:

Selected publications

List of publications

2025

Aleksandra Tomaszewska, Dariusz Czerski, Bartosz Żuk, and Maciej Ogrodniczuk. NeoN: A tool for automated detection, linguistic and LLM-driven aalysis of neologisms in Polish. In Michael H. Lees, Wentong Cai, Siew Ann Cheong, Yi Su, David Abramson, Jack J. Dongarra, and Peter M. A. Sloot, editors, Computational Science – ICCS 2025, pages 318–326, Cham, 2025. Springer Nature Switzerland.

2024

Katarzyna Krasnowska-Kieraś and Marcin Woliński. Parsing headed constituencies. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 12633–12643, Torino, Italy, 2024. ELRA and ICCL.

Adam Przepiórkowski, Magdalena Borysiak, Adam Okrasiński, Bartosz Pobożniak, Wojciech Stempniak, Kamil Tomaszek, and Adam Głowacki. Symmetric dependency structure of coordination: Crosslinguistic arguments from dependency length minimization. In Daniel Dakota, Sarah Jablotschkin, Sandra Kübler, and Heike Zinsmeister, editors, Proceedings of the 22nd Workshop on Treebanks and Linguistic Theories (TLT 2024), pages 11–22, Hamburg,Germany, 2024. Association for Computational Linguistics.

Adam Przepiórkowski, Katarzyna Kuś, Agnieszka Patejuk, and Berke Şenşekerci. You can depend on the symmetry of coordination and that NPs and CPs can be conjoined. Presentation delivered on 5 July 2024 at the “Form and Meaning of Coordination” workshop in Göttingen, Germany (https://www.uni-goettingen.de/de/685553.html), 2024.

Piotr Rybak, Piotr Przybyła, and Maciej Ogrodniczuk. PolQA: Polish question answering dataset. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 12846–12855, Torino, Italy, 2024. ELRA and ICCL.

2023

Łukasz Kobyliński, Maciej Ogrodniczuk, Piotr Rybak, Piotr Przybyła, Piotr Pęzik, Agnieszka Mikołajczyk, Wojciech Janowski, Michał Marcińczuk, and Aleksander Smywiński-Pohl. PolEval 2022/23 challenge tasks and results. In Maria Ganzha, Leszek Maciaszek, Marcin Paprzycki, and Dominik Ślęzak, editors, Proceedings of the 18th Conference on Computer Science and Intelligence Systems, volume 35 of Annals of Computer Science and Information Systems, pages 1237–1244, 2023.

Katarzyna Krasnowska-Kieraś and Marcin Woliński. Constituency parsing with spines and attachments. In Jiří Mikyška, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M.A. Sloot, editors, Computational Science – ICCS 2023. 23rd International Conference, Prague, Czech Republic, July 3–5, 2023, Proceedings, Part I, number 14073 in Lecture Notes in Computer Science, pages 191–205, Cham, 2023. Springer Nature Switzerland.

Maciej Ogrodniczuk, editor. Analiza danych parlamentarnych. Warsztat pokonkursowy, Warsaw, 2023. Institute of Computer Science, Polish Academy of Sciences.

Adam Przepiórkowski and Michał Woźniak. Conjunct lengths in English, Dependency Length Minimization, and dependency structure of coordination. In Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15494–15512, Toronto, Canada, 2023. Association for Computational Linguistics.

Karol Saputa, Aleksandra Tomaszewska, Natalia Zawadzka-Paluektau, Witold Kieraś, and Łukasz Kobyliński. Korpusomat.eu: A multilingual platform for building and analysing linguistic corpora. In Jiří Mikyška, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M.A. Sloot, editors, Computational Science – ICCS 2023. 23rd International Conference, Prague, Czech Republic, July 3–5, 2023, Proceedings, Part II, number 14074 in Lecture Notes in Computer Science, pages 230–237, Cham, 2023. Springer Nature Switzerland.

Joanna Wołoszyn, Witold Kieraś, and Marcin Woliński. Sieć powiązań derywacyjnych na materiale Słownika gramatycznego języka polskiego: Propozycja klasyfikacji. LingVaria, 18(2):47–61, 2023.

2022

Maciej Ogrodniczuk, Sameer Pradhan, Anna Nedoluzhko, Vincent Ng, and Massimo Poesio, editors. Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference, Gyeongju, Republic of Korea, 2022. Association for Computational Linguistics.

Adam Przepiórkowski. Polyadic cover quantification in heterofunctional coordination. In Daniel Gutzmann and Sophie Repp, editors, Proceedings of Sinn und Bedeutung 26, pages 677–696, 2022.

2021

Maciej Ogrodniczuk and Łukasz Kobyliński, editors. Proceedings of the PolEval 2021 Workshop, Warsaw, 2021. Institute of Computer Science, Polish Academy of Sciences.

Maciej Ogrodniczuk and Piotr Przybyła. PolEval 2021 Task 4: Question Answering Challenge. In Maciej Ogrodniczuk and Łukasz Kobyliński, editors, Proceedings of the PolEval 2021 Workshop, pages 123–136, Warsaw, 2021. Institute of Computer Science, Polish Academy of Sciences.

2019

Katarzyna Krasnowska-Kieraś and Łukasz Kobyliński. Part of speech tagging for Polish. Poznań Studies in Contemporary Linguistics, 55(2):211–237, 2019.

Maciej Ogrodniczuk, Rafał L. Górski, Marek Łaziński, and Piotr Pęzik. From the National Corpus of Polish to the Polish Corpus Infrastructure. Jazykovedný časopis, 70(2):315–323, 2019.

2018

Małgorzata Marciniak, Agnieszka Mykowiecka, and Piotr Rychlik. Recognition of irrelevant phrases in automatically extracted lists of domain terms. Terminology, 24(1):66–90, 2018.

Agnieszka Mykowiecka, Małgorzata Marciniak, and Aleksander Wawer. Literal, metphorical or both? Detecting metaphoricity in isolated adjective-noun phrases. In Beata Beigman Klebanov, Ekaterina Shutova, Patricia Lichtenstein, Smaranda Muresan, and Chee Wee, editors, Proceedings of the Workshop on Figurative Language Processing, pages 27–33. Association for Computational Linguistics, 2018.

Maciej Ogrodniczuk, Joanna Bilińska, Zbigniew Bronk, and Witold Kieraś. Multisłownik: Linking plWordNet-based lexical data for lexicography and educational purposes. In Francis Bond, Takayuki Kuribayashi, Christiane Fellbaum, and Piek Vossen, editors, Proceedings of the 9th Global WordNet Conference (GWC 2018), pages 368–375, Singapore, 2018. University of Tartu.

Piotr Rybak and Alina Wróblewska. Semi-supervised neural system for tagging, parsing and lemmatization. Addendum. In Proceedings of the PolEval 2018 Workshop, pages 49–51. Institute of Computer Science, Polish Academy of Sciences, 2018.

Alina Wróblewska. Polish corpus of annotated descriptions of images. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pages 2141–2146. European Language Resources Association (ELRA), 2018.

Alina Wróblewska. Results of the PolEval 2018 Shared Task 1: Dependency Parsing. In Proceedings of the PolEval 2018 Workshop, pages 11–24. Institute of Computer Science, Polish Academy of Sciences, 2018.

Magdalena Zawisławska, Marta Falkowska, and Maciej Ogrodniczuk. Verbal synaesthesia in the Polish corpus of synaesthetic metaphors. LaMiCuS, 2:226–253, 2018.

2017

Witold Kieraś and Marcin Woliński. Morfeusz 2 – analizator i generator fleksyjny dla języka polskiego. Język Polski, XCVII(1):75–83, 2017.

Bartłomiej Nitoń and Maciej Ogrodniczuk. Multi-pass sieve coreference resolution system for Polish. In Jorge Gracia, Francis Bond, John P. McCrae, Paul Buitelaar, Christian Chiarcos, and Sebastian Hellmann, editors, Proceedings of the 1st Conference on Language, Data and Knowledge (LDK 2017), number 10318 in Lecture Notes in Artificial Intelligence, pages 222–236. Springer International Publishing, Berlin, 2017.

Adam Przepiórkowski. Argumenty i modyfikatory w gramatyce i w słowniku. Wydawnictwa Uniwersytetu Warszawskiego, Warsaw, 2017.

Adam Przepiórkowski. On the argument–adjunct distinction in the Polish Semantic Syntax tradition. Cognitive Studies / Études Cognitives, 17:1–10, 2017.

Aleksander Wawer and Agnieszka Mykowiecka. Supervised and unsupervised word sense disambiguation on word embedding vectors of unambigous synonyms. In Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications, pages 120–125. Association for Computational Linguistics, 2017.

Alina Wróblewska, Katarzyna Krasnowska-Kieraś, and Piotr Rybak. Towards the evaluation of feature embedding models of the fusional languages. In Zygmunt Vetulani and Patrick Paroubek, editors, Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 420–424, Poznań, Poland, 2017. Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu.

2016

Joanna Bilińska, Magdalena Derwojedowa, Witold Kieraś, and Monika Kwiecień. Mikrokorpus polszczyzny 1830-1918. Komunikacja specjalistyczna, 11:149–161, 2016.

Adam Przepiórkowski. How not  to distinguish arguments from adjuncts in LFG. In Doug Arnold, Miriam Butt, Berthold Crysmann, Tracy Holloway King, and Stefan Müller, editors, The Proceedings of the Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, pages 560–580, Stanford, CA, 2016. CSLI Publications.

2015

Markus Dickinson, Erhard Hinrichs, Agnieszka Patejuk, and Adam Przepiórkowski, editors. Proceedings of the Fourteenth International Workshop on Treebanks and Linguistic Theories (TLT 14), Warsaw, 2015. Institute of Computer Science, Polish Academy of Sciences.

Katarzyna Krasnowska-Kieraś and Agnieszka Patejuk. Integrating Polish LFG with external morphology. In Markus Dickinson, Erhard Hinrichs, Agnieszka Patejuk, and Adam Przepiórkowski, editors, Proceedings of the Fourteenth International Workshop on Treebanks and Linguistic Theories (TLT 14), pages 134–147, Warsaw, 2015. Institute of Computer Science, Polish Academy of Sciences.

2014

Elżbieta Hajnicz. The procedure of lexico-semantic annotation of Składnica treebank. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 2290–2297, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Adam Przepiórkowski, Elżbieta Hajnicz, Agnieszka Patejuk, and Marcin Woliński. Extended phraseological information in a valence dictionary for NLP applications. In Proceedings of the Workshop on Lexical and Grammatical Resources for Language Processing (LG-LP 2014), pages 83–91, Dublin, Ireland, 2014. Association for Computational Linguistics and Dublin City University.

Adam Przepiórkowski, Elżbieta Hajnicz, Agnieszka Patejuk, Marcin Woliński, Filip Skwarski, and Marek Świdziński. Walenty: Towards a comprehensive valence dictionary of Polish. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 2785–2792, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Alina Wróblewska. Polish Dependency Parser Trained on an Automatically Induced Dependency Bank. Ph.D. dissertation, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2014.

2013

Elżbieta Hajnicz. Actualising lexico-semantic annotation of Składnica Treebank to modified versions of source resources. In Zygmunt Vetulani, editor, Proceedings of the 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 178–182, Poznań, Poland, 2013. Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza.

Barbara Lewandowska-Tomaszczyk, Rafał Górski, Marek Łaziński, and Adam Przepiórkowski. The National Corpus of Polish (NKJP). Language use and data analysis. In Irina Kor Chahine and Charles Zaremba, editors, Travaux de slavistique : Actes du VIe congrès de la Slavic Linguistic Society, pages 309–319. Presses Universitaires de Provence, 2013.

Maciej Ogrodniczuk and Michał Lenart. A multi-purpose online toolset for NLP applications. In Elisabeth Métais, Farid Meziane, Mohamed Saraee, Vijay Sugumaran, and Sunil Vadera, editors, Proceedings of the 18th International Conference on Applications of Natural Language to Information Systems, number 7934 in Lecture Notes in Computer Science, pages 392–395. Springer-Verlag, Berlin, Heidelberg, 2013.

Piotr Przybyła. Question Classification for Polish Question Answering. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Proceedings of the 20th International Conference on Language Processing and Intelligent Information Systems (LP&IIS 2013), pages 50–56. Springer-Verlag, 2013.

2012

Szymon Acedański, Adam Slaski, and Adam Przepiórkowski. Machine learning of syntactic attachment from morphosyntactic and semantic co-occurrence statistics. In Proceedings of the ACL 2012 Joint Workshop on Statistical Parsing and Semantic Processing of Morphologically Rich Languages, pages 42–47, Jeju, Republic of Korea, 2012. Association for Computational Linguistics.

Anna Andrzejczuk. Klasyfikacja onomazjologiczna rzeczowników a ich charakterystyka gramatyczna. Nowy sposób opracowania materiału leksykograficznego.. PhD thesis, Instytut Języka Polskiego, Polska Akademia Nauk, Cracow, 2012.

Mateusz Kopeć, Rafał Młodzki, and Adam Przepiórkowski. Word Sense Disambiguation in the National Corpus of Polish. Prace Filologiczne, LXIII:155–165, 2012.

Mateusz Kopeć, Rafał Młodzki, and Adam Przepiórkowski. Automatyczne znakowanie sensami słów. In Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors, Narodowy Korpus Języka Polskiego, pages 209–224. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Barbara Lewandowska-Tomaszczyk, Mirosław Bańko, Rafał L. Górski, Marek Łazinski, Piotr Pęzik, and Adam Przepiórkowski. Narodowy Korpus Języka Polskiego: geneza i dzień dzisiejszy. In Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors, Narodowy Korpus Języka Polskiego, pages 3–10. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors. Narodowy Korpus Języka Polskiego. Wydawnictwo Naukowe PWN, Warsaw, 2012.

2011

Anna Andrzejczuk. Dwoje urodzin to brzmi dziwnie. Norma językowe dotycząca połączeń rzeczowników PT z liczebnikami a jej realizacja w tekstach Narodowego Korpusu Języka Polskiego i w tekstach internetowych. Język Polski, XCI(4):273–283, 2011.

Maciej Ogrodniczuk and Mateusz Kopeć. Rule-based coreference resolution module for Polish. In Proceedings of the 8th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC 2011), pages 191–200, Faro, Portugal, 2011.

Marcin Woliński, Katarzyna Głowińska, and Marek Świdziński. A preliminary version of Składnica—a treebank of Polish. In Zygmunt Vetulani, editor, Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 299–303, Poznań, Poland, 2011.

2010

Małgorzata Marciniak, editor. Anotowany korpus dialogów telefonicznych. Akademicka Oficyna Wydawnicza EXIT, Warsaw, 2010.

Maciej Ogrodniczuk and Adam Przepiórkowski. Linguistic processing chains as Web Services: Initial linguistic considerations, 2010. CLARIN deliverable D5R-3a.

2009

Marco Passarotti, Adam Przepiórkowski, Savina Raynaud, and Frank Van Eynde, editors. Proceedings of the Eighth International Workshop on Treebanks and Linguistic Theories (TLT 8), Milan, Italy, 2009.

Adam Przepiórkowski. A comparison of two morphosyntactic tagsets of Polish. In Violetta Koseska-Toszewa, Ludmila Dimitrova, and Roman Roszko, editors, Representing Semantics in Digital Lexicography: Proceedings of MONDILEX Fourth Open Workshop, pages 138–144, Warsaw, 2009.

Adam Przepiórkowski. TEI P5 as an XML standard for treebank encoding. In Marco Passarotti, Adam Przepiórkowski, Savina Raynaud, and Frank Van Eynde, editors, Proceedings of the Eighth International Workshop on Treebanks and Linguistic Theories (TLT 8), pages 149–160, Milan, Italy, 2009.

2008

Mieczysław A. Kłopotek, Adam Przepiórkowski, Sławomir T. Wierzchoń, and Krzysztof Trojanowski, editors. Intelligent Information Systems. Akademicka Oficyna Wydawnicza EXIT, Warsaw, 2008.

2007

Anna Andrzejczuk. (Nie)tylko w liczbie mnogiej. Rozważania o szeroko rozumianych plurale tantum. LingVaria, 4(2):177–188, 2007. Cracow.

2005

Agnieszka Mykowiecka, Małgorzata Marciniak, and Anna Kupść. Making shallow look deeper: Anaphora and comparisons in medical information extraction. In Zygmunt Vetulani, editor, Proceedings of the 2nd Language & Technology Conference, pages 225–229, Poznań, Poland, 2005.

Dariusz Piechociński and Agnieszka Mykowiecka. Question answering in Polish using shallow parsing. In Radovan Garabík, editor, Computer Treatment of Slavic and East European Languages: Proceedings of the Third International Seminar, Bratislava, Slovakia, 10–12 November 2005, pages 167–173, Bratislava, 2005. VEDA: Vydavatel'stvo Slovenskej akadéme vied.

2004

Jakub Piskorski, Peter Homola, Małgorzata Marciniak, Agnieszka Mykowiecka, Adam Przepiórkowski, and Marcin Woliński. Information extraction for Polish using the SProUT platform. In Mieczysław A. Kłopotek, Sławomir T. Wierzchoń, and Krzysztof Trojanowski, editors, Intelligent Information Processing and Web Mining, Advances in Soft Computing, pages 227–236. Springer-Verlag, Berlin, 2004.

2003

Maciej Ogrodniczuk. Rozszerzenie opisów morfologicznych w tekstach korpusu „Słownika frekwencyjnego polszczyzny współczesnej”. In Roman Huszcza and Jadwiga Linde-Usiekniewicz, editors, Prace lingwistyczne dedykowane prof. Jadwidze Sambor, pages 164–168. Wydział Polonistyki Uniwersytetu Warszawskiego, Warsaw, 2003.

2001

Adam Przepiórkowski. arg-st on phrases: Evidence from Polish. In Dan Flickinger and Andreas Kathol, editors, Proceedings of the HPSG 2000 Conference, pages 267–284. CSLI Publications, Stanford, CA, 2001.

2000

Piotr Bański and Adam Przepiórkowski, editors. Proceedings of the First Generative Linguistics in Poland Conference. Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2000.

Adam Przepiórkowski. Optional and multiple Long Distance Genitive of Negation in Polish. In Piotr Bański and Adam Przepiórkowski, editors, Proceedings of the First Generative Linguistics in Poland Conference, pages 135–146, Warsaw, 2000. Institute of Computer Science, Polish Academy of Sciences.

1997

Anna Kupść, Małgorzata Marciniak, and Leonard Bolc. Anaphor binding in Polish. An attempt at an HPSG account. IPI PAN Research Report 836, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 1997.

Anna Kupść, Małgorzata Marciniak, and Agnieszka Mykowiecka. Komputerowe przetwarzanie jezyka naturalnego — wybrane zagadnienia. Informatyka, 1997.

1989

Elżbieta Hajnicz. Formalizacja systemu wnioskowania o zależnościach czasowych między zdarzeniami. IPI PAN Research Report 658, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 1989.