Locked History Actions

Diff for "ZILStart"

Differences between revisions 143 and 260 (spanning 117 versions)
Revision 143 as of 2018-06-19 12:23:08
Size: 13192
Comment:
Revision 260 as of 2025-08-10 10:05:11
Size: 17967
Comment:
Deletions are marked like this. Additions are marked like this.
Line 4: Line 4:
The Linguistic Engineering (LE) Group is part of the [[http://www.ipipan.waw.pl/en/dept/dept-ai.html|Department of Artificial Intelligence]] at the [[http://www.ipipan.waw.pl/en/|Institute of Computer Science]], [[http://www.english.pan.pl/|Polish Academy of Sciences]] (ICS PAS). The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the [[http://www.ipipan.waw.pl/en/|Institute of Computer Science]], [[https://institution.pan.pl/|Polish Academy of Sciences]] (IPI PAN).
Line 10: Line 10:
|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] ||
Line 12: Line 11:
|| Zbigniew Gawłowicz                                     || [[mailto:zbigniew.gawlowicz@ipipan.waw.pl|zbigniew.gawlowicz@ipipan.waw.pl]] || || [[https://www.diegofeinmann.com/|Diego Feinmann]], PhD || [[mailto:diego.feinmann@ipipan.waw.pl|diego.feinmann@ipipan.waw.pl]] ||
Line 17: Line 16:
|| [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]], MSc        || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska-Kieraś]], MSc || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] ||
Line 20: Line 19:
|| [[http://zil.ipipan.waw.pl/BartlomiejNiton|Bartłomiej Nitoń]], MSc || [[mailto:bartek.niton@gmail.com|bartek.niton@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD, Head of the Group || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD, Assoc. Prof., Head of the Group || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] ||
Line 23: Line 21:
|| [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Assoc. Prof. || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Full Prof. || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/PiotrPrzybyla|Piotr Przybyła]], PhD (on postdoctoral fellowship at [[https://www.upf.edu/web/erinia|UPF]]) || [[mailto:piotr.przybyla@ipipan.waw.pl|piotr.przybyla@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MichałRudolf|Michał Rudolf]], PhD || [[mailto:michal@rudolf.waw.pl|michal@rudolf.waw.pl]] ||
Line 25: Line 25:
||<style="border: 3px solid black"> [[https://zil.ipipan.waw.pl/KarolinaSaputa|Karolina Saputa]], BEng || [[mailto:karolsaputa@gmail.com|karolsaputa@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/AleksandraTomaszewska|Aleksandra Tomaszewska]], PhD candidate || [[mailto:aleksandra.tomaszewska@ipipan.waw.pl|aleksandra.tomaszewska@ipipan.waw.pl]] ||
Line 26: Line 28:
|| [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD               || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD, Assoc. Prof. || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] ||
|| Joanna Wołoszyn, PhD || [[mailto:joanna.woloszyn@ipipan.waw.pl|joanna.woloszyn@ipipan.waw.pl]] ||
Line 28: Line 31:
|| [[http://zil.ipipan.waw.pl/SebastianZawada|Sebastian Zawada]], MSc || [[mailto:sebastian.zawada@ipipan.waw.pl|sebastian.zawada@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/NataliaZawadzka|Natalia Zawadzka-Paluektau]], PhD || [[mailto:natalia.zawadzka-paluektau@ipipan.waw.pl|natalia.zawadzka-paluektau@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/BartoszŻuk|Bartosz Żuk]], PhD candidate || [[mailto:bartoszzuk.poczta@gmail.com|bartoszzuk.poczta@gmail.com]] ||
Line 32: Line 38:
|| Piotr Rybak || [[mailto:piotr.cezary.rybak@gmail.com|piotr.cezary.rybak@gmail.com]] ||
|| Filip Stefaniuk || [[mailto:filip.stefaniuk@gmail.com|filip.stefaniuk@gmail.com]] ||
|| Jakub Szymanik, PhD || [[mailto:jakub.szymanik@gmail.com|jakub.szymanik@gmail.com]] ||
|| Grzegorz Wojdyga, MSc || [[mailto:g.wojdyga@gmail.com|g.wojdyga@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD (on leave) || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] ||
|| Wiktor Eźlakowski, MSc || [[mailto:wiktor.ezlakowski@ipipan.waw.pl|wiktor.ezlakowski@ipipan.waw.pl]] ||
|| Sonia Janicka || [[mailto:sonia.janicka@gmail.com|sonia.janicka@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/MateuszKlimaszewski|Mateusz Klimaszewski]], MSc || [[mailto:mk.klimaszewski@gmail.com|mk.klimaszewski@gmail.com]] ||
|| Jakub Piskorski, PhD || [[mailto:jpiskorski@gmail.com|jpiskorski@gmail.com]] ||
|| Piotr Rybak, MSc || [[mailto:piotr.cezary.rybak@gmail.com|piotr.cezary.rybak@gmail.com]] ||
|| Jakub Szymanik, PhD || [[mailto:jakub.szymanik@gmail.com|jakub.szymanik@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/RyszardTuora|Ryszard Tuora]], MSc || [[mailto:ryszardtuora@gmail.com|ryszardtuora@gmail.com]] ||
|| Grzegorz Wojdyga, MSc || [[mailto:g.wojdyga@ipipan.waw.pl|g.wojdyga@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD, Assoc. Prof. || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] ||
Line 43: Line 54:
 * (Polish) corpus linguistics; cf.&nbsp;the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]],
 * syntactic and semantic parsing of Polish; cf. [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]],
 * extraction of linguistic knowledge from corpora,
 * information extraction,
 * sentiment analysis,
 * morphosyntactic system of Polish,
 * (Polish) corpus linguistics ([[http://nkjp.pl/|National Corpus of Polish]]) /* ; cf.&nbsp;the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]], */
 * morphosyntactic tagging and lemmatisation of Polish
 * syntactic and semantic parsing of Polish
 * extraction of linguistic knowledge from corpora
 * information extraction
 * distributional semantics and compositional distributional semantics
 * sentiment analysis
 * credibility assessment of online content
 * reference and discourse relations
Line 55: Line 69:
 * [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]])
 * [[http://clip.ipipan.waw.pl/COTHEC|COTHEC]] (Unified theory of coreference in Polish and its corpus-based verification)
 * [[http://zil.ipipan.waw.pl/Chronofleks|Chronofleks]] (A diachronic formal model of Polish inflection and its implementation)
 * [[http://zil.ipipan.waw.pl/CoDeS|CoDeS]] (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts)
 * [[http://dariah.pl/|DARIAH-PL]] (Digital Research Infrastructure for the Arts and Humanities)
 * [[http://clip.ipipan.waw.pl/CLARIN-PL-3|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]])
 * [[CORMETAN]] (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts)
 * [[http://clip.ipipan.waw.pl/CURLICAT|CURLICAT]] (Curated Multilingual Language Resources for CEF AT)
 * [[http://korpus-dekady.ipipan.waw.pl|Korpus Dekady]] ([[http://dariah.pl/|DARIAH-PL]] — Digital Research Infrastructure for the Arts and Humanities)
 * [[http://clip.ipipan.waw.pl/ELE|ELE]] (European Language Equality)
 * [[http://clip.ipipan.waw.pl/ELG|ELG]] (European Language Grid)
Line 61: Line 76:
 * [[http://clip.ipipan.waw.pl/KORBA|KORBA]] (Electronic corpus of 17th and 18th century Polish texts)  * [[HOMADOS|HOMADOS]] (Hampering Misinformation by Assessing Credibility of Online Sources)
* [[http://clip.ipipan.waw.pl/KORBA-2|KORBA 2]] (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish")
Line 63: Line 79:
 * [[http://clip.ipipan.waw.pl/Parthenos|Parthenos]] (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies)  * [[http://clip.ipipan.waw.pl/MARCELL|MARCELL]] (Multilingual Resources for CEF.AT in the legal domain)
 * [[http://clip.ipipan.waw.pl/Nexus|Nexus Linguarum]] (European network for Web-centred linguistic data science)
Line 68: Line 85:
 * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS)
 * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]]
 * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]]
 * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]]
 * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources)
 * [[http://zil.ipipan.waw.pl/Chronofleks|Chronofleks]] (A diachronic formal model of Polish inflection and its implementation)
 * [[CLARIN|CLARIN]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]], see also [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL 2]])
 * [[http://zil.ipipan.waw.pl/CoDeS|CoDeS]] (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts)
 * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]]
 * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts)
 * [[http://clip.ipipan.waw.pl/COTHEC|COTHEC]] (Unified theory of coreference in Polish and its corpus-based verification)
 * [[HPSG Grammar of Polish|HPSG Grammar of Polish]]
 * [[Information Extraction from Polish free text|Information Extraction from Polish free text]]
 * [[IPI PAN Corpus|IPI PAN Corpus of Polish]]
 * [[http://clip.ipipan.waw.pl/KORBA|KORBA]] (Electronic corpus of 17th and 18th century Polish texts)
 * [[LT4eL|LT4eL]] (Language Technology for eLearning)
 * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support
 * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet)
 * [[NKJP|NKJP]] (National Corpus of Polish)
 * [[http://zil.ipipan.waw.pl/OPTA|OPTA]] (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim)
 * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing)
 * [[http://clip.ipipan.waw.pl/Parthenos|Parthenos]] (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies)
 * [[http://clip.ipipan.waw.pl/Readability|Readability]] (Measuring the degree of readability of nonliterary Polish texts)
 * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society)
 * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]]
Line 70: Line 111:
 * [[http://clip.ipipan.waw.pl/Readability|Readability]] (Measuring the degree of readability of nonliterary Polish texts)
 * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing)
 * [[http://zil.ipipan.waw.pl/OPTA|OPTA]] (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim)
Line 74: Line 112:
 * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet),
 * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society),
 * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]],
 * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS),
 * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources),
 * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]],
 * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts),
 * [[CLARIN|CLARIN]] (Common Language Resources and Technology Infrastructure),
 * [[NKJP|NKJP]] (National Corpus of Polish),
 * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]],
 * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support,
 * [[LT4eL|LT4eL]] (Language Technology for eLearning),
 * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]],
 * [[Information Extraction from Polish free text|Information Extraction from Polish free text]],
 * [[IPI PAN Corpus|The IPI PAN Corpus of Polish]],
 * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]],
 * [[HPSG Grammar of Polish|HPSG Grammar of Polish]].
Line 98: Line 119:
 * [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]] – a DCG parser,  * [[http://morfeusz.sgjp.pl/|Morfeusz 2]] – a morphological analyser of Polish,
Line 100: Line 121:
 * [[http://zil.ipipan.waw.pl/%C5%9Awigra|Świgra]] – a DCG parser,
 * [[https://github.com/360er0/COMBO|COMBO]] – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling,
 * [[http://zil.ipipan.waw.pl/Concraft|Concraft]] — a CRF morphosyntactic tagger of Polish compatible with Morfeusz SGJP,
 * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish,
Line 101: Line 126:
 * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish,
Line 105: Line 129:
 * [[http://zil.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming),  * [[http://zil.ipipan.waw.pl/Anotatornia2/|Anotatornia 2]] – an annotation tool geared towards historical corpora,
Line 110: Line 134:
 * [[http://nlp.ipipan.waw.pl/PPJP/|etc.]]
Line 117: Line 141:
 * [[http://zil.ipipan.waw.pl/CoDeS|Polish word embeddings based on NKJP and Wikipedia]].  * [[http://zil.ipipan.waw.pl/CoDeS|Polish word embeddings based on NKJP and Wikipedia]],
 * Polish dependency banks: [[http://zil.ipipan.waw.pl/PDB|PDB]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/PDB-UD_current|PDB-UD]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/PUD-PL_current|PUD-PL]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/NKJP1M-UD_current|NKJP1M-UD]],
 * [[http://zil.ipipan.waw.pl/PDB/PDBparser|Dependency parsing models for Polish]].
Line 127: Line 153:
 * [[http://poleval.pl/|PolEval]], the evaluation campaign for natural language processing tools for Polish
Line 129: Line 156:
  * [[http://poltal.ipipan.waw.pl/|PolTAL 2014]] – 9th International Conference on Natural Language Processing, 17–19 September 2014, Warsaw, Poland
  * [[http://tlt14.ipipan.waw.pl/|TLT14]] – 14th International Workshop on Treebanks and Linguistic Theories, 11–12 December 2015, Warsaw, Poland
  * [[http://corbon.nlp.ipipan.waw.pl/2016/|CORBON 2016]] – Coreference Resolution Beyond !OntoNotes workshop at [[http://naacl.org/naacl-hlt-2016/|NAACL 2016]] (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US
  * [[http://headlex16.ipipan.waw.pl/|HeadLex16]] – Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, 24–29 July 2016, Warsaw, Poland
  * [[http://corbon.nlp.ipipan.waw.pl/|CORBON 2017]] – 2nd Workshop on Coreference Resolution Beyond !OntoNotes at [[http://eacl2017.org/|EACL 2017]] (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain
  * [[http://anawiki.essex.ac.uk/dali/crac18/|CRAC: Computational Models of Reference, Anaphora, and Coreference]] at [[http://naacl2018.org/|NAACL 2017]] (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA
  * [[http://waw2018.argdiap.pl/argdiap-conference/|16th ArgDiaP conference]], part of the [[http://waw2018.argdiap.pl/|WAW 2018]] (Warsaw Argumentation Week), 15–16 September 2018, Warsaw
  * [[http://poltal.ipipan.waw.pl/|9th International Conference on Natural Language Processing]] (PolTAL 2014), 17–19 September 2014, Warsaw, Poland
  * [[http://tlt14.ipipan.waw.pl/|14th International Workshop on Treebanks and Linguistic Theories]] (TLT14), 11–12 December 2015, Warsaw, Poland
  * [[http://corbon.nlp.ipipan.waw.pl/2016/|Coreference Resolution Beyond OntoNotes]] (CORBON 2016) workshop at [[http://naacl.org/naacl-hlt-2016/|NAACL 2016]] (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US
  * [[http://headlex16.ipipan.waw.pl/|Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar]] (!HeadLex16), 24–29 July 2016, Warsaw, Poland
  * [[http://corbon.nlp.ipipan.waw.pl/|2nd Workshop on Coreference Resolution Beyond OntoNotes]] (CORBON 2017) at [[http://eacl2017.org/|EACL 2017]] (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain
  * [[http://anawiki.essex.ac.uk/dali/crac18/|Computational Models of Reference, Anaphora, and Coreference]] workshop (CRAC) at [[http://naacl2018.org/|NAACL 2018]] (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA
  * [[https://nlpday.pl/|AI & NLP Workshop Day]], 19 October 2018, Warsaw
  * [[https://sites.google.com/view/crac2019/|Second Workshop on Computational Models of Reference, Anaphora and Coreference]] (CRAC 2019), 6 ot 7 June 2019, Minneapolis
  * [[http://www.dynamicsoflanguage.edu.au/lfg
-2019/|The 24th International Lexical-Functional Grammar Conference]] (LFG19), 8–10 July 2019, Canberra
  * [[https://lfg20.w.uib.no/|The 25th International Lexical-Functional Grammar Conference]] (LFG20), 23–25 June 2020, online
  * [[https://typo.uni-konstanz.de/lfg2021/|The 2
6th International Lexical-Functional Grammar Conference]] (LFG21), 13–15 July 2021, online


== Selected publications ==

<<BibMate(author, "
Andrzejczuk", "Bartosiak", "Gawłowicz", "Hajnicz", "Kaczyński", "Kieraś", "Klimaszewski", "Kobyliński", "Krasnowska", "Marciniak", "Mykowiecka", "Nitoń", "Ogrodniczuk", "Patejuk", "Przepiórkowski", "Przybyła", "Rychlik, "Wawer", "Wojdyga", "Wołoszyn", "Woliński", "Wójtowicz", "Wróblewska", "Bolc")>>

The Linguistic Engineering Group

The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (IPI PAN).

People

Core team

Tomasz Bartosiak, MSc

tomasz.bartosiak@gmail.com

Diego Feinmann, PhD

diego.feinmann@ipipan.waw.pl

Elżbieta Hajnicz, PhD, Assoc. Prof.

elzbieta.hajnicz@ipipan.waw.pl

Witold Kieraś, PhD

witold.kieras@ipipan.waw.pl

Łukasz Kobyliński, PhD

lukasz.kobylinski@ipipan.waw.pl

Dorota Komosińska, MSc

dorota.komosinska@gmail.com

Katarzyna Krasnowska-Kieraś, MSc

katarzyna.krasnowska@ipipan.waw.pl

Małgorzata Marciniak, PhD, Assoc. Prof.

malgorzata.marciniak@ipipan.waw.pl

Agnieszka Mykowiecka, PhD, Assoc. Prof.

agnieszka.mykowiecka@ipipan.waw.pl

Maciej Ogrodniczuk, PhD, Assoc. Prof., Head of the Group

maciej.ogrodniczuk@ipipan.waw.pl

Agnieszka Patejuk, PhD

agnieszka.patejuk@ipipan.waw.pl

Adam Przepiórkowski, PhD, Full Prof.

adam.przepiorkowski@ipipan.waw.pl

Piotr Przybyła, PhD (on postdoctoral fellowship at UPF)

piotr.przybyla@ipipan.waw.pl

Michał Rudolf, PhD

michal@rudolf.waw.pl

Piotr Rychlik, PhD

piotr.rychlik@ipipan.waw.pl

Karolina Saputa, BEng

karolsaputa@gmail.com

Aleksandra Tomaszewska, PhD candidate

aleksandra.tomaszewska@ipipan.waw.pl

Aleksander Wawer, PhD

aleksander.wawer@ipipan.waw.pl

Marcin Woliński, PhD, Assoc. Prof.

marcin.wolinski@ipipan.waw.pl

Joanna Wołoszyn, PhD

joanna.woloszyn@ipipan.waw.pl

Alina Wróblewska, PhD

alina.wroblewska@ipipan.waw.pl

Sebastian Zawada, MSc

sebastian.zawada@ipipan.waw.pl

Natalia Zawadzka-Paluektau, PhD

natalia.zawadzka-paluektau@ipipan.waw.pl

Bartosz Żuk, PhD candidate

bartoszzuk.poczta@gmail.com

Associates

Anna Andrzejczuk, PhD (on leave)

anna.andrzejczuk@ipipan.waw.pl

Wiktor Eźlakowski, MSc

wiktor.ezlakowski@ipipan.waw.pl

Sonia Janicka

sonia.janicka@gmail.com

Mateusz Klimaszewski, MSc

mk.klimaszewski@gmail.com

Jakub Piskorski, PhD

jpiskorski@gmail.com

Piotr Rybak, MSc

piotr.cezary.rybak@gmail.com

Jakub Szymanik, PhD

jakub.szymanik@gmail.com

Ryszard Tuora, MSc

ryszardtuora@gmail.com

Grzegorz Wojdyga, MSc

g.wojdyga@ipipan.waw.pl

Beata Wójtowicz, PhD, Assoc. Prof.

beata.wojtowicz@ipipan.waw.pl

Research

The main research areas of the Group

  • (Polish) corpus linguistics (National Corpus of Polish)

  • morphosyntactic tagging and lemmatisation of Polish
  • syntactic and semantic parsing of Polish
  • extraction of linguistic knowledge from corpora
  • information extraction
  • distributional semantics and compositional distributional semantics
  • sentiment analysis
  • credibility assessment of online content
  • reference and discourse relations
  • generative linguistic formalisms, esp., HPSG and LFG.

The Group is a member of CLARIN, DARIAH-PL, ELRC, FLaReNet and META-NET.

Current externally funded projects

  • CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure)

  • CORMETAN (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts)

  • CURLICAT (Curated Multilingual Language Resources for CEF AT)

  • Korpus Dekady (DARIAH-PL — Digital Research Infrastructure for the Arts and Humanities)

  • ELE (European Language Equality)

  • ELG (European Language Grid)

  • ELRC (European Language Resource Coordination)

  • HOMADOS (Hampering Misinformation by Assessing Credibility of Online Sources)

  • KORBA 2 (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish")

  • Kwantyfikatory w języku: użycie i znaczenie (Quantifiers in Language: Use and Meaning)

  • MARCELL (Multilingual Resources for CEF.AT in the legal domain)

  • Nexus Linguarum (European network for Web-centred linguistic data science)

  • Scwad (Compositional distributional modelling of Polish language semantics)

  • SYNAMET (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse)

Some of our past projects

Publicly available tools and resources

Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN.

Some tools (all open source, under GPL; see also CLIP):

  • Morfeusz 2 – a morphological analyser of Polish,

  • Spejd – a shallow parsing and disambiguation system,

  • Świgra – a DCG parser,

  • COMBO – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling,

  • Concraft — a CRF morphosyntactic tagger of Polish compatible with Morfeusz SGJP,

  • PANTERA – a morphosyntactic tagger for Polish,

  • TaKIPI – a morphosyntactic tagger for Polish,

  • Poliqarp – a corpus indexing and search engine,

  • Poliqarp2 – a new generation corpus indexing and search engine,

  • Dendrarium – a treebank development system (under development),

  • Anotatornia 2 – an annotation tool geared towards historical corpora,

  • WSDDE – a system for designing and performing Word Sense Disambiguation experiments,

  • Multiservice – web service for various of our tools,

  • TermoPL - multiword terms extraction from text

  • DSmodels - web service for calculating word similarity using Polish word embeddings

Main resources (many more at CLIP):

Other activities

Links to some other activities of the Group:

Selected publications

List of publications

2025

Adam Przepiórkowski and Agnieszka Patejuk. Prenominal adverbs, or apparent selectional violations in coordination. Linguistic Inquiry, Early Access:1–29, 2025.

Aleksandra Tomaszewska, Dariusz Czerski, Bartosz Żuk, and Maciej Ogrodniczuk. NeoN: A tool for automated detection, linguistic and LLM-driven aalysis of neologisms in Polish. In Michael H. Lees, Wentong Cai, Siew Ann Cheong, Yi Su, David Abramson, Jack J. Dongarra, and Peter M. A. Sloot, editors, Computational Science – ICCS 2025, pages 318–326, Cham, 2025. Springer Nature Switzerland.

2024

Katarzyna Krasnowska-Kieraś and Marcin Woliński. Parsing headed constituencies. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 12633–12643, Torino, Italy, 2024. ELRA and ICCL.

Adam Przepiórkowski, Magdalena Borysiak, Adam Okrasiński, Bartosz Pobożniak, Wojciech Stempniak, Kamil Tomaszek, and Adam Głowacki. Symmetric dependency structure of coordination: Crosslinguistic arguments from dependency length minimization. In Daniel Dakota, Sarah Jablotschkin, Sandra Kübler, and Heike Zinsmeister, editors, Proceedings of the 22nd Workshop on Treebanks and Linguistic Theories (TLT 2024), pages 11–22, Hamburg,Germany, 2024. Association for Computational Linguistics.

Adam Przepiórkowski, Katarzyna Kuś, Agnieszka Patejuk, and Berke Şenşekerci. You can depend on the symmetry of coordination and that NPs and CPs can be conjoined. Presentation delivered on 5 July 2024 at the “Form and Meaning of Coordination” workshop in Göttingen, Germany (https://www.uni-goettingen.de/de/685553.html), 2024.

Piotr Rybak, Piotr Przybyła, and Maciej Ogrodniczuk. PolQA: Polish question answering dataset. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 12846–12855, Torino, Italy, 2024. ELRA and ICCL.

Agata Savary, Daniel Zeman, Verginica Barbu Mititelu, Anabela Barreiro, Olesea Caftanatov, Marie-Catherine de Marneffe, Kaja Dobrovoljc, Gülsen Eryiğit, Voula Giouli, Bruno Guillaume, Stella Markantonatou, Nurit Melnik, Joakim Nivre, Atul Kr. Ojha, Carlos Ramisch, Abigail Walsh, Beata Wójtowicz, and Alina Wróblewska. UniDive: A COST action on universality, diversity and idiosyncrasy in language technology. In Maite Melero, Sakriani Sakti, and Claudia Soria, editors, Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024, pages 372–382, Torino, Italy, 2024. ELRA and ICCL.

2023

Łukasz Kobyliński, Maciej Ogrodniczuk, Piotr Rybak, Piotr Przybyła, Piotr Pęzik, Agnieszka Mikołajczyk, Wojciech Janowski, Michał Marcińczuk, and Aleksander Smywiński-Pohl. PolEval 2022/23 challenge tasks and results. In Maria Ganzha, Leszek Maciaszek, Marcin Paprzycki, and Dominik Ślęzak, editors, Proceedings of the 18th Conference on Computer Science and Intelligence Systems, volume 35 of Annals of Computer Science and Information Systems, pages 1237–1244, 2023.

Katarzyna Krasnowska-Kieraś and Marcin Woliński. Constituency parsing with spines and attachments. In Jiří Mikyška, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M.A. Sloot, editors, Computational Science – ICCS 2023. 23rd International Conference, Prague, Czech Republic, July 3–5, 2023, Proceedings, Part I, number 14073 in Lecture Notes in Computer Science, pages 191–205, Cham, 2023. Springer Nature Switzerland.

Maciej Ogrodniczuk, editor. Analiza danych parlamentarnych. Warsztat pokonkursowy, Warsaw, 2023. Institute of Computer Science, Polish Academy of Sciences.

Agnieszka Patejuk. Coordination. In Mary Dalrymple, editor, Handbook of Lexical Functional Grammar, pages 309–374. Language Science Press, Berlin, 2023.

Adam Przepiórkowski and Michał Woźniak. Conjunct lengths in English, Dependency Length Minimization, and dependency structure of coordination. In Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15494–15512, Toronto, Canada, 2023. Association for Computational Linguistics.

Karol Saputa, Aleksandra Tomaszewska, Natalia Zawadzka-Paluektau, Witold Kieraś, and Łukasz Kobyliński. Korpusomat.eu: A multilingual platform for building and analysing linguistic corpora. In Jiří Mikyška, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M.A. Sloot, editors, Computational Science – ICCS 2023. 23rd International Conference, Prague, Czech Republic, July 3–5, 2023, Proceedings, Part II, number 14074 in Lecture Notes in Computer Science, pages 230–237, Cham, 2023. Springer Nature Switzerland.

Joanna Wołoszyn, Witold Kieraś, and Marcin Woliński. Sieć powiązań derywacyjnych na materiale Słownika gramatycznego języka polskiego: Propozycja klasyfikacji. LingVaria, 18(2):47–61, 2023.

2022

Włodzimierz Gruszczyński, Dorota Adamiec, Renata Bronikowska, Witold Kieraś, Emanuel Modrzejewski, Aleksandra Wieczorek, and Marcin Woliński. The electronic corpus of 17th- and 18th-century Polish texts. Language Resources and Evaluation, 56(1):309–332, 2022.

Maciej Ogrodniczuk and Katarzyna Kryńska. Evaluating Machine Translation of Latin Interjections in the Digital Library of Polish and Poland-related News Pamphlets. In Yuen-Hsien Tseng, Marie Katsurai, and Hoa N. Nguyen, editors, From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries. ICADL 2022, number 13636 in Lecture Notes in Computer Science, pages 430–439, Cham, 2022. Springer International Publishing.

Maciej Ogrodniczuk, Sameer Pradhan, Anna Nedoluzhko, Vincent Ng, and Massimo Poesio, editors. Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference, Gyeongju, Republic of Korea, 2022. Association for Computational Linguistics.

Maciej Ogrodniczuk. Fine-tuning OCR error detection and correction in a Polish corpus of scientific abstracts. In Edward Szczerbicki, Krystian Wojtkiewicz, Sinh Van Nguyen, Marcin Pietranik, and Marek Krótkiewicz, editors, ACIIDS 2022: Recent Challenges in Intelligent Information and Database Systems, number 1716 in Communications in Computer and Information Science (CCIS), pages 450–461. Springer Nature Singapore, 2022.

Adam Przepiórkowski. Polyadic cover quantification in heterofunctional coordination. In Daniel Gutzmann and Sophie Repp, editors, Proceedings of Sinn und Bedeutung 26, pages 677–696, 2022.

Tamás Váradi, Marko Tadić, Svetla Koeva, Maciej Ogrodniczuk, Dan Tufiș, Radovan Garabík, Simon Krek, and Andraž Repar. Curated multilingual language resources for CEF AT (CURLICAT): Overall view. In Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, pages 339–340, Ghent, Belgium, 2022. European Association for Machine Translation.

2021

Małgorzata Marciniak, Agnieszka Mykowiecka, and Piotr Rychlik. Terminology/keyphrase extraction for creation of book indexes in Polish. In Gerd Berget, Mark Michael Hall, Daniel Brenn, and Sanna Kumpulainen, editors, Linking Theory and Practice of Digital Libraries, pages 49–54, Cham, 2021. Springer International Publishing.

Maciej Ogrodniczuk and Włodzimierz Gruszczyński. Embedding transcription and transliteration layers in the Digital Library of Polish and Poland-Related News Pamphlets. In Hao-Ren Ke, Chei Sian Lee, and Kazunari Sugiyama, editors, Towards Open and Trustworthy Digital Societies, pages 54–60, Cham, 2021. Springer International Publishing.

Maciej Ogrodniczuk and Łukasz Kobyliński, editors. Proceedings of the PolEval 2021 Workshop, Warsaw, 2021. Institute of Computer Science, Polish Academy of Sciences.

Maciej Ogrodniczuk and Piotr Przybyła. PolEval 2021 Task 4: Question Answering Challenge. In Maciej Ogrodniczuk and Łukasz Kobyliński, editors, Proceedings of the PolEval 2021 Workshop, pages 123–136, Warsaw, 2021. Institute of Computer Science, Polish Academy of Sciences.

Tiago Pimentel, Maria Ryskina, Sabrina J. Mielke, Shijie Wu, Eleanor Chodroff, Brian Leonard, Garrett Nicolai, Yustinus Ghanggo Ate, Salam Khalifa, Nizar Habash, Charbel El-Khaissi, Omer Goldman, Michael Gasser, William Lane, Matt Coler, Arturo Oncevay, Jaime Rafael Montoya Samame, Gema Celeste Silva Villegas, Adam Ek, Jean-Philippe Bernardy, Andrey Shcherbakov, Aziyana Bayyr-ool, Karina Sheifer, Sofya Ganieva, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Andrew Krizhanovsky, Natalia Krizhanovsky, Clara Vania, Sardana Ivanova, Aelita Salchak, Christopher Straughn, Zoey Liu, Jonathan North Washington, Duygu Ataman, Witold Kieraś, Marcin Woliński, Totok Suhardijanto, Niklas Stoehr, Zahroh Nuriah, Shyam Ratan, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Richard J. Hatcher, Emily Prud'hommeaux, Ritesh Kumar, Mans Hulden, Botond Barta, Dorina Lakatos, Gábor Szolnok, Judit Ács, Mohit Raj, David Yarowsky, Ryan Cotterell, Ben Ambridge, and Ekaterina Vylomova. SIGMORPHON 2021 shared task on morphological reinflection: Generalization across languages. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 229–259. Association for Computational Linguistics, 2021.

2020

Marcin Woliński and Witold Kieraś. Analiza fleksyjna tekstów historycznych i zmienność fleksji polskiej z perspektywy danych korpusowych. Poradnik Językowy, 8:66–80, 2020.

Alina Wróblewska, Katarzyna Krasnowska-Kieraś, and Piotr Rybak. Towards the evaluation of feature embedding models of the fusional languages. In Zygmunt Vetulani, Patrick Paroubek, and Marek Kubis, editors, Human Language Technology. Challenges for Computer Science and Linguistics, 8th Language and Technology Conference, LTC 2017, Poznań, Poland, November 17–19, 2017, Revised Selected Papers, number 12598 in Lecture Notes in Computer Science, pages 256–270, Cham, 2020. Springer International Publishing.

2019

Jakub Gąsior and Piotr Przybyła. The IPIPAN team participation in the check-worthiness task of the CLEF2019 checkthat ! lab. In Linda Cappellato, Nicola Ferro, David E. Losada, and Henning Müller, editors, Working Notes of CLEF 2019 – Conference and Labs of the Evaluation Forum, Lugano, Switzerland, 2019. CEUR-WS.org.

Katarzyna Krasnowska-Kieraś and Łukasz Kobyliński. Part of speech tagging for Polish. Poznań Studies in Contemporary Linguistics, 55(2):211–237, 2019.

Maciej Ogrodniczuk, Rafał L. Górski, Marek Łaziński, and Piotr Pęzik. From the National Corpus of Polish to the Polish Corpus Infrastructure. Jazykovedný časopis, 70(2):315–323, 2019.

2018

Witold Kieraś and Marcin Woliński. Manually annotated corpus of Polish texts published between 1830 and 1918. In Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga, editors, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pages 3854–3859, Paris, France, 2018. European Language Resources Association (ELRA).

Łukasz Kobyliński, Michał Wasiluk, and Grzegorz Wojdyga. Improving part-of-speech tagging by meta-learning. In Petr Sojka, Aleš Horák, Ivan Kopeček, and Karel Pala, editors, Text, Speech, and Dialogue: 21st International Conference, TSD 2018, Brno, Czech Republic, September 11-14, 2018, Proceedings, number 11107 in Lecture Notes in Artificial Intelligence, pages 144–152. Springer-Verlag, 2018.

Małgorzata Marciniak, Agnieszka Mykowiecka, and Piotr Rychlik. Recognition of irrelevant phrases in automatically extracted lists of domain terms. Terminology, 24(1):66–90, 2018.

Agnieszka Mykowiecka, Małgorzata Marciniak, and Aleksander Wawer. Literal, metphorical or both? Detecting metaphoricity in isolated adjective-noun phrases. In Beata Beigman Klebanov, Ekaterina Shutova, Patricia Lichtenstein, Smaranda Muresan, and Chee Wee, editors, Proceedings of the Workshop on Figurative Language Processing, pages 27–33. Association for Computational Linguistics, 2018.

Maciej Ogrodniczuk, Joanna Bilińska, Zbigniew Bronk, and Witold Kieraś. Multisłownik: Linking plWordNet-based lexical data for lexicography and educational purposes. In Francis Bond, Takayuki Kuribayashi, Christiane Fellbaum, and Piek Vossen, editors, Proceedings of the 9th Global WordNet Conference (GWC 2018), pages 368–375, Singapore, 2018. University of Tartu.

Piotr Rybak and Alina Wróblewska. Semi-supervised neural system for tagging, parsing and lematization. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pages 45–54. Association for Computational Linguistics, 2018.

Piotr Rybak and Alina Wróblewska. Semi-supervised neural system for tagging, parsing and lemmatization. Addendum. In Proceedings of the PolEval 2018 Workshop, pages 49–51. Institute of Computer Science, Polish Academy of Sciences, 2018.

Alina Wróblewska. Polish corpus of annotated descriptions of images. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pages 2141–2146. European Language Resources Association (ELRA), 2018.

Alina Wróblewska. Results of the PolEval 2018 Shared Task 1: Dependency Parsing. In Proceedings of the PolEval 2018 Workshop, pages 11–24. Institute of Computer Science, Polish Academy of Sciences, 2018.

2017

Tomasz Bartosiak. Shared forest representation of predicate-argument structures for shared syntactic forests. In Zygmunt Vetulani and Patrick Paroubek, editors, Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 410–414, Poznań, Poland, 2017. Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu.

Maciej Ogrodniczuk and Vincent Ng, editors. Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), Valencia, Spain, 2017. Association for Computational Linguistics.

Maciej Ogrodniczuk and Bartłomiej Nitoń. Improving Polish mention detection with valency dictionary. In Maciej Ogrodniczuk and Vincent Ng, editors, Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), pages 17–23, Valencia, Spain, 2017. Association for Computational Linguistics.

Adam Przepiórkowski. Argumenty i modyfikatory w gramatyce i w słowniku. Wydawnictwa Uniwersytetu Warszawskiego, Warsaw, 2017.

Adam Przepiórkowski. On the argument–adjunct distinction in the Polish Semantic Syntax tradition. Cognitive Studies / Études Cognitives, 17:1–10, 2017.

Aleksander Wawer and Agnieszka Mykowiecka. Supervised and unsupervised word sense disambiguation on word embedding vectors of unambigous synonyms. In Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications, pages 120–125. Association for Computational Linguistics, 2017.

Aleksander Wawer and Maciej Ogrodniczuk. Results of the PolEval 2017 competition: Sentiment Analysis shared task. In Zygmunt Vetulani and Patrick Paroubek, editors, Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 406–409, Poznań, Poland, 2017. Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu.

Alina Wróblewska, Katarzyna Krasnowska-Kieraś, and Piotr Rybak. Towards the evaluation of feature embedding models of the fusional languages. In Zygmunt Vetulani and Patrick Paroubek, editors, Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 420–424, Poznań, Poland, 2017. Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu.

2016

Joanna Bilińska, Magdalena Derwojedowa, Witold Kieraś, and Monika Kwiecień. Mikrokorpus polszczyzny 1830-1918. Komunikacja specjalistyczna, 11:149–161, 2016.

Maciej Ogrodniczuk, Katarzyna Głowińska, Mateusz Kopeć, Agata Savary, and Magdalena Zawisławska. Polish Coreference Corpus. In Zygmunt Vetulani, Hans Uszkoreit, and Marek Kubis, editors, Human Language Technology. Challenges for Computer Science and Linguistics: 6th Language and Technology Conference, LTC 2013, Poznań, Poland, December 7-9, 2013. Revised Selected Papers, number 9561 in Lecture Notes in Artificial Intelligence, pages 215–226, Switzerland, 2016. Springer International Publishing.

2015

Markus Dickinson, Erhard Hinrichs, Agnieszka Patejuk, and Adam Przepiórkowski, editors. Proceedings of the Fourteenth International Workshop on Treebanks and Linguistic Theories (TLT 14), Warsaw, 2015. Institute of Computer Science, Polish Academy of Sciences.

Katarzyna Krasnowska and Adam Przepiórkowski. Combining various degrees of supervision in PP-attachment disambiguation. In Zygmunt Vetulani and Joseph Mariani, editors, Proceedings of the 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, page 85–89, Poznań, Poland, 2015.

Katarzyna Krasnowska-Kieraś and Agnieszka Patejuk. Integrating Polish LFG with external morphology. In Markus Dickinson, Erhard Hinrichs, Agnieszka Patejuk, and Adam Przepiórkowski, editors, Proceedings of the Fourteenth International Workshop on Treebanks and Linguistic Theories (TLT 14), pages 134–147, Warsaw, 2015. Institute of Computer Science, Polish Academy of Sciences.

Małgorzata Marciniak and Agnieszka Mykowiecka. Nested term recognition driven by word connection strength. Terminology, 2:180–204, 2015.

2014

Elżbieta Hajnicz. The procedure of lexico-semantic annotation of Składnica treebank. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 2290–2297, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Verena Henrich, Erhard Hinrichs, Daniël de Kok, Petya Osenova, and Adam Przepiórkowski, editors. Proceedings of the Thirteenth International Workshop on Treebanks and Linguistic Theories (TLT 13), Tübingen, 2014. Department of Linguistics (SfS), University of Tübingen.

Agnieszka Patejuk and Adam Przepiórkowski. Synergistic development of grammatical resources: A valence dictionary, an LFG grammar, and an LFG structure bank for Polish. In Verena Henrich, Erhard Hinrichs, Daniël de Kok, Petya Osenova, and Adam Przepiórkowski, editors, Proceedings of the Thirteenth International Workshop on Treebanks and Linguistic Theories (TLT 13), pages 113–126, Tübingen, 2014. Department of Linguistics (SfS), University of Tübingen.

Adam Przepiórkowski, Elżbieta Hajnicz, Agnieszka Patejuk, and Marcin Woliński. Extended phraseological information in a valence dictionary for NLP applications. In Proceedings of the Workshop on Lexical and Grammatical Resources for Language Processing (LG-LP 2014), pages 83–91, Dublin, Ireland, 2014. Association for Computational Linguistics and Dublin City University.

Adam Przepiórkowski, Elżbieta Hajnicz, Agnieszka Patejuk, Marcin Woliński, Filip Skwarski, and Marek Świdziński. Walenty: Towards a comprehensive valence dictionary of Polish. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 2785–2792, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Alina Wróblewska. Polish Dependency Parser Trained on an Automatically Induced Dependency Bank. Ph.D. dissertation, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2014.

Alina Wróblewska. Polish Dependency Parser Trained on an Automatically Induced Dependency Bank. Ph.D. dissertation, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2014.

2013

Elżbieta Hajnicz. Mapping named entities from NKJP corpus to składnica treebank and Polish WordNet. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Language Processing and Intelligent Information Systems – 20th International Conference, IIS 2013, Warsaw, Poland, June 17-18, 2013. Proceedings, number 7912 in Lecture Notes in Computer Science, pages 92–105, Berlin, Heidelberg, 2013. Springer-Verlag.

Elżbieta Hajnicz. Actualising lexico-semantic annotation of Składnica Treebank to modified versions of source resources. In Zygmunt Vetulani, editor, Proceedings of the 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 178–182, Poznań, Poland, 2013. Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza.

Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors. Language Processing and Intelligent Information Systems – 20th International Conference, IIS 2013, Warsaw, Poland, June 17-18, 2013. Proceedings, number 7912 in Lecture Notes in Computer Science, Berlin, Heidelberg, 2013. Springer-Verlag.

Katarzyna Krasnowska and Adam Przepiórkowski. Detecting syntactic errors in dependency treebanks for morphosyntactically rich languages. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Language Processing and Intelligent Information Systems – 20th International Conference, IIS 2013, Warsaw, Poland, June 17-18, 2013. Proceedings, number 7912 in Lecture Notes in Computer Science, pages 69–79, Berlin, Heidelberg, 2013. Springer-Verlag.

Barbara Lewandowska-Tomaszczyk, Rafał Górski, Marek Łaziński, and Adam Przepiórkowski. The National Corpus of Polish (NKJP). Language use and data analysis. In Irina Kor Chahine and Charles Zaremba, editors, Travaux de slavistique : Actes du VIe congrès de la Slavic Linguistic Society, pages 309–319. Presses Universitaires de Provence, 2013.

Maciej Ogrodniczuk and Michał Lenart. A multi-purpose online toolset for NLP applications. In Elisabeth Métais, Farid Meziane, Mohamed Saraee, Vijay Sugumaran, and Sunil Vadera, editors, Proceedings of the 18th International Conference on Applications of Natural Language to Information Systems, number 7934 in Lecture Notes in Computer Science, pages 392–395. Springer-Verlag, Berlin, Heidelberg, 2013.

Piotr Przybyła. Question Classification for Polish Question Answering. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Proceedings of the 20th International Conference on Language Processing and Intelligent Information Systems (LP&IIS 2013), pages 50–56. Springer-Verlag, 2013.

2012

Szymon Acedański, Adam Slaski, and Adam Przepiórkowski. Machine learning of syntactic attachment from morphosyntactic and semantic co-occurrence statistics. In Proceedings of the ACL 2012 Joint Workshop on Statistical Parsing and Semantic Processing of Morphologically Rich Languages, pages 42–47, Jeju, Republic of Korea, 2012. Association for Computational Linguistics.

Anna Andrzejczuk. Klasyfikacja onomazjologiczna rzeczowników a ich charakterystyka gramatyczna. Nowy sposób opracowania materiału leksykograficznego.. PhD thesis, Instytut Języka Polskiego, Polska Akademia Nauk, Cracow, 2012.

Mateusz Kopeć, Rafał Młodzki, and Adam Przepiórkowski. Word Sense Disambiguation in the National Corpus of Polish. Prace Filologiczne, LXIII:155–165, 2012.

Mateusz Kopeć, Rafał Młodzki, and Adam Przepiórkowski. Automatyczne znakowanie sensami słów. In Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors, Narodowy Korpus Języka Polskiego, pages 209–224. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Mateusz Kopeć and Maciej Ogrodniczuk. Creating a Coreference Resolution System for Polish. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, pages 192–195, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

Barbara Lewandowska-Tomaszczyk, Mirosław Bańko, Rafał L. Górski, Marek Łazinski, Piotr Pęzik, and Adam Przepiórkowski. Narodowy Korpus Języka Polskiego: geneza i dzień dzisiejszy. In Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors, Narodowy Korpus Języka Polskiego, pages 3–10. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Maciej Ogrodniczuk. The Polish Sejm Corpus. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, pages 2219–2223, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors. Narodowy Korpus Języka Polskiego. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Zygmunt Saloni, Marcin Woliński, Robert Wołosz, Włodzimierz Gruszczyński, and Danuta Skowrońska. Słownik gramatyczny języka polskiego. Warsaw, 2nd edition, 2012.

Marcin Woliński, Marcin Miłkowski, Maciej Ogrodniczuk, Adam Przepiórkowski, and Łukasz Szałkiewicz. PoliMorf: A (not so) new open morphological dictionary for Polish. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, pages 860–864, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

Marcin Woliński and Andrzej Zaborowski. An ambiguity aware treebank search tool. In Petr Sojka, Aleš Horák, Ivan Kopeček, and Karel Pala, editors, Text, Speech and Dialogue: 15th International Conference, TSD 2012, Brno, Czech Republic, number 7499 in Lecture Notes in Artificial Intelligence, pages 88–94. Springer-Verlag, Heidelberg, 2012.

2011

Anna Andrzejczuk. Dwoje urodzin to brzmi dziwnie. Norma językowe dotycząca połączeń rzeczowników PT z liczebnikami a jej realizacja w tekstach Narodowego Korpusu Języka Polskiego i w tekstach internetowych. Język Polski, XCI(4):273–283, 2011.

Maciej Ogrodniczuk and Mateusz Kopeć. Rule-based coreference resolution module for Polish. In Proceedings of the 8th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC 2011), pages 191–200, Faro, Portugal, 2011.

Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, Barbara Lewandowska-Tomaszczyk, Marek Łaziński, and Piotr Pęzik. National Corpus of Polish. In Zygmunt Vetulani, editor, Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 259–263, Poznań, Poland, 2011.

Marcin Woliński, Katarzyna Głowińska, and Marek Świdziński. A preliminary version of Składnica—a treebank of Polish. In Zygmunt Vetulani, editor, Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 299–303, Poznań, Poland, 2011.

2010

Włodzimierz Gruszczyński and Maciej Ogrodniczuk. Cyfrowa Biblioteka Druków Ulotnych Polskich i Polski dotyczących z XVI, XVII i XVIII w. w nauce i dydaktyce. In Materiały konferencji Polskie Biblioteki Cyfrowe 2010, pages 23–27, Poznań, Poland, 2010.

Małgorzata Marciniak, editor. Anotowany korpus dialogów telefonicznych. Akademicka Oficyna Wydawnicza EXIT, Warsaw, 2010.

Maciej Ogrodniczuk and Adam Przepiórkowski. Linguistic processing chains as Web Services: Initial linguistic considerations, 2010. CLARIN deliverable D5R-3a.

2009

Piotr Bański and Adam Przepiórkowski. Stand-off TEI annotation: the case of the National Corpus of Polish. In Proceedings of the Third Linguistic Annotation Workshop (LAW III) at ACL-IJCNLP 2009, pages 64–67, Singapore, 2009.

Núria Bel, Jonas Beskow, Lou Boves, Gerhard Budin, Nicoletta Calzolari, Khalid Choukri, Erhard Hinrichs, Steven Krauwer, Lothar Lemnitzer, Stelios Piperidis, Adam Przepiórkowski, Laurent Romary, Florian Schiel, Helmut Schmidt, Hans Uszkoreit, and Peter Wittenburg. Standardisation action plan for Clarin, 2009. State: Proposal to CLARIN Community; August 2009.

Łukasz Kobyliński and Krzysztof Walczak. Jumping emerging substrings in image classification. In X. Jiang and N. Petkov, editors, International Conference on Computer Analysis of Images and Patterns, number 5702 in Lecture Notes in Computer Science, pages 732–739. Springer-Verlag, 2009.

2008

Mieczysław A. Kłopotek, Adam Przepiórkowski, Sławomir T. Wierzchoń, and Krzysztof Trojanowski, editors. Intelligent Information Systems. Akademicka Oficyna Wydawnicza EXIT, Warsaw, 2008.

Łukasz Kobyliński and Krzysztof Walczak. Jumping emerging patterns with occurrence count in image classification. In T. Washio, E. Suzuki, K. M. Ting, and A. Inokuchi, editors, Pacific-Asia Conference on Knowledge Discovery and Data Mining, number 5012 in Lecture Notes in Artificial Intelligence, pages 904–909. Springer-Verlag, 2008.

2007

Anna Andrzejczuk. (Nie)tylko w liczbie mnogiej. Rozważania o szeroko rozumianych plurale tantum. LingVaria, 4(2):177–188, 2007. Cracow.

Łukasz Kobyliński and Krzysztof Walczak. Class association rules with occurrence count in image classification. TASK Quarterly, 11(1–2):35–45, 2007.

Agnieszka Mykowiecka and Małgorzata Marciniak. Information extraction from patients' free form documentation. In Proceedings of BioNLP 2007: Biological, translational, and clinical language processing ACL Workshop, 2007.

Adam Przepiórkowski and Aleksander Buczyński. ♠: Shallow Parsing and Disambiguation Engine. In Zygmunt Vetulani, editor, Proceedings of the 3rd Language & Technology Conference, pages 340–344, Poznań, Poland, 2007.

2006

Włodzimierz Gruszczyński, Zygmunt Saloni, Anna Andrzejczuk, Maciej Czupryniak, Laura Polkowska, and Marcin Woliński. Informacja gramatyczna i tablice fleksyjne. In Encyklopedia powszechna: Encyklopedyczny słownik języka polskiego od a do z. Uniwersalna encyklopedia od A do Z. Larousse Polska, Wrocław, 2006.

Adam Przepiórkowski. The potential of the IPI PAN Corpus. Poznań Studies in Contemporary Linguistics, 41:31–48, 2006.

2005

Agnieszka Mykowiecka, Małgorzata Marciniak, and Anna Kupść. Making shallow look deeper: Anaphora and comparisons in medical information extraction. In Zygmunt Vetulani, editor, Proceedings of the 2nd Language & Technology Conference, pages 225–229, Poznań, Poland, 2005.

Dariusz Piechociński and Agnieszka Mykowiecka. Question answering in Polish using shallow parsing. In Radovan Garabík, editor, Computer Treatment of Slavic and East European Languages: Proceedings of the Third International Seminar, Bratislava, Slovakia, 10–12 November 2005, pages 167–173, Bratislava, 2005. VEDA: Vydavatel'stvo Slovenskej akadéme vied.

Adam Przepiórkowski. The IPI PAN Corpus in numbers. In Zygmunt Vetulani, editor, Proceedings of the 2nd Language & Technology Conference, pages 27–31, Poznań, Poland, 2005.

2004

Jakub Piskorski, Peter Homola, Małgorzata Marciniak, Agnieszka Mykowiecka, Adam Przepiórkowski, and Marcin Woliński. Information extraction for Polish using the SProUT platform. In Mieczysław A. Kłopotek, Sławomir T. Wierzchoń, and Krzysztof Trojanowski, editors, Intelligent Information Processing and Web Mining, Advances in Soft Computing, pages 227–236. Springer-Verlag, Berlin, 2004.

2003

Maciej Ogrodniczuk. Rozszerzenie opisów morfologicznych w tekstach korpusu „Słownika frekwencyjnego polszczyzny współczesnej”. In Roman Huszcza and Jadwiga Linde-Usiekniewicz, editors, Prace lingwistyczne dedykowane prof. Jadwidze Sambor, pages 164–168. Wydział Polonistyki Uniwersytetu Warszawskiego, Warsaw, 2003.

Adam Przepiórkowski and Marcin Woliński. A flexemic tagset for Polish. In Proceedings of Morphological Processing of Slavic Languages, EACL 2003, pages 33–40, Budapest, 2003.

2001

Adam Przepiórkowski and Piotr Bański, editors. Generative Linguistics in Poland: Syntax and Morphosyntax. Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2001.

2000

Piotr Bański and Adam Przepiórkowski, editors. Proceedings of the First Generative Linguistics in Poland Conference. Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2000.

1999

Adam Przepiórkowski. Case Assignment and the Complement-Adjunct Dichotomy: A Non-Configurational Constraint-Based Approach. Ph.D. dissertation, Universität Tübingen, 1999.

Adam Przepiórkowski. On case assignment and `adjuncts as complements'. In Gert Webelhuth, Jean-Pierre Koenig, and Andreas Kathol, editors, Lexical and Constructional Aspects of Linguistic Explanation, pages 231–245. CSLI Publications, Stanford, CA, 1999.

1998

Leonard Bolc, Krzysztof Dziewicki, Piotr Rychlik, and Andrzej Szałas. Wnioskowanie w logikach nieklasycznych. Automatyzacja wnioskowania. Akademicka Oficyna Wydawnicza RM, Warsaw, 1998.

1997

Anna Kupść, Małgorzata Marciniak, and Leonard Bolc. Anaphor binding in Polish. An attempt at an HPSG account. IPI PAN Research Report 836, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 1997.

Anna Kupść, Małgorzata Marciniak, and Agnieszka Mykowiecka. Komputerowe przetwarzanie jezyka naturalnego — wybrane zagadnienia. Informatyka, 1997.

Adam Przepiórkowski and Marek Świdziński. Polish verbal negation revisited: A metamorphosis vs. HPSG account. IPI PAN Research Report 829, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 1997.

1995

Leonard Bolc, Krzysztof Dziewicki, Piotr Rychlik, and Andrzej Szałas. Wnioskowanie w logikach nieklasycznych. Podstawy teoretyczne. Akademicka Oficyna Wydawnicza RM, Warsaw, 1995.

1994

Adam Przepiórkowski. Critical review of approaches to multiple wh-movement. Research Paper EUCCS/RP-62, Centre for Cognitive Science, University of Edinburgh, 1994.

1989

Elżbieta Hajnicz. Formalizacja systemu wnioskowania o zależnościach czasowych między zdarzeniami. IPI PAN Research Report 658, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 1989.

Elżbieta Hajnicz and Andrzej Pilitowski. Reprezentowanie w hierarchii dziedzin informacji zmieniającej się w czasie. IPI PAN Research Report 675, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 1989.