Locked History Actions


The Linguistic Engineering Group

The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (IPI PAN).


Core team

Tomasz Bartosiak, MSc


Diego Feinmann, PhD


Elżbieta Hajnicz, PhD, Assoc. Prof.


Witold Kieraś, PhD


Łukasz Kobyliński, PhD


Dorota Komosińska, MSc


Katarzyna Krasnowska-Kieraś, MSc


Małgorzata Marciniak, PhD, Assoc. Prof.


Agnieszka Mykowiecka, PhD, Assoc. Prof.


Maciej Ogrodniczuk, PhD, Assoc. Prof., Head of the Group


Agnieszka Patejuk, PhD


Adam Przepiórkowski, PhD, Full Prof.


Piotr Przybyła, PhD (on postdoctoral fellowship at UPF)


Michał Rudolf, PhD


Piotr Rychlik, PhD


Aleksandra Tomaszewska, PhD candidate


Aleksander Wawer, PhD


Marcin Woliński, PhD, Assoc. Prof.


Joanna Wołoszyn, PhD


Alina Wróblewska, PhD


Sebastian Zawada, MSc


Natalia Zawadzka, PhD


Bartosz Żuk, MSc



Anna Andrzejczuk, PhD (on leave)


Wiktor Eźlakowski, MSc


Sonia Janicka


Mateusz Klimaszewski, MSc


Jakub Piskorski, PhD


Piotr Rybak, MSc


Karol Saputa, BEng


Jakub Szymanik, PhD


Ryszard Tuora, MSc


Grzegorz Wojdyga, MSc


Beata Wójtowicz, PhD, Assoc. Prof.



The main research areas of the Group

  • (Polish) corpus linguistics (National Corpus of Polish),

  • morphosyntactic tagging and lemmatisation of Polish,
  • syntactic and semantic parsing of Polish,
  • extraction of linguistic knowledge from corpora,
  • information extraction,
  • distributional semantics and compositional distributional semantics,
  • sentiment analysis,
  • credibility assessment of online content,

  • generative linguistic formalisms, esp., HPSG and LFG.

The Group is a member of CLARIN, DARIAH-PL, ELRC, FLaReNet and META-NET.

Current externally funded projects

  • CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure)

  • CORMETAN (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts)

  • CURLICAT (Curated Multilingual Language Resources for CEF AT)

  • Korpus Dekady (DARIAH-PL — Digital Research Infrastructure for the Arts and Humanities)

  • ELE (European Language Equality)

  • ELG (European Language Grid)

  • ELRC (European Language Resource Coordination)

  • HOMADOS (Hampering Misinformation by Assessing Credibility of Online Sources)

  • KORBA 2 (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish")

  • Kwantyfikatory w języku: użycie i znaczenie (Quantifiers in Language: Use and Meaning)

  • MARCELL (Multilingual Resources for CEF.AT in the legal domain)

  • Nexus Linguarum (European network for Web-centred linguistic data science)

  • Scwad (Compositional distributional modelling of Polish language semantics)

  • SYNAMET (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse)

Some of our past projects

Publicly available tools and resources

Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN.

Some tools (all open source, under GPL; see also CLIP):

  • Morfeusz 2 – a morphological analyser of Polish,

  • Spejd – a shallow parsing and disambiguation system,

  • Świgra – a DCG parser,

  • COMBO – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling,

  • Concraft — a CRF morphosyntactic tagger of Polish compatible with Morfeusz SGJP,

  • PANTERA – a morphosyntactic tagger for Polish,

  • TaKIPI – a morphosyntactic tagger for Polish,

  • Poliqarp – a corpus indexing and search engine,

  • Poliqarp2 – a new generation corpus indexing and search engine,

  • Dendrarium – a treebank development system (under development),

  • Anotatornia 2 – an annotation tool geared towards historical corpora,

  • WSDDE – a system for designing and performing Word Sense Disambiguation experiments,

  • Multiservice – web service for various of our tools,

  • TermoPL - multiword terms extraction from text

  • DSmodels - web service for calculating word similarity using Polish word embeddings

Main resources (many more at CLIP):

Other activities

Links to some other activities of the Group:

Selected publications

List of publications


Adam Przepiórkowski, Julia Łukasiewicz-Pater, Katarzyna Kuś, and Bartosz Maćkiewicz. Heterofunctional coordination in German. To appear in the Journal of Comparative Germanic Linguistics, 2025.


Tomaž Erjavec, Matyáš Kopp, Nikola Ljubešić, Taja Kuzman, Paul Rayson, Petya Osenova, Maciej Ogrodniczuk, Çağrı Çöltekin, Danijel Koržinek, Katja Meden, Jure Skubic, Peter Rupnik, Tommaso Agnoloni, José Aires, Starkaður Barkarson, Roberto Bartolini, Núria Bel, María Calzada Pérez, Roberts Dargis, Sascha Diwersy, Maria Gavriilidou, Ruben van Heusden, Mikel Iruskieta, Neeme Kahusk, Anna Kryvenko, Noémi Ligeti-Nagy, Carmen Magariños, Martin Mölder, Costanza Navarretta, Kiril Simov, Lars Magne Tungland, Jouni Tuominen, John Vidler, Adina Ioana Vladu, Tanja Wissik, Väinö Yrjänäinen, and Darja Fišer. ParlaMint II: Advancing comparable parliamentary corpora across Europe. Language Resources and Evaluation, 2024.

Katarzyna Krasnowska-Kieraś and Marcin Woliński. Parsing headed constituencies. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 12633–12643, Torino, Italy, 2024. ELRA and ICCL.

Maciej Ogrodniczuk. Towards including South African Hansard papers in the ParlaMint schema. Journal of the Digital Humanities Association of Southern Africa, 5(1), 2024.

Maciej Ogrodniczuk and Łukasz Kobyliński, editors. Proceedings of the PolEval 2024 Workshop, Warsaw, 2024. Institute of Computer Science, Polish Academy of Sciences.

Maciej Ogrodniczuk, Anna Nedoluzhko, Massimo Poesio, Sameer Pradhan, and Vincent Ng, editors. Proceedings of The Seventh Workshop on Computational Models of Reference, Anaphora and Coreference, Miami, 2024. Association for Computational Linguistics.

Maciej Ogrodniczuk, Aleksandra Tomaszewska, Daniel Ziembicki, Sebastian Żurowski, Ryszard Tuora, and Aleksandra Zwierzchowska. Polish Discourse Corpus (PDC): Corpus design, ISO-compliant annotation, data highlights, and parser development. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 12829–12835, Torino, Italy, 2024. ELRA and ICCL.

Maciej Ogrodniczuk, Ryszard Tuora, and Beata Wójtowicz. Polish Round Table Corpus. In Darja Fišer, Maria Eskevich, and David Bordon, editors, Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024, pages 43–47, Torino, Italy, 2024. ELRA and ICCL.

Massimo Poesio, Maciej Ogrodniczuk, Vincent Ng, Sameer Pradhan, Juntao Yu, Nafise Sadat Moosavi, Silviu Paun, Amir Zeldes, Anna Nedoluzhko, Michal Novák, Martin Popel, Zdeněk Žabokrtský, and Daniel Zeman. Universal Anaphora: The first three years. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 17087–17100, Torino, Italy, 2024. ELRA and ICCL.

Adam Przepiórkowski. Coordination and binary branching. Syntax, Early View, 2024.

Adam Przepiórkowski, Magdalena Borysiak, and Adam Głowacki. An argument for symmetric coordination from Dependency Length Minimization: A replication study. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 1021–1033, Torino, Italy, 2024. ELRA and ICCL.

Adam Przepiórkowski, Magdalena Borysiak, Adam Okrasiński, Bartosz Pobożniak, Wojciech Stempniak, Kamil Tomaszek, and Adam Głowacki. Symmetric dependency structure of coordination: Crosslinguistic arguments from dependency length minimization. In Daniel Dakota, Sarah Jablotschkin, Sandra Kübler, and Heike Zinsmeister, editors, Proceedings of the 22nd Workshop on Treebanks and Linguistic Theories (TLT 2024), pages 11–22, Hamburg,Germany, 2024. Association for Computational Linguistics.

Adam Przepiórkowski. Case. In Stefan Müller, Anne Abeillé, Robert D. Borsley, and Jean-Pierre Koenig, editors, Head-Driven Phrase Structure Grammar: The Handbook, pages 261–294. Language Science Press, Berlin, 2nd edition, 2024.

Adam Przepiórkowski and Agnieszka Patejuk. Prenominal adverbs, or apparent selectional violations in coordination. Linguistic Inquiry, Just Accepted:1–58, 2024.

Piotr Rybak and Maciej Ogrodniczuk. Silver retriever: Advancing neural passage retrieval for Polish question answering. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 14826–14831, Torino, Italy, 2024. ELRA and ICCL.

Piotr Rybak, Piotr Przybyła, and Maciej Ogrodniczuk. PolQA: Polish question answering dataset. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 12846–12855, Torino, Italy, 2024. ELRA and ICCL.

Karol Saputa, Angelika Peljak-Łapińska, and Maciej Ogrodniczuk. Polish Coreference Corpus as an LLM testbed: Evaluating coreference resolution within instruction-following language models by instruction–answer alignment. In Maciej Ogrodniczuk, Anna Nedoluzhko, Massimo Poesio, Sameer Pradhan, and Vincent Ng, editors, Proceedings of The Seventh Workshop on Computational Models of Reference, Anaphora and Coreference, pages 23–32, Miami, 2024. Association for Computational Linguistics.

Agata Savary, Daniel Zeman, Verginica Barbu Mititelu, Anabela Barreiro, Olesea Caftanatov, Marie-Catherine de Marneffe, Kaja Dobrovoljc, Gülsen Eryiğit, Voula Giouli, Bruno Guillaume, Stella Markantonatou, Nurit Melnik, Joakim Nivre, Atul Kr. Ojha, Carlos Ramisch, Abigail Walsh, Beata Wójtowicz, and Alina Wróblewska. UniDive: A COST action on universality, diversity and idiosyncrasy in language technology. In Maite Melero, Sakriani Sakti, and Claudia Soria, editors, Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024, pages 372–382, Torino, Italy, 2024. ELRA and ICCL.

Berke Şenşekerci and Adam Przepiórkowski. Coordination of unlikes in Turkish. In Miriam Butt, Jamie Y. Findlay, and Ida Toivonen, editors, The Proceedings of the LFG'24 Conference, pages 207–225. PubliKon, 2024.

Martyna Wiącek, Piotr Rybak, Łukasz Pszenny, and Alina Wróblewska. NLPre: A revised approach towards language-centric benchmarking of natural language preprocessing systems. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 12271–12287, Torino, Italy, 2024. ELRA and ICCL.

Alina Wróblewska. Investigating large language models for their competence in extracting grammatically sound sentences from transcribed noisy utterances. In Libby Barak and Malihe Alikhani, editors, Proceedings of the 28th Conference on Computational Natural Language Learning, pages 10–23, Miami, FL, 2024. Association for Computational Linguistics.


Stanisław Bogdanowicz, Hanna Cwynar, Aleksandra Zwierzchowska, Cezary Klamra, Witold Kieraś, and Łukasz Kobyliński. TwitterEmo: Annotating emotions and sentiment in Polish twitter. In Jiří Mikyška, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M.A. Sloot, editors, Computational Science – ICCS 2023. 23rd International Conference, Prague, Czech Republic, July 3–5, 2023, Proceedings, Part II, number 14074 in Lecture Notes in Computer Science, pages 212–220, Cham, 2023. Springer Nature Switzerland.

Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Andrej Pančur, Michał Rudolf, Matyáš Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Çağrı Çöltekin, Jesse de Does, Katrien Depuydt, Tommaso Agnoloni, Giulia Venturi, María Calzada Pérez, Luciana D. de Macedo, Costanza Navarretta, Giancarlo Luxardo, Matthew Coole, Paul Rayson, Vaidas Morkevičius, Tomas Krilavičius, Roberts Darǵis, Orsolya Ring, Ruben van Heusden, Maarten Marx, and Darja Fišer. The ParlaMint corpora of parliamentary proceedings. Language Resources and Evaluation, 58:415–448, 2023.

Cezary Klamra, Katarzyna Kryńska, and Maciej Ogrodniczuk. Evaluating the use of generative LLMs for intralingual diachronic translation of Middle-Polish texts into contemporary Polish. In Dion H. Goh, Shu-Jiun Chen, and Suppawong Tuarob, editors, Leveraging Generative Intelligence in Digital Libraries: Towards Human-Machine Collaboration. ICADL 2023, number 14457 in Lecture Notes in Computer Science, pages 18–27, Singapore, 2023. Springer Nature Singapore.

Łukasz Kobyliński, Maciej Ogrodniczuk, Piotr Rybak, Piotr Przybyła, Piotr Pęzik, Agnieszka Mikołajczyk, Wojciech Janowski, Michał Marcińczuk, and Aleksander Smywiński-Pohl. PolEval 2022/23 challenge tasks and results. In Maria Ganzha, Leszek Maciaszek, Marcin Paprzycki, and Dominik Ślęzak, editors, Proceedings of the 18th Conference on Computer Science and Intelligence Systems, volume 35 of Annals of Computer Science and Information Systems, pages 1237–1244, 2023.

Katarzyna Krasnowska-Kieraś and Marcin Woliński. Constituency parsing with spines and attachments. In Jiří Mikyška, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M.A. Sloot, editors, Computational Science – ICCS 2023. 23rd International Conference, Prague, Czech Republic, July 3–5, 2023, Proceedings, Part I, number 14073 in Lecture Notes in Computer Science, pages 191–205, Cham, 2023. Springer Nature Switzerland.

Małgorzata Marciniak, Piotr Rychlik, and Agnieszka Mykowiecka. TermoUD – a language-independent terminology extraction tool. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, pages 178–186, Dubrovnik, Croatia, 2023. Association for Computational Linguistics.

Maciej Ogrodniczuk, editor. Analiza danych parlamentarnych. Warsztat pokonkursowy, Warsaw, 2023. Institute of Computer Science, Polish Academy of Sciences.

Maciej Ogrodniczuk, Vincent Ng, Sameer Pradhan, and Massimo Poesio, editors. Proceedings of The Sixth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2023), Singapore, 2023. Association for Computational Linguistics.

Maciej Ogrodniczuk, Piotr Pęzik, Marek Łaziński, and Marcin Miłkowski. Language Report Polish. In Georg Rehm and Andy Way, editors, European Language Equality: A Strategic Agenda for Digital Language Equality, pages 191–194. Springer International Publishing, Cham, 2023.

Agnieszka Patejuk. Coordination. In Mary Dalrymple, editor, Handbook of Lexical Functional Grammar, pages 309–374. Language Science Press, Berlin, 2023.

Agnieszka Patejuk and Adam Przepiórkowski. Category mismatches in coordination vindicated. Linguistic Inquiry, 54(2):326–349, 2023.

Piotr Pęzik, Agnieszka Mikołajczyk, Adam Wawrzyński, Filip Żarnecki, Bartłomiej Nitoń, and Maciej Ogrodniczuk. Transferable keyword extraction and generation with text-to-text language models. In Jiří Mikyška, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M.A. Sloot, editors, Computational Science – ICCS 2023. 23rd International Conference, Prague, Czech Republic, July 3–5, 2023, Proceedings, Part II, number 14074 in Lecture Notes in Computer Science, pages 398–405, Cham, 2023. Springer Nature Switzerland.

Jakub Piskorski, Michał Marcińczuk, Preslav Nakov, Maciej Ogrodniczuk, Senja Pollak, Pavel Přibáň, Piotr Rybak, Josef Steinberger, and Roman Yangarber, editors. Proceedings of the 9th Workshop on Slavic Natural Language Processing 2023 (SlavicNLP 2023), Dubrovnik, Croatia, 2023. Association for Computational Linguistics.

Adam Przepiórkowski. LFG and HPSG. In Mary Dalrymple, editor, Handbook of Lexical Functional Grammar, pages 1861–1918. Language Science Press, Berlin, 2023.

Adam Przepiórkowski and Agnieszka Patejuk. Filling gaps with Glue. In Miriam Butt, Jamie Y. Findlay, and Ida Toivonen, editors, The Proceedings of the LFG'23 Conference, pages 223–240. PubliKon, 2023.

Adam Przepiórkowski and Michał Woźniak. Conjunct lengths in English, Dependency Length Minimization, and dependency structure of coordination. In Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15494–15512, Toronto, Canada, 2023. Association for Computational Linguistics.

Karol Saputa, Aleksandra Tomaszewska, Natalia Zawadzka-Paluektau, Witold Kieraś, and Łukasz Kobyliński. Korpusomat.eu: A multilingual platform for building and analysing linguistic corpora. In Jiří Mikyška, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M.A. Sloot, editors, Computational Science – ICCS 2023. 23rd International Conference, Prague, Czech Republic, July 3–5, 2023, Proceedings, Part II, number 14074 in Lecture Notes in Computer Science, pages 230–237, Cham, 2023. Springer Nature Switzerland.

Marcin Woliński, Alina Wróblewska, Małgorzata Marciniak, Katarzyna Krasnowska-Kieraś, and Wiktor Eźlakowski. O konstrukcji …, ale nie… i podobnych w języku polskim. Język Polski, CIII(4):5–21, 2023.

Joanna Wołoszyn, Witold Kieraś, and Marcin Woliński. Sieć powiązań derywacyjnych na materiale Słownika gramatycznego języka polskiego: Propozycja klasyfikacji. LingVaria, 18(2):47–61, 2023.

Zdeněk Žabokrtský, Miloslav Konopik, Anna Nedoluzhko, Michal Novák, Maciej Ogrodniczuk, Martin Popel, Ondrej Prazak, Jakub Sido, and Daniel Zeman. Findings of the second shared task on multilingual coreference resolution. In Zdeněk Žabokrtský and Maciej Ogrodniczuk, editors, Proceedings of the CRAC 2023 Shared Task on Multilingual Coreference Resolution, pages 1–18, Singapore, 2023. Association for Computational Linguistics.

Zdeněk Žabokrtský and Maciej Ogrodniczuk, editors. Proceedings of the CRAC 2023 Shared Task on Multilingual Coreference Resolution, Singapore, 2023. Association for Computational Linguistics.

Sebastian Żurowski, Daniel Ziembicki, Aleksandra Tomaszewska, Maciej Ogrodniczuk, and Agata Drozd. Adopting ISO 24617-8 for discourse relations annotation in Polish: Challenges and future directions. In Sara Carvalho, Anas Fahad Khan, Ana Ostroski Anić, Blerina Spahiu, Jorge Gracia, John P. McCrae, Dagmar Gromann, Barbara Heinisch, and Ana Castro Salgado, editors, Proceedings of the 4th Conference on Language, Data and Knowledge, pages 482–492, Vienna, Austria, 2023. NOVA CLUNL, Portugal.


Khuyagbaatar Batsuren, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Kyle Gorman, Yustinus Ghanggo Ate, Maria Ryskina, Sabrina Mielke, Elena Budianskaya, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Abbott Lane, Mohit Raj, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay, Juan López Bautista, Gema Celeste Silva Villegas, Lucas Torroba Hennigen, Adam Ek, David Guriel, Peter Dirix, Jean-Philippe Bernardy, Andrey Scherbakov, Aziyana Bayyr-ool, Antonios Anastasopoulos, Roberto Zariquiey, Karina Sheifer, Sofya Ganieva, Hilaria Cruz, Ritván Karahóǧa, Stella Markantonatou, George Pavlidis, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Candy Angulo, Jatayu Baxi, Andrew Krizhanovsky, Natalia Krizhanovskaya, Elizabeth Salesky, Clara Vania, Sardana Ivanova, Jennifer White, Rowan Hall Maudslay, Josef Valvoda, Ran Zmigrod, Paula Czarnowska, Irene Nikkarinen, Aelita Salchak, Brijesh Bhatt, Christopher Straughn, Zoey Liu, Jonathan North Washington, Yuval Pinter, Duygu Ataman, Marcin Woliński, Totok Suhardijanto, Anna Yablonskaya, Niklas Stoehr, Hossep Dolatian, Zahroh Nuriah, Shyam Ratan, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Aryaman Arora, Richard J. Hatcher, Ritesh Kumar, Jeremiah Young, Daria Rodionova, Anastasia Yemelina, Taras Andrushko, Igor Marchenko, Polina Mashkovtseva, Alexandra Serova, Emily Prud'hommeaux, Maria Nepomniashchaya, Fausto Giunchiglia, Eleanor Chodroff, Mans Hulden, Miikka Silfverberg, Arya D. McCarthy, David Yarowsky, Ryan Cotterell, Reut Tsarfaty, and Ekaterina Vylomova. UniMorph 4.0: Universal Morphology. In Proceedings of the Language Resources and Evaluation Conference, pages 840–855, Marseille, France, 2022. European Language Resources Association.

Włodzimierz Gruszczyński, Dorota Adamiec, Renata Bronikowska, Witold Kieraś, Emanuel Modrzejewski, Aleksandra Wieczorek, and Marcin Woliński. The electronic corpus of 17th- and 18th-century Polish texts. Language Resources and Evaluation, 56(1):309–332, 2022.

Elżbieta Hajnicz. Annotation of metaphorical expressions in the Basic Corpus of Polish Metaphors. In Proceedings of the Language Resources and Evaluation Conference, pages 5648–5653, Marseille, France, 2022. European Language Resources Association.

Wojciech Jaworski, Przemysław Biecek, Adam Dobrakowski, Małgorzata Marciniak, Agnieszka Mykowiecka, Agnieszka Morusiewicz, Joanna Przetacka, and Łukasz Kamiński. Supporting doctor’s decisions based on electronic medical documentation in Polish. In MEDINFO 2021: One World, One Health – Global Partnership for Digital Innovation, pages 1076–1077, 2022.

Cezary Klamra, Grzegorz Wojdyga, Sebastian Żurowski, Paulina Rosalska, Matylda Kozłowska, and Maciej Ogrodniczuk. Devulgarization of Polish texts using pre-trained language models. In Derek Groen, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M. A. Sloot, editors, Computational Science – ICCS 2022, number 13351 in Lecture Notes in Computer Science, pages 49–55. Springer International Publishing, Cham, 2022.

Łukasz Kobyliński, Maciej Ogrodniczuk, Jan Kocoń, Michał Marcińczuk, Aleksander Smywiński-Pohl, Krzysztof Wołk, Danijel Koržinek, Michal Ptaszynski, Agata Pieciukiewicz, and Paweł Dybała. Evaluating Natural Language Processing tools for Polish during PolEval 2019. In Human Language Technology. Challenges for Computer Science and Linguistics, pages 303–321, Cham, 2022. Springer International Publishing.

Ewa Kozioł-Chrzanowska, Anna Niepytalska-Osiecka, Justyna Zandberg-Malec, and Maciej Ogrodniczuk. Prosty język jako gra zespołowa: refleksje trenera, językoznawcy, praktyka. Poradnik Językowy, 8:11–21, 2022.

Maciej Ogrodniczuk and Katarzyna Kryńska. Evaluating Machine Translation of Latin Interjections in the Digital Library of Polish and Poland-related News Pamphlets. In Yuen-Hsien Tseng, Marie Katsurai, and Hoa N. Nguyen, editors, From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries. ICADL 2022, number 13636 in Lecture Notes in Computer Science, pages 430–439, Cham, 2022. Springer International Publishing.

Maciej Ogrodniczuk, Petya Osenova, Tomaž Erjavec, Darja Fišer, Nikola Ljubešić, Çağrı Çöltekin, Matyáš Kopp, and Meden Katja. ParlaMint II: The show must go on. In Proceedings of The Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference, pages 1–6, Marseille, France, 2022. European Language Resources Association.

Maciej Ogrodniczuk, Sameer Pradhan, Anna Nedoluzhko, Vincent Ng, and Massimo Poesio, editors. Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference, Gyeongju, Republic of Korea, 2022. Association for Computational Linguistics.

Maciej Ogrodniczuk, Michał Rudolf, Beata Wójtowicz, and Sonia Janicka. Error correction environment for the Polish Parliamentary Corpus. In Proceedings of The Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference, pages 35–38, Marseille, France, 2022. European Language Resources Association.

Maciej Ogrodniczuk. Fine-tuning OCR error detection and correction in a Polish corpus of scientific abstracts. In Edward Szczerbicki, Krystian Wojtkiewicz, Sinh Van Nguyen, Marcin Pietranik, and Marek Krótkiewicz, editors, ACIIDS 2022: Recent Challenges in Intelligent Information and Database Systems, number 1716 in Communications in Computer and Information Science (CCIS), pages 450–461. Springer Nature Singapore, 2022.

Piotr Pęzik, Agnieszka Mikołajczyk, Adam Wawrzyński, Bartłomiej Nitoń, and Maciej Ogrodniczuk. Keyword extraction from short texts with a text-to-text Transfer Transformer. In Edward Szczerbicki, Krystian Wojtkiewicz, Sinh Van Nguyen, Marcin Pietranik, and Marek Krótkiewicz, editors, ACIIDS 2022: Recent Challenges in Intelligent Information and Database Systems, number 1716 in Communications in Computer and Information Science (CCIS), pages 530–542. Springer Nature Singapore, 2022.

Adam Przepiórkowski. Coordination of unlike grammatical cases (and unlike categories). Language, 98(3):592–634, 2022.

Adam Przepiórkowski. Polyadic cover quantification in heterofunctional coordination. In Daniel Gutzmann and Sophie Repp, editors, Proceedings of Sinn und Bedeutung 26, pages 677–696, 2022.

Adam Przepiórkowski. A compositional intersective account of heterofunctional coordination. In John R. Starr, Juhyae Kim, and Burak Öney, editors, Proceedings of Semantics and Linguistic Theory 32, pages 270–293, 2022.

Piotr Rychlik, Małgorzata Marciniak, and Agnieszka Mykowiecka. TermoPL: A tool for extracting and clustering domain related terms. In Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries, pages 1–4, New York, NY, USA, 2022. Association for Computing Machinery.

Jakub Szymanik and Witold Kieraś. The semantically annotated corpus of Polish quantificational expressions. Language Resources and Evaluation, 2022. Forthcoming.

Tamás Váradi, Bence Nyéki, Svetla Koeva, Marko Tadić, Vanja Štefanec, Maciej Ogrodniczuk, Bartłomiej Nitoń, Piotr Pęzik, Verginica Barbu Mititelu, Elena Irimia, Maria Mitrofan, Vasile Păiș, Dan Tufiș, Radovan Garabík, Simon Krek, and Andraž Repar. Introducing the CURLICAT corpora: Seven-language domain specific annotated corpora from curated sources. In Proceedings of the Language Resources and Evaluation Conference, pages 100–108, Marseille, France, 2022. European Language Resources Association.

Tamás Váradi, Marko Tadić, Svetla Koeva, Maciej Ogrodniczuk, Dan Tufiș, Radovan Garabík, Simon Krek, and Andraž Repar. Curated multilingual language resources for CEF AT (CURLICAT): Overall view. In Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, pages 339–340, Ghent, Belgium, 2022. European Association for Machine Translation.

Aleksander Wawer, Małgorzata Marciniak, and Agnieszka Mykowiecka. Neural nets in detecting word level metaphors in Polish. In Zygmunt Vetulani, Patrick Paroubek, and Marek Kubis, editors, Human Language Technology. Challenges for Computer Science and Linguistics, pages 277–288, Cham, 2022. Springer International Publishing.

Marcin Woliński, Bartłomiej Nitoń, Witold Kieraś, and Jakub Szymanik. HerBERT based language model detects quantifiers and their semantic properties in Polish. In Proceedings of the Language Resources and Evaluation Conference, pages 7140–7146, Marseille, France, 2022. European Language Resources Association.

Zdeněk Žabokrtský, Miloslav Konopík, Anna Nedoluzhko, Michal Novák, Maciej Ogrodniczuk, Martin Popel, Ondřej Pražák, Jakub Sido, Daniel Zeman, and Yilun Zhu. Findings of the Shared Task on Multilingual Coreference Resolution. In Zdeněk Žabokrtský and Maciej Ogrodniczuk, editors, Proceedings of the CRAC 2022 Shared Task on Multilingual Coreference Resolution, pages 1–17, Gyeongju, Republic of Korea, 2022. Association for Computational Linguistics.

Zdeněk Žabokrtský and Maciej Ogrodniczuk, editors. Proceedings of the CRAC 2022 Shared Task on Multilingual Coreference Resolution, Gyeongju, Republic of Korea, 2022. Association for Computational Linguistics.


Adam Gabriel Dobrakowski, Agnieszka Mykowiecka, Małgorzata Marciniak, Wojciech Jaworski, and Przemysław Biecek. Interpretable segmentation of medical free-text records based on word embeddings. Journal of Intelligent Information Systems, 2021.

Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Andrej Pančur, Nikola Ljubešić, Tommaso Agnoloni, Starkaður Barkarson, María Calzada Pérez, Çağrı Çöltekin, Matthew Coole, Roberts Darģis, Luciana de Macedo, Jesse de Does, Katrien Depuydt, Sascha Diwersy, Dorte Haltrup Hansen, Matyáš Kopp, Tomas Krilavičius, Giancarlo Luxardo, Maarten Marx, Vaidas Morkevičius, Costanza Navarretta, Paul Rayson, Orsolya Ring, Michał Rudolf, Kiril Simov, Steinþór Steingrímsson, István Üveges, Ruben van Heusden, and Giulia Venturi. ParlaMint: Comparable Corpora of European Parliamentary Data. In Monica Monachini and Maria Eskevich, editors, CLARIN Annual Conference 2021: Proceedings, pages 20–25, Utrecht, The Netherlands, 2021. CLARIN ERIC.

Elżbieta Hajnicz, Anna Andrzejczuk, and Tomasz Bartosiak. Słownik walencyjny języka polskiego Walenty. Część druga – semantyka. Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2021.

Wojciech Jaworski, Małgorzata Marciniak, and Agnieszka Mykowiecka. Side effect alerts generation from EHR in Polish. In Maciej Paszynski, Dieter Kranzlmüller, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M.A. Sloot, editors, Computational Science – ICCS 2021, pages 634–647, Cham, 2021. Springer International Publishing.

Witold Kieraś and Łukasz Kobyliński. Korpusomat – stan obecny i przyszłość projektu. Język Polski, CI(2):49–58, 2021.

Witold Kieraś, Marcin Woliński, and Bartłomiej Nitoń. Nowe wielowarstwowe znakowanie lingwistyczne zrównoważonego Narodowego Korpusu Języka Polskiego. Język Polski, CI(2):59–70, 2021.

Mateusz Klimaszewski and Alina Wróblewska. COMBO: A new module for EUD parsing. In Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies (IWPT 2021), pages 158–166. Association for Computational Linguistics, 2021.

Mateusz Klimaszewski and Alina Wróblewska. COMBO: State-of-the-art morphosyntactic analysis. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 50–62, Online and Punta Cana, Dominican Republic, 2021. Association for Computational Linguistics.

Małgorzata Marciniak, Agnieszka Mykowiecka, and Piotr Rychlik. Terminology/keyphrase extraction for creation of book indexes in Polish. In Gerd Berget, Mark Michael Hall, Daniel Brenn, and Sanna Kumpulainen, editors, Linking Theory and Practice of Digital Libraries, pages 49–54, Cham, 2021. Springer International Publishing.

Robert Mroczkowski, Piotr Rybak, Alina Wróblewska, and Ireneusz Gawlik. HerBERT: Efficiently pretrained transformer-based language model for Polish. In Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing, pages 1–10, Kiyv, Ukraine, 2021. Association for Computational Linguistics.

Maciej Ogrodniczuk and Włodzimierz Gruszczyński. Embedding transcription and transliteration layers in the Digital Library of Polish and Poland-Related News Pamphlets. In Hao-Ren Ke, Chei Sian Lee, and Kazunari Sugiyama, editors, Towards Open and Trustworthy Digital Societies, pages 54–60, Cham, 2021. Springer International Publishing.

Maciej Ogrodniczuk and Łukasz Kobyliński, editors. Proceedings of the PolEval 2021 Workshop, Warsaw, 2021. Institute of Computer Science, Polish Academy of Sciences.

Maciej Ogrodniczuk, Sameer Pradhan, Massimo Poesio, Yulia Grishina, and Vincent Ng, editors. Proceedings of the Fourth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2021), Punta Cana, Dominican Republic, 2021. Association for Computational Linguistics.

Maciej Ogrodniczuk and Piotr Przybyła. PolEval 2021 Task 4: Question Answering Challenge. In Maciej Ogrodniczuk and Łukasz Kobyliński, editors, Proceedings of the PolEval 2021 Workshop, pages 123–136, Warsaw, 2021. Institute of Computer Science, Polish Academy of Sciences.

Agnieszka Patejuk and Adam Przepiórkowski. Predicative adverbs: Evidence from Polish. Linguistic Inquiry, 52(4):835–851, 2021.

Tiago Pimentel, Maria Ryskina, Sabrina J. Mielke, Shijie Wu, Eleanor Chodroff, Brian Leonard, Garrett Nicolai, Yustinus Ghanggo Ate, Salam Khalifa, Nizar Habash, Charbel El-Khaissi, Omer Goldman, Michael Gasser, William Lane, Matt Coler, Arturo Oncevay, Jaime Rafael Montoya Samame, Gema Celeste Silva Villegas, Adam Ek, Jean-Philippe Bernardy, Andrey Shcherbakov, Aziyana Bayyr-ool, Karina Sheifer, Sofya Ganieva, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Andrew Krizhanovsky, Natalia Krizhanovsky, Clara Vania, Sardana Ivanova, Aelita Salchak, Christopher Straughn, Zoey Liu, Jonathan North Washington, Duygu Ataman, Witold Kieraś, Marcin Woliński, Totok Suhardijanto, Niklas Stoehr, Zahroh Nuriah, Shyam Ratan, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Richard J. Hatcher, Emily Prud'hommeaux, Ritesh Kumar, Mans Hulden, Botond Barta, Dorina Lakatos, Gábor Szolnok, Judit Ács, Mohit Raj, David Yarowsky, Ryan Cotterell, Ben Ambridge, and Ekaterina Vylomova. SIGMORPHON 2021 shared task on morphological reinflection: Generalization across languages. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 229–259. Association for Computational Linguistics, 2021.

Adam Przepiórkowski. Frazemowość narzędnikowych form liczebnikowych na -u. Język Polski, CI(3):5–15, 2021.

Adam Przepiórkowski. Case. In Stefan Müller, Anne Abeillé, Robert D. Borsley, and Jean-Pierre Koenig, editors, Head-Driven Phrase Structure Grammar: The Handbook, pages 245–274. Language Science Press, Berlin, 2021.

Adam Przepiórkowski. Polyadic quantification in hybrid coordination. In Stefan Müller and Nurit Melnik, editors, Proceedings of the HPSG 2021 Conference, pages 144–164. Frankfurt/Main University Library, 2021.

Adam Przepiórkowski. Three improvements to the HPSG model theory. In Stefan Müller and Nurit Melnik, editors, Proceedings of the HPSG 2021 Conference, pages 165–185. Frankfurt/Main University Library, 2021.

Adam Przepiórkowski and Agnieszka Patejuk. Coordinate structures without syntactic categories. In I Wayan Arka, Ash Asudeh, and Tracy Holloway King, editors, Modular Design of Grammar: Linguistics on the Edge, pages 205–220. Oxford University Press, Oxford, 2021.

Adam Przepiórkowski, Julia Pater, and Maciej Pastwa. O dystrybucji synonimicznych przyimków i operatorów adnumeratywnych. Język Polski, CI(1):5–21, 2021.

Ryszard Tuora, Adam Przepiórkowski, and Aleksander Leczkowski. Comparing learnability of two dependency schemes: ‘semantic’ (UD) and ‘syntactic’ (SUD). In Findings of the Association for Computational Linguistics: EMNLP 2021. Association for Computational Linguistics, 2021.

Marcin Woliński and Elżbieta Hajnicz. Składnica: a constituency treebank of Polish harmonised with the Walenty valency dictionary. Language Resources and Evaluation, 55:209–239, 2021.

Magdalena Zawisławska, Maciej Ogrodniczuk, and Michał Szczyszek. Indirect relations and frames: Coreference in context. In Tadeusz Ciecierski and Paweł Grabarczyk, editors, Context Dependence in Language, Action, and Cognition, pages 229–246. De Gruyter, Berlin/Boston, 2021.


Mary Dalrymple, Agnieszka Patejuk, and Mark-Matthias Zymla. XLE+Glue – A new tool for integrating semantic analysis in XLE. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'20 Conference, pages 89–108, Stanford, CA, 2020. CSLI Publications.

Adam Gabriel Dobrakowski, Agnieszka Mykowiecka, Małgorzata Marciniak, Wojciech Jaworski, and Przemysław Biecek. Interpretable segmentation of medical free-text records based on word embeddings. In Denis Helic, Gerhard Leitner, Martin Stettinger, Alexander Felfernig, and Zbigniew W. Raś, editors, Foundations of Intelligent Systems – 25th International Symposium, ISMIS 2020, Graz, Austria, September 23–25, 2020, Proceedings, volume 12117 of Lecture Notes in Computer Science, pages 45–55. Springer, 2020.

Elżbieta Hajnicz. Interannotator agreement for lexico-semantic annotation of a corpus. In Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the 12th Language Resources and Evaluation Conference (LREC-2020), pages 1842–1848. European Language Resources Association, 2020.

Małgorzata Marciniak, Piotr Rychlik, and Agnieszka Mykowiecka. Supporting terminology extraction with dependency parses. In Proceedings of the 6th International Workshop on Computational Terminology, pages 72–79, Marseille, France, May 2020. European Language Resources Association.

Agnieszka Mykowiecka and Małgorzata Marciniak. Are white ravens ever white? – non-literal adjective-noun phrases in Polish. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 5871–5877, Marseille, France, May 2020. European Language Resources Association.

Maciej Ogrodniczuk and Włodzimierz Gruszczyński. Wikipedia-based entity linking for the digital library of Polish and Poland-related news pamphlets. In Emi Ishita, Natalie Lee San Pang, and Lihong Zhou, editors, Digital Libraries at Times of Massive Societal Transition, pages 81–88, Cham, 2020. Springer International Publishing.

Maciej Ogrodniczuk and Łukasz Kobyliński, editors. Proceedings of the PolEval 2020 Workshop, Warsaw, 2020. Institute of Computer Science, Polish Academy of Sciences.

Maciej Ogrodniczuk, Vincent Ng, Yulia Grishina, and Sameer Pradhan, editors. Proceedings of the Third Workshop on Computational Models of Reference, Anaphora and Coreference, Barcelona, Spain (online), 2020. Association for Computational Linguistics.

Maciej Ogrodniczuk and Bartłomiej Nitoń. New developments in the Polish Parliamentary Corpus. In Darja Fišer, Maria Eskevich, and Franciska de Jong, editors, Proceedings of the Second ParlaCLARIN Workshop, pages 1–4, Marseille, France, 2020. European Language Resources Association (ELRA).

Adam Przepiórkowski and Agnieszka Patejuk. From Lexical Functional Grammar to enhanced Universal Dependencies: The UD-LFG treebank of Polish. Language Resources and Evaluation, 54:185–221, 2020.

Adam Przepiórkowski and Agnieszka Patejuk. Predicative adverbs and adjectives with infinitival subjects: A corpus investigation. Studies in Polish Linguistics, 15(3):129–150, 2020.

Georg Rehm, Katrin Marheinecke, Stefanie Hegele, Stelios Piperidis, Kalina Bontcheva, Jan Hajic, Khalid Choukri, Andrejs Vasiļjevs, Gerhard Backfried, Christoph Prinz, Jose Manuel Gomez-Perez, Luc Meertens, Paul Lukowicz, Josef van Genabith, Andrea Lösch, Philipp Slusallek, Morten Irgens, Patrick Gatellier, Joachim Köhler, Laure Le Bars, Dimitra Anastasiou, Albina Auksoriūtė, Núria Bel, António Branco, Gerhard Budin, Walter Daelemans, Koenraad De Smedt, Radovan Garabík, Maria Gavriilidou, Dagmar Gromann, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Jan Odijk, Maciej Ogrodniczuk, Eiríkur Rögnvaldsson, Mike Rosner, Bolette Pedersen, Inguna Skadina, Marko Tadić, Dan Tufiș, Tamás Váradi, Kadri Vider, Andy Way, and François Yvon. The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe. In Proceedings of The 12th Language Resources and Evaluation Conference, pages 3322–3332, Marseille, France, 2020. European Language Resources Association (ELRA).

Tamás Váradi, Svetla Koeva, Martin Yamalov, Marko Tadić, Bálint Sass, Bartłomiej Nitoń, Maciej Ogrodniczuk, Piotr Pęzik, Verginica Barbu Mititelu, Radu Ion, Elena Irimia, Maria Mitrofan, Vasile Păiș, Dan Tufiș, Radovan Garabík, Simon Krek, Andraž Repar, Matjaž Rihtar, and Janez Brank. The MARCELL legislative corpus. In Proceedings of The 12th Language Resources and Evaluation Conference, pages 3761–3768, Marseille, France, 2020. European Language Resources Association (ELRA).

Marcin Woliński and Witold Kieraś. Analiza fleksyjna tekstów historycznych i zmienność fleksji polskiej z perspektywy danych korpusowych. Poradnik Językowy, 8:66–80, 2020.

Marcin Woliński, Witold Kieraś, Dorota Komosińska, and Włodzimierz Gruszczyński. Results of the PolEval 2020 shared task 2: Morphosyntactic tagging of Middle, New and Modern Polish. pages 39–46, Warsaw, 2020. Institute of Computer Science, Polish Academy of Sciences.

Alina Wróblewska. Towards the Conversion of National Corpus of Polish to Universal Dependencies. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 5308–5315, Marseille, France, 2020. European Language Resources Association (ELRA).

Alina Wróblewska, Katarzyna Krasnowska-Kieraś, and Piotr Rybak. Towards the evaluation of feature embedding models of the fusional languages. In Zygmunt Vetulani, Patrick Paroubek, and Marek Kubis, editors, Human Language Technology. Challenges for Computer Science and Linguistics, 8th Language and Technology Conference, LTC 2017, Poznań, Poland, November 17–19, 2017, Revised Selected Papers, number 12598 in Lecture Notes in Computer Science, pages 256–270, Cham, 2020. Springer International Publishing.

Deniz Zeyrek, Amália Mendes, Yulia Grishina, Murathan Kurfalı, Samuel Gibbon, and Maciej Ogrodniczuk. TED Multilingual Discourse Bank (TED-MDB): A parallel corpus annotated in the PDTB style. Language Resources and Evaluation, 54(2):587–613, 2020.


Cleo Condoravdi, Mary Dalrymple, Dag Haug, and Adam Przepiórkowski. Modification of DPs by epistemic adverbs. In Katherine Blake, Forrest Davis, Kaelyn Lamp, and Joseph Rhyne, editors, Proceedings of the 29th Semantics and Linguistic Theory Conference (SALT 29), pages 477–495, 2019.

Jakub Gąsior and Piotr Przybyła. The IPIPAN team participation in the check-worthiness task of the CLEF2019 checkthat ! lab. In Linda Cappellato, Nicola Ferro, David E. Losada, and Henning Müller, editors, Working Notes of CLEF 2019 – Conference and Labs of the Evaluation Forum, Lugano, Switzerland, 2019. CEUR-WS.org.

Elżbieta Hajnicz and Tomasz Bartosiak. Connections between the semantic layer of Walenty valency dictionary and PlWordNet. In Christiane Fellbaum, Piek Vossen, Ewa Rudnicka, Marek Maziarz, and Maciej Piasecki, editors, Proceedings of the 10th Global WordNet Conference (GWC 2019), pages 99–107, Wrocław, 2019. Oficyna Wydawnicza Politechniki Wrocławskiej.

Celina Heliasz and Maciej Ogrodniczuk. Eksplicytność a implicytność w świetle analizy korpusowej (meta)tekstu. Linguistica Copernicana, 16:75–100, 2019.

Łukasz Kobyliński, Maciej Ogrodniczuk, Jan Kocoń, Michał M. Marcińczuk, Aleksander Smywiński-Pohl, Krzysztof Wołk, Danijel Koržinek, Michał Ptaszyński, Agata Pieciukiewicz, and Paweł Dybała. PolEval 2019 — the next chapter in evaluating Natural Language Processing tools for Polish. In Zygmunt Vetulani and Patrick Paroubek, editors, Human Language Technologies as a Challenge for Computer Science and Linguistics – 2019, pages 165–172. Wydawnictwo Nauka i Innowacje, Poznań, Poland, 2019.

Łukasz Kobyliński and Michał Wasiluk. Deep learning in event detection in Polish. In Christiane Fellbaum, Piek Vossen, Ewa Rudnicka, Marek Maziarz, and Maciej Piasecki, editors, Proceedings of the 10th Global WordNet Conference (GWC 2019), pages 216–221, Wrocław, 2019. Oficyna Wydawnicza Politechniki Wrocławskiej.

Katarzyna Krasnowska-Kieraś and Łukasz Kobyliński. Part of speech tagging for Polish. Poznań Studies in Contemporary Linguistics, 55(2):211–237, 2019.

Katarzyna Krasnowska-Kieraś and Alina Wróblewska. Empirical linguistic study of sentence embeddings. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 5729–5739, Florence, Italy, 2019. Association for Computational Linguistics.

Magdalena Król, Włodzimierz Gruszczyński, Magdalena Derwojedowa, Rafał L. Górski, Krzysztof Opaliński, Patrycja Potoniec, Marcin Woliński, Witold Kieraś, and Maciej Eder. Narodowy Korpus Diachroniczny Polszczyzny. Projekt. Język Polski, XCIX(1):92–101, 2019.

Agnieszka Mykowiecka and Małgorzata Marciniak. Experiments with ad hoc ambiguous abbreviation expansion. In Eben Holderness, Antonio Jimeno Yepes, Alberto Lavelli, Anne-Lyse Minard, James Pustejovsky, and Fabio Rinaldi, editors, Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019), pages 44–53, Hong Kong, 2019. Association for Computational Linguistics.

Maciej Ogrodniczuk. Automatyczne wykrywanie nominalnych zależności referencyjnych w polskich tekstach współczesnych. Wydawnictwa Uniwersytetu Warszawskiego, Warsaw, 2019.

Maciej Ogrodniczuk. Nominal coreference resolution for Polish. Poznan Studies in Contemporary Linguistics, 55(2):367–396, 2019.

Maciej Ogrodniczuk, Rafał L. Górski, Marek Łaziński, and Piotr Pęzik. From the National Corpus of Polish to the Polish Corpus Infrastructure. Jazykovedný časopis, 70(2):315–323, 2019.

Maciej Ogrodniczuk and Włodzimierz Gruszczyński. Connecting data for digital libraries: The library, the dictionary and the corpus. In Adam Jatowt, Akira Maeda, and Sue Yeon Syn, editors, Digital Libraries at the Crossroads of Digital Information for the Future, pages 125–138, Cham, 2019. Springer International Publishing.

Maciej Ogrodniczuk and Łukasz Kobyliński, editors. Proceedings of the PolEval 2019 Workshop, Warsaw, Poland, 2019. Institute of Computer Science, Polish Academy of Sciences.

Maciej Ogrodniczuk, Sameer Pradhan, Yulia Grishina, and Vincent Ng, editors. Proceedings of the Second Workshop on Computational Models of Reference, Anaphora and Coreference, Minneapolis, USA, 2019. Association for Computational Linguistics.

Agnieszka Patejuk and Adam Przepiórkowski. Coordination of unlike grammatical functions. In Kim Gerdes and Sylvain Kahane, editors, Proceedings of the Fifth International Conference on Dependency Linguistics (DepLing, SyntaxFest 2019), pages 26–37. Association for Computational Linguistics, 2019.

Adam Przepiórkowski. Status gramatyczny predykatywnych szkoda, wstyd, żal raz jeszcze. Polonica, XXXIX:85–110, 2019.

Adam Przepiórkowski and Agnieszka Patejuk. Nested coordination in Universal Dependencies. In Alexandre Rademaker and Francis Tyers, editors, Proceedings of the Third Workshop on Universal Dependencies (UDW, SyntaxFest 2019), pages 58–69. Association for Computational Linguistics, 2019.

Piotr Przybyła. Detecting Bot Accounts on Twitter by Measuring Message Predictability. In Linda Cappellato, Nicola Ferro, David E. Losada, and Henning Müller, editors, Working Notes of CLEF 2019 – Conference and Labs of the Evaluation Forum, Lugano, Switzerland, 2019. CEUR-WS.org.

Szymon Rutkowski, Piotr Rychlik, and Agnieszka Mykowiecka. Estimating senses with sets of lexically related words for Polish word sense disambiguation. In Christiane Fellbaum, Piek Vossen, Ewa Rudnicka, Marek Maziarz, and Maciej Piasecki, editors, Proceedings of the 10th Global WordNet Conference (GWC 2019), pages 118–124, Wrocław, 2019. Oficyna Wydawnicza Politechniki Wrocławskiej.

Ryszard Tuora and Łukasz Kobyliński. Integrating Polish language tools and resources in Spacy. In Proceedings of PP-RAI 2019 Conference, pages 210–214, Wrocław, 2019. Department of Systems and Computer Networks, Faculty of Electronics, Wroclaw University of Science and Technology.

Aleksander Wawer, Małgorzata Marciniak, and Agnieszka Mykowiecka. Detecting word level metaphors in Polish. In Zygmunt Vetulani and Patrick Paroubek, editors, Human Language Technologies as a Challenge for Computer Science and Linguistics – 2019, pages 87–91. Wydawnictwo Nauka i Innowacje, Poznań, Poland, 2019.

Marcin Woliński. Automatyczna analiza składnikowa języka polskiego. Wydawnictwa Uniwersytetu Warszawskiego, Warsaw, 2019.

Marcin Woliński. Globally optimal page breaking with column balancing – a case study. In Proceedings of the ACM Symposium on Document Engineering 2019, pages 33:1–33:4, New York, NY, USA, 2019. ACM.

Alina Wróblewska and Piotr Rybak. Dependency parsing of Polish. Poznań Studies in Contemporary Linguistics, 55(2):305–337, 2019.


Witold Kieraś, Łukasz Kobyliński, and Maciej Ogrodniczuk. Korpusomat — a tool for creating searchable morphosyntactically tagged corpora. Computational Methods in Science and Technology, 24(1):21–27, 2018.

Witold Kieraś and Marcin Woliński. Manually annotated corpus of Polish texts published between 1830 and 1918. In Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga, editors, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pages 3854–3859, Paris, France, 2018. European Language Resources Association (ELRA).

Łukasz Kobyliński, Michał Wasiluk, and Grzegorz Wojdyga. Improving part-of-speech tagging by meta-learning. In Petr Sojka, Aleš Horák, Ivan Kopeček, and Karel Pala, editors, Text, Speech, and Dialogue: 21st International Conference, TSD 2018, Brno, Czech Republic, September 11-14, 2018, Proceedings, number 11107 in Lecture Notes in Artificial Intelligence, pages 144–152. Springer-Verlag, 2018.

Małgorzata Marciniak, Agnieszka Mykowiecka, and Piotr Rychlik. Recognition of irrelevant phrases in automatically extracted lists of domain terms. Terminology, 24(1):66–90, 2018.

Agnieszka Mykowiecka, Małgorzata Marciniak, and Piotr Rychlik. SimLex-999 for Polish. In Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga, editors, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Paris, France, 2018. European Language Resources Association (ELRA).

Agnieszka Mykowiecka, Małgorzata Marciniak, and Aleksander Wawer. Literal, metphorical or both? Detecting metaphoricity in isolated adjective-noun phrases. In Beata Beigman Klebanov, Ekaterina Shutova, Patricia Lichtenstein, Smaranda Muresan, and Chee Wee, editors, Proceedings of the Workshop on Figurative Language Processing, pages 27–33. Association for Computational Linguistics, 2018.

Agnieszka Mykowiecka, Aleksander Wawer, and Małgorzata Marciniak. Detecting figurative word occurrences using recurrent neural networks. In Beata Beigman Klebanov, Ekaterina Shutova, Patricia Lichtenstein, Smaranda Muresan, and Chee Wee, editors, Proceedings of the Workshop on Figurative Language Processing, pages 124–127. Association for Computational Linguistics, 2018.

Anna Nedoluzhko, Michal Novák, and Maciej Ogrodniczuk. Analysis of coreferential expressions in PAWS (English-Czech-Russian-Polish Parallel Treebank with Anaphoric Relations). In Computational Linguistics and Intellectual Technologies: Papers from the Annual International Conference “Dialogue”, pages 512–521, 2018.

Anna Nedoluzhko, Michal Novák, and Maciej Ogrodniczuk. PAWS: A multi-lingual parallel treebank with anaphoric relations. In Massimo Poesio, Vincent Ng, and Maciej Ogrodniczuk, editors, Proceedings of the First Workshop on Computational Models of Reference, Anaphora and Coreference, pages 68–76. Association for Computational Linguistics, 2018.

Bartłomiej Nitoń, Paweł Morawiecki, and Maciej Ogrodniczuk. Deep neural networks for coreference resolution for Polish. In Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga, editors, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pages 395–400, Paris, France, 2018. European Language Resources Association (ELRA).

Maciej Ogrodniczuk, Joanna Bilińska, Zbigniew Bronk, and Witold Kieraś. Multisłownik: Linking plWordNet-based lexical data for lexicography and educational purposes. In Francis Bond, Takayuki Kuribayashi, Christiane Fellbaum, and Piek Vossen, editors, Proceedings of the 9th Global WordNet Conference (GWC 2018), pages 368–375, Singapore, 2018. University of Tartu.

Maciej Ogrodniczuk. Polish Parliamentary Corpus. In Darja Fišer, Maria Eskevich, and Franciska de Jong, editors, Proceedings of the LREC 2018 Workshop ParlaCLARIN: Creating and Using Parliamentary Corpora, pages 15–19, Paris, France, 2018. European Language Resources Association (ELRA).

Maciej Ogrodniczuk and Łukasz Kobyliński, editors. Proceedings of the PolEval 2018 Workshop, Warsaw, 2018. Institute of Computer Science, Polish Academy of Sciences.

Agnieszka Patejuk. Incorporating conjunctions in Polish. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'18 Conference, pages 283–303, Stanford, CA, 2018. CSLI Publications.

Agnieszka Patejuk and Adam Przepiórkowski. From Lexical Functional Grammar to Enhanced Universal Dependencies: Linguistically Informed Treebanks of Polish. Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2018.

Agnieszka Patejuk and Adam Przepiórkowski. Predicative constructions with infinitival and clausal subjects. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'18 Conference, pages 304–324, Stanford, CA, 2018. CSLI Publications.

Massimo Poesio, Vincent Ng, and Maciej Ogrodniczuk, editors. Proceedings of the First Workshop on Computational Models of Reference, Anaphora and Coreference. Association for Computational Linguistics, 2018.

Adam Przepiórkowski. The origin of the valency metaphor in linguistics. Lingvisticæ Investigationes, 41(1):152–159, 2018.

Adam Przepiórkowski and Agnieszka Patejuk. Arguments and adjuncts in Universal Dependencies. In Proceedings of the 27th International Conference on Computational Linguistics (COLING 2018), pages 3837–3852, Santa Fe, NM, 2018. (Best position paper at COLING 2018).

Piotr Rybak and Alina Wróblewska. Semi-supervised neural system for tagging, parsing and lematization. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pages 45–54. Association for Computational Linguistics, 2018.

Piotr Rybak and Alina Wróblewska. Semi-supervised neural system for tagging, parsing and lemmatization. Addendum. In Proceedings of the PolEval 2018 Workshop, pages 49–51. Institute of Computer Science, Polish Academy of Sciences, 2018.

Jakub Waszczuk, Witold Kieraś, and Marcin Woliński. Morphosyntactic disambiguation and segmentation for historical Polish with graph-based conditional random fields. In Petr Sojka, Aleš Horák, Ivan Kopeček, and Karel Pala, editors, Text, Speech, and Dialogue: 21st International Conference, TSD 2018, Brno, Czech Republic, September 11-14, 2018, Proceedings, number 11107 in Lecture Notes in Artificial Intelligence, pages 188–196. Springer-Verlag, 2018.

Marcin Woliński, Elżbieta Hajnicz, and Tomasz Bartosiak. A new version of the Składnica treebank of Polish harmonised with the Walenty valency dictionary. In Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, and Takenobu Tokunaga, editors, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pages 1839–1844, Paris, France, 2018. European Language Resources Association (ELRA).

Alina Wróblewska. Polish corpus of annotated descriptions of images. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pages 2141–2146. European Language Resources Association (ELRA), 2018.

Alina Wróblewska. Results of the PolEval 2018 Shared Task 1: Dependency Parsing. In Proceedings of the PolEval 2018 Workshop, pages 11–24. Institute of Computer Science, Polish Academy of Sciences, 2018.

Alina Wróblewska. Extended and enhanced Polish dependency bank in Universal Dependencies format. In Marie-Catherine de Marneffe, Teresa Lynn, and Sebastian Schuster, editors, Proceedings of the Second Workshop on Universal Dependencies (UDW 2018), pages 173–182. Association for Computational Linguistics, 2018.

Alina Wróblewska and Aleksandra Wieczorek. Status morfoskładniowy wyrazu jako we współczesnej polszczyźnie. Język Polski, XCVIII(3):16–30, 2018.

Magdalena Zawisławska, Marta Falkowska, and Maciej Ogrodniczuk. Verbal synaesthesia in the Polish corpus of synaesthetic metaphors. LaMiCuS, 2:226–253, 2018.


Tomasz Bartosiak. Shared forest representation of predicate-argument structures for shared syntactic forests. In Zygmunt Vetulani and Patrick Paroubek, editors, Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 410–414, Poznań, Poland, 2017. Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu.

Markus Dickinson, Jan Hajič, Sandra Kübler, and Adam Przepiórkowski, editors. Proceedings of the Fifteenth International Workshop on Treebanks and Linguistic Theories (TLT 15). CEUR Workshop Proceedings, 2017.

Witold Kieraś. Co jest zgodne z duchem kraftu? Próba korpusowego badania słownictwa związanego z piwem. Język Polski, XCVII(2):105–112, 2017.

Witold Kieraś, Dorota Komosińska, Emanuel Modrzejewski, and Marcin Woliński. Morphosyntactic annotation of historical texts. The making of the baroque corpus of Polish. In Kamil Ekštein and Václav Matoušek, editors, Text, Speech, and Dialogue 20th International Conference, TSD 2017, Prague, Czech Republic, August 27-31, 2017, Proceedings, number 10415 in Lecture Notes in Computer Science, pages 308–316. Springer International Publishing, 2017.

Witold Kieraś and Marcin Woliński. Morfeusz 2 – analizator i generator fleksyjny dla języka polskiego. Język Polski, XCVII(1):75–83, 2017.

Witold Kieraś and Marcin Woliński. Słownik gramatyczny języka polskiego – wersja internetowa. Język Polski, XCVII(1):84–93, 2017.

Łukasz Kobyliński and Maciej Ogrodniczuk. Results of the PolEval 2017 competition: Part-of-speech tagging shared task. In Zygmunt Vetulani and Patrick Paroubek, editors, Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 362–366, Poznań, Poland, 2017. Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu.

Katarzyna Krasnowska-Kieraś. Morphosyntactic disambiguation for Polish with bi-LSTM neural networks. In Zygmunt Vetulani and Patrick Paroubek, editors, Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 367–371, Poznań, Poland, 2017. Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu.

Ekaterina Lapshinova-Koltunski and Maciej Ogrodniczuk. Šárka Zikánová – Eva Hajičová – Barbora Hladká – Pavlína Jínová – Jiří Mírovský – Anna Nedoluzhko – Lucie Poláková – Kateřina Rysová – Magdaléna Rysová – Jan Václ: Discourse and Coherence: From the Sentence Structure to Textual Relations. Slovo a Slovesnost, 78:343–349, 2017.

Małgorzata Marciniak, Agnieszka Mykowiecka, and Piotr Rychlik. Automatyczne wydobywanie terminologii dziedzinowej z korpusów tekstowych. Język Polski, XCVII(1):64–74, 2017.

Agnieszka Mykowiecka, Małgorzata Marciniak, and Piotr Rychlik. Testing word embeddings for Polish. Cognitive Studies / Études Cognitives, 17:1–19, 2017.

Bartłomiej Nitoń and Maciej Ogrodniczuk. Multi-pass sieve coreference resolution system for Polish. In Jorge Gracia, Francis Bond, John P. McCrae, Paul Buitelaar, Christian Chiarcos, and Sebastian Hellmann, editors, Proceedings of the 1st Conference on Language, Data and Knowledge (LDK 2017), number 10318 in Lecture Notes in Artificial Intelligence, pages 222–236. Springer International Publishing, Berlin, 2017.

Maciej Ogrodniczuk. Lingwistyka komputerowa dla języka polskiego: dziś i jutro. Język Polski, XCVII(1):18–28, 2017.

Maciej Ogrodniczuk, Magdalena Derwojedowa, Marek Łaziński, and Piotr Pęzik. Narodowy Korpus Języka Polskiego – co dalej?. Prace Filologiczne, LXXI:237–245, 2017.

Maciej Ogrodniczuk and Mateusz Kopeć. Lexical correction of Polish Twitter political data. In Proceedings of the Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pages 115–125, Vancouver, Canada, 2017. Association for Computational Linguistics.

Maciej Ogrodniczuk and Vincent Ng, editors. Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), Valencia, Spain, 2017. Association for Computational Linguistics.

Maciej Ogrodniczuk and Bartłomiej Nitoń. Improving Polish mention detection with valency dictionary. In Maciej Ogrodniczuk and Vincent Ng, editors, Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), pages 17–23, Valencia, Spain, 2017. Association for Computational Linguistics.

Agnieszka Patejuk. A gapping analysis of lexicalised comparative constructions. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'17 Conference, pages 306–326, Stanford, CA, 2017. CSLI Publications.

Agnieszka Patejuk and Adam Przepiórkowski. POLFIE: współczesna gramatyka formalna języka polskiego. Język Polski, XCVII(1):48–64, 2017.

Agnieszka Patejuk and Adam Przepiórkowski. Filling the gap. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'17 Conference, pages 327–347, Stanford, CA, 2017. CSLI Publications.

Adam Przepiórkowski. Argumenty i modyfikatory w gramatyce i w słowniku. Wydawnictwa Uniwersytetu Warszawskiego, Warsaw, 2017.

Adam Przepiórkowski. On the argument–adjunct distinction in the Polish Semantic Syntax tradition. Cognitive Studies / Études Cognitives, 17:1–10, 2017.

Adam Przepiórkowski. Hierarchical lexicon and the argument/adjunct distinction. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'17 Conference, pages 348–367, Stanford, CA, 2017. CSLI Publications.

Adam Przepiórkowski, Jan Hajič, Elżbieta Hajnicz, and Zdeňka Urešová. Phraseology in two Slavic valency dictionaries: Limitations and perspectives. International Journal of Lexicography, 30(1):1–38, 2017.

Adam Przepiórkowski, Elżbieta Hajnicz, Anna Andrzejczuk, Agnieszka Patejuk, and Marcin Woliński. Walenty: gruntowny składniowo-semantyczny słownik walencyjny języka polskiego. Język Polski, XCVII(1):30–47, 2017.

Adam Przepiórkowski. A full-fledged hierarchical lexicon in LFG: The FrameNet approach. In Victoria Rosén and Koenraad De Smedt, editors, The Very Model of a Modern Linguist, volume 8 of Bergen Language and Linguistics Studies, pages 202–219. University of Bergen Library, Bergen, 2017.

Aleksander Wawer and Agnieszka Mykowiecka. Detecting metaphorical phrases in the Polish language. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 772–777, Varna, Bulgaria, 2017. INCOMA Ltd.

Aleksander Wawer and Agnieszka Mykowiecka. Supervised and unsupervised word sense disambiguation on word embedding vectors of unambigous synonyms. In Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications, pages 120–125. Association for Computational Linguistics, 2017.

Aleksander Wawer and Maciej Ogrodniczuk. Results of the PolEval 2017 competition: Sentiment Analysis shared task. In Zygmunt Vetulani and Patrick Paroubek, editors, Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 406–409, Poznań, Poland, 2017. Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu.

Marcin Woliński, Witold Kieraś, and Dorota Komosińska. Anotatornia 2 — an annotation tool geared towards historical corpora. In Zygmunt Vetulani and Patrick Paroubek, editors, Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 158–162, Poznań, Poland, 2017. Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu.

Alina Wróblewska and Katarzyna Krasnowska-Kieraś. Polish evaluation dataset for compositional distributional semantics models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 784–792, Vancouver, Canada, 2017. Association for Computational Linguistics.

Alina Wróblewska, Katarzyna Krasnowska-Kieraś, and Piotr Rybak. Towards the evaluation of feature embedding models of the fusional languages. In Zygmunt Vetulani and Patrick Paroubek, editors, Proceedings of the 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 420–424, Poznań, Poland, 2017. Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu.

Magdalena Zawisławska, Marta Falkowska, and Maciej Ogrodniczuk. Metaphor annotation in the corpus of Polish. In Proceedings of the 2nd International Workshop on Language Sense on Computer, pages 16–22, Melbourne, Australia, 2017.


Joanna Bilińska, Magdalena Derwojedowa, Witold Kieraś, and Monika Kwiecień. Mikrokorpus polszczyzny 1830-1918. Komunikacja specjalistyczna, 11:149–161, 2016.

Renata Bronikowska, Włodzimierz Gruszczyński, Maciej Ogrodniczuk, and Marcin Woliński. The use of electronic historical dictionary data in corpus design. Studies in Polish Linguistics, 11(2):47–56, 2016.

Magdalena Derwojedowa, Witold Kieraś, Joanna Bilińska, and Monika Kwiecień. Dynamika zmian fleksyjnych i ortograficznych między reformami 1830-1918. Język Polski, XCVI(1):24–35, 2016.

Elżbieta Hajnicz, Anna Andrzejczuk, and Tomasz Bartosiak. Semantic layer of the valence dictionary of Polish Walenty. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Tenth International Conference on Language Resources and Evaluation, LREC 2016, pages 2625–2632, Portorož, Slovenia, 2016. European Language Resources Association (ELRA).

Elżbieta Hajnicz, Agnieszka Patejuk, Adam Przepiórkowski, and Marcin Woliński. Walenty: słownik walencyjny języka polskiego z bogatym komponentem frazeologicznym. In Karolina Skwarska and Elżbieta Kaczmarska, editors, Výzkum slovesné valence ve slovanských zemích, pages 71–102. Slovanský ústav AV ČR, Prague, 2016.

Łukasz Kobyliński and Witold Kieraś. Part of speech tagging for Polish: State of the art and future perspectives. In Proceedings of the 17th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2016), Konya, Turkey, 2016.

Małgorzata Marciniak, Agnieszka Mykowiecka, and Piotr Rychlik. TermoPL — a flexible tool for terminology extraction. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Tenth International Conference on Language Resources and Evaluation, LREC 2016, pages 2278–2284, Portorož, Slovenia, 2016. European Language Resources Association (ELRA).

Agnieszka Mykowiecka, Małgorzata Marciniak, and Piotr Rychlik. Recognition of non-domain phrases in automatically extracted lists of terms. In Proceedings of the 5th International Workshop on Computational Terminology (CompuTerm2016), pages 12–20, Osaka, Japan, 2016. The COLING 2016 Organizing Committee.

Bartłomiej Nitoń, Tomasz Bartosiak, and Elżbieta Hajnicz. Accessing and elaborating Walenty—a valence dictionary of Polish—via internet browser. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Tenth International Conference on Language Resources and Evaluation, LREC 2016, pages 1352–1359, Portorož, Slovenia, 2016. European Language Resources Association (ELRA).

Bartłomiej Nitoń. Evaluation of Uryupina’s coreference resolution features for Polish. In Zygmunt Vetulani, Hans Uszkoreit, and Marek Kubis, editors, Human Language Technology. Challenges for Computer Science and Linguistics: 6th Language and Technology Conference, LTC 2013, Poznań, Poland, December 7-9, 2013. Revised Selected Papers, number 9561 in Lecture Notes in Artificial Intelligence, pages 354–367, Switzerland, 2016. Springer International Publishing.

Maciej Ogrodniczuk. Web services and data mining: combining linguistic tools for Polish with an analytical platform. In Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH), pages 187–195, Osaka, Japan, 2016. The COLING 2016 Organizing Committee.

Maciej Ogrodniczuk, Katarzyna Głowińska, Mateusz Kopeć, Agata Savary, and Magdalena Zawisławska. Polish Coreference Corpus. In Zygmunt Vetulani, Hans Uszkoreit, and Marek Kubis, editors, Human Language Technology. Challenges for Computer Science and Linguistics: 6th Language and Technology Conference, LTC 2013, Poznań, Poland, December 7-9, 2013. Revised Selected Papers, number 9561 in Lecture Notes in Artificial Intelligence, pages 215–226, Switzerland, 2016. Springer International Publishing.

Maciej Ogrodniczuk and Vincent Ng, editors. Proceedings of the Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2016), San Diego, USA, June 2016. Association for Computational Linguistics.

Maciej Ogrodniczuk and Magdalena Zawisławska. Bridging relations in Polish: Adaptation of existing typologies. In Maciej Ogrodniczuk and Vincent Ng, editors, Proceedings of the Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2016), pages 16–22, San Diego, USA, June 2016. Association for Computational Linguistics.

Agnieszka Patejuk. Integrating a rich external valency dictionary with an implemented XLE/LFG grammar. In Doug Arnold, Miriam Butt, Berthold Crysmann, Tracy Holloway King, and Stefan Müller, editors, The Proceedings of the Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, pages 520–540, Stanford, CA, 2016. CSLI Publications.

Agnieszka Patejuk and Adam Przepiórkowski. Reducing grammatical functions in Lexical Functional Grammar. In Doug Arnold, Miriam Butt, Berthold Crysmann, Tracy Holloway King, and Stefan Müller, editors, The Proceedings of the Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, pages 541–559, Stanford, CA, 2016. CSLI Publications.

Adam Przepiórkowski. Against the argument–adjunct distinction in Functional Generative Description. The Prague Bulletin of Mathematical Linguistics, 106:5–20, 2016.

Adam Przepiórkowski. How not  to distinguish arguments from adjuncts in LFG. In Doug Arnold, Miriam Butt, Berthold Crysmann, Tracy Holloway King, and Stefan Müller, editors, The Proceedings of the Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, pages 560–580, Stanford, CA, 2016. CSLI Publications.

Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, Núria Bel, Audronė Bielevičienė, Lars Borin, António Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garabík, Marko Grobelnik, Carmen García-Mateo, Josef van Genabith, Jan Hajič, Inma Hernáez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Joseph Mariani, John McNaught, Maite Melero, Monica Monachini, Asunción Moreno, Jan Odijk, Maciej Ogrodniczuk, Piotr Pęzik, Stelios Piperidis, Adam Przepiórkowski, Eiríkur Rögnvaldsson, Mike Rosner, Bolette Sandford Pedersen, Inguna Skadiņa, Koenraad De Smedt, Marko Tadić, Paul Thompson, Dan Tufiș, Tamás Váradi, Andrejs Vasiļjevs, Kadri Vider, and Jolanta Zabarskaitė. The strategic impact of META-NET on the regional, national and international level. Language Resources and Evaluation, 50:351–374, 2016.

Marcin Woliński and Witold Kieraś. The on-line version of Grammatical Dictionary of Polish. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Tenth International Conference on Language Resources and Evaluation, LREC 2016, pages 2589–2594, Portorož, Slovenia, 2016. European Language Resources Association (ELRA).

Marcin Woliński and Dominika Rogozińska. Experiments in PCFG-like disambiguation of constituency parse forests for Polish. In Zygmunt Vetulani, Hans Uszkoreit, and Marek Kubis, editors, Human Language Technology. Challenges for Computer Science and Linguistics: 6th Language and Technology Conference, LTC 2013, Poznań, Poland, December 7-9, 2013. Revised Selected Papers, number 9561 in Lecture Notes in Artificial Intelligence, pages 146–158. Springer International Publishing, Switzerland, 2016.


Tomasz Bartosiak and Marcin Woliński. On genitive clusters, Kleene star, and an exploding parser. In Zygmunt Vetulani and Joseph Mariani, editors, Proceedings of the 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 509–513, Poznań, Poland, 2015.

Markus Dickinson, Erhard Hinrichs, Agnieszka Patejuk, and Adam Przepiórkowski, editors. Proceedings of the Fourteenth International Workshop on Treebanks and Linguistic Theories (TLT 14), Warsaw, 2015. Institute of Computer Science, Polish Academy of Sciences.

Włodzimierz Gruszczyński, Bartosz Broda, Łukasz Dębowski, Milena Hadryan, Bartłomiej Nitoń, and Maciej Ogrodniczuk. Measuring readability of Polish texts. In Zygmunt Vetulani and Joseph Mariani, editors, Proceedings of the 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 445–449, Poznań, Poland, 2015.

Włodzimierz Gruszczyński, Bartosz Broda, Bartłomiej Nitoń, and Maciej Ogrodniczuk. W poszukiwaniu metody automatycznego mierzenia zrozumiałości tekstów informacyjnych. Poradnik Językowy, 2:9–22, 2015.

Włodzimierz Gruszczyński and Maciej Ogrodniczuk, editors. Jasnopis, czyli mierzenie zrozumiałości polskich tekstów użytkowych. Wydawnictwo ASPRA-JR, Warsaw, 2015.

Elżbieta Hajnicz, Bartłomiej Nitoń, Agnieszka Patejuk, Adam Przepiórkowski, and Marcin Woliński. Internetowy słownik walencyjny języka polskiego oparty na danych korpusowych. Prace Filologiczne, LXV:95–110, 2015.

Łukasz Kobyliński. Combining linguistic knowledge with machine learning for domain-specific named entity recognition. In Zygmunt Vetulani and Joseph Mariani, editors, Proceedings of the 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 267–269, Poznań, Poland, 2015.

Katarzyna Krasnowska and Adam Przepiórkowski. Combining various degrees of supervision in PP-attachment disambiguation. In Zygmunt Vetulani and Joseph Mariani, editors, Proceedings of the 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, page 85–89, Poznań, Poland, 2015.

Katarzyna Krasnowska-Kieraś and Agnieszka Patejuk. Integrating Polish LFG with external morphology. In Markus Dickinson, Erhard Hinrichs, Agnieszka Patejuk, and Adam Przepiórkowski, editors, Proceedings of the Fourteenth International Workshop on Treebanks and Linguistic Theories (TLT 14), pages 134–147, Warsaw, 2015. Institute of Computer Science, Polish Academy of Sciences.

Małgorzata Marciniak. Domain corpora as a source of information, volume 4 of Monograph Series. Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2015.

Małgorzata Marciniak and Agnieszka Mykowiecka. Nested term recognition driven by word connection strength. Terminology, 2:180–204, 2015.

Agnieszka Mykowiecka and Małgorzata Marciniak. Introducing a structure into a set of similar concepts. In Zygmunt Vetulani and Joseph Mariani, editors, Proceedings of the 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 130–134, Poznań, Poland, 2015.

Bartłomiej Nitoń, Tomasz Bartosiak, Elżbieta Hajnicz, Agnieszka Patejuk, Adam Przepiórkowski, and Marcin Woliński. DEMO: Access to a valence dictionary of Polish Walenty via Internet browser. In Zygmunt Vetulani and Joseph Mariani, editors, Proceedings of the 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 270–272, Poznań, Poland, 2015.

Maciej Ogrodniczuk, Katarzyna Głowińska, Mateusz Kopeć, Agata Savary, and Magdalena Zawisławska. Coreference in Polish: Annotation, Resolution and Evaluation. Walter De Gruyter, Berlin, München, Boston, 2015.

Agnieszka Patejuk. Unlike Coordination in Polish: An LFG Account. Ph.D. dissertation, Institute of Polish Language, Polish Academy of Sciences, Cracow, 2015.

Agnieszka Patejuk and Adam Przepiórkowski. Parallel development of linguistic resources: Towards a structure bank of Polish. Prace Filologiczne, LXV:255–270, 2015.

Agnieszka Patejuk and Adam Przepiórkowski. An LFG analysis of the so-called reflexive marker in Polish. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'15 Conference, pages 270–288, Stanford, CA, 2015. CSLI Publications.

Agnieszka Patejuk. Coordinated Wh-words in Polish: Monoclausal or multiclausal?. In Małgorzata Szajbel-Keck, Roslyn Burns, and Darya Kavitskaya, editors, Annual Workshop on Formal Approaches to Slavic Linguistics: The First Berkeley Meeting 2014, pages 222–241, Ann Arbor, MI, 2015.

Adam Przepiórkowski. Inżynieria lingwistyczna a obecna sytuacja językoznawstwa polskiego. LingVaria, X(2):135–145, 2015.

Adam Przepiórkowski. Towards a linguistically-oriented textual entailment test-suite for Polish based on the semantic syntax approach. Cognitive Studies / Études Cognitives, 15:177–191, 2015.

Adam Przepiórkowski, Jakub Kozakoszczak, Jan Winkowski, Daniel Ziembicki, and Tadeusz Teleżyński. Towards a taxonomy of textual entailments. In Proceedings of the 20th Amsterdam Colloquium, pages 333–342, 2015.

Adam Przepiórkowski and Agnieszka Patejuk. Two representations of negation in LFG: Evidence from Polish. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'15 Conference, pages 322–336, Stanford, CA, 2015. CSLI Publications.

Adam Przepiórkowski. A weakly compositional analysis of distance distributivity in Polish. In Małgorzata Szajbel-Keck, Roslyn Burns, and Darya Kavitskaya, editors, Annual Workshop on Formal Approaches to Slavic Linguistics: The First Berkeley Meeting 2014, pages 262–281, Ann Arbor, MI, 2015.

Adam Przepiórkowski and Alina Wróblewska. Supporting LFG parsing with dependency parsing. In Markus Dickinson, Erhard Hinrichs, Agnieszka Patejuk, and Adam Przepiórkowski, editors, Proceedings of the Fourteenth International Workshop on Treebanks and Linguistic Theories (TLT 14), pages 168–178, Warsaw, 2015. Institute of Computer Science, Polish Academy of Sciences.

Piotr Przybyła. Gathering Knowledge for Question Answering Beyond Named Entities. In Chris Biemann, Siegfried Handschuh, André Freitas, Farid Meziane, and Elisabeth Métais, editors, Proceedings of the 20th International Conference on Applications of Natural Language to Information Systems (NLDB 2015), pages 412–417, Passau, Germany, 2015. Springer-Verlag.

Piotr Przybyła and Paweł Teisseyre. What do your look-alikes say about you? Exploiting strong and weak similarities for author profiling - Notebook for PAN at CLEF 2015. In Linda Cappellato, Nicola Ferro, Gareth Jones, and Eric San Juan, editors, CLEF 2015 Labs and Workshops, Notebook Papers, Toulouse, France, 2015. CEUR-WS.org.

Victoria Rosén, Gyri Smørdal Losnegaard, Koenraad De Smedt, Eduard Bejček, Agata Savary, Adam Przepiórkowski, Petya Osenova, and Verginica Barbu Mititelu. A survey of multiword expressions in treebanks. In Markus Dickinson, Erhard Hinrichs, Agnieszka Patejuk, and Adam Przepiórkowski, editors, Proceedings of the Fourteenth International Workshop on Treebanks and Linguistic Theories (TLT 14), pages 179–193, Warsaw, 2015. Institute of Computer Science, Polish Academy of Sciences.

Zygmunt Saloni, Marcin Woliński, Robert Wołosz, Włodzimierz Gruszczyński, and Danuta Skowrońska. Słownik gramatyczny języka polskiego. Warsaw, 3rd edition, 2015.

Agata Savary, Manfred Sailer, Yannick Parmentier, Michael Rosner, Victoria Rosén, Adam Przepiórkowski, Cvetana Krstev, Veronika Vincze, Beata Wójtowicz, Miriam Butt, Gyri Smørdal Losnegaard, Carla Parra Escartín, Jakub Waszczuk, Matthieu Constant, Petya Osenova, and Federico Sangati. PARSEME – PARSing and Multiword Expressions within a European multilingual network. In Zygmunt Vetulani and Joseph Mariani, editors, Proceedings of the 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 438–444, Poznań, Poland, 2015.

Marcin Woliński. Deploying the new valency dictionary Walenty in a DCG parser of Polish. In Markus Dickinson, Erhard Hinrichs, Agnieszka Patejuk, and Adam Przepiórkowski, editors, Proceedings of the Fourteenth International Workshop on Treebanks and Linguistic Theories (TLT 14), pages 221–229, Warsaw, 2015. Institute of Computer Science, Polish Academy of Sciences.


Bartosz Broda, Bartłomiej Nitoń, Włodzimierz Gruszczyński, and Maciej Ogrodniczuk. Measuring readability of Polish texts: Baseline experiments. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 573–580, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Magdalena Derwojedowa, Witold Kieraś, Danuta Skowrońska, and Robert Wołosz. Korpus polszczyzny XIX wieku – od mikrokorpusu do korpusu średniej wielkości. Prace Filologiczne, LXV:251–256, 2014.

Magdalena Derwojedowa, Witold Kieraś, Danuta Skowrońska, and Robert Wołosz. Współczesne narzędzia leksykograficzne a analiza tekstów dawniejszych. Polonica, XXXIV:21–27, 2014.

Elżbieta Hajnicz. The procedure of lexico-semantic annotation of Składnica treebank. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 2290–2297, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Elżbieta Hajnicz. Lexico-semantic annotation of składnica treebank by means of plwn lexical units. In Heili Orav, Christiane Fellbaum, and Piek Vossen, editors, Proceedings of the 7th International WordNet Conference (GWC 2014), pages 23–31, Tartu, Estonia, 2014. University of Tartu.

Verena Henrich, Erhard Hinrichs, Daniël de Kok, Petya Osenova, and Adam Przepiórkowski, editors. Proceedings of the Thirteenth International Workshop on Treebanks and Linguistic Theories (TLT 13), Tübingen, 2014. Department of Linguistics (SfS), University of Tübingen.

Wojciech Jaworski and Adam Przepiórkowski. Semantic roles in grammar engineering. In Proceedings of the Third Joint Conference on Lexical and Computational Semantics (*SEM 2014), pages 81–86, Dublin, Ireland, 2014. Association for Computational Linguistics and Dublin City University.

Wojciech Jaworski and Adam Przepiórkowski. Syntactic approximation of semantic roles. In Adam Przepiórkowski and Maciej Ogrodniczuk, editors, Advances in Natural Language Processing: Proceedings of the 9th International Conference on NLP, PolTAL 2014, Warsaw, Poland, September 17–19, 2014, number 8686 in Lecture Notes in Artificial Intelligence, pages 193–201. Springer International Publishing, Heidelberg, 2014.

Witold Kieraś. Na tysiąc żołnierza ledwie pięciu rosłych chłopa. O pewnej nietypowej konstrukcji z liczebnikiem. In Piotr Żmigrodzki and Sylwia Przęczka-Kisielak, editors, Bogactwo współczesnej polszczyzny. Towarzystwo Miłośników Języka Polskiego, Cracow, 2014.

Łukasz Kobyliński. PoliTa: A multitagger for Polish. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 2949–2954, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Mateusz Kopeć and Maciej Ogrodniczuk. Inter-annotator agreement in coreference annotation of Polish. In Janusz Sobecki, Veera Boonjing, and Suphamit Chittayasothorn, editors, Advanced Approaches to Intelligent Information and Database Systems, volume 551 of Studies in Computational Intelligence, pages 149–158. Springer International Publishing, Switzerland, 2014.

Katarzyna Krasnowska. Different approaches to the PP-attachment problem in Polish. In Verena Henrich, Erhard Hinrichs, Daniël de Kok, Petya Osenova, and Adam Przepiórkowski, editors, Proceedings of the Thirteenth International Workshop on Treebanks and Linguistic Theories (TLT 13), pages 88–102, Tübingen, 2014. Department of Linguistics (SfS), University of Tübingen.

Małgorzata Marciniak and Agnieszka Mykowiecka. Terminology extraction from medical texts in Polish. Journal of Biomedical Semantics, 5, 2014.

Małgorzata Marciniak and Agnieszka Mykowiecka. NPMI Driven Recognition of Nested Terms. pages 33–41, Dublin, Ireland, 2014. Association for Computational Linguistics and Dublin City University.

Agnieszka Mykowiecka and Małgorzata Marciniak. Attribute value acquisition through clustering of adjectives. In Adam Przepiórkowski and Maciej Ogrodniczuk, editors, Advances in Natural Language Processing: Proceedings of the 9th International Conference on NLP, PolTAL 2014, Warsaw, Poland, September 17–19, 2014, number 8686 in Lecture Notes in Artificial Intelligence, pages 92–104. Springer International Publishing, Heidelberg, 2014.

Agnieszka Mykowiecka, Piotr Rychlik, and Jakub Waszczuk. Definicja struktury oraz narzędzia wspomagające budowę słownika polszczyzny niewspółczej. Polonica, XXXIV:29–52, 2014.

Maciej Ogrodniczuk and Włodzimierz Gruszczyński. Digital Library 2.0 — source of knowledge and research collaboration platform. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 1649–1653, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Maciej Ogrodniczuk and Mateusz Kopeć. The Polish Summaries Corpus. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 3712–3715, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Maciej Ogrodniczuk, Mateusz Kopeć, and Agata Savary. Polish Coreference Corpus in numbers. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 3234–3238, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Maciej Ogrodniczuk, Alicja Wójcicka, Katarzyna Głowińska, and Mateusz Kopeć. Detection of nested mentions for coreference resolution in Polish. In Adam Przepiórkowski and Maciej Ogrodniczuk, editors, Advances in Natural Language Processing: Proceedings of the 9th International Conference on NLP, PolTAL 2014, Warsaw, Poland, September 17–19, 2014, number 8686 in Lecture Notes in Artificial Intelligence, pages 270–277. Springer International Publishing, Heidelberg, 2014.

Agnieszka Patejuk and Adam Przepiórkowski. Control into selected conjuncts. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'14 Conference, pages 448–460, Stanford, CA, 2014. CSLI Publications.

Agnieszka Patejuk and Adam Przepiórkowski. In favour of the raising analysis of passivisation. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'14 Conference, pages 461–481, Stanford, CA, 2014. CSLI Publications.

Agnieszka Patejuk and Adam Przepiórkowski. Structural case assignment to objects in Polish. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'14 Conference, pages 429–447, Stanford, CA, 2014. CSLI Publications.

Agnieszka Patejuk and Adam Przepiórkowski. Lexico-semantic coordination in Polish: A critical review of tests for determining representation. In Małgorzata Gębka-Wolak, Joanna Kamper-Warejko, and Andrzej Moroz, editors, Leksyka języków słowiańskich w badaniach synchronicznych i diachronicznych, pages 119–134. Wydawnictwo Naukowe Uniwersytetu Mikołaja Kopernika, Toruń, 2014.

Agnieszka Patejuk and Adam Przepiórkowski. Synergistic development of grammatical resources: A valence dictionary, an LFG grammar, and an LFG structure bank for Polish. In Verena Henrich, Erhard Hinrichs, Daniël de Kok, Petya Osenova, and Adam Przepiórkowski, editors, Proceedings of the Thirteenth International Workshop on Treebanks and Linguistic Theories (TLT 13), pages 113–126, Tübingen, 2014. Department of Linguistics (SfS), University of Tübingen.

Adam Przepiórkowski. Locality constraints in distance distributivity: A Propositional Glue approach. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'14 Conference, pages 482–502, Stanford, CA, 2014. CSLI Publications.

Adam Przepiórkowski, Elżbieta Hajnicz, Agnieszka Patejuk, and Marcin Woliński. Extended phraseological information in a valence dictionary for NLP applications. In Proceedings of the Workshop on Lexical and Grammatical Resources for Language Processing (LG-LP 2014), pages 83–91, Dublin, Ireland, 2014. Association for Computational Linguistics and Dublin City University.

Adam Przepiórkowski, Elżbieta Hajnicz, Agnieszka Patejuk, Marcin Woliński, Filip Skwarski, and Marek Świdziński. Walenty: Towards a comprehensive valence dictionary of Polish. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 2785–2792, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Adam Przepiórkowski and Maciej Ogrodniczuk, editors. Advances in Natural Language Processing: Proceedings of the 9th International Conference on NLP, PolTAL 2014, Warsaw, Poland, September 17–19, 2014. Number 8686 in Lecture Notes in Artificial Intelligence. Springer International Publishing, Heidelberg, 2014.

Adam Przepiórkowski and Agnieszka Patejuk. Koordynacja leksykalno-semantyczna w systemie współczesnej polszczyzny (na materiale Narodowego Korpusu Języka Polskiego). Język Polski, XCIV(2):104–115, 2014.

Adam Przepiórkowski. Distance distributivity in Polish: Towards a Glue Semantics approach. In Christopher Piñón, editor, Empirical Issues in Syntax and Semantics 10 (CSSP 2013 Proceedings), pages 107–124, 2014.

Adam Przepiórkowski, Filip Skwarski, Elżbieta Hajnicz, Agnieszka Patejuk, Marek Świdziński, and Marcin Woliński. Modelowanie własności składniowych czasowników w nowym słowniku walencyjnym języka polskiego. Polonica, XXXIII:159–178, 2014.

Piotr Przybyła and Paweł Teisseyre. Analysing Utterances in Polish Parliament to Predict Speaker's Background. Journal of Quantitative Linguistics, 21(4):350–376, 2014.

Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, Núria Bel, Audronė Bielevičienė, Lars Borin, António Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garabík, Marko Grobelnik, Carmen García-Mateo, Josef van Genabith, Jan Hajič, Inma Hernáez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Joseph Mariani, John McNaught, Maite Melero, Monica Monachini, Asunción Moreno, Jan Odijk, Maciej Ogrodniczuk, Piotr Pęzik, Stelios Piperidis, Adam Przepiórkowski, Eiríkur Rögnvaldsson, Mike Rosner, Bolette Sandford Pedersen, Inguna Skadiņa, Koenraad De Smedt, Marko Tadić, Paul Thompson, Dan Tufiș, Tamás Váradi, Andrejs Vasiļjevs, Kadri Vider, and Jolanta Zabarskaite. The strategic impact of META-NET on the regional, national and international level. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 1517–1524, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Marko Tadić, Tamás Váradi, Radovan Garabík, Svetla Koeva, Maciej Ogrodniczuk, and Duško Vitas. Detecting gaps in language resources and tools in the project CESAR. In Zygmunt Vetulani and Joseph Mariani, editors, Human Language Technology. Challenges for Computer Science and Linguistics: 5th Language and Technology Conference, LTC 2011, Poznań, Poland, November 25–27, 2011, Revised Selected Papers, number 8387 in Lecture Notes in Artificial Intelligence, pages 27–41, Berlin, 2014. Springer-Verlag.

Marcin Woliński. Morfeusz reloaded. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 1106–1111, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Alina Wróblewska. Polish Dependency Parser Trained on an Automatically Induced Dependency Bank. Ph.D. dissertation, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2014.

Alina Wróblewska. Polish Dependency Parser Trained on an Automatically Induced Dependency Bank. Ph.D. dissertation, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2014.

Alina Wróblewska and Adam Przepiórkowski. Projection-based annotation of a Polish dependency treebank. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 2306–2312, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Alina Wróblewska and Adam Przepiórkowski. Towards a weighted induction method of dependency annotation. In Adam Przepiórkowski and Maciej Ogrodniczuk, editors, Advances in Natural Language Processing: Proceedings of the 9th International Conference on NLP, PolTAL 2014, Warsaw, Poland, September 17–19, 2014, number 8686 in Lecture Notes in Artificial Intelligence, pages 164–176. Springer International Publishing, Heidelberg, 2014.

Magdalena Zawisławska and Maciej Ogrodniczuk. The same or just much the same? Problems with coreference from the reader's perspective. In Marek Kuźniak, Agnieszka Libura, and Michał Szawerna, editors, From Conceptual Metaphor Theory to Cognitive Ethnolinguistics. Patterns of Imagery in Language, volume 3 of Studies in Language, Culture and Society, pages 173–184. Peter Lang, Frankfurt am Main, 2014.


Włodzimierz Gruszczyński, Dorota Adamiec, and Maciej Ogrodniczuk. Elektroniczny korpus tekstów polskich z XVII i XVIII w. (do 1772 r.) — prezentacja projektu badawczego. Polonica, XXXIII:309–316, 2013.

Włodzimierz Gruszczyński, Bartosz Broda, Bartłomiej Nitoń, and Maciej Ogrodniczuk. Jasnopis: a new application for measuring readability of Polish texts. In Zygmunt Vetulani, editor, Proceedings of the 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, page 581, Poznań, Poland, 2013. Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza.

Elżbieta Hajnicz. Mapping named entities from NKJP corpus to składnica treebank and Polish WordNet. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Language Processing and Intelligent Information Systems – 20th International Conference, IIS 2013, Warsaw, Poland, June 17-18, 2013. Proceedings, number 7912 in Lecture Notes in Computer Science, pages 92–105, Berlin, Heidelberg, 2013. Springer-Verlag.

Elżbieta Hajnicz. Actualising lexico-semantic annotation of Składnica Treebank to modified versions of source resources. In Zygmunt Vetulani, editor, Proceedings of the 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 178–182, Poznań, Poland, 2013. Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza.

Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors. Language Processing and Intelligent Information Systems – 20th International Conference, IIS 2013, Warsaw, Poland, June 17-18, 2013. Proceedings, number 7912 in Lecture Notes in Computer Science, Berlin, Heidelberg, 2013. Springer-Verlag.

Łukasz Kobyliński. Automatic detection of annotation errors in Polish-language corpora. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Language Processing and Intelligent Information Systems – 20th International Conference, IIS 2013, Warsaw, Poland, June 17-18, 2013. Proceedings, number 7912 in Lecture Notes in Computer Science, pages 106–111, Berlin, Heidelberg, 2013. Springer-Verlag.

Łukasz Kobyliński. Improving the accuracy of Polish POS tagging by using voting ensembles. In Zygmunt Vetulani, editor, Proceedings of the 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 453–456, Poznań, Poland, 2013. Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza.

Katarzyna Krasnowska and Witold Kieraś. Polish LFG treebank on a shoestring. In Sandra Kübler, Petya Osenova, and Martin Volk, editors, Proceedings of The Twelfth Workshop on Treebanks and Linguistic Theories (TLT12), pages 109–120, Sofia, Bulgaria, 2013. The Institute of Information and Communication Technologies, Bulgarian Academy of Sciences.

Katarzyna Krasnowska. Towards a Polish LTAG grammar. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Language Processing and Intelligent Information Systems – 20th International Conference, IIS 2013, Warsaw, Poland, June 17-18, 2013. Proceedings, number 7912 in Lecture Notes in Computer Science, pages 16–21, Berlin, Heidelberg, 2013. Springer-Verlag.

Katarzyna Krasnowska and Adam Przepiórkowski. Detecting syntactic errors in dependency treebanks for morphosyntactically rich languages. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Language Processing and Intelligent Information Systems – 20th International Conference, IIS 2013, Warsaw, Poland, June 17-18, 2013. Proceedings, number 7912 in Lecture Notes in Computer Science, pages 69–79, Berlin, Heidelberg, 2013. Springer-Verlag.

Barbara Lewandowska-Tomaszczyk, Rafał Górski, Marek Łaziński, and Adam Przepiórkowski. The National Corpus of Polish (NKJP). Language use and data analysis. In Irina Kor Chahine and Charles Zaremba, editors, Travaux de slavistique : Actes du VIe congrès de la Slavic Linguistic Society, pages 309–319. Presses Universitaires de Provence, 2013.

Małgorzata Marciniak and Agnieszka Mykowiecka. Terminology extraction from domain texts in Polish. In R. Bembenik, L. Skonieczny, H. Rybinski, M. Kryszkiewicz, and M. Niezgodka, editors, Intelligent Tools for Building a Scientific Information Platform. Advanced Architectures and Solutions, volume 467 of Studies in Computational Intelligence, pages 171–185. Springer-Verlag, 2013.

Bartłomiej Nitoń. Evaluation of Uryupina's coreference resolution features for Polish. In Zygmunt Vetulani, editor, Proceedings of the 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 122–126, Poznań, Poland, 2013. Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza.

Maciej Ogrodniczuk. Cyfrowi mówcy uczą się szybko. Academia, 4 (36):26–29, 2013.

Maciej Ogrodniczuk, Katarzyna Głowińska, Mateusz Kopeć, Agata Savary, and Magdalena Zawisławska. Interesting Linguistic Features in Coreference Annotation of an Inflectional Language. In Maosong Sun, Min Zhang, Dekang Lin, and Haifeng Wang, editors, 12th China National Conference on Computational Linguistics (12th CCL) and the 1st International Symposium on Natural Language Processing based on Naturally Annotated Big Data (1st NLP-NABD), number 8202 in Lecture Notes in Computer Science, pages 97–108. Springer-Verlag, Berlin, Heidelberg, 2013.

Maciej Ogrodniczuk, Katarzyna Głowińska, Mateusz Kopeć, Agata Savary, and Magdalena Zawisławska. Polish Coreference Corpus. In Zygmunt Vetulani, editor, Proceedings of the 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 494–498, Poznań, Poland, 2013. Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza.

Maciej Ogrodniczuk. Translation- and projection-based unsupervised coreference resolution for Polish. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Language Processing and Intelligent Information Systems – 20th International Conference, IIS 2013, Warsaw, Poland, June 17-18, 2013. Proceedings, number 7912 in Lecture Notes in Computer Science, pages 125–130. Springer-Verlag, Berlin, Heidelberg, 2013.

Maciej Ogrodniczuk and Michał Lenart. A multi-purpose online toolset for NLP applications. In Elisabeth Métais, Farid Meziane, Mohamed Saraee, Vijay Sugumaran, and Sunil Vadera, editors, Proceedings of the 18th International Conference on Applications of Natural Language to Information Systems, number 7934 in Lecture Notes in Computer Science, pages 392–395. Springer-Verlag, Berlin, Heidelberg, 2013.

Maciej Ogrodniczuk. Discovery of common nominal facts for coreference resolution: Proof of concept. In R. Prasath and T. Kathirvalavakumar, editors, Mining Intelligence and Knowledge Exploration (MIKE 2013), number 8284 in Lecture Notes in Artificial Intelligence, pages 709–716. Springer-Verlag, Berlin, Heidelberg, 2013.

Maciej Ogrodniczuk, Magdalena Zawisławska, Katarzyna Głowińska, and Agata Savary. Coreference annotation schema for an inflectional language. In Alexander Gelbukh, editor, Computational Linguistics and Intelligent Text Processing (CICLing 2013), number 7816 in Lecture Notes in Computer Science, pages 394–407, Heidelberg, 2013. Springer-Verlag.

Adam Przepiórkowski. The syntax of distance distributivity in Polish: Preserving generalisations with weak heads. In Stefan Müller, editor, Proceedings of the HPSG 2013 Conference, pages 161–181, Stanford, CA, 2013. CSLI Publications.

Adam Przepiórkowski and Agnieszka Patejuk. The syntax of distance distributivity in Polish: Weak heads in LFG via restriction. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'13 Conference, pages 482–502, Stanford, CA, 2013. CSLI Publications.

Adam Przepiórkowski, Maciej Piasecki, Krzysztof Jassem, and Piotr W. Fuglewicz, editors. Computational Linguistics: Applications. Springer-Verlag, Berlin, 2013.

Piotr Przybyła. Question Analysis for Polish Question Answering. In 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Student Research Workshop, pages 96–102, Sofia, Bulgaria, 2013. Association for Computational Linguistics.

Piotr Przybyła. Question Classification for Polish Question Answering. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Proceedings of the 20th International Conference on Language Processing and Intelligent Information Systems (LP&IIS 2013), pages 50–56. Springer-Verlag, 2013.

Djamé Seddah, Reut Tsarfaty, Sandra Kübler, Marie Candito, Jinho D. Choi, Richárd Farkas, Jennifer Foster, Iakes Goenaga, Koldo Gojenola Galletebeitia, Yoav Goldberg, Spence Green, Nizar Habash, Marco Kuhlmann, Wolfgang Maier, Yuval Marton, Joakim Nivre, Adam Przepiórkowski, Ryan Roth, Wolfgang Seeker, Yannick Versley, Veronika Vincze, Marcin Woliński, Alina Wróblewska, and Eric Villemonte de la Clérgerie. Overview of the SPMRL 2013 shared task: A cross-framework evaluation of parsing morphologically rich languages. In Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages, pages 146–182, Seattle, WA, 2013. Association for Computational Linguistics.

Sebastian Sulger, Miriam Butt, Tracy Holloway King, Paul Meurer, Tibor Laczkó, György Rákosi, Cheikh Bamba Dione, Helge Dyvik, Victoria Rosén, Koenraad De Smedt, Agnieszka Patejuk, Özlem Çetinoğlu, I Wayan Arka, and Meladel Mistica. ParGramBank: The ParGram parallel treebank. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 550–560, Sofia, Bulgaria, 2013. Association for Computational Linguistics.

Jakub Waszczuk, Katarzyna Głowińska, Agata Savary, Adam Przepiórkowski, and Michał Lenart. Annotation tools for syntax and named entities in the National Corpus of Polish. International Journal of Data Mining, Modelling and Management, 5(2):103–122, 2013.

Marcin Woliński and Dominika Rogozińska. First experiments in PCFG-like disambiguation of constituency parse forests for Polish. In Zygmunt Vetulani, editor, Proceedings of the 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 343–347, Poznań, Poland, 2013. Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza.

Alina Wróblewska and Piotr Sikora. Online service for Polish dependency parsing and results visualisation. In Mieczysław A. Kłopotek, Jacek Koronacki, Małgorzata Marciniak, Agnieszka Mykowiecka, and Sławomir T. Wierzchoń, editors, Language Processing and Intelligent Information Systems – 20th International Conference, IIS 2013, Warsaw, Poland, June 17-18, 2013. Proceedings, number 7912 in Lecture Notes in Computer Science, pages 39–44, Berlin, Heidelberg, 2013. Springer-Verlag.

Marcin Zając and Adam Przepiórkowski. Distant supervision learning of DBPedia relations. In Ivan Habernal and Václav Matoušek, editors, Text, Speech and Dialogue: 16th International Conference, TSD 2013, Pilsen, Czech Republic, number 8082 in Lecture Notes in Artificial Intelligence, pages 193–200. Springer-Verlag, Heidelberg, 2013.


Szymon Acedański, Adam Slaski, and Adam Przepiórkowski. Machine learning of syntactic attachment from morphosyntactic and semantic co-occurrence statistics. In Proceedings of the ACL 2012 Joint Workshop on Statistical Parsing and Semantic Processing of Morphologically Rich Languages, pages 42–47, Jeju, Republic of Korea, 2012. Association for Computational Linguistics.

Anna Andrzejczuk. Klasyfikacja onomazjologiczna rzeczowników a ich charakterystyka gramatyczna. Nowy sposób opracowania materiału leksykograficznego.. PhD thesis, Instytut Języka Polskiego, Polska Akademia Nauk, Cracow, 2012.

Anelia Belogay, Damir Ćavar, Dan Cristea, Diman Karagiozov, Svetla Koeva, Roumen Nikolov, Maciej Ogrodniczuk, Adam Przepiórkowski, Polivios Raxis, and Cristina Vertan. i-Publisher, i-Librarian and EUDocLib – linguistic services for the Web. In Piotr Pęzik, editor, Corpus Data across Languages and Disciplines, volume 28 of Łódź Studies in Language, pages 203–212. Peter Lang, 2012.

Anelia Belogay, Dan Cristea, Eugen Ignat, Diman Karagyozov, Svetla Koeva, Maciej Ogrodniczuk, Adam Przepiórkowski, Polivios Raxis, and Cristina Vertan. Merging heterogeneous resources and tools in a digital library. In Proceedings of the Merging LR Workshop at the Eighth International Conference on Language Resources and Evaluation (LREC 2012), pages 41–44, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

Anelia Belogay, Diman Karagyozov, Svetla Koeva, Cristina Vertan, Adam Przepiórkowski, Dan Cristea, and Plovios Raxis. Harnessing NLP techniques in the processes of multilingual content management. In Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 6–10, Avignon, France, 2012. Association for Computational Linguistics.

Pascal Bouvry, Mieczysław A. Kłopotek, Franck Leprevost, Małgorzata Marciniak, Agnieszka Mykowiecka, and Henryk Rybiński, editors. Security and Intelligent Information Systems: International Joint Conference, SIIS 2011, Warsaw, Poland, June 13-14, 2011, Revised Selected Papers. Number 7053 in Lecture Notes in Computer Science. Springer-Verlag, 2012.

Łukasz Degórski and Adam Przepiórkowski. Ręcznie znakowany milionowy podkorpus NKJP. In Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors, Narodowy Korpus Języka Polskiego, pages 51–58. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Konrad Gołuchowski and Adam Przepiórkowski. Semantic role labelling without deep syntactic parsing. In Hitoshi Isahara and Kyoko Kanzaki, editors, Advances in Natural Language Processing: Proceedings of the 8th International Conference on NLP, JapTAL 2012, Kanazawa, Japan, October 22-24, 2012, number 7614 in Lecture Notes in Artificial Intelligence, pages 192–197. Springer-Verlag, Heidelberg, 2012.

Rafał L Górski, Barbara Lewandowska-Tomaszczyk, Mirosław Bańko, Piotr Pęzik, Marek Łaziński, and Adam Przepiórkowski. Practical applications of the National Corpus of Polish. Prace Filologiczne, LXIII:231–240, 2012.

Elżbieta Hajnicz. Znakowanie semantyczne Składnicy frazowej. założenia ogólne, nazwy własne, aktualizacja. Technical Report 1025, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2012.

Elżbieta Hajnicz. Similarity-based method of detecting diathesis alternations in semantic valence dictionary of Polish verbs. In Pascal Bouvry, Mieczysław A. Kłopotek, Franck Leprevost, Małgorzata Marciniak, Agnieszka Mykowiecka, and Henryk Rybiński, editors, Security and Intelligent Information Systems: International Joint Conference, SIIS 2011, Warsaw, Poland, June 13-14, 2011, Revised Selected Papers, number 7053 in Lecture Notes in Computer Science, pages 345–358. Springer-Verlag, 2012.

Przemysław Jarzębowski and Adam Przepiórkowski. Temporal information extraction with cross-language projected data. In Hitoshi Isahara and Kyoko Kanzaki, editors, Advances in Natural Language Processing: Proceedings of the 8th International Conference on NLP, JapTAL 2012, Kanazawa, Japan, October 22-24, 2012, number 7614 in Lecture Notes in Artificial Intelligence, pages 198–209. Springer-Verlag, Heidelberg, 2012.

Diman Karagiozov, Anelia Belogay, Dan Cristea, Svetla Koeva, Maciej Ogrodniczuk, Polivios Raxis, Emil Stoyanov, and Cristina Vertan. i-Librarian — Free online library for European citizens. INFOtheca: Journal of Informatics and Librarianship, 13(1):27–42, 2012.

Witold Kieraś. Atrakcje wyjazdowe, czyli w obydwie strony bez wahadła. O słownictwie kibiców piłkarskich. Socjolingwistyka, 24–25:119–134, 2012.

Łukasz Kobyliński. Mining class association rules for word sense disambiguation. In Pascal Bouvry, Mieczysław A. Kłopotek, Franck Leprevost, Małgorzata Marciniak, Agnieszka Mykowiecka, and Henryk Rybiński, editors, Security and Intelligent Information Systems: International Joint Conference, SIIS 2011, Warsaw, Poland, June 13-14, 2011, Revised Selected Papers, number 7053 in Lecture Notes in Computer Science, pages 307–318. Springer-Verlag, 2012.

Łukasz Kobyliński and Mateusz Kopeć. Semantic similarity functions in word sense disambiguation. In Petr Sojka, Aleš Horák, Ivan Kopeček, and Karel Pala, editors, Text, Speech and Dialogue: 15th International Conference, TSD 2012, Brno, Czech Republic, number 7499 in Lecture Notes in Artificial Intelligence, pages 31–38, Heidelberg, 2012. Springer-Verlag.

Łukasz Kobyliński and Krzysztof Walczak. Emerging patterns and classification for spatial and image data. In Guozhu Dong and James Bailey, editors, Contrast Data Mining: Concepts, Algorithms and Applications, Data Mining and Knowledge Discovery, pages 285–302. Chapman & Hall/CRC, 2012.

Mateusz Kopeć, Rafał Młodzki, and Adam Przepiórkowski. Word Sense Disambiguation in the National Corpus of Polish. Prace Filologiczne, LXIII:155–165, 2012.

Mateusz Kopeć, Rafał Młodzki, and Adam Przepiórkowski. Automatyczne znakowanie sensami słów. In Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors, Narodowy Korpus Języka Polskiego, pages 209–224. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Mateusz Kopeć and Maciej Ogrodniczuk. Creating a Coreference Resolution System for Polish. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, pages 192–195, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

Katarzyna Krasnowska, Witold Kieraś, Marcin Woliński, and Adam Przepiórkowski. Using tree transducers for detecting errors in a treebank of Polish. In Petr Sojka, Aleš Horák, Ivan Kopeček, and Karel Pala, editors, Text, Speech and Dialogue: 15th International Conference, TSD 2012, Brno, Czech Republic, number 7499 in Lecture Notes in Artificial Intelligence, pages 119–126. Springer-Verlag, Heidelberg, 2012.

Barbara Lewandowska-Tomaszczyk, Mirosław Bańko, Rafał L. Górski, Marek Łazinski, Piotr Pęzik, and Adam Przepiórkowski. Narodowy Korpus Języka Polskiego: geneza i dzień dzisiejszy. In Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors, Narodowy Korpus Języka Polskiego, pages 3–10. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Małgorzata Marciniak and Agnieszka Mykowiecka. Terminology extraction from medical texts in Polish. In Sophia Ananiadou, Sampo Pyysalo, Dietrich Rebholz-Schuhmann, Fabio Rinaldi, and Tapio Salakoski, editors, Proceedings of the 5th International Symposium on Semantic Mining in Biomedicine. University of Zurich, 2012.

Agnieszka Mykowiecka and Małgorzata Marciniak. Clustering of medical terms based on morpho-syntactic features. In Proceedings of International Conference on Knowledge Engineering and Ontology Development (KEOD 2012), pages 214–219. SciTePress, 2012.

Agnieszka Mykowiecka and Małgorzata Marciniak. Combining wordnet and morphosyntactic information in terminology clustering. In Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), Mumbai, India, 2012.

Agnieszka Mykowiecka, Piotr Rychlik, and Jakub Waszczuk. Building an electronic dictionary of Old Polish on he base of the paper resource. In Proceedings of the Workshop on Adaptation of Language Resources and Tools for Processing Cultural Heritage at LREC 2012, pages 16–21. European Language Resources Association (ELRA), 2012.

Maciej Ogrodniczuk. The Polish Sejm Corpus. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, pages 2219–2223, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

Maciej Ogrodniczuk, Radovan Garabík, Svetla Koeva, Cvetana Krstev, Piotr Pęzik, Tibor Pintér, Adam Przepiórkowski, György Szaszák, Marko Tadić, Tamás Váradi, and Duško Vitas. Central and South-European language resources in META-SHARE. INFOtheca: Journal of Informatics and Librarianship, 13(1):3–26, 2012.

Maciej Ogrodniczuk and Michał Lenart. Web Service integration platform for Polish linguistic resources. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, pages 1164–1168, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

Maciej Ogrodniczuk, Piotr Pęzik, and Adam Przepiórkowski. Towards a comprehensive open repository of Polish language resources. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, pages 3593–3597, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

Maciej Ogrodniczuk and Adam Przepiórkowski. Polish language processing chains for multilingual information systems. In Gosse Bouma, Ashwin Ittoo, Elisabeth Métais, and Hans Wortmann, editors, Natural Language Processing and Information Systems, number 7337 in Lecture Notes in Computer Science, pages 152–157. Springer-Verlag, Heidelberg, 2012.

Maciej Ogrodniczuk and Magdalena Zawisławska. Semantic approach to identity in coreference resolution task. In Birte Glimm and Antonio Krüger, editors, KI 2012: Advances in Artificial Intelligence, number 7526 in Lecture Notes in Artificial Intelligence, pages 241–244. Springer-Verlag, Heidelberg, 2012.

Agnieszka Patejuk and Adam Przepiórkowski. A comprehensive analysis of constituent coordination for grammar engineering. In Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), pages 2191–2207, Mumbai, India, 2012.

Agnieszka Patejuk and Adam Przepiórkowski. Towards an LFG parser for Polish: An exercise in parasitic grammar development. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, pages 3849–3852, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

Agnieszka Patejuk and Adam Przepiórkowski. Lexico-semantic coordination in Polish. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'12 Conference, pages 461–478, Stanford, CA, 2012. CSLI Publications.

Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors. Narodowy Korpus Języka Polskiego. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Adam Przepiórkowski and Michał Lenart. Simultaneous error detection at two levels of syntactic annotation. In Proceedings of the Sixth Linguistic Annotation Workshop, pages 118–123, Jeju, Republic of Korea, 2012. Association for Computational Linguistics.

Adam Przepiórkowski and Agnieszka Patejuk. On case assignment and the coordination of unlikes: The limits of distributive features. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'12 Conference, pages 479–489, Stanford, CA, 2012. CSLI Publications.

Adam Przepiórkowski and Agnieszka Patejuk. The puzzle of case agreement between numeral phrases and predicative adjectives in Polish. In Miriam Butt and Tracy Holloway King, editors, The Proceedings of the LFG'12 Conference, pages 490–502, Stanford, CA, 2012. CSLI Publications.

Adam Przepiórkowski. Znakowanie XML. In Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors, Narodowy Korpus Języka Polskiego, pages 169–193. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Piotr Przybyła. Issues of Polish Question Answering. In Olgierd Hryniewicz, Jan Mielniczuk, Wojciech Penczek, and Jacek Waniewski, editors, Proceedings of the first conference 'Information Technologies: Research and their Interdisciplinary Applications' (ITRIA 2012), pages 122–139. Institute of Computer Science, Polish Academy of Sciences, 2012.

Zygmunt Saloni, Marcin Woliński, Robert Wołosz, Włodzimierz Gruszczyński, and Danuta Skowrońska. Słownik gramatyczny języka polskiego. Warsaw, 2nd edition, 2012.

Łukasz Szałkiewicz and Adam Przepiórkowski. Anotacja morfoskładniowa. In Adam Przepiórkowski, Mirosław Bańko, Rafał L. Górski, and Barbara Lewandowska-Tomaszczyk, editors, Narodowy Korpus Języka Polskiego, pages 59–96. Wydawnictwo Naukowe PWN, Warsaw, 2012.

Marcin Woliński, Marcin Miłkowski, Maciej Ogrodniczuk, Adam Przepiórkowski, and Łukasz Szałkiewicz. PoliMorf: A (not so) new open morphological dictionary for Polish. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, pages 860–864, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

Marcin Woliński and Andrzej Zaborowski. An ambiguity aware treebank search tool. In Petr Sojka, Aleš Horák, Ivan Kopeček, and Karel Pala, editors, Text, Speech and Dialogue: 15th International Conference, TSD 2012, Brno, Czech Republic, number 7499 in Lecture Notes in Artificial Intelligence, pages 88–94. Springer-Verlag, Heidelberg, 2012.

Alina Wróblewska. Polish dependency bank. Linguistic Issues in Language Technology, 7(1), 2012.

Alina Wróblewska and Adam Przepiórkowski. Induction of dependency structures based on weighted projection. In Proceedings of the 4th International Conference on Computational Collective Intelligence Technologies and Applications (ICCCI 2012), Part I, number 7653 in Lecture Notes in Artificial Intelligence, pages 364–374, Berlin, 2012. Springer-Verlag.

Alina Wróblewska and Marcin Sydow. DEBORA: Dependency-based method for extracting entity-relationship triples from open-domain texts in Polish. In Li Chen, Alexander Felfernig, Jiming Liu, and Zbigniew W. Raś, editors, Foundations of Intelligent Systems. Proceedings of the 20th International Symposium, ISMIS 2012, Macau, China, number 7661 in Lecture Notes in Computer Science, pages 155–161, Berlin, Heidelberg, 2012. Springer-Verlag.

Alina Wróblewska and Marcin Woliński. Preliminary experiments in Polish dependency parsing. In Pascal Bouvry, Mieczysław A. Kłopotek, Franck Leprevost, Małgorzata Marciniak, Agnieszka Mykowiecka, and Henryk Rybiński, editors, Security and Intelligent Information Systems: International Joint Conference, SIIS 2011, Warsaw, Poland, June 13-14, 2011, Revised Selected Papers, number 7053 in Lecture Notes in Computer Science, pages 279–292. Springer-Verlag, 2012.

Bartosz Zaborowski and Adam Przepiórkowski. Tagset conversion with decision trees. In Hitoshi Isahara and Kyoko Kanzaki, editors, Advances in Natural Language Processing: Proceedings of the 8th International Conference on NLP, JapTAL 2012, Kanazawa, Japan, October 22-24, 2012, number 7614 in Lecture Notes in Artificial Intelligence, pages 144–155. Springer-Verlag, Heidelberg, 2012.


Anna Andrzejczuk. Dwoje urodzin to brzmi dziwnie. Norma językowe dotycząca połączeń rzeczowników PT z liczebnikami a jej realizacja w tekstach Narodowego Korpusu Języka Polskiego i w tekstach internetowych. Język Polski, XCI(4):273–283, 2011.

Anelia Belogay, Dan Cristea, Eugen Ignat, Diman Karagiozov, Svetla Koeva, Maciej Ogrodniczuk, Adam Przepiórkowski, Polivios Raxis, and Cristina Vertan. Language processing chains in ATLAS. In Zygmunt Vetulani, editor, Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, page 577, Poznań, Poland, 2011.

Radovan Garabík, Svetla Koeva, Cvetana Krstev, Maciej Ogrodniczuk, Adam Przepiórkowski, Mladen Stanojević, Marko Tadić, and Tamás Váradi. CESAR resources in META-SHARE repository. In Zygmunt Vetulani, editor, Proceedings of the 5th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, page 583, Poznań, Poland, 2011.