Unknown action newaccount.

Clear message
Locked History Actions



Contact information


Institute of Computer Science, Polish Academy of Sciences

Jana Kazimierza 5

01-248 Warszawa



<jakub DOT piskorski AT SPAMFREE ipipan DOT waw DOT pl>

Selected publications

List of publications


Jakub Piskorski, Michał Marcińczuk, Preslav Nakov, Maciej Ogrodniczuk, Senja Pollak, Pavel Přibáň, Piotr Rybak, Josef Steinberger, and Roman Yangarber, editors. Proceedings of the 9th Workshop on Slavic Natural Language Processing 2023 (SlavicNLP 2023), Dubrovnik, Croatia, 2023. Association for Computational Linguistics.


Jakub Piskorski, Karol Wieloch, and Marcin Sydow. On knowledge-poor methods for person name matching and lemmatization for highly inflectional languages. Information Retrieval, 12:275–299, 2009.


Sandra Kübler, Jakub Piskorski, and Adam Przepiórkowski, editors. Proceedings of the LREC 2008 Workshop on Partial Parsing: Between Chunking and Deep Parsing, Marrakech, 2008. European Language Resources Association (ELRA).


Agnieszka Mykowiecka, Anna Kupść, Małgorzata Marciniak, and Jakub Piskorski. Resources for Information Extraction from Polish texts. In Zygmunt Vetulani, editor, Proceedings of the 3rd Language & Technology Conference, Poznań, Poland, 2007.

Jakub Piskorski. ExPRESS – extraction pattern recognition engine and specification suite. In Proceedings of the International Workshop Finite-State Methods and Natural language Processing 2007 (FSMNLP'2007), Potsdam, 2007.

Jakub Piskorski, Bruno Pouliquen, Ralf Steinberger, and Hristo Tanev, editors. Proceedings of the Workshop on Balto-Slavonic Natural Language Processing at ACL 2007, Prague, 2007.

Jakub Piskorski, Marcin Sydow, and Anna Kupść. Lemmatization of Polish person names. In Jakub Piskorski, Bruno Pouliquen, Ralf Steinberger, and Hristo Tanev, editors, Proceedings of the Workshop on Balto-Slavonic Natural Language Processing at ACL 2007, pages 27–34, Prague, 2007.


Małgorzata Marciniak, Agnieszka Mykowiecka, Anna Kupść, and Jakub Piskorski. Intelligent content extraction from Polish medical texts. In Leonard Bolc, Zbigniew Michalewicz, and Toyoaki Nishida, editors, Intelligent Media Technology for Communicative Intelligence, Second International Workshop, IMTCI 2004, Warsaw, Poland, September 13–14, 2004, Revised Selected Papers, number 3490 in Lecture Notes in Computer Science, pages 68–78. Springer-Verlag, 2005.

Jakub Piskorski. Named-entity recognition for Polish with SProUT. In Leonard Bolc, Zbigniew Michalewicz, and Toyoaki Nishida, editors, Intelligent Media Technology for Communicative Intelligence, Second International Workshop, IMTCI 2004, Warsaw, Poland, September 13–14, 2004, Revised Selected Papers, number 3490 in Lecture Notes in Computer Science. Springer-Verlag, 2005.

Jakub Piskorski and Marcin Sydow. Exploring deployment of linguistic features in classification of Polish. In Zygmunt Vetulani, editor, Proceedings of the 2nd Language & Technology Conference, Poznań, Poland, 2005.


Witold Drożdżyński, Hans-Ulrich Krieger, Jakub Piskorski, Ulrich Schäfer, and Feiyu Xu. Shallow processing with unification and typed feature structures — foundations and applications. Künstliche Intelligenz, 1:17–23, 2004.

Anna Kupść, Małgorzata Marciniak, Agnieszka Mykowiecka, Jakub Piskorski, and Teresa Podsiadły-Marczykowska. Information Extraction from mammographic reports. In KONVENS 2004, pages 113–116, Vienna, 2004.

Małgorzata Marciniak, Agnieszka Mykowiecka, Anna Kupść, and Jakub Piskorski. Intelligent content extraction from Polish medical texts. In Proceedings of International Workshop on Intelligent Media Technology for Communicative Intelligence, pages 96–99, Warsaw, 2004.

Jakub Piskorski. Extraction of Polish named-entities. In Proceedings of the Fourth International Conference on Language Resources and Evaluation, LREC 2004, pages 313–316, Lisbon, 2004. European Language Resources Association (ELRA).

Jakub Piskorski. Rule-based named-entity recognition for Polish. In Proceedings of the Workshop on Named-Entity Recognition for NLP Applications held in conjunction with the 1st International Joint Conference on NLP, March 2004, Sanya, Hainan Island, China, 2004.

Jakub Piskorski, Peter Homola, Małgorzata Marciniak, Agnieszka Mykowiecka, Adam Przepiórkowski, and Marcin Woliński. Information extraction for Polish using the SProUT platform. In Mieczysław A. Kłopotek, Sławomir T. Wierzchoń, and Krzysztof Trojanowski, editors, Intelligent Information Processing and Web Mining, Advances in Soft Computing, pages 227–236. Springer-Verlag, Berlin, 2004.


Witold Drożdżyński, Petr Homola, Jakub Piskorski, and Vytautas Zinkevičius. Adapting SProUT to processing Baltic and Slavonic languages. In Hamish Cunningham, Elena Paskaleva, Kalina Bontcheva, and G. Angelova, editors, Information Extraction for Slavonic and Other Central and Eastern European Languages, pages 18–25, Borovets, Bulgaria, 2003.


Markus Becker, Witold Drożdżyński, Hans-Ulrich Krieger, Jakub Piskorski, Ulrich Schäfer, and Feiyu Xu. SProUT — shallow processing with typed feature structures and unification. In Proceedings of the International Conference on NLP (ICON 2002), Mumbai, India, 2002.

Berthold Crysmann, Anette Frank, Bernd Kiefer, Hans-Ulrich Krieger, Stefan Müller, Günter Neumann, Jakub Piskorski, Ulrich Schäfer, Melanie Siegel, Hans Uszkoreit, and Feiyu Xu. An integrated architecture for shallow and deep processing. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, PA, 2002.

Günter Neumann and Jakub Piskorski. A shallow text processing core engine. Journal of Computational Intelligence, 18(3):451–476, 2002.


Günter Neumann, Christian Braun, and Jakub Piskorski. A divide-and-conquer strategy for shallow parsing of German free texts. In Proceedings of the 6th Applied Natural Language Processing Conference, pages 239–246, Seatle, WA, 2000. ACL.

Jakub Piskorski and Günter Neumann. An intelligent text extraction and navigation system. In Proceedings of 6th International Conference on Computer-Assisted Information Retrieval (RIAO-2000), Paris, 2000.