Locked History Actions

JakubPiskorski

JakubPiskorski

Contact information

Address:

Institute of Computer Science, Polish Academy of Sciences

Jana Kazimierza 5

01-248 Warszawa

Poland

E-mail:

<jakub DOT piskorski AT SPAMFREE ipipan DOT waw DOT pl>

Selected publications

List of publications

2013

Martin Atkinson, Jakub Piskorski, Hristo Tanev, Roman Yangarber, and Vanni Zavarella. Techniques for Multilingual Security-related Event Extraction from Online News. Springer-Verlag, Berlin, 2013.

Jakub Piskorski and Maud Ehrmann. On Named Entity Recognition in Targeted Twitter Streams in Polish. In Proceedings of the 4th Biennial Workshop on Balto-Slavic Natural Language Processing (BSNLP), collocated with ACL 2013, 2013.

Jakub Piskorski, Lidia Pivovarova, Hristo Tanev, and Roman Yangarber, editors. Proceedings of the 4th Biennial Workshop on Balto-Slavic Natural Language Processing (BSNLP 2013). Held at ACL 2013. Sofia, Bulgaria, 8-9 August 2013. Association for Computational Linguistics, 2013.

Jakub Piskorski, Hristo Tanev, and Alexandra Balahur. Exploiting Twitter for Border Security-Related Intelligence Gathering. In IEEE Proceedings of the 3rd European Intelligence and Security Informatics Conference (EISIC 2013), Uppsala, Sweden, 2013, 2013.

Jakub Piskorski and Roman Yangarber. Information Extraction: Past, Present and Future. In T. Poibeau, H. Saggion, J. Piskorski, and R. Yangarber, editors, Multi-source, Multilingual Information Extraction and Summarization. Volume in the Series: Theory and Applications of Natural Language Processing. Springer-Verlag, Berlin & New York, 2013.

Thierry Poibeau, Horacio Saggion, Jakub Piskorski, and Roman Yangarber, editors. Multi-source, Multilingual Information Extraction and Summarization. Springer-Verlag, Berlin & New York, 2013.

2012

Hristo Tanev, Maud Ehrmann, Jakub Piskorski, and Vanni Zavarella. Enhancing Event Descriptions through Twitter Mining. In Proceedings of the 6th International AAAI Conference on Weblogs and Social Media , Dublin, Ireland, 2012.

Vanni Zavarella, Jakub Piskorski, Ana Esteves, and Stefano Bucci. Refining Border Security News Event Geotagging through Deployment of Lexico-Semantic Patterns. In IEEE Proceedings of the European Intelligence and Security Informatics Conference (EISIC 2012) Odense, Denmark, 2012.

2011

Martin Atkinson, Jakub Piskorski, Roman Yangarber, and E. van der Goot. Multilingual Real-Time Event Extraction for Border Security Intelligence Gathering. In Uffe Kock Wiil, editor, Open Source Intelligence and Counter-terrorism. Springer, LNCS, Vol. 2, 2011.

Jakub Piskorski, Jenya Belayeva, and Martin Atkinson. Exploring the Usefulness of Cross-lingual Information Fusion for Refining Real-time News Event Extraction: A Preliminary Study. In Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP 2011) Hissar, Bulgaria, 2011.

Jakub Piskorski, Hristo Tanev, Martin Atkinson, Erik van der Goot, and Vanni Zavarella. Online News Event Extraction for Global Crisis Surveillance. Transactions on Computational Collective Intelligence, 6910(5):182–212, 2011.

Agata Savary and Jakub Piskorski. Language Resources for Named Entity Annotation in the National Corpus of Polish. Control and Cybernetics, 40(2):361–391, 2011.

2010

Jan Daciuk, Jakub Piskorski, and Strahil Ristov. NLP Dictionaries Implemented as Finite-State Automata. In Carlos Martín-Vide, editor, Mathematics, Computing, Language, and Life: Frontiers in Mathematical Linguistics and Language Theory - Vol. 2 SCIENTIFIC APPLICATIONS OF LANGUAGE METHODS, pages 133–204. World Scientific & Imperial College Press, 2010.

Jakub Piskorski, Martin Atkinson, Jenya Belyaeva, Vanni Zavarella, Silja Huttunen, and Roman Yangarber. Real-Time Text Mining in Multilingual News for the Creation of a Pre-frontier Intelligence Picture. In Proceedings of 16th Conference on Knowledge Discovery and Data Mining (KDD 2010) ACM SIGKDD Workshop on Intelligence and Security Informatics, Washington, DC, USA,, 2010.

2009

Ivan Budiscak, Jakub Piskorski, and Strahil Ristov. Compressing Gazetteers Revisited. In Pre-proceedings of the 8th International Workshop on Finite-State Methods and Natural Language Processing 2009 (FSMNLP 2009) workshop, Pretoria, South Africa, 2009.

Jakub Piskorski. Exploring Curvature-based Topic Development Analysis for Detecting Event Reporting Boundaries. In Małgorzata Marciniak and Agnieszka Mykowiecka, editors, Aspects of Natural Language Processing. Essays dedicated to Leonard Bolc on the Occasion of His 75th Birthday, volume 5070 of Lecture Notes in Computer Science, pages 311–331. Springer-Verlag, Berlin, 2009.

Jakub Piskorski, Marcin Sydow, and Karol Wieloch. Comparison of String Distance Metrics for Lemmatisation of Named Entities in Polish. In Zygmunt Vetulani and Hans Uszkoreit, editors, Human Language Technology: Challenges of the Information Society, volume 5603 of Lecture Notes in Artificial Intelligence. Springer-Verlag, Berlin, 2009.

Jakub Piskorski, Bruce Watson, and Ansi Yli–Jyrä, editors. Post-proceedings of the Workshop on Finite-State Methods and Natural Language Processing 2008 (FSMNLP 2008), “Frontiers in Artificial Intelligence and Applications”, Volume 191, IOS Press, Amsterdam, The Netherlands. IOS PressAssociation for Computational Linguistics, 2009.

Jakub Piskorski, Karol Wieloch, and Marcin Sydow. On knowledge-poor methods for person name matching and lemmatization for highly inflectional languages. Information Retrieval, 12:275–299, 2009.

Hristo Tanev, Vanni Zavarella, Jens Linge, Mihail Kabadjov, Jakub Piskorski, Martin Atkinson, and Ralf Steinberger. Exploiting Machine Learning Techniques to Build an Event Extraction System for Portuguese and Spanish. Linguamática: Revista para o Processamento Automático das Línguas Ibéricas, 2:550–566, 2009.

2008

Sandra Kübler, Jakub Piskorski, and Adam Przepiórkowski, editors. Proceedings of the LREC 2008 Workshop on Partial Parsing: Between Chunking and Deep Parsing, Marrakech, 2008. ELRA.

Jakub Piskorski, Marcin Sydow, and Dawid Weiss. Exploring Linguistic Features for Web Spam Detection: A Preliminary Study. In Proceedings of the 4th Workshop on Adversarial Information Retrieval on the Web (AIRWEB 2008), 16th World Wide Web Conference 2008, Beijing, China, 2008.

Marcin Sydow, Jakub Piskorski, Dawid Weiss, and Carlos Castillo. Fighting Web Spam. In D. Perrotta, J. Piskorski, F. Soulié-Fogelman, and R. Steinberger, editors, Mining Massive Data Sets for Security, Volume 19 of the NATO Science for Peace and Security Series. IOS Press, Amsterdam, The Netherlands, 2008.

Hristo Tanev, Jakub Piskorski, and Martin Atkinson. Real-Time News Event Extraction for Global Crisis Monitoring. In Proceedings of NLDB 2008, pages 207–218, 2008.

Vanni Zavarella, Hristo Tanev, and Jakub Piskorski. Event Extraction for Italian using a Cascade of Finite-State Grammars. In Proceedings of the International Workshop on Finite-State Machines and Natural Language Processing (FSMNLP 2008), 2008.

2007

Agnieszka Mykowiecka, Anna Kupść, Małgorzata Marciniak, and Jakub Piskorski. Resources for Information Extraction from Polish texts. In Zygmunt Vetulani, editor, Proceedings of the 3rd Language & Technology Conference, Poznań, Poland, 2007.

Jakub Piskorski. ExPRESS – extraction pattern recognition engine and specification suite. In Proceedings of the International Workshop Finite-State Methods and Natural language Processing 2007 (FSMNLP'2007), Potsdam, 2007.

Jakub Piskorski, Bruno Pouliquen, Ralf Steinberger, and Hristo Tanev, editors. Proceedings of the Workshop on Balto-Slavonic Natural Language Processing at ACL 2007, Prague, 2007.

Jakub Piskorski, Marcin Sydow, and Anna Kupść. Lemmatization of Polish person names. In Jakub Piskorski, Bruno Pouliquen, Ralf Steinberger, and Hristo Tanev, editors, Proceedings of the Workshop on Balto-Slavonic Natural Language Processing at ACL 2007, pages 27–34, Prague, 2007.

2005

Małgorzata Marciniak, Agnieszka Mykowiecka, Anna Kupść, and Jakub Piskorski. Intelligent content extraction from Polish medical texts. In Leonard Bolc, Zbigniew Michalewicz, and Toyoaki Nishida, editors, Intelligent Media Technology for Communicative Intelligence, Second International Workshop, IMTCI 2004, Warsaw, Poland, September 13–14, 2004, Revised Selected Papers, volume 3490 of Lecture Notes in Computer Science, pages 68–78. Springer-Verlag, 2005.

Jakub Piskorski. Named-entity recognition for Polish with SProUT. In Leonard Bolc, Zbigniew Michalewicz, and Toyoaki Nishida, editors, Intelligent Media Technology for Communicative Intelligence, Second International Workshop, IMTCI 2004, Warsaw, Poland, September 13–14, 2004, Revised Selected Papers, volume 3490 of Lecture Notes in Computer Science. Springer-Verlag, 2005.

Jakub Piskorski and Marcin Sydow. Exploring deployment of linguistic features in classification of Polish. In Zygmunt Vetulani, editor, Proceedings of the 2nd Language & Technology Conference, Poznań, Poland, 2005.

2004

Witold Drożdżyński, Hans-Ulrich Krieger, Jakub Piskorski, Ulrich Schäfer, and Feiyu Xu. Shallow processing with unification and typed feature structures — foundations and applications. Künstliche Intelligenz, 1:17–23, 2004.

Anna Kupść, Małgorzata Marciniak, Agnieszka Mykowiecka, Jakub Piskorski, and Teresa Podsiadły-Marczykowska. Information Extraction from mammographic reports. In KONVENS 2004, pages 113–116, Vienna, 2004.

Małgorzata Marciniak, Agnieszka Mykowiecka, Anna Kupść, and Jakub Piskorski. Intelligent content extraction from Polish medical texts. In Proceedings of International Workshop on Intelligent Media Technology for Communicative Intelligence, pages 96–99, Warsaw, 2004.

Jakub Piskorski. Extraction of Polish named-entities. In Proceedings of the Fourth International Conference on Language Resources and Evaluation, LREC 2004, pages 313–316, Lisbon, 2004. ELRA.

Jakub Piskorski. Rule-based named-entity recognition for Polish. In Proceedings of the Workshop on Named-Entity Recognition for NLP Applications held in conjunction with the 1st International Joint Conference on NLP, March 2004, Sanya, Hainan Island, China, 2004.

Jakub Piskorski, Peter Homola, Małgorzata Marciniak, Agnieszka Mykowiecka, Adam Przepiórkowski, and Marcin Woliński. Information extraction for Polish using the SProUT platform. In Mieczysław A. Kłopotek, Sławomir T. Wierzchoń, and Krzysztof Trojanowski, editors, Intelligent Information Processing and Web Mining, Advances in Soft Computing, pages 227–236. Springer-Verlag, Berlin, 2004.

2003

Witold Drożdżyński, Petr Homola, Jakub Piskorski, and Vytautas Zinkevičius. Adapting SProUT to processing Baltic and Slavonic languages. In Hamish Cunningham, Elena Paskaleva, Kalina Bontcheva, and G. Angelova, editors, Information Extraction for Slavonic and Other Central and Eastern European Languages, pages 18–25, Borovets, Bulgaria, 2003.

2002

Markus Becker, Witold Drożdżyński, Hans-Ulrich Krieger, Jakub Piskorski, Ulrich Schäfer, and Feiyu Xu. SProUT — shallow processing with typed feature structures and unification. In Proceedings of the International Conference on NLP (ICON 2002), Mumbai, India, 2002.

Berthold Crysmann, Anette Frank, Bernd Kiefer, Hans-Ulrich Krieger, Stefan Müller, Günter Neumann, Jakub Piskorski, Ulrich Schäfer, Melanie Siegel, Hans Uszkoreit, and Feiyu Xu. An integrated architecture for shallow and deep processing. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, PA, 2002.

Günter Neumann and Jakub Piskorski. A shallow text processing core engine. Journal of Computational Intelligence, 18(3):451–476, 2002.

2000

Günter Neumann, Christian Braun, and Jakub Piskorski. A divide-and-conquer strategy for shallow parsing of German free texts. In Proceedings of the 6th Applied Natural Language Processing Conference, pages 239–246, Seatle, WA, 2000. ACL.

Jakub Piskorski and Günter Neumann. An intelligent text extraction and navigation system. In Proceedings of 6th International Conference on Computer-Assisted Information Retrieval (RIAO-2000), Paris, 2000.