Locked History Actions

Diff for "ZILStart"

Differences between revisions 37 and 245 (spanning 208 versions)
Revision 37 as of 2013-03-01 10:08:05
Size: 9768
Comment:
Revision 245 as of 2024-06-07 09:36:59
Size: 17772
Comment:
Deletions are marked like this. Additions are marked like this.
Line 4: Line 4:
The Linguistic Engineering (LE) Group is part of the [[http://www.ipipan.waw.pl/en/dept/dept-ai.html|Department of Artificial Intelligence]] at the [[http://www.ipipan.waw.pl/en/|Institute of Computer Science]], [[http://www.english.pan.pl/|Polish Academy of Sciences]] (ICS PAS). The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the [[http://www.ipipan.waw.pl/en/|Institute of Computer Science]], [[https://institution.pan.pl/|Polish Academy of Sciences]] (IPI PAN).
Line 8: Line 8:
|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/KacperChwialkowski|Kacper Chwiałkowski]] (part time) || [[mailto:kacper.chwialkowski@ipipan.waw.pl|kacper.chwialkowski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/LukaszDegorski|Łukasz Degórski]], MSc || [[mailto:ldegorski@ipipan.waw.pl|ldegorski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/ElzbietaHajnicz|Elżbieta Hajnicz]], PhD || [[mailto:elzbieta.hajnicz@ipipan.waw.pl|elzbieta.hajnicz@ipipan.waw.pl]] ||
|| [[http://ak243.user.srcf.net/annakibort.html|Anna Kibort]], PhD (part time) || [[mailto:ak243@cam.ac.uk|ak243@cam.ac.uk]] ||
|| [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], PhD || [[mailto:lkobylinski@ipipan.waw.pl|lkobylinski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MateuszKopec|Mateusz Kopeć]], MSc (part time) || [[mailto:mateusz.kopec@ipipan.waw.pl|mateusz.kopec@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]], MSc (part time) || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AnnaKupsc|Anna Kupść]], PhD (on leave) || [[mailto:anna.kupsc@ipipan.waw.pl|anna.kupsc@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MichalLenart|Michał Lenart]], MSc || [[mailto:michal.lenart@ipipan.waw.pl|michal.lenart@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MalgorzataMarciniak|Małgorzata Marciniak]], PhD || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AgnieszkaMykowiecka|Agnieszka Mykowiecka]], PhD || [[mailto:agnieszka.mykowiecka@ipipan.waw.pl|agnieszka.mykowiecka@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/JoannaOgrodniczuk|Joanna Ogrodniczuk]], MSc (part time) || [[mailto:joanna.ogrodniczuk@ipipan.waw.pl|joanna.ogrodniczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/JakubPiskorski|Jakub Piskorski]], PhD, Associate || [[mailto:jakub.piskorski@ipipan.waw.pl|jakub.piskorski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Head of the Group || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/PiotrRychlik|Piotr Rychlik]], PhD || [[mailto:rychlik@ipipan.waw.pl|rychlik@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/ZygmuntSaloni|Zygmunt Saloni]], Professor (part time) || [[mailto:zygmunt.saloni@ipipan.waw.pl|zygmunt.saloni@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/FilipSkwarski|Filip Skwarski]], MSc (part time) || [[mailto:filip.skwarski@ipipan.waw.pl|filip.skwarski@ipipan.waw.pl]] ||
|| [[http://www.cs.albany.edu/~tomek/|Tomek Strzałkowski]], PhD, Foreign Associate || [[mailto:tomek@cs.albany.edu|tomek@cs.albany.edu]] ||
|| [[http://zil.ipipan.waw.pl/LukaszSzalkiewicz|Łukasz Szałkiewicz]], MSc || [[mailto:lukasz.szalkiewicz@ipipan.waw.pl|lukasz.szalkiewicz@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/JanSzejko|Jan Szejko]] (part time) || [[mailto:jan.szejko@ipipan.waw.pl|jan.szejko@ipipan.waw.pl]] ||
|| [[http://www.site.uottawa.ca/~szpak/|Stan Szpakowicz]], PhD, Foreign Associate || [[mailto:szpak@site.uottawa.ca|szpak@site.uottawa.ca]] ||
|| [[http://zil.ipipan.waw.pl/MarekSwidzinski|Marek Świdziński]], Professor (part time) || [[mailto:marek.swidzinski@ipipan.waw.pl|marek.swidzinski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/JakubWaszczuk|Jakub Waszczuk]], MSc (part time) || [[mailto:jakub.waszczuk@ipipan.waw.pl|jakub.waszczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], MSc || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AleksandraWieczorek|Aleksandra Wieczorek]], PhD (part time) || [[mailto:aleksandra.wieczorek@ipipan.waw.pl|aleksandra.wieczorek@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD (part time) || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AlinaWroblewska|Alina Wróblewska]], MSc || [[mailto:alina.wroblewska@ipipan.waw.pl|alina.wroblewska@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/BartoszZaborowski|Bartosz Zaborowski]], MSc (part time) || [[mailto:bartosz.zaborowski@ipipan.waw.pl|bartosz.zaborowski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/SebastianZurowski|Sebastian Żurowski]], PhD (part time) || [[mailto:sebastian.zurowski@ipipan.waw.pl|sebastian.zurowski@ipipan.waw.pl]] ||
=== Core team ===

|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD (on leave) || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/TomaszBartosiak|Tomasz Bartosiak]], MSc || [[mailto:tomasz.bartosiak@gmail.com|tomasz.bartosiak@gmail.com]] ||
|| [[https://www.diegofeinmann.com/|Diego Feinmann]], PhD || [[mailto:diego.feinmann@ipipan.waw.pl|diego.feinmann@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/ElzbietaHajnicz|Elżbieta Hajnicz]], PhD, Assoc. Prof. || [[mailto:elzbieta.hajnicz@ipipan.waw.pl|elzbieta.hajnicz@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/WitoldKieras|Witold Kieraś]], PhD || [[mailto:witold.kieras@ipipan.waw.pl|witold.kieras@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], PhD || [[mailto:lkobylinski@ipipan.waw.pl|lukasz.kobylinski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/DorotaKomosi%C5%84ska|Dorota Komosińska]], MSc || [[mailto:dorota.komosinska@gmail.com|dorota.komosinska@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska-Kieraś]], MSc || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MalgorzataMarciniak|Małgorzata Marciniak]], PhD, Assoc. Prof. || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AgnieszkaMykowiecka|Agnieszka Mykowiecka]], PhD, Assoc. Prof. || [[mailto:agnieszka.mykowiecka@ipipan.waw.pl|agnieszka.mykowiecka@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD, Assoc. Prof., Head of the Group || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AgnieszkaPatejuk|Agnieszka Patejuk]], PhD || [[mailto:aep@ipipan.waw.pl|agnieszka.patejuk@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Full Prof. || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/PiotrPrzybyla|Piotr Przybyła]], PhD (on postdoctoral fellowship at [[https://www.upf.edu/web/erinia|UPF]]) || [[mailto:piotr.przybyla@ipipan.waw.pl|piotr.przybyla@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MichałRudolf|Michał Rudolf]], PhD || [[mailto:michal@rudolf.waw.pl|michal@rudolf.waw.pl]] ||
|| Piotr Rybak, MSc || [[mailto:piotr.cezary.rybak@gmail.com|piotr.cezary.rybak@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/PiotrRychlik|Piotr Rychlik]], PhD || [[mailto:rychlik@ipipan.waw.pl|piotr.rychlik@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AleksandraTomaszewska|Aleksandra Tomaszewska]], PhD candidate || [[mailto:aleksandra.tomaszewska@hotmail.com|aleksandra.tomaszewska@hotmail.com]] ||
|| [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], PhD || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD, Assoc. Prof. || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] ||
|| Joanna Wołoszyn, PhD || [[mailto:joanna.woloszyn@ipipan.waw.pl|joanna.woloszyn@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/AlinaWroblewska|Alina Wróblewska]], PhD || [[mailto:alina.wroblewska@ipipan.waw.pl|alina.wroblewska@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/SebastianZawada|Sebastian Zawada]], MSc || [[mailto:sebastian.zawada@ipipan.waw.pl|sebastian.zawada@ipipan.waw.pl]] ||


=== Associates ===

|| Wiktor Eźlakowski, MSc || [[mailto:wiktor.ezlakowski@ipipan.waw.pl|wiktor.ezlakowski@ipipan.waw.pl]] ||
|| Sonia Janicka || [[mailto:sonia.janicka@gmail.com|sonia.janicka@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/MateuszKlimaszewski|Mateusz Klimaszewski]], MSc || [[mailto:mk.klimaszewski@gmail.com|mk.klimaszewski@gmail.com]] ||
|| Jakub Piskorski, PhD || [[mailto:jpiskorski@gmail.com|jpiskorski@gmail.com]] ||
|| Karol Saputa, BEng || [[mailto:karolsaputa@gmail.com|karolsaputa@gmail.com]] ||
|| Jakub Szymanik, PhD || [[mailto:jakub.szymanik@gmail.com|jakub.szymanik@gmail.com]] ||
|| [[http://zil.ipipan.waw.pl/RyszardTuora|Ryszard Tuora]], MSc || [[mailto:ryszardtuora@gmail.com|ryszardtuora@gmail.com]] ||
|| Grzegorz Wojdyga, MSc || [[mailto:g.wojdyga@ipipan.waw.pl|g.wojdyga@ipipan.waw.pl]] ||
|| [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD, Assoc. Prof. || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] ||
|| Natalia Zawadzka, PhD candidate || [[mailto:natalia.zawadzka-paluektau@ipipan.waw.pl|natalia.zawadzka-paluektau@ipipan.waw.pl]] ||
Line 46: Line 53:
 * (Polish) corpus linguistics; cf. the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]],
 * syntactic and semantic parsing of Polish; cf. [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]],
 * (Polish) corpus linguistics ([[http://nkjp.pl/|National Corpus of Polish]]), /* ; cf. the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]], */
 * morphosyntactic tagging and lemmatisation of Polish,
 * syntactic an
d semantic parsing of Polish,
Line 50: Line 58:
 * distributional semantics and compositional distributional semantics,
Line 51: Line 60:
 * morphosyntactic system of Polish,  * credibility assessment of online content,
 /*
* morphosyntactic system of Polish, */
Line 54: Line 64:
The Group is a member of [[http://www.clarin.eu/|CLARIN]], [[http://www.flarenet.eu/|FLaReNet]] and [[http://www.meta-net.eu/|META-NET]]. The Group is a member of [[http://www.clarin.eu/|CLARIN]], [[http://dariah.pl/|DARIAH-PL]], [[http://clip.ipipan.waw.pl/ELRC|ELRC]], [[http://www.flarenet.eu/|FLaReNet]] and [[http://www.meta-net.eu/|META-NET]].
Line 58: Line 68:
 * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts),
 * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet),
 * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society).
 * [[http://clip.ipipan.waw.pl/CLARIN-PL-3|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]])
 * [[CORMETAN]] (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts)
 * [[http://clip.ipipan.waw.pl/CURLICAT|CURLICAT]] (Curated Multilingual Language Resources for CEF AT)
 * [[http://korpus-dekady.ipipan.waw.pl|Korpus Dekady]] ([[http://dariah.pl/|DARIAH-PL]] — Digital Research Infrastructure for the Arts and Humanities)
 * [[http://clip.ipipan.waw.pl/ELE|ELE]] (European Language Equality)
 * [[http://clip.ipipan.waw.pl/ELG|ELG]] (European Language Grid)
 * [[http://clip.ipipan.waw.pl/ELRC|ELRC]] (European Language Resource Coordination)
 * [[HOMADOS|HOMADOS]] (Hampering Misinformation by Assessing Credibility of Online Sources)
 * [[http://clip.ipipan.waw.pl/KORBA-2|KORBA 2]] (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish")
 * [[http://zil.ipipan.waw.pl/Quantifiers|Kwantyfikatory w języku: użycie i znaczenie]] (Quantifiers in Language: Use and Meaning)
 * [[http://clip.ipipan.waw.pl/MARCELL|MARCELL]] (Multilingual Resources for CEF.AT in the legal domain)
 * [[http://clip.ipipan.waw.pl/Nexus|Nexus Linguarum]] (European network for Web-centred linguistic data science)
 * [[http://zil.ipipan.waw.pl/Scwad|Scwad]] (Compositional distributional modelling of Polish language semantics)
 * [[http://synamet.uw.edu.pl/|SYNAMET]] (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse)
Line 63: Line 84:

 * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS),
 * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources),
 * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]],
 * [[CLARIN|CLARIN]] (Common Language Resources and Technology Infrastructure),
 * [[NKJP|NKJP]] (National Corpus of Polish),
 * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]],
 * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support,
 * [[LT4eL|LT4eL]] (Language Technology for eLearning),
 * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]],
 * [[Information Extraction from Polish free text|Information Extraction from Polish free text]],
 * [[IPI PAN Corpus|The IPI PAN Corpus of Polish]],
 * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]],
 * [[HPSG Grammar of Polish|HPSG Grammar of Polish]].
 * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS)
 * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]]
 * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]]
 * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]]
 * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources)
 * [[http://zil.ipipan.waw.pl/Chronofleks|Chronofleks]] (A diachronic formal model of Polish inflection and its implementation)
 * [[CLARIN|CLARIN]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]], see also [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL 2]])
 * [[http://zil.ipipan.waw.pl/CoDeS|CoDeS]] (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts)
 * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]]
 * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts)
 * [[http://clip.ipipan.waw.pl/COTHEC|COTHEC]] (Unified theory of coreference in Polish and its corpus-based verification)
 * [[HPSG Grammar of Polish|HPSG Grammar of Polish]]
 * [[Information Extraction from Polish free text|Information Extraction from Polish free text]]
 * [[IPI PAN Corpus|IPI PAN Corpus of Polish]]
 * [[http://clip.ipipan.waw.pl/KORBA|KORBA]] (Electronic corpus of 17th and 18th century Polish texts)
 * [[LT4eL|LT4eL]] (Language Technology for eLearning)
 * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support
 * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet)
 * [[NKJP|NKJP]] (National Corpus of Polish)
 * [[http://zil.ipipan.waw.pl/OPTA|OPTA]] (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim)
 * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing)
 * [[http://clip.ipipan.waw.pl/Parthenos|Parthenos]] (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies)
 * [[http://clip.ipipan.waw.pl/Readability|Readability]] (Measuring the degree of readability of nonliterary Polish texts)
 * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society)
 * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]]
 * [[http://clip.ipipan.waw.pl/TextLink|TextLink]] (Structuring Discourse in Multilingual Europe)
 * [[http://clip.ipipan.waw.pl/TrendMiner|TrendMiner]] (Large-scale, Cross-lingual Trend Mining and Summarisation of Real-time Media Streams)
Line 80: Line 114:
Here are some of the tools and resources created within our projects. See [[http://clip.ipipan.waw.pl/|CLIP]] pages for a more exhaustive list of Polish tools and resources. Here are some of the tools and resources created within our projects. See [[http://clip.ipipan.waw.pl/|CLIP]] pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN.
Line 82: Line 116:
Tools (all open source, under [[http://www.gnu.org/copyleft/gpl.html|GPL]]): Some '''tools''' (all open source, under [[http://www.gnu.org/copyleft/gpl.html|GPL]]; see also [[http://clip.ipipan.waw.pl/|CLIP]]):
Line 84: Line 118:
 * [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]] – a DCG parser,  * [[http://morfeusz.sgjp.pl/|Morfeusz 2]] – a morphological analyser of Polish,
Line 86: Line 120:
 * [[http://zil.ipipan.waw.pl/%C5%9Awigra|Świgra]] – a DCG parser,
 * [[https://github.com/360er0/COMBO|COMBO]] – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling,
 * [[http://zil.ipipan.waw.pl/Concraft|Concraft]] — a CRF morphosyntactic tagger of Polish compatible with Morfeusz SGJP,
 * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish,
Line 87: Line 125:
 * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish,
Line 89: Line 126:
 * [[https://sourceforge.net/projects/poliqarp2/|Poliqarp2]] – a new generation corpus indexing and search engine,
Line 90: Line 128:
 * [[http://zil.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming),
 * [[http://zil.ipipan.waw.pl/WSDDE|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments (forthcoming),
 * [[http://nlp.ipipan.waw.pl/PPJP/|etc.]]
 * [[http://zil.ipipan.waw.pl/Anotatornia2/|Anotatornia 2]] – an annotation tool geared towards historical corpora,
 * [[http://zil.ipipan.waw.pl/WSDDE|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments,
 * [[http://multiservice.nlp.ipipan.waw.pl/|Multiservice]] – web service for various of our tools,
 * [[http://zil.ipipan.waw.pl/TermoPL|TermoPL]] - multiword terms extraction from text
 * [[http://dsmodels.nlp.ipipan.waw.pl/sim1.html|DSmodels]] - web service for calculating word similarity using Polish word embeddings
Line 95: Line 135:
Resources:
Line 97: Line 136:
Main '''resources''' (many more at [[http://clip.ipipan.waw.pl/|CLIP]]):

 * [[http://walenty.ipipan.waw.pl/|Walenty]] – a valence dictionary of Polish (described [[http://zil.ipipan.waw.pl/Walenty|here]]),
Line 98: Line 140:
 * [[http://zil.ipipan.waw.pl/DistrNKJP/|DistrNKJP]] – a distributable (IPR-free) subcorpus of National Corpus of Polish,
 * [[http://korpus.pl/|IPI PAN Corpus of Polish]] (obsolete).
 * [[http://zil.ipipan.waw.pl/CoDeS|Polish word embeddings based on NKJP and Wikipedia]],
 * Polish dependency banks: [[http://zil.ipipan.waw.pl/PDB|PDB]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/PDB-UD_current|PDB-UD]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/PUD-PL_current|PUD-PL]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/NKJP1M-UD_current|NKJP1M-UD]],
 * [[http://zil.ipipan.waw.pl/PDB/PDBparser|Dependency parsing models for Polish]].
Line 107: Line 150:
 * [[http://nlp.ipipan.waw.pl/seminar-e.html|NLP Seminar at IPI PAN]];
 * [[http://iis.ipipan.waw.pl/|Intelligent Information Systems]] series of conferences.
 * [[http://jlm.ipipan.waw.pl/|Journal of Language Modelling]]
 * [[http://zil.ipipan.waw.pl/seminar|NLP Seminar at IPI PAN]]
 * [[http://poleval.pl/|PolEval]], the evaluation campaign for natural language processing tools for Polish
 * conferences organised by the Group:
  * [[http://iis.ipipan.waw.pl/|Intelligent Information Systems]] series of conferences
  * [[http://poltal.ipipan.waw.pl/|9th International Conference on Natural Language Processing]] (PolTAL 2014), 17–19 September 2014, Warsaw, Poland
  * [[http://tlt14.ipipan.waw.pl/|14th International Workshop on Treebanks and Linguistic Theories]] (TLT14), 11–12 December 2015, Warsaw, Poland
  * [[http://corbon.nlp.ipipan.waw.pl/2016/|Coreference Resolution Beyond OntoNotes]] (CORBON 2016) workshop at [[http://naacl.org/naacl-hlt-2016/|NAACL 2016]] (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US
  * [[http://headlex16.ipipan.waw.pl/|Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar]] (!HeadLex16), 24–29 July 2016, Warsaw, Poland
  * [[http://corbon.nlp.ipipan.waw.pl/|2nd Workshop on Coreference Resolution Beyond OntoNotes]] (CORBON 2017) at [[http://eacl2017.org/|EACL 2017]] (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain
  * [[http://anawiki.essex.ac.uk/dali/crac18/|Computational Models of Reference, Anaphora, and Coreference]] workshop (CRAC) at [[http://naacl2018.org/|NAACL 2018]] (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA
  * [[https://nlpday.pl/|AI & NLP Workshop Day]], 19 October 2018, Warsaw
  * [[https://sites.google.com/view/crac2019/|Second Workshop on Computational Models of Reference, Anaphora and Coreference]] (CRAC 2019), 6 ot 7 June 2019, Minneapolis
  * [[http://www.dynamicsoflanguage.edu.au/lfg-2019/|The 24th International Lexical-Functional Grammar Conference]] (LFG19), 8–10 July 2019, Canberra
  * [[https://lfg20.w.uib.no/|The 25th International Lexical-Functional Grammar Conference]] (LFG20), 23–25 June 2020, online
  * [[https://typo.uni-konstanz.de/lfg2021/|The 26th International Lexical-Functional Grammar Conference]] (LFG21), 13–15 July 2021, online


== Selected publications ==

<<BibMate(author, "Andrzejczuk", "Bartosiak", "Gawłowicz", "Hajnicz", "Kaczyński", "Kieraś", "Klimaszewski", "Kobyliński", "Krasnowska", "Marciniak", "Mykowiecka", "Nitoń", "Ogrodniczuk", "Patejuk", "Przepiórkowski", "Przybyła", "Rychlik, "Wawer", "Wojdyga", "Wołoszyn", "Woliński", "Wójtowicz", "Wróblewska", "Bolc")>>

The Linguistic Engineering Group

The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (IPI PAN).

People

Core team

Anna Andrzejczuk, PhD (on leave)

anna.andrzejczuk@ipipan.waw.pl

Tomasz Bartosiak, MSc

tomasz.bartosiak@gmail.com

Diego Feinmann, PhD

diego.feinmann@ipipan.waw.pl

Elżbieta Hajnicz, PhD, Assoc. Prof.

elzbieta.hajnicz@ipipan.waw.pl

Witold Kieraś, PhD

witold.kieras@ipipan.waw.pl

Łukasz Kobyliński, PhD

lukasz.kobylinski@ipipan.waw.pl

Dorota Komosińska, MSc

dorota.komosinska@gmail.com

Katarzyna Krasnowska-Kieraś, MSc

katarzyna.krasnowska@ipipan.waw.pl

Małgorzata Marciniak, PhD, Assoc. Prof.

malgorzata.marciniak@ipipan.waw.pl

Agnieszka Mykowiecka, PhD, Assoc. Prof.

agnieszka.mykowiecka@ipipan.waw.pl

Maciej Ogrodniczuk, PhD, Assoc. Prof., Head of the Group

maciej.ogrodniczuk@ipipan.waw.pl

Agnieszka Patejuk, PhD

agnieszka.patejuk@ipipan.waw.pl

Adam Przepiórkowski, PhD, Full Prof.

adam.przepiorkowski@ipipan.waw.pl

Piotr Przybyła, PhD (on postdoctoral fellowship at UPF)

piotr.przybyla@ipipan.waw.pl

Michał Rudolf, PhD

michal@rudolf.waw.pl

Piotr Rybak, MSc

piotr.cezary.rybak@gmail.com

Piotr Rychlik, PhD

piotr.rychlik@ipipan.waw.pl

Aleksandra Tomaszewska, PhD candidate

aleksandra.tomaszewska@hotmail.com

Aleksander Wawer, PhD

aleksander.wawer@ipipan.waw.pl

Marcin Woliński, PhD, Assoc. Prof.

marcin.wolinski@ipipan.waw.pl

Joanna Wołoszyn, PhD

joanna.woloszyn@ipipan.waw.pl

Alina Wróblewska, PhD

alina.wroblewska@ipipan.waw.pl

Sebastian Zawada, MSc

sebastian.zawada@ipipan.waw.pl

Associates

Wiktor Eźlakowski, MSc

wiktor.ezlakowski@ipipan.waw.pl

Sonia Janicka

sonia.janicka@gmail.com

Mateusz Klimaszewski, MSc

mk.klimaszewski@gmail.com

Jakub Piskorski, PhD

jpiskorski@gmail.com

Karol Saputa, BEng

karolsaputa@gmail.com

Jakub Szymanik, PhD

jakub.szymanik@gmail.com

Ryszard Tuora, MSc

ryszardtuora@gmail.com

Grzegorz Wojdyga, MSc

g.wojdyga@ipipan.waw.pl

Beata Wójtowicz, PhD, Assoc. Prof.

beata.wojtowicz@ipipan.waw.pl

Natalia Zawadzka, PhD candidate

natalia.zawadzka-paluektau@ipipan.waw.pl

Research

The main research areas of the Group

  • (Polish) corpus linguistics (National Corpus of Polish),

  • morphosyntactic tagging and lemmatisation of Polish,
  • syntactic and semantic parsing of Polish,
  • extraction of linguistic knowledge from corpora,
  • information extraction,
  • distributional semantics and compositional distributional semantics,
  • sentiment analysis,
  • credibility assessment of online content,

  • generative linguistic formalisms, esp., HPSG and LFG.

The Group is a member of CLARIN, DARIAH-PL, ELRC, FLaReNet and META-NET.

Current externally funded projects

  • CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure)

  • CORMETAN (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts)

  • CURLICAT (Curated Multilingual Language Resources for CEF AT)

  • Korpus Dekady (DARIAH-PL — Digital Research Infrastructure for the Arts and Humanities)

  • ELE (European Language Equality)

  • ELG (European Language Grid)

  • ELRC (European Language Resource Coordination)

  • HOMADOS (Hampering Misinformation by Assessing Credibility of Online Sources)

  • KORBA 2 (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish")

  • Kwantyfikatory w języku: użycie i znaczenie (Quantifiers in Language: Use and Meaning)

  • MARCELL (Multilingual Resources for CEF.AT in the legal domain)

  • Nexus Linguarum (European network for Web-centred linguistic data science)

  • Scwad (Compositional distributional modelling of Polish language semantics)

  • SYNAMET (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse)

Some of our past projects

Publicly available tools and resources

Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN.

Some tools (all open source, under GPL; see also CLIP):

  • Morfeusz 2 – a morphological analyser of Polish,

  • Spejd – a shallow parsing and disambiguation system,

  • Świgra – a DCG parser,

  • COMBO – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling,

  • Concraft — a CRF morphosyntactic tagger of Polish compatible with Morfeusz SGJP,

  • PANTERA – a morphosyntactic tagger for Polish,

  • TaKIPI – a morphosyntactic tagger for Polish,

  • Poliqarp – a corpus indexing and search engine,

  • Poliqarp2 – a new generation corpus indexing and search engine,

  • Dendrarium – a treebank development system (under development),

  • Anotatornia 2 – an annotation tool geared towards historical corpora,

  • WSDDE – a system for designing and performing Word Sense Disambiguation experiments,

  • Multiservice – web service for various of our tools,

  • TermoPL - multiword terms extraction from text

  • DSmodels - web service for calculating word similarity using Polish word embeddings

Main resources (many more at CLIP):

Other activities

Links to some other activities of the Group:

Selected publications

List of publications

2025

Aleksandra Tomaszewska, Dariusz Czerski, Bartosz Żuk, and Maciej Ogrodniczuk. NeoN: A tool for automated detection, linguistic and LLM-driven aalysis of neologisms in Polish. In Michael H. Lees, Wentong Cai, Siew Ann Cheong, Yi Su, David Abramson, Jack J. Dongarra, and Peter M. A. Sloot, editors, Computational Science – ICCS 2025, pages 318–326, Cham, 2025. Springer Nature Switzerland.

2024

Katarzyna Krasnowska-Kieraś and Marcin Woliński. Parsing headed constituencies. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, editors, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 12633–12643, Torino, Italy, 2024. ELRA and ICCL.

Adam Przepiórkowski, Katarzyna Kuś, Agnieszka Patejuk, and Berke Şenşekerci. You can depend on the symmetry of coordination and that NPs and CPs can be conjoined. Presentation delivered on 5 July 2024 at the “Form and Meaning of Coordination” workshop in Göttingen, Germany (https://www.uni-goettingen.de/de/685553.html), 2024.

2023

Maciej Ogrodniczuk, editor. Analiza danych parlamentarnych. Warsztat pokonkursowy, Warsaw, 2023. Institute of Computer Science, Polish Academy of Sciences.

Adam Przepiórkowski and Michał Woźniak. Conjunct lengths in English, Dependency Length Minimization, and dependency structure of coordination. In Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15494–15512, Toronto, Canada, 2023. Association for Computational Linguistics.

Karol Saputa, Aleksandra Tomaszewska, Natalia Zawadzka-Paluektau, Witold Kieraś, and Łukasz Kobyliński. Korpusomat.eu: A multilingual platform for building and analysing linguistic corpora. In Jiří Mikyška, Clélia de Mulatier, Maciej Paszynski, Valeria V. Krzhizhanovskaya, Jack J. Dongarra, and Peter M.A. Sloot, editors, Computational Science – ICCS 2023. 23rd International Conference, Prague, Czech Republic, July 3–5, 2023, Proceedings, Part II, number 14074 in Lecture Notes in Computer Science, pages 230–237, Cham, 2023. Springer Nature Switzerland.

2022

Maciej Ogrodniczuk, Sameer Pradhan, Anna Nedoluzhko, Vincent Ng, and Massimo Poesio, editors. Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference, Gyeongju, Republic of Korea, 2022. Association for Computational Linguistics.

Adam Przepiórkowski. Polyadic cover quantification in heterofunctional coordination. In Daniel Gutzmann and Sophie Repp, editors, Proceedings of Sinn und Bedeutung 26, pages 677–696, 2022.

2018

Alina Wróblewska. Polish corpus of annotated descriptions of images. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pages 2141–2146. European Language Resources Association (ELRA), 2018.

Alina Wróblewska. Results of the PolEval 2018 Shared Task 1: Dependency Parsing. In Proceedings of the PolEval 2018 Workshop, pages 11–24. Institute of Computer Science, Polish Academy of Sciences, 2018.

2017

Adam Przepiórkowski. Argumenty i modyfikatory w gramatyce i w słowniku. Wydawnictwa Uniwersytetu Warszawskiego, Warsaw, 2017.

Adam Przepiórkowski. On the argument–adjunct distinction in the Polish Semantic Syntax tradition. Cognitive Studies / Études Cognitives, 17:1–10, 2017.

Aleksander Wawer and Agnieszka Mykowiecka. Supervised and unsupervised word sense disambiguation on word embedding vectors of unambigous synonyms. In Proceedings of the 1st Workshop on Sense, Concept and Entity Representations and their Applications, pages 120–125. Association for Computational Linguistics, 2017.

2016

Joanna Bilińska, Magdalena Derwojedowa, Witold Kieraś, and Monika Kwiecień. Mikrokorpus polszczyzny 1830-1918. Komunikacja specjalistyczna, 11:149–161, 2016.

2014

Adam Przepiórkowski, Elżbieta Hajnicz, Agnieszka Patejuk, and Marcin Woliński. Extended phraseological information in a valence dictionary for NLP applications. In Proceedings of the Workshop on Lexical and Grammatical Resources for Language Processing (LG-LP 2014), pages 83–91, Dublin, Ireland, 2014. Association for Computational Linguistics and Dublin City University.

Adam Przepiórkowski, Elżbieta Hajnicz, Agnieszka Patejuk, Marcin Woliński, Filip Skwarski, and Marek Świdziński. Walenty: Towards a comprehensive valence dictionary of Polish. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 2785–2792, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

Alina Wróblewska. Polish Dependency Parser Trained on an Automatically Induced Dependency Bank. Ph.D. dissertation, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2014.

2013

Maciej Ogrodniczuk and Michał Lenart. A multi-purpose online toolset for NLP applications. In Elisabeth Métais, Farid Meziane, Mohamed Saraee, Vijay Sugumaran, and Sunil Vadera, editors, Proceedings of the 18th International Conference on Applications of Natural Language to Information Systems, number 7934 in Lecture Notes in Computer Science, pages 392–395. Springer-Verlag, Berlin, Heidelberg, 2013.

2012

Szymon Acedański, Adam Slaski, and Adam Przepiórkowski. Machine learning of syntactic attachment from morphosyntactic and semantic co-occurrence statistics. In Proceedings of the ACL 2012 Joint Workshop on Statistical Parsing and Semantic Processing of Morphologically Rich Languages, pages 42–47, Jeju, Republic of Korea, 2012. Association for Computational Linguistics.

2010

Małgorzata Marciniak, editor. Anotowany korpus dialogów telefonicznych. Akademicka Oficyna Wydawnicza EXIT, Warsaw, 2010.

2005

Agnieszka Mykowiecka, Małgorzata Marciniak, and Anna Kupść. Making shallow look deeper: Anaphora and comparisons in medical information extraction. In Zygmunt Vetulani, editor, Proceedings of the 2nd Language & Technology Conference, pages 225–229, Poznań, Poland, 2005.

Dariusz Piechociński and Agnieszka Mykowiecka. Question answering in Polish using shallow parsing. In Radovan Garabík, editor, Computer Treatment of Slavic and East European Languages: Proceedings of the Third International Seminar, Bratislava, Slovakia, 10–12 November 2005, pages 167–173, Bratislava, 2005. VEDA: Vydavatel'stvo Slovenskej akadéme vied.

2003

Maciej Ogrodniczuk. Rozszerzenie opisów morfologicznych w tekstach korpusu „Słownika frekwencyjnego polszczyzny współczesnej”. In Roman Huszcza and Jadwiga Linde-Usiekniewicz, editors, Prace lingwistyczne dedykowane prof. Jadwidze Sambor, pages 164–168. Wydział Polonistyki Uniwersytetu Warszawskiego, Warsaw, 2003.

1997

Anna Kupść, Małgorzata Marciniak, and Leonard Bolc. Anaphor binding in Polish. An attempt at an HPSG account. IPI PAN Research Report 836, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 1997.

None

1989

Elżbieta Hajnicz. Formalizacja systemu wnioskowania o zależnościach czasowych między zdarzeniami. IPI PAN Research Report 658, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 1989.