Size: 13192
Comment:
|
Size: 17967
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 4: | Line 4: |
The Linguistic Engineering (LE) Group is part of the [[http://www.ipipan.waw.pl/en/dept/dept-ai.html|Department of Artificial Intelligence]] at the [[http://www.ipipan.waw.pl/en/|Institute of Computer Science]], [[http://www.english.pan.pl/|Polish Academy of Sciences]] (ICS PAS). | The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the [[http://www.ipipan.waw.pl/en/|Institute of Computer Science]], [[https://institution.pan.pl/|Polish Academy of Sciences]] (IPI PAN). |
Line 10: | Line 10: |
|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] || | |
Line 12: | Line 11: |
|| Zbigniew Gawłowicz || [[mailto:zbigniew.gawlowicz@ipipan.waw.pl|zbigniew.gawlowicz@ipipan.waw.pl]] || | || [[https://www.diegofeinmann.com/|Diego Feinmann]], PhD || [[mailto:diego.feinmann@ipipan.waw.pl|diego.feinmann@ipipan.waw.pl]] || |
Line 17: | Line 16: |
|| [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]], MSc || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || | || [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska-Kieraś]], MSc || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || |
Line 20: | Line 19: |
|| [[http://zil.ipipan.waw.pl/BartlomiejNiton|Bartłomiej Nitoń]], MSc || [[mailto:bartek.niton@gmail.com|bartek.niton@gmail.com]] || || [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD, Head of the Group || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] || |
|| [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD, Assoc. Prof., Head of the Group || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] || |
Line 23: | Line 21: |
|| [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Assoc. Prof. || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] || | || [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Full Prof. || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/PiotrPrzybyla|Piotr Przybyła]], PhD (on postdoctoral fellowship at [[https://www.upf.edu/web/erinia|UPF]]) || [[mailto:piotr.przybyla@ipipan.waw.pl|piotr.przybyla@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MichałRudolf|Michał Rudolf]], PhD || [[mailto:michal@rudolf.waw.pl|michal@rudolf.waw.pl]] || |
Line 25: | Line 25: |
||<style="border: 3px solid black"> [[https://zil.ipipan.waw.pl/KarolinaSaputa|Karolina Saputa]], BEng || [[mailto:karolsaputa@gmail.com|karolsaputa@gmail.com]] || || [[http://zil.ipipan.waw.pl/AleksandraTomaszewska|Aleksandra Tomaszewska]], PhD candidate || [[mailto:aleksandra.tomaszewska@ipipan.waw.pl|aleksandra.tomaszewska@ipipan.waw.pl]] || |
|
Line 26: | Line 28: |
|| [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] || | || [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD, Assoc. Prof. || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] || || Joanna Wołoszyn, PhD || [[mailto:joanna.woloszyn@ipipan.waw.pl|joanna.woloszyn@ipipan.waw.pl]] || |
Line 28: | Line 31: |
|| [[http://zil.ipipan.waw.pl/SebastianZawada|Sebastian Zawada]], MSc || [[mailto:sebastian.zawada@ipipan.waw.pl|sebastian.zawada@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/NataliaZawadzka|Natalia Zawadzka-Paluektau]], PhD || [[mailto:natalia.zawadzka-paluektau@ipipan.waw.pl|natalia.zawadzka-paluektau@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/BartoszŻuk|Bartosz Żuk]], PhD candidate || [[mailto:bartoszzuk.poczta@gmail.com|bartoszzuk.poczta@gmail.com]] || |
|
Line 32: | Line 38: |
|| Piotr Rybak || [[mailto:piotr.cezary.rybak@gmail.com|piotr.cezary.rybak@gmail.com]] || || Filip Stefaniuk || [[mailto:filip.stefaniuk@gmail.com|filip.stefaniuk@gmail.com]] || || Jakub Szymanik, PhD || [[mailto:jakub.szymanik@gmail.com|jakub.szymanik@gmail.com]] || || Grzegorz Wojdyga, MSc || [[mailto:g.wojdyga@gmail.com|g.wojdyga@gmail.com]] || || [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] || |
|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD (on leave) || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] || || Wiktor Eźlakowski, MSc || [[mailto:wiktor.ezlakowski@ipipan.waw.pl|wiktor.ezlakowski@ipipan.waw.pl]] || || Sonia Janicka || [[mailto:sonia.janicka@gmail.com|sonia.janicka@gmail.com]] || || [[http://zil.ipipan.waw.pl/MateuszKlimaszewski|Mateusz Klimaszewski]], MSc || [[mailto:mk.klimaszewski@gmail.com|mk.klimaszewski@gmail.com]] || || Jakub Piskorski, PhD || [[mailto:jpiskorski@gmail.com|jpiskorski@gmail.com]] || || Piotr Rybak, MSc || [[mailto:piotr.cezary.rybak@gmail.com|piotr.cezary.rybak@gmail.com]] || || Jakub Szymanik, PhD || [[mailto:jakub.szymanik@gmail.com|jakub.szymanik@gmail.com]] || || [[http://zil.ipipan.waw.pl/RyszardTuora|Ryszard Tuora]], MSc || [[mailto:ryszardtuora@gmail.com|ryszardtuora@gmail.com]] || || Grzegorz Wojdyga, MSc || [[mailto:g.wojdyga@ipipan.waw.pl|g.wojdyga@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD, Assoc. Prof. || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] || |
Line 43: | Line 54: |
* (Polish) corpus linguistics; cf. the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]], * syntactic and semantic parsing of Polish; cf. [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]], * extraction of linguistic knowledge from corpora, * information extraction, * sentiment analysis, * morphosyntactic system of Polish, |
* (Polish) corpus linguistics ([[http://nkjp.pl/|National Corpus of Polish]]) /* ; cf. the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]], */ * morphosyntactic tagging and lemmatisation of Polish * syntactic and semantic parsing of Polish * extraction of linguistic knowledge from corpora * information extraction * distributional semantics and compositional distributional semantics * sentiment analysis * credibility assessment of online content * reference and discourse relations |
Line 55: | Line 69: |
* [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]]) * [[http://clip.ipipan.waw.pl/COTHEC|COTHEC]] (Unified theory of coreference in Polish and its corpus-based verification) * [[http://zil.ipipan.waw.pl/Chronofleks|Chronofleks]] (A diachronic formal model of Polish inflection and its implementation) * [[http://zil.ipipan.waw.pl/CoDeS|CoDeS]] (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts) * [[http://dariah.pl/|DARIAH-PL]] (Digital Research Infrastructure for the Arts and Humanities) |
* [[http://clip.ipipan.waw.pl/CLARIN-PL-3|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]]) * [[CORMETAN]] (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts) * [[http://clip.ipipan.waw.pl/CURLICAT|CURLICAT]] (Curated Multilingual Language Resources for CEF AT) * [[http://korpus-dekady.ipipan.waw.pl|Korpus Dekady]] ([[http://dariah.pl/|DARIAH-PL]] — Digital Research Infrastructure for the Arts and Humanities) * [[http://clip.ipipan.waw.pl/ELE|ELE]] (European Language Equality) * [[http://clip.ipipan.waw.pl/ELG|ELG]] (European Language Grid) |
Line 61: | Line 76: |
* [[http://clip.ipipan.waw.pl/KORBA|KORBA]] (Electronic corpus of 17th and 18th century Polish texts) | * [[HOMADOS|HOMADOS]] (Hampering Misinformation by Assessing Credibility of Online Sources) * [[http://clip.ipipan.waw.pl/KORBA-2|KORBA 2]] (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish") |
Line 63: | Line 79: |
* [[http://clip.ipipan.waw.pl/Parthenos|Parthenos]] (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies) | * [[http://clip.ipipan.waw.pl/MARCELL|MARCELL]] (Multilingual Resources for CEF.AT in the legal domain) * [[http://clip.ipipan.waw.pl/Nexus|Nexus Linguarum]] (European network for Web-centred linguistic data science) |
Line 68: | Line 85: |
* [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS) * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]] * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]] * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]] * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources) * [[http://zil.ipipan.waw.pl/Chronofleks|Chronofleks]] (A diachronic formal model of Polish inflection and its implementation) * [[CLARIN|CLARIN]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]], see also [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL 2]]) * [[http://zil.ipipan.waw.pl/CoDeS|CoDeS]] (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts) * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]] * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts) * [[http://clip.ipipan.waw.pl/COTHEC|COTHEC]] (Unified theory of coreference in Polish and its corpus-based verification) * [[HPSG Grammar of Polish|HPSG Grammar of Polish]] * [[Information Extraction from Polish free text|Information Extraction from Polish free text]] * [[IPI PAN Corpus|IPI PAN Corpus of Polish]] * [[http://clip.ipipan.waw.pl/KORBA|KORBA]] (Electronic corpus of 17th and 18th century Polish texts) * [[LT4eL|LT4eL]] (Language Technology for eLearning) * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet) * [[NKJP|NKJP]] (National Corpus of Polish) * [[http://zil.ipipan.waw.pl/OPTA|OPTA]] (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim) * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing) * [[http://clip.ipipan.waw.pl/Parthenos|Parthenos]] (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies) * [[http://clip.ipipan.waw.pl/Readability|Readability]] (Measuring the degree of readability of nonliterary Polish texts) * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society) * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]] |
|
Line 70: | Line 111: |
* [[http://clip.ipipan.waw.pl/Readability|Readability]] (Measuring the degree of readability of nonliterary Polish texts) * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing) * [[http://zil.ipipan.waw.pl/OPTA|OPTA]] (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim) |
|
Line 74: | Line 112: |
* [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet), * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society), * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]], * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS), * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources), * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]], * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts), * [[CLARIN|CLARIN]] (Common Language Resources and Technology Infrastructure), * [[NKJP|NKJP]] (National Corpus of Polish), * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]], * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support, * [[LT4eL|LT4eL]] (Language Technology for eLearning), * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]], * [[Information Extraction from Polish free text|Information Extraction from Polish free text]], * [[IPI PAN Corpus|The IPI PAN Corpus of Polish]], * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]], * [[HPSG Grammar of Polish|HPSG Grammar of Polish]]. |
|
Line 98: | Line 119: |
* [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]] – a DCG parser, | * [[http://morfeusz.sgjp.pl/|Morfeusz 2]] – a morphological analyser of Polish, |
Line 100: | Line 121: |
* [[http://zil.ipipan.waw.pl/%C5%9Awigra|Świgra]] – a DCG parser, * [[https://github.com/360er0/COMBO|COMBO]] – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling, * [[http://zil.ipipan.waw.pl/Concraft|Concraft]] — a CRF morphosyntactic tagger of Polish compatible with Morfeusz SGJP, * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish, |
|
Line 101: | Line 126: |
* [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish, | |
Line 105: | Line 129: |
* [[http://zil.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming), | * [[http://zil.ipipan.waw.pl/Anotatornia2/|Anotatornia 2]] – an annotation tool geared towards historical corpora, |
Line 110: | Line 134: |
* [[http://nlp.ipipan.waw.pl/PPJP/|etc.]] | |
Line 117: | Line 141: |
* [[http://zil.ipipan.waw.pl/CoDeS|Polish word embeddings based on NKJP and Wikipedia]]. | * [[http://zil.ipipan.waw.pl/CoDeS|Polish word embeddings based on NKJP and Wikipedia]], * Polish dependency banks: [[http://zil.ipipan.waw.pl/PDB|PDB]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/PDB-UD_current|PDB-UD]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/PUD-PL_current|PUD-PL]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/NKJP1M-UD_current|NKJP1M-UD]], * [[http://zil.ipipan.waw.pl/PDB/PDBparser|Dependency parsing models for Polish]]. |
Line 127: | Line 153: |
* [[http://poleval.pl/|PolEval]], the evaluation campaign for natural language processing tools for Polish | |
Line 129: | Line 156: |
* [[http://poltal.ipipan.waw.pl/|PolTAL 2014]] – 9th International Conference on Natural Language Processing, 17–19 September 2014, Warsaw, Poland * [[http://tlt14.ipipan.waw.pl/|TLT14]] – 14th International Workshop on Treebanks and Linguistic Theories, 11–12 December 2015, Warsaw, Poland * [[http://corbon.nlp.ipipan.waw.pl/2016/|CORBON 2016]] – Coreference Resolution Beyond !OntoNotes workshop at [[http://naacl.org/naacl-hlt-2016/|NAACL 2016]] (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US * [[http://headlex16.ipipan.waw.pl/|HeadLex16]] – Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, 24–29 July 2016, Warsaw, Poland * [[http://corbon.nlp.ipipan.waw.pl/|CORBON 2017]] – 2nd Workshop on Coreference Resolution Beyond !OntoNotes at [[http://eacl2017.org/|EACL 2017]] (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain * [[http://anawiki.essex.ac.uk/dali/crac18/|CRAC: Computational Models of Reference, Anaphora, and Coreference]] at [[http://naacl2018.org/|NAACL 2017]] (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA * [[http://waw2018.argdiap.pl/argdiap-conference/|16th ArgDiaP conference]], part of the [[http://waw2018.argdiap.pl/|WAW 2018]] (Warsaw Argumentation Week), 15–16 September 2018, Warsaw |
* [[http://poltal.ipipan.waw.pl/|9th International Conference on Natural Language Processing]] (PolTAL 2014), 17–19 September 2014, Warsaw, Poland * [[http://tlt14.ipipan.waw.pl/|14th International Workshop on Treebanks and Linguistic Theories]] (TLT14), 11–12 December 2015, Warsaw, Poland * [[http://corbon.nlp.ipipan.waw.pl/2016/|Coreference Resolution Beyond OntoNotes]] (CORBON 2016) workshop at [[http://naacl.org/naacl-hlt-2016/|NAACL 2016]] (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US * [[http://headlex16.ipipan.waw.pl/|Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar]] (!HeadLex16), 24–29 July 2016, Warsaw, Poland * [[http://corbon.nlp.ipipan.waw.pl/|2nd Workshop on Coreference Resolution Beyond OntoNotes]] (CORBON 2017) at [[http://eacl2017.org/|EACL 2017]] (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain * [[http://anawiki.essex.ac.uk/dali/crac18/|Computational Models of Reference, Anaphora, and Coreference]] workshop (CRAC) at [[http://naacl2018.org/|NAACL 2018]] (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA * [[https://nlpday.pl/|AI & NLP Workshop Day]], 19 October 2018, Warsaw * [[https://sites.google.com/view/crac2019/|Second Workshop on Computational Models of Reference, Anaphora and Coreference]] (CRAC 2019), 6 ot 7 June 2019, Minneapolis * [[http://www.dynamicsoflanguage.edu.au/lfg-2019/|The 24th International Lexical-Functional Grammar Conference]] (LFG19), 8–10 July 2019, Canberra * [[https://lfg20.w.uib.no/|The 25th International Lexical-Functional Grammar Conference]] (LFG20), 23–25 June 2020, online * [[https://typo.uni-konstanz.de/lfg2021/|The 26th International Lexical-Functional Grammar Conference]] (LFG21), 13–15 July 2021, online == Selected publications == <<BibMate(author, "Andrzejczuk", "Bartosiak", "Gawłowicz", "Hajnicz", "Kaczyński", "Kieraś", "Klimaszewski", "Kobyliński", "Krasnowska", "Marciniak", "Mykowiecka", "Nitoń", "Ogrodniczuk", "Patejuk", "Przepiórkowski", "Przybyła", "Rychlik, "Wawer", "Wojdyga", "Wołoszyn", "Woliński", "Wójtowicz", "Wróblewska", "Bolc")>> |
The Linguistic Engineering Group
The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (IPI PAN).
People
Core team
Tomasz Bartosiak, MSc |
|
Diego Feinmann, PhD |
|
Elżbieta Hajnicz, PhD, Assoc. Prof. |
|
Witold Kieraś, PhD |
|
Łukasz Kobyliński, PhD |
|
Dorota Komosińska, MSc |
|
Małgorzata Marciniak, PhD, Assoc. Prof. |
|
Agnieszka Mykowiecka, PhD, Assoc. Prof. |
|
Maciej Ogrodniczuk, PhD, Assoc. Prof., Head of the Group |
|
Agnieszka Patejuk, PhD |
|
Adam Przepiórkowski, PhD, Full Prof. |
|
Piotr Przybyła, PhD (on postdoctoral fellowship at UPF) |
|
Michał Rudolf, PhD |
|
Piotr Rychlik, PhD |
|
Karolina Saputa, BEng |
|
Aleksandra Tomaszewska, PhD candidate |
|
Aleksander Wawer, PhD |
|
Marcin Woliński, PhD, Assoc. Prof. |
|
Joanna Wołoszyn, PhD |
|
Alina Wróblewska, PhD |
|
Sebastian Zawada, MSc |
|
Bartosz Żuk, PhD candidate |
Associates
Anna Andrzejczuk, PhD (on leave) |
|
Wiktor Eźlakowski, MSc |
|
Sonia Janicka |
|
Mateusz Klimaszewski, MSc |
|
Jakub Piskorski, PhD |
|
Piotr Rybak, MSc |
|
Jakub Szymanik, PhD |
|
Ryszard Tuora, MSc |
|
Grzegorz Wojdyga, MSc |
|
Beata Wójtowicz, PhD, Assoc. Prof. |
Research
The main research areas of the Group
(Polish) corpus linguistics (National Corpus of Polish)
- morphosyntactic tagging and lemmatisation of Polish
- syntactic and semantic parsing of Polish
- extraction of linguistic knowledge from corpora
- information extraction
- distributional semantics and compositional distributional semantics
- sentiment analysis
- credibility assessment of online content
- reference and discourse relations
- generative linguistic formalisms, esp., HPSG and LFG.
The Group is a member of CLARIN, DARIAH-PL, ELRC, FLaReNet and META-NET.
Current externally funded projects
CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure)
CORMETAN (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts)
CURLICAT (Curated Multilingual Language Resources for CEF AT)
Korpus Dekady (DARIAH-PL — Digital Research Infrastructure for the Arts and Humanities)
ELE (European Language Equality)
ELG (European Language Grid)
ELRC (European Language Resource Coordination)
HOMADOS (Hampering Misinformation by Assessing Credibility of Online Sources)
KORBA 2 (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish")
Kwantyfikatory w języku: użycie i znaczenie (Quantifiers in Language: Use and Meaning)
MARCELL (Multilingual Resources for CEF.AT in the legal domain)
Nexus Linguarum (European network for Web-centred linguistic data science)
Scwad (Compositional distributional modelling of Polish language semantics)
SYNAMET (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse)
Some of our past projects
ATLAS (Applied Technology for Language-Aided CMS)
Automatic detection and correction of annotation errors in Polish language corpora
Automatic detection of semantic dependencies within verb argument structures in large treebanks
Automatic extraction of linguistic knowledge from a large corpus of Polish
CESAR (CEntral and South-east europeAn Resources)
Chronofleks (A diachronic formal model of Polish inflection and its implementation)
CLARIN (Polish chapter of Common Language Resources and Technology Infrastructure, see also CLARIN-PL 2)
CoDeS (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts)
Construction of a treebank for Polish using automatic syntactic analysis
CORE (Computer-based methods for coreference resolution in Polish texts)
COTHEC (Unified theory of coreference in Polish and its corpus-based verification)
KORBA (Electronic corpus of 17th and 18th century Polish texts)
LT4eL (Language Technology for eLearning)
LUNA (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support
NEKST (An adaptive system to support problem-solving on the basis of document collections in the Internet)
NKJP (National Corpus of Polish)
OPTA (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim)
PARSEME (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing)
Parthenos (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies)
Readability (Measuring the degree of readability of nonliterary Polish texts)
SYNAT (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society)
TextLink (Structuring Discourse in Multilingual Europe)
TrendMiner (Large-scale, Cross-lingual Trend Mining and Summarisation of Real-time Media Streams)
Publicly available tools and resources
Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN.
Some tools (all open source, under GPL; see also CLIP):
Morfeusz 2 – a morphological analyser of Polish,
Spejd – a shallow parsing and disambiguation system,
Świgra – a DCG parser,
COMBO – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling,
Concraft — a CRF morphosyntactic tagger of Polish compatible with Morfeusz SGJP,
PANTERA – a morphosyntactic tagger for Polish,
TaKIPI – a morphosyntactic tagger for Polish,
Poliqarp – a corpus indexing and search engine,
Poliqarp2 – a new generation corpus indexing and search engine,
Dendrarium – a treebank development system (under development),
Anotatornia 2 – an annotation tool geared towards historical corpora,
WSDDE – a system for designing and performing Word Sense Disambiguation experiments,
Multiservice – web service for various of our tools,
TermoPL - multiword terms extraction from text
DSmodels - web service for calculating word similarity using Polish word embeddings
Main resources (many more at CLIP):
Other activities
Links to some other activities of the Group:
PolEval, the evaluation campaign for natural language processing tools for Polish
- conferences organised by the Group:
Intelligent Information Systems series of conferences
9th International Conference on Natural Language Processing (PolTAL 2014), 17–19 September 2014, Warsaw, Poland
14th International Workshop on Treebanks and Linguistic Theories (TLT14), 11–12 December 2015, Warsaw, Poland
Coreference Resolution Beyond OntoNotes (CORBON 2016) workshop at NAACL 2016 (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US
Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar (HeadLex16), 24–29 July 2016, Warsaw, Poland
2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017) at EACL 2017 (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain
Computational Models of Reference, Anaphora, and Coreference workshop (CRAC) at NAACL 2018 (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA
AI & NLP Workshop Day, 19 October 2018, Warsaw
Second Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2019), 6 ot 7 June 2019, Minneapolis
The 24th International Lexical-Functional Grammar Conference (LFG19), 8–10 July 2019, Canberra
The 25th International Lexical-Functional Grammar Conference (LFG20), 23–25 June 2020, online
The 26th International Lexical-Functional Grammar Conference (LFG21), 13–15 July 2021, online
Selected publications
2025
![]() |
2024
![]() |
Adam Przepiórkowski,
Katarzyna Kuś, Agnieszka Patejuk, and
Berke Şenşekerci.
You can depend on the symmetry of coordination
and that NPs and CPs can be conjoined.
Presentation delivered on 5 July 2024 at the “Form and Meaning of
Coordination” workshop in Göttingen, Germany
(https://www.uni-goettingen.de/de/685553.html), 2024.
|
2023
![]() |
![]() |
![]() |
2022
![]() |
![]() |
![]() |
2021
![]() |
![]() |
2020
![]() |
2019
![]() |
![]() |
2018
![]() |
![]() |
![]() |
![]() |
![]() |
2017
![]() |
![]() |
![]() |
2016
![]() |
2015
![]() |
![]() |
2014
![]() |
![]() |
![]() |
2013
2012
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
2011
![]() |
![]() |
2010
![]() |
![]() |
![]() |
2009
![]() |
![]() |
2008
![]() |
2007
![]() |
![]() |
![]() |
![]() |
2006
![]() |
2005
![]() |
![]() |
2004
2003
![]() |
2001
![]() |
2000
![]() |
1999
![]() |
![]() |
1998
![]() |
1997
![]() |
![]() |
![]() |
1995
![]() |
1994
![]() |
1989
![]() |