| Size: 9001 Comment: added Joanna Ogrodniczu, Zygmunt Saloni | Size: 15154 Comment:  | 
| Deletions are marked like this. | Additions are marked like this. | 
| Line 1: | Line 1: | 
| #acl ZILGroup:read,write All:read | #acl +All:read Default | 
| Line 8: | Line 8: | 
| || [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], MSc (on leave)      || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KacperChwialkowski|Kacper Chwiałkowski]] (part time) || [[mailto:kacper.chwialkowski@ipipan.waw.pl|kacper.chwialkowski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/LukaszDegorski|Łukasz Degórski]], MSc || [[mailto:ldegorski@ipipan.waw.pl|ldegorski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/ElzbietaHajnicz|Elżbieta Hajnicz]], PhD || [[mailto:elzbieta.hajnicz@ipipan.waw.pl|elzbieta.hajnicz@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], MSc || [[mailto:lkobylinski@ipipan.waw.pl|lkobylinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MateuszKopec|Mateusz Kopeć]] || [[mailto:mateusz.kopec@ipipan.waw.pl|mateusz.kopec@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]] (part time) || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AnnaKupsc|Anna Kupść]], PhD (on leave) || [[mailto:anna.kupsc@ipipan.waw.pl|anna.kupsc@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MichalLenart|Michał Lenart]] || [[mailto:michal.lenart@ipipan.waw.pl|michal.lenart@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MalgorzataMarciniak|Małgorzata Marciniak]], PhD || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MarcinMilkowski|Marcin Miłkowski]], PhD (part time) || [[mailto:marcin.milkowski@ifispan.waw.pl|marcin.milkowski@ifispan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AgnieszkaMykowiecka|Agnieszka Mykowiecka]], PhD || [[mailto:agnieszka.mykowiecka@ipipan.waw.pl|agnieszka.mykowiecka@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/JoannaOgrodniczuk|Joanna Ogrodniczuk]], PhD || [[mailto:joanna.ogrodniczuk@ipipan.waw.pl|joanna.ogrodniczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/JakubPiskorski|Jakub Piskorski]], PhD, Associate || [[mailto:jakub.piskorski@ipipan.waw.pl|jakub.piskorski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Head of the Group || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/PiotrRychlik|Piotr Rychlik]], PhD || [[mailto:rychlik@ipipan.waw.pl|rychlik@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/ZygmuntSaloni|Zygmunt Saloni]], Professor || [[mailto:zygmunt.saloni@ipipan.waw.pl|zygmunt.saloni@ipipan.waw.pl]] || || [[http://www.cs.albany.edu/~tomek/|Tomek Strzałkowski]], PhD, Foreign Associate || [[mailto:tomek@cs.albany.edu|tomek@cs.albany.edu]] || || [[http://zil.ipipan.waw.pl/DanutaSkowronska|Danuta Skowrońska]], MSc || [[mailto:danuta.skowronska@ipipan.waw.pl|danuta.skowronska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/JanSzejko|Jan Szejko]] (part time) || [[mailto:jan.szejko@ipipan.waw.pl|jan.szejko@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/LukaszSzalkiewicz|Łukasz Szałkiewicz]], MSc || [[mailto:lukasz.szalkiewicz@ipipan.waw.pl|lukasz.szalkiewicz@ipipan.waw.pl]] || || [[http://www.site.uottawa.ca/~szpak/|Stan Szpakowicz]], PhD, Foreign Associate || [[mailto:szpak@site.uottawa.ca|szpak@site.uottawa.ca]] || || [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], MSc || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AleksandraWieczorek|Aleksandra Wieczorek]], MSc (part time) || [[mailto:aleksandra.wieczorek@ipipan.waw.pl|aleksandra.wieczorek@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AlinaWroblewska|Alina Wróblewska]], MSc || [[mailto:alina.wroblewska@ipipan.waw.pl|alina.wroblewska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/SebastianZurowski|Sebastian Żurowski]], PhD (part time) || [[mailto:sebastian.zurowski@ipipan.waw.pl|sebastian.zurowski@ipipan.waw.pl]] || | === Core team === || [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/TomaszBartosiak|Tomasz Bartosiak]], MSc || [[mailto:tomasz.bartosiak@gmail.com|tomasz.bartosiak@gmail.com]] || || [[http://zil.ipipan.waw.pl/ElzbietaHajnicz|Elżbieta Hajnicz]], PhD, Assoc. Prof. || [[mailto:elzbieta.hajnicz@ipipan.waw.pl|elzbieta.hajnicz@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/WitoldKieras|Witold Kieraś]], PhD || [[mailto:witold.kieras@ipipan.waw.pl|witold.kieras@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], PhD || [[mailto:lkobylinski@ipipan.waw.pl|lukasz.kobylinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/DorotaKomosi%C5%84ska|Dorota Komosińska]], MSc || [[mailto:dorota.komosinska@gmail.com|dorota.komosinska@gmail.com]] || || [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska-Kieraś]], MSc || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MalgorzataMaciejewska|Małgorzata Maciejewska]], PhD || [[mailto:m.maciejewska@yahoo.co.uk|m.maciejewska@yahoo.co.uk]] || || [[http://zil.ipipan.waw.pl/MalgorzataMarciniak|Małgorzata Marciniak]], PhD, Assoc. Prof. || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AgnieszkaMykowiecka|Agnieszka Mykowiecka]], PhD, Assoc. Prof. || [[mailto:agnieszka.mykowiecka@ipipan.waw.pl|agnieszka.mykowiecka@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/BartlomiejNiton|Bartłomiej Nitoń]], MSc || [[mailto:bartek.niton@gmail.com|bartek.niton@gmail.com]] || || [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD, Assoc. Prof., Head of the Group || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AgnieszkaPatejuk|Agnieszka Patejuk]], PhD || [[mailto:aep@ipipan.waw.pl|agnieszka.patejuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Full Prof. || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/PiotrPrzybyla|Piotr Przybyła]], PhD || [[mailto:piotr.przybyla@ipipan.waw.pl|piotr.przybyla@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/PiotrRychlik|Piotr Rychlik]], PhD || [[mailto:rychlik@ipipan.waw.pl|piotr.rychlik@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], PhD || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] || || Grzegorz Wojdyga, MSc || [[mailto:g.wojdyga@ipipan.waw.pl|g.wojdyga@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD, Assoc. Prof. || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AlinaWroblewska|Alina Wróblewska]], PhD || [[mailto:alina.wroblewska@ipipan.waw.pl|alina.wroblewska@ipipan.waw.pl]] || === Associates === || Jakub Piskorski, PhD || [[mailto:jpiskorski@gmail.com|jpiskorski@gmail.com]] || || Piotr Rybak || [[mailto:piotr.cezary.rybak@gmail.com|piotr.cezary.rybak@gmail.com]] || || Jakub Szymanik, PhD || [[mailto:jakub.szymanik@gmail.com|jakub.szymanik@gmail.com]] || || [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD, Assoc. Prof. || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] || | 
| Line 42: | Line 44: | 
| * (Polish) corpus linguistics; cf. the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]], * syntactic and semantic parsing of Polish; cf. [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]], | * (Polish) corpus linguistics ([[http://nkjp.pl/|National Corpus of Polish]]), /* ; cf. the [[http://korpus.pl/en/|IPI PAN Corpus of Polish]] and the [[http://nkjp.pl/|National Corpus of Polish]], */ * morphosyntactic tagging and lemmatisation of Polish, * syntactic and semantic parsing of Polish, | 
| Line 46: | Line 49: | 
| * distributional semantics and compositional distributional semantics, | |
| Line 47: | Line 51: | 
| * morphosyntactic system of Polish, | * credibility assessment of online content, /* * morphosyntactic system of Polish, */ | 
| Line 50: | Line 55: | 
| The Group is a member of [[http://www.clarin.eu/|CLARIN]], [[http://www.flarenet.eu/|FLaReNet]] and [[http://www.meta-net.eu/|META-NET]]. | The Group is a member of [[http://www.clarin.eu/|CLARIN]], [[http://dariah.pl/|DARIAH-PL]], [[http://clip.ipipan.waw.pl/ELRC|ELRC]], [[http://www.flarenet.eu/|FLaReNet]] and [[http://www.meta-net.eu/|META-NET]]. | 
| Line 54: | Line 59: | 
| * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts), * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet), * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society), * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS), * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources). | * [[http://clip.ipipan.waw.pl/CLARIN-PL-3|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]]) * [[http://zil.ipipan.waw.pl/CoDeS|CoDeS]] (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts) * [[CORMETAN]] (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts) * [[http://dariah.pl/|DARIAH-PL]] (Digital Research Infrastructure for the Arts and Humanities) * [[http://clip.ipipan.waw.pl/ELG|ELG]] (European Language Grid) * [[http://clip.ipipan.waw.pl/ELRC|ELRC]] (European Language Resource Coordination) * [[http://clip.ipipan.waw.pl/KORBA-2|KORBA 2]] (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish") * [[HOMADOS|HOMADOS]] (Hampering Misinformation by Assessing Credibility of Online Sources) * [[http://clip.ipipan.waw.pl/MARCELL|MARCELL]] (Multilingual Resources for CEF.AT in the legal domain) * [[http://clip.ipipan.waw.pl/Nexus|Nexus Linguarum]] (European network for Web-centred linguistic data science) * [[http://zil.ipipan.waw.pl/Quantifiers|Kwantyfikatory w języku: użycie i znaczenie]] (Quantifiers in Language: Use and Meaning) * [[http://clip.ipipan.waw.pl/Parthenos|Parthenos]] (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies) * [[http://zil.ipipan.waw.pl/Scwad|Scwad]] (Compositional distributional modelling of Polish language semantics) * [[http://synamet.uw.edu.pl/|SYNAMET]] (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse) | 
| Line 62: | Line 76: | 
| * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]], * [[CLARIN|CLARIN]] (Common Language Resources and Technology Infrastructure), * [[NKJP|NKJP]] (National Corpus of Polish), * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]], * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support, * [[LT4eL|LT4eL]] (Language Technology for eLearning), * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]], * [[Information Extraction from Polish free text|Information Extraction from Polish free text]], * [[IPI PAN Corpus|The IPI PAN Corpus of Polish]], * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]], * [[HPSG Grammar of Polish|HPSG Grammar of Polish]]. | * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS) * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]] * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]] * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]] * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources) * [[http://zil.ipipan.waw.pl/Chronofleks|Chronofleks]] (A diachronic formal model of Polish inflection and its implementation) * [[CLARIN|CLARIN]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]], see also [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL 2]]) * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]] * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts) * [[http://clip.ipipan.waw.pl/COTHEC|COTHEC]] (Unified theory of coreference in Polish and its corpus-based verification) * [[HPSG Grammar of Polish|HPSG Grammar of Polish]] * [[Information Extraction from Polish free text|Information Extraction from Polish free text]] * [[IPI PAN Corpus|The IPI PAN Corpus of Polish]] * [[http://clip.ipipan.waw.pl/KORBA|KORBA]] (Electronic corpus of 17th and 18th century Polish texts) * [[LT4eL|LT4eL]] (Language Technology for eLearning) * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet) * [[NKJP|NKJP]] (National Corpus of Polish) * [[http://zil.ipipan.waw.pl/OPTA|OPTA]] (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim) * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing) * [[http://clip.ipipan.waw.pl/Readability|Readability]] (Measuring the degree of readability of nonliterary Polish texts) * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society) * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]] * [[http://clip.ipipan.waw.pl/TextLink|TextLink]] (Structuring Discourse in Multilingual Europe) * [[http://clip.ipipan.waw.pl/TrendMiner|TrendMiner]] (Large-scale, Cross-lingual Trend Mining and Summarisation of Real-time Media Streams) | 
| Line 76: | Line 104: | 
| Here are some of the tools and resources created within our projects. See [[|CLIP]] pages for a more exhaustive list of Polish tools and resources. | Here are some of the tools and resources created within our projects. See [[http://clip.ipipan.waw.pl/|CLIP]] pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN. | 
| Line 78: | Line 106: | 
| Tools (all open source, under [[http://www.gnu.org/copyleft/gpl.html|GPL]]): | Some '''tools''' (all open source, under [[http://www.gnu.org/copyleft/gpl.html|GPL]]; see also [[http://clip.ipipan.waw.pl/|CLIP]]): | 
| Line 80: | Line 108: | 
| * [[http://morfeusz.sgjp.pl/|Morfeusz]] – a morphological analyser of Polish, * [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system, | |
| Line 81: | Line 111: | 
| * [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system, | * [[https://github.com/360er0/COMBO|COMBO]] – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling, | 
| Line 83: | Line 113: | 
| * [[http://code.google.com/p/pantera-tagger/|PANTERA]] – a morphosyntactic tagger for Polish, | * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish, | 
| Line 85: | Line 115: | 
| * [[https://sourceforge.net/projects/poliqarp2/|Poliqarp2]] – a new generation corpus indexing and search engine, | |
| Line 86: | Line 117: | 
| * [[http://nlp.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming), * [[http://nlp.ipipan.waw.pl/WSDDE/|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments (forthcoming), * [[http://nlp.ipipan.waw.pl/PPJP/|etc.]] | * [[http://zil.ipipan.waw.pl/Anotatornia2/|Anotatornia 2]] – an annotation tool geared towards historical corpora, * [[http://zil.ipipan.waw.pl/WSDDE|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments, * [[http://multiservice.nlp.ipipan.waw.pl/|Multiservice]] – web service for various of our tools, * [[http://zil.ipipan.waw.pl/TermoPL|TermoPL]] - multiword terms extraction from text * [[http://dsmodels.nlp.ipipan.waw.pl/sim1.html|DSmodels]] - web service for calculating word similarity using Polish word embeddings | 
| Line 91: | Line 124: | 
| Resources: | |
| Line 93: | Line 125: | 
| Main '''resources''' (many more at [[http://clip.ipipan.waw.pl/|CLIP]]): * [[http://walenty.ipipan.waw.pl/|Walenty]] – a valence dictionary of Polish (described [[http://zil.ipipan.waw.pl/Walenty|here]]), | |
| Line 94: | Line 129: | 
| * [[http://korpus.pl/|IPI PAN Corpus of Polish]] (obsolete). | * [[http://zil.ipipan.waw.pl/CoDeS|Polish word embeddings based on NKJP and Wikipedia]], * Polish dependency banks: [[http://zil.ipipan.waw.pl/PDB|PDB]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/PDB-UD_current|PDB-UD]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/PUD-PL_current|PUD-PL]], [[http://git.nlp.ipipan.waw.pl/alina/PDBUD/tree/master/NKJP1M-UD_current|NKJP1M-UD]], * [[http://zil.ipipan.waw.pl/PDB/PDBparser|Dependency parsing models for Polish]]. | 
| Line 102: | Line 139: | 
| * [[http://nlp.ipipan.waw.pl/seminar-e.html|NLP Seminar at IPI PAN]]; * [[http://iis.ipipan.waw.pl/|Intelligent Information Systems]] series of conferences. | * [[http://jlm.ipipan.waw.pl/|Journal of Language Modelling]] * [[http://zil.ipipan.waw.pl/seminar|NLP Seminar at IPI PAN]] * conferences organised by the Group: * [[http://iis.ipipan.waw.pl/|Intelligent Information Systems]] series of conferences * [[http://poltal.ipipan.waw.pl/|9th International Conference on Natural Language Processing]] (PolTAL 2014), 17–19 September 2014, Warsaw, Poland * [[http://tlt14.ipipan.waw.pl/|14th International Workshop on Treebanks and Linguistic Theories]] (TLT14), 11–12 December 2015, Warsaw, Poland * [[http://corbon.nlp.ipipan.waw.pl/2016/|Coreference Resolution Beyond OntoNotes]] (CORBON 2016) workshop at [[http://naacl.org/naacl-hlt-2016/|NAACL 2016]] (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US * [[http://headlex16.ipipan.waw.pl/|Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar]] (!HeadLex16), 24–29 July 2016, Warsaw, Poland * [[http://corbon.nlp.ipipan.waw.pl/|2nd Workshop on Coreference Resolution Beyond OntoNotes]] (CORBON 2017) at [[http://eacl2017.org/|EACL 2017]] (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain * [[http://anawiki.essex.ac.uk/dali/crac18/|Computational Models of Reference, Anaphora, and Coreference]] workshop (CRAC) at [[http://naacl2018.org/|NAACL 2018]] (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA * [[https://nlpday.pl/|AI & NLP Workshop Day]], 19 October 2018, Warsaw * [[https://sites.google.com/view/crac2019/|Second Workshop on Computational Models of Reference, Anaphora and Coreference]] (CRAC 2019), 6 ot 7 June 2019, Minneapolis * [[http://www.dynamicsoflanguage.edu.au/lfg-2019/|The 24th International Lexical-Functional Grammar Conference]] (LFG 2019), 8–10 July 2019, Canberra | 
The Linguistic Engineering Group
The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (ICS PAS).
People
Core team
| Anna Andrzejczuk, PhD | |
| Tomasz Bartosiak, MSc | |
| Elżbieta Hajnicz, PhD, Assoc. Prof. | |
| Witold Kieraś, PhD | |
| Łukasz Kobyliński, PhD | |
| Dorota Komosińska, MSc | |
| Małgorzata Marciniak, PhD, Assoc. Prof. | |
| Agnieszka Mykowiecka, PhD, Assoc. Prof. | |
| Bartłomiej Nitoń, MSc | |
| Maciej Ogrodniczuk, PhD, Assoc. Prof., Head of the Group | |
| Agnieszka Patejuk, PhD | |
| Adam Przepiórkowski, PhD, Full Prof. | |
| Piotr Przybyła, PhD | |
| Piotr Rychlik, PhD | |
| Aleksander Wawer, PhD | |
| Grzegorz Wojdyga, MSc | |
| Marcin Woliński, PhD, Assoc. Prof. | |
| Alina Wróblewska, PhD | 
Associates
| Jakub Piskorski, PhD | |
| Piotr Rybak | |
| Jakub Szymanik, PhD | |
| Beata Wójtowicz, PhD, Assoc. Prof. | 
Research
The main research areas of the Group
- (Polish) corpus linguistics (National Corpus of Polish), 
- morphosyntactic tagging and lemmatisation of Polish,
- syntactic and semantic parsing of Polish,
- extraction of linguistic knowledge from corpora,
- information extraction,
- distributional semantics and compositional distributional semantics,
- sentiment analysis,
- credibility assessment of online content, 
- generative linguistic formalisms, esp., HPSG and LFG.
The Group is a member of CLARIN, DARIAH-PL, ELRC, FLaReNet and META-NET.
Current externally funded projects
- CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure) 
- CoDeS (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts) 
- CORMETAN (Cognitive and sociocultural analysis of metaphoric expressions in Polish texts) 
- DARIAH-PL (Digital Research Infrastructure for the Arts and Humanities) 
- ELG (European Language Grid) 
- ELRC (European Language Resource Coordination) 
- KORBA 2 (Extension of the "Electronic corpus of 17th and 18th century Polish texts" and its integration with the "Electronic Dictionary of the 17th–18th Century Polish") 
- HOMADOS (Hampering Misinformation by Assessing Credibility of Online Sources) 
- MARCELL (Multilingual Resources for CEF.AT in the legal domain) 
- Nexus Linguarum (European network for Web-centred linguistic data science) 
- Kwantyfikatory w języku: użycie i znaczenie (Quantifiers in Language: Use and Meaning) 
- Parthenos (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies) 
- Scwad (Compositional distributional modelling of Polish language semantics) 
- SYNAMET (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse) 
Some of our past projects
- ATLAS (Applied Technology for Language-Aided CMS) 
- Automatic detection and correction of annotation errors in Polish language corpora 
- Automatic detection of semantic dependencies within verb argument structures in large treebanks 
- Automatic extraction of linguistic knowledge from a large corpus of Polish 
- CESAR (CEntral and South-east europeAn Resources) 
- Chronofleks (A diachronic formal model of Polish inflection and its implementation) 
- CLARIN (Polish chapter of Common Language Resources and Technology Infrastructure, see also CLARIN-PL 2) 
- Construction of a treebank for Polish using automatic syntactic analysis 
- CORE (Computer-based methods for coreference resolution in Polish texts) 
- COTHEC (Unified theory of coreference in Polish and its corpus-based verification) 
- KORBA (Electronic corpus of 17th and 18th century Polish texts) 
- LT4eL (Language Technology for eLearning) 
- LUNA (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support 
- NEKST (An adaptive system to support problem-solving on the basis of document collections in the Internet) 
- NKJP (National Corpus of Polish) 
- OPTA (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim) 
- PARSEME (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing) 
- Readability (Measuring the degree of readability of nonliterary Polish texts) 
- SYNAT (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society) 
- TextLink (Structuring Discourse in Multilingual Europe) 
- TrendMiner (Large-scale, Cross-lingual Trend Mining and Summarisation of Real-time Media Streams) 
Publicly available tools and resources
Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN.
Some tools (all open source, under GPL; see also CLIP):
- Morfeusz – a morphological analyser of Polish, 
- Spejd – a shallow parsing and disambiguation system, 
- Świgra – a DCG parser, 
- COMBO – a language-independent system for natural language preprocessing (i.e. morphosyntactic tagging, lemmatisation, dependency parsing and thematic role labelling, 
- TaKIPI – a morphosyntactic tagger for Polish, 
- PANTERA – a morphosyntactic tagger for Polish, 
- Poliqarp – a corpus indexing and search engine, 
- Poliqarp2 – a new generation corpus indexing and search engine, 
- Dendrarium – a treebank development system (under development), 
- Anotatornia 2 – an annotation tool geared towards historical corpora, 
- WSDDE – a system for designing and performing Word Sense Disambiguation experiments, 
- Multiservice – web service for various of our tools, 
- TermoPL - multiword terms extraction from text 
- DSmodels - web service for calculating word similarity using Polish word embeddings 
Main resources (many more at CLIP):
Other activities
Links to some other activities of the Group:
- conferences organised by the Group: - Intelligent Information Systems series of conferences 
- 9th International Conference on Natural Language Processing (PolTAL 2014), 17–19 September 2014, Warsaw, Poland 
- 14th International Workshop on Treebanks and Linguistic Theories (TLT14), 11–12 December 2015, Warsaw, Poland 
- Coreference Resolution Beyond OntoNotes (CORBON 2016) workshop at NAACL 2016 (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US 
- Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar (HeadLex16), 24–29 July 2016, Warsaw, Poland 
- 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017) at EACL 2017 (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain 
- Computational Models of Reference, Anaphora, and Coreference workshop (CRAC) at NAACL 2018 (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA 
- AI & NLP Workshop Day, 19 October 2018, Warsaw 
- Second Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2019), 6 ot 7 June 2019, Minneapolis 
- The 24th International Lexical-Functional Grammar Conference (LFG 2019), 8–10 July 2019, Canberra 
 
