Size: 8340
Comment:
|
Size: 13503
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
#acl CLIPWarszawaGroup:read,write All:read | #acl +All:read Default |
Line 8: | Line 8: |
|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], MSc (on leave) || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KacperChwialkowski|Kacper Chwiałkowski]] (part time) || [[mailto:kacper.chwialkowski@ipipan.waw.pl|kacper.chwialkowski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/LukaszDegorski|Łukasz Degórski]], MSc || [[mailto:ldegorski@ipipan.waw.pl|ldegorski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/ElzbietaHajnicz|Elżbieta Hajnicz]], PhD || [[mailto:elzbieta.hajnicz@ipipan.waw.pl|elzbieta.hajnicz@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], MSc || [[mailto:lkobylinski@ipipan.waw.pl|lkobylinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MateuszKopec|Mateusz Kopeć]] || [[mailto:mateusz.kopec@ipipan.waw.pl|mateusz.kopec@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MichalLenart|Michał Lenart]] || [[mailto:michal.lenart@ipipan.waw.pl|michal.lenart@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]] (part time) || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AnnaKupsc|Anna Kupść]], PhD (on leave) || [[mailto:anna.kupsc@ipipan.waw.pl|anna.kupsc@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MalgorzataMarciniak|Małgorzata Marciniak]], PhD || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MarcinMilkowski|Marcin Miłkowski]], PhD (part time) || [[mailto:marcin.milkowski@ifispan.waw.pl|marcin.milkowski@ifispan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AgnieszkaMykowiecka|Agnieszka Mykowiecka]], PhD || [[mailto:agnieszka.mykowiecka@ipipan.waw.pl|agnieszka.mykowiecka@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/JakubPiskorski|Jakub Piskorski]], PhD, Associate || [[mailto:jakub.piskorski@ipipan.waw.pl|jakub.piskorski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Head of the Group || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/PiotrRychlik|Piotr Rychlik]], PhD || [[mailto:rychlik@ipipan.waw.pl|rychlik@ipipan.waw.pl]] || || [[http://www.cs.albany.edu/~tomek/|Tomek Strzałkowski]], PhD, Foreign Associate || [[mailto:tomek@cs.albany.edu|tomek@cs.albany.edu]] || || [[http://zil.ipipan.waw.pl/DanutaSkowronska|Danuta Skowrońska]], MSc || [[mailto:danuta.skowronska@ipipan.waw.pl|danuta.skowronska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/JanSzejko|Jan Szejko]] (part time) || [[mailto:jan.szejko@ipipan.waw.pl|jan.szejko@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/LukaszSzalkiewicz|Łukasz Szałkiewicz]], MSc || [[mailto:lukasz.szalkiewicz@ipipan.waw.pl|lukasz.szalkiewicz@ipipan.waw.pl]] || || [[http://www.site.uottawa.ca/~szpak/|Stan Szpakowicz]], PhD, Foreign Associate || [[mailto:szpak@site.uottawa.ca|szpak@site.uottawa.ca]] || || [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], MSc || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AleksandraWieczorek|Aleksandra Wieczorek]], MSc (part time) || [[mailto:aleksandra.wieczorek@ipipan.waw.pl|aleksandra.wieczorek@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AlinaWroblewska|Alina Wróblewska]], MSc || [[mailto:alina.wroblewska@ipipan.waw.pl|alina.wroblewska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/SebastianZurowski|Sebastian Żurowski]], PhD (part time) || [[mailto:sebastian.zurowski@ipipan.waw.pl|sebastian.zurowski@ipipan.waw.pl]] || |
=== Core team === || [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/TomaszBartosiak|Tomasz Bartosiak]], MSc || [[mailto:tomasz.bartosiak@gmail.com|tomasz.bartosiak@gmail.com]] || || Zbigniew Gawłowicz || [[mailto:zbigniew.gawlowicz@ipipan.waw.pl|zbigniew.gawlowicz@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/ElzbietaHajnicz|Elżbieta Hajnicz]], PhD, Assoc. Prof. || [[mailto:elzbieta.hajnicz@ipipan.waw.pl|elzbieta.hajnicz@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/WitoldKieras|Witold Kieraś]], PhD || [[mailto:witold.kieras@ipipan.waw.pl|witold.kieras@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], PhD || [[mailto:lkobylinski@ipipan.waw.pl|lukasz.kobylinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/DorotaKomosi%C5%84ska|Dorota Komosińska]], MSc || [[mailto:dorota.komosinska@gmail.com|dorota.komosinska@gmail.com]] || || [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]], MSc || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MalgorzataMarciniak|Małgorzata Marciniak]], PhD, Assoc. Prof. || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AgnieszkaMykowiecka|Agnieszka Mykowiecka]], PhD, Assoc. Prof. || [[mailto:agnieszka.mykowiecka@ipipan.waw.pl|agnieszka.mykowiecka@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/BartlomiejNiton|Bartłomiej Nitoń]], MSc || [[mailto:bartek.niton@gmail.com|bartek.niton@gmail.com]] || || [[http://zil.ipipan.waw.pl/MaciejOgrodniczuk|Maciej Ogrodniczuk]], PhD, Head of the Group || [[mailto:maciej.ogrodniczuk@ipipan.waw.pl|maciej.ogrodniczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AgnieszkaPatejuk|Agnieszka Patejuk]], PhD || [[mailto:aep@ipipan.waw.pl|agnieszka.patejuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AdamPrzepiorkowski|Adam Przepiórkowski]], PhD, Assoc. Prof. || [[mailto:adam.przepiorkowski@ipipan.waw.pl|adam.przepiorkowski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/PiotrRychlik|Piotr Rychlik]], PhD || [[mailto:rychlik@ipipan.waw.pl|piotr.rychlik@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], PhD || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MarcinWolinski|Marcin Woliński]], PhD || [[mailto:marcin.wolinski@ipipan.waw.pl|marcin.wolinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AlinaWroblewska|Alina Wróblewska]], PhD || [[mailto:alina.wroblewska@ipipan.waw.pl|alina.wroblewska@ipipan.waw.pl]] || === Associates === || [[http://zil.ipipan.waw.pl/KonradGoluchowski|Konrad Gołuchowski]], MSc || [[mailto:kodieg@gmail.com|kodieg@gmail.com]] || || [[http://www.mimuw.edu.pl/~wjaworski/|Wojciech Jaworski]], PhD || [[mailto:wjaworski@mimuw.edu.pl|wjaworski@mimuw.edu.pl]] || || [[http://zil.ipipan.waw.pl/JakubPiskorski|Jakub Piskorski]], PhD || [[mailto:jakub.piskorski@ipipan.waw.pl|jakub.piskorski@ipipan.waw.pl]] || || Piotr Rybak || [[mailto:piotr.cezary.rybak@gmail.com|piotr.cezary.rybak@gmail.com]] || || Filip Stefaniuk || [[mailto:filip.stefaniuk@gmail.com|filip.stefaniuk@gmail.com]] || || Jakub Szymanik, PhD || [[mailto:jakub.szymanik@gmail.com|jakub.szymanik@gmail.com]] || || [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] || |
Line 41: | Line 46: |
* syntactic and semantic parsing of Polish; cf. [[http://nlp.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]], | * syntactic and semantic parsing of Polish; cf. [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]], |
Line 48: | Line 53: |
The Group is a member of [[http://www.clarin.eu/|CLARIN]], [[http://www.flarenet.eu/|FLaReNet]] and [[http://www.meta-net.eu/|META-NET]]. | The Group is a member of [[http://www.clarin.eu/|CLARIN]], [[http://dariah.pl/|DARIAH-PL]], [[http://clip.ipipan.waw.pl/ELRC|ELRC]], [[http://www.flarenet.eu/|FLaReNet]] and [[http://www.meta-net.eu/|META-NET]]. |
Line 52: | Line 57: |
* [[CORE]] (Computer-based methods for coreference resolution in Polish texts), * [[NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet), * [[SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society), * [[ATLAS]] (Applied Technology for Language-Aided CMS), * [[CESAR]] (CEntral and South-east europeAn Resources), * [[Construction of a treebank for Polish using automatic syntactic analysis]]. |
* [[http://clip.ipipan.waw.pl/CLARIN-PL-2|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]]) * [[http://clip.ipipan.waw.pl/COTHEC|COTHEC]] (Unified theory of coreference in Polish and its corpus-based verification) * [[http://zil.ipipan.waw.pl/Chronofleks|Chronofleks]] (A diachronic formal model of Polish inflection and its implementation) * [[http://zil.ipipan.waw.pl/CoDeS|CoDeS]] (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts) * [[http://dariah.pl/|DARIAH-PL]] (Digital Research Infrastructure for the Arts and Humanities) * [[http://clip.ipipan.waw.pl/ELRC|ELRC]] (European Language Resource Coordination) * [[http://clip.ipipan.waw.pl/KORBA|KORBA]] (Electronic corpus of 17th and 18th century Polish texts) * [[http://zil.ipipan.waw.pl/Quantifiers|Kwantyfikatory w języku: użycie i znaczenie]] (Quantifiers in Language: Use and Meaning) * [[http://clip.ipipan.waw.pl/Parthenos|Parthenos]] (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies) * [[http://zil.ipipan.waw.pl/Scwad|Scwad]] (Compositional distributional modelling of Polish language semantics) * [[http://synamet.uw.edu.pl/|SYNAMET]] (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse) |
Line 61: | Line 71: |
* [[CLARIN]] (Common Language Resources and Technology Infrastructure), * [[NKJP]] (National Corpus of Polish), * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks]], * [[LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support, * [[LT4eL]] (Language Technology for eLearning), * [[Automatic extraction of linguistic knowledge from a large corpus of Polish]], * [[Information Extraction from Polish free text]], |
* [[http://clip.ipipan.waw.pl/TextLink|TextLink]] (Structuring Discourse in Multilingual Europe) * [[http://clip.ipipan.waw.pl/Readability|Readability]] (Measuring the degree of readability of nonliterary Polish texts) * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing) * [[http://zil.ipipan.waw.pl/OPTA|OPTA]] (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim) * [[http://clip.ipipan.waw.pl/TrendMiner|TrendMiner]] (Large-scale, Cross-lingual Trend Mining and Summarisation of Real-time Media Streams) * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet), * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society), * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]], * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS), * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources), * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]], * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts), * [[CLARIN|CLARIN]] (Common Language Resources and Technology Infrastructure), * [[NKJP|NKJP]] (National Corpus of Polish), * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]], * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support, * [[LT4eL|LT4eL]] (Language Technology for eLearning), * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]], * [[Information Extraction from Polish free text|Information Extraction from Polish free text]], |
Line 70: | Line 92: |
* [[HPSG Grammar of Polish]]. | * [[HPSG Grammar of Polish|HPSG Grammar of Polish]]. |
Line 74: | Line 96: |
Here are some of the tools and resources created within our projects. See [[http://clip.ipipan.waw.pl/|CLIP]] pages for a more exhaustive list of Polish tools and resources. | Here are some of the tools and resources created within our projects. See [[http://clip.ipipan.waw.pl/|CLIP]] pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN. |
Line 76: | Line 98: |
Tools (all open source, under [[http://www.gnu.org/copyleft/gpl.html|GPL]]): | Some '''tools''' (all open source, under [[http://www.gnu.org/copyleft/gpl.html|GPL]]; see also [[http://clip.ipipan.waw.pl/|CLIP]]): |
Line 79: | Line 101: |
* [[http://nlp.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system, | * [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system, |
Line 81: | Line 103: |
* [[http://code.google.com/p/pantera-tagger/|PANTERA]] – a morphosyntactic tagger for Polish, | * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish, |
Line 83: | Line 105: |
* [[https://sourceforge.net/projects/poliqarp2/|Poliqarp2]] – a new generation corpus indexing and search engine, | |
Line 84: | Line 107: |
* [[http://nlp.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming), * [[http://nlp.ipipan.waw.pl/WSDDE/|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments (forthcoming), |
* [[http://zil.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming), * [[http://zil.ipipan.waw.pl/WSDDE|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments, * [[http://multiservice.nlp.ipipan.waw.pl/|Multiservice]] – web service for various of our tools, * [[http://zil.ipipan.waw.pl/TermoPL|TermoPL]] - multiword terms extraction from text * [[http://dsmodels.nlp.ipipan.waw.pl/sim1.html|DSmodels]] - web service for calculating word similarity using Polish word embeddings |
Line 89: | Line 115: |
Resources: | Main '''resources''' (many more at [[http://clip.ipipan.waw.pl/|CLIP]]): |
Line 91: | Line 117: |
* [[http://walenty.ipipan.waw.pl/|Walenty]] – a valence dictionary of Polish (described [[http://zil.ipipan.waw.pl/Walenty|here]]), | |
Line 92: | Line 119: |
* [[http://korpus.pl/|IPI PAN Corpus of Polish]] (obsolete). | * [[http://zil.ipipan.waw.pl/CoDeS|Polish word embeddings based on NKJP and Wikipedia]]. |
Line 100: | Line 127: |
* [[http://nlp.ipipan.waw.pl/seminar-e.html|NLP Seminar at IPI PAN]]; * [[http://iis.ipipan.waw.pl/|Intelligent Information Systems]] series of conferences. |
* [[http://jlm.ipipan.waw.pl/|Journal of Language Modelling]] * [[http://zil.ipipan.waw.pl/seminar|NLP Seminar at IPI PAN]] * conferences organised by the Group: * [[http://iis.ipipan.waw.pl/|Intelligent Information Systems]] series of conferences * [[http://poltal.ipipan.waw.pl/|PolTAL 2014]] – 9th International Conference on Natural Language Processing, 17–19 September 2014, Warsaw, Poland * [[http://tlt14.ipipan.waw.pl/|TLT14]] – 14th International Workshop on Treebanks and Linguistic Theories, 11–12 December 2015, Warsaw, Poland * [[http://corbon.nlp.ipipan.waw.pl/2016/|CORBON 2016]] – Coreference Resolution Beyond !OntoNotes workshop at [[http://naacl.org/naacl-hlt-2016/|NAACL 2016]] (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US * [[http://headlex16.ipipan.waw.pl/|HeadLex16]] – Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, 24–29 July 2016, Warsaw, Poland * [[http://corbon.nlp.ipipan.waw.pl/|CORBON 2017]] – 2nd Workshop on Coreference Resolution Beyond !OntoNotes at [[http://eacl2017.org/|EACL 2017]] (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain * [[http://anawiki.essex.ac.uk/dali/crac18/|CRAC: Computational Models of Reference, Anaphora, and Coreference]] at [[http://naacl2018.org/|NAACL 2017]] (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA * [[http://waw2018.argdiap.pl/argdiap-conference/|16th ArgDiaP conference]], part of the [[http://waw2018.argdiap.pl/|WAW 2018]] (Warsaw Argumentation Week), 15–16 September 2018, Warsaw |
The Linguistic Engineering Group
The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (ICS PAS).
People
Core team
Anna Andrzejczuk, PhD |
|
Tomasz Bartosiak, MSc |
|
Zbigniew Gawłowicz |
|
Elżbieta Hajnicz, PhD, Assoc. Prof. |
|
Witold Kieraś, PhD |
|
Łukasz Kobyliński, PhD |
|
Dorota Komosińska, MSc |
|
Katarzyna Krasnowska, MSc |
|
Małgorzata Marciniak, PhD, Assoc. Prof. |
|
Agnieszka Mykowiecka, PhD, Assoc. Prof. |
|
Bartłomiej Nitoń, MSc |
|
Maciej Ogrodniczuk, PhD, Head of the Group |
|
Agnieszka Patejuk, PhD |
|
Adam Przepiórkowski, PhD, Assoc. Prof. |
|
Piotr Rychlik, PhD |
|
Aleksander Wawer, PhD |
|
Marcin Woliński, PhD |
|
Alina Wróblewska, PhD |
Associates
Konrad Gołuchowski, MSc |
|
Wojciech Jaworski, PhD |
|
Jakub Piskorski, PhD |
|
Piotr Rybak |
|
Filip Stefaniuk |
|
Jakub Szymanik, PhD |
|
Beata Wójtowicz, PhD |
Research
The main research areas of the Group
(Polish) corpus linguistics; cf. the IPI PAN Corpus of Polish and the National Corpus of Polish,
syntactic and semantic parsing of Polish; cf. Spejd and Świgra,
- extraction of linguistic knowledge from corpora,
- information extraction,
- sentiment analysis,
- morphosyntactic system of Polish,
- generative linguistic formalisms, esp., HPSG and LFG.
The Group is a member of CLARIN, DARIAH-PL, ELRC, FLaReNet and META-NET.
Current externally funded projects
CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure)
COTHEC (Unified theory of coreference in Polish and its corpus-based verification)
Chronofleks (A diachronic formal model of Polish inflection and its implementation)
CoDeS (Compositional distributional semantic models for identification, discrimination and disambiguation of senses in Polish texts)
DARIAH-PL (Digital Research Infrastructure for the Arts and Humanities)
ELRC (European Language Resource Coordination)
KORBA (Electronic corpus of 17th and 18th century Polish texts)
Kwantyfikatory w języku: użycie i znaczenie (Quantifiers in Language: Use and Meaning)
Parthenos (Pooling Activities, Resources and Tools for Heritage, E-research Networking, Optimization and Synergies)
Scwad (Compositional distributional modelling of Polish language semantics)
SYNAMET (Microcorpus of Synaesthetic Metaphors. Towards a Formal Description and Efficient Methods of Analysis of Metaphors in Discourse)
Some of our past projects
TextLink (Structuring Discourse in Multilingual Europe)
Readability (Measuring the degree of readability of nonliterary Polish texts)
PARSEME (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing)
OPTA (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim)
TrendMiner (Large-scale, Cross-lingual Trend Mining and Summarisation of Real-time Media Streams)
NEKST (An adaptive system to support problem-solving on the basis of document collections in the Internet),
SYNAT (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society),
Automatic detection and correction of annotation errors in Polish language corpora,
ATLAS (Applied Technology for Language-Aided CMS),
CESAR (CEntral and South-east europeAn Resources),
Construction of a treebank for Polish using automatic syntactic analysis,
CORE (Computer-based methods for coreference resolution in Polish texts),
CLARIN (Common Language Resources and Technology Infrastructure),
NKJP (National Corpus of Polish),
Automatic detection of semantic dependencies within verb argument structures in large treebanks,
LUNA (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support,
LT4eL (Language Technology for eLearning),
Automatic extraction of linguistic knowledge from a large corpus of Polish,
Publicly available tools and resources
Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN.
Some tools (all open source, under GPL; see also CLIP):
Świgra – a DCG parser,
Spejd – a shallow parsing and disambiguation system,
TaKIPI – a morphosyntactic tagger for Polish,
PANTERA – a morphosyntactic tagger for Polish,
Poliqarp – a corpus indexing and search engine,
Poliqarp2 – a new generation corpus indexing and search engine,
Dendrarium – a treebank development system (under development),
Anotatornia – a system for multi-level manual annotation of corpora (forthcoming),
WSDDE – a system for designing and performing Word Sense Disambiguation experiments,
Multiservice – web service for various of our tools,
TermoPL - multiword terms extraction from text
DSmodels - web service for calculating word similarity using Polish word embeddings
Main resources (many more at CLIP):
Other activities
Links to some other activities of the Group:
- conferences organised by the Group:
Intelligent Information Systems series of conferences
PolTAL 2014 – 9th International Conference on Natural Language Processing, 17–19 September 2014, Warsaw, Poland
TLT14 – 14th International Workshop on Treebanks and Linguistic Theories, 11–12 December 2015, Warsaw, Poland
CORBON 2016 – Coreference Resolution Beyond OntoNotes workshop at NAACL 2016 (The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 16 June 2016, San Diego, US
HeadLex16 – Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, 24–29 July 2016, Warsaw, Poland
CORBON 2017 – 2nd Workshop on Coreference Resolution Beyond OntoNotes at EACL 2017 (The 15th Conference of the European Chapter of the Association for Computational Linguistics), 4 April 2017, Valencia, Spain
CRAC: Computational Models of Reference, Anaphora, and Coreference at NAACL 2017 (The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies), 6 June 2018, New Orleans, USA
16th ArgDiaP conference, part of the WAW 2018 (Warsaw Argumentation Week), 15–16 September 2018, Warsaw