| Size: 8756 Comment:  | Size: 9193 Comment:  | 
| Deletions are marked like this. | Additions are marked like this. | 
| Line 1: | Line 1: | 
| #acl CLIPWarszawaGroup:read,write All:read | #acl +All:read Default | 
| Line 8: | Line 8: | 
| || [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], MSc (on leave)      || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KacperChwialkowski|Kacper Chwiałkowski]] (part time) || [[mailto:kacper.chwialkowski@ipipan.waw.pl|kacper.chwialkowski@ipipan.waw.pl]] || | || [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] || | 
| Line 12: | Line 11: | 
| || [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], MSc               || [[mailto:lkobylinski@ipipan.waw.pl|lkobylinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MateuszKopec|Mateusz Kopeć]] || [[mailto:mateusz.kopec@ipipan.waw.pl|mateusz.kopec@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MichalLenart|Michał Lenart]] || [[mailto:michal.lenart@ipipan.waw.pl|michal.lenart@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]] (part time) || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || | || [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], PhD               || [[mailto:lkobylinski@ipipan.waw.pl|lkobylinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MateuszKopec|Mateusz Kopeć]], MSc || [[mailto:mateusz.kopec@ipipan.waw.pl|mateusz.kopec@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]], MSc || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || | 
| Line 18: | Line 16: | 
| || [[http://zil.ipipan.waw.pl/MarcinMilkowski|Marcin Miłkowski]], PhD (part time) || [[mailto:marcin.milkowski@ifispan.waw.pl|marcin.milkowski@ifispan.waw.pl]] || | |
| Line 21: | Line 18: | 
| || [[http://zil.ipipan.waw.pl/AgnieszkaPatejuk|Agnieszka Patejuk]], MSc || [[mailto:aep@ipipan.waw.pl|aep@ipipan.waw.pl]] || | |
| Line 23: | Line 21: | 
| || [[http://zil.ipipan.waw.pl/DominikaRogozinska|Dominika Rogozińska]] || [[mailto:dominika.rogozinska@students.mimuw.edu.pl|dominika.rogozinska@students.mimuw.edu.pl]] || | |
| Line 24: | Line 23: | 
| || [[http://zil.ipipan.waw.pl/PiotrSikora|Piotr Sikora]] || [[mailto:piotr.sikora@ipipan.waw.pl|piotr.sikora@ipipan.waw.pl]] || | |
| Line 25: | Line 25: | 
| || [[http://zil.ipipan.waw.pl/DanutaSkowronska|Danuta Skowrońska]], MSc               || [[mailto:danuta.skowronska@ipipan.waw.pl|danuta.skowronska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/JanSzejko|Jan Szejko]] (part time) || [[mailto:jan.szejko@ipipan.waw.pl|jan.szejko@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/LukaszSzalkiewicz|Łukasz Szałkiewicz]], MSc || [[mailto:lukasz.szalkiewicz@ipipan.waw.pl|lukasz.szalkiewicz@ipipan.waw.pl]] || | || [[http://zil.ipipan.waw.pl/JanSzejko|Jan Szejko]] || [[mailto:jan.szejko@ipipan.waw.pl|jan.szejko@ipipan.waw.pl]] || | 
| Line 29: | Line 27: | 
| || [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], MSc                 || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AleksandraWieczorek|Aleksandra Wieczorek]], MSc (part time) || [[mailto:aleksandra.wieczorek@ipipan.waw.pl|aleksandra.wieczorek@ipipan.waw.pl]] || | || [[http://zil.ipipan.waw.pl/JakubWaszczuk|Jakub Waszczuk]], MSc         || [[mailto:jakub.waszczuk@ipipan.waw.pl|jakub.waszczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], PhD || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] || | 
| Line 32: | Line 30: | 
| || [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD (part time) || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] || | |
| Line 33: | Line 32: | 
| || [[http://zil.ipipan.waw.pl/BartoszZaborowski|Bartosz Zaborowski]], MSc || [[mailto:bartosz.zaborowski@ipipan.waw.pl|bartosz.zaborowski@ipipan.waw.pl]] || | |
| Line 41: | Line 41: | 
| * syntactic and semantic parsing of Polish; cf. [[http://nlp.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]], | * syntactic and semantic parsing of Polish; cf. [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]], | 
| Line 52: | Line 52: | 
| * [[http://clip.ipipan.waw.pl/CORE]] (Computer-based methods for coreference resolution in Polish texts), * [[http://clip.ipipan.waw.pl/NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet), * [[http://clip.ipipan.waw.pl/SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society), * [[http://clip.ipipan.waw.pl/ATLAS]] (Applied Technology for Language-Aided CMS), * [[http://clip.ipipan.waw.pl/CESAR]] (CEntral and South-east europeAn Resources), * [[http://clip.ipipan.waw.pl/Construction of a treebank for Polish using automatic syntactic analysis]]. | * [[http://www.clarin-pl.eu/|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]]) * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing) * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts), * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet), * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society). * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]] | 
| Line 61: | Line 61: | 
| * [[http://clip.ipipan.waw.pl/CLARIN]] (Common Language Resources and Technology Infrastructure), * [[http://clip.ipipan.waw.pl/NKJP]] (National Corpus of Polish), * [[http://clip.ipipan.waw.pl/Automatic detection of semantic dependencies within verb argument structures in large treebanks]], * [[http://clip.ipipan.waw.pl/LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support, * [[http://clip.ipipan.waw.pl/LT4eL]] (Language Technology for eLearning), * [[http://clip.ipipan.waw.pl/Automatic extraction of linguistic knowledge from a large corpus of Polish]], * [[http://clip.ipipan.waw.pl/Information Extraction from Polish free text]], * [[http://clip.ipipan.waw.pl/IPI PAN Corpus|The IPI PAN Corpus of Polish]], * [[http://clip.ipipan.waw.pl/Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]], * [[http://clip.ipipan.waw.pl/HPSG Grammar of Polish]]. | * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS), * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources), * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]], * [[CLARIN|CLARIN]] (Common Language Resources and Technology Infrastructure), * [[NKJP|NKJP]] (National Corpus of Polish), * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]], * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support, * [[LT4eL|LT4eL]] (Language Technology for eLearning), * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]], * [[Information Extraction from Polish free text|Information Extraction from Polish free text]], * [[IPI PAN Corpus|The IPI PAN Corpus of Polish]], * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]], * [[HPSG Grammar of Polish|HPSG Grammar of Polish]]. | 
| Line 79: | Line 82: | 
| * [[http://nlp.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system, | * [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system, | 
| Line 81: | Line 84: | 
| * [[http://code.google.com/p/pantera-tagger/|PANTERA]] – a morphosyntactic tagger for Polish, | * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish, | 
| Line 84: | Line 87: | 
| * [[http://nlp.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming), * [[http://nlp.ipipan.waw.pl/WSDDE/|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments (forthcoming), | * [[http://zil.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming), * [[http://zil.ipipan.waw.pl/WSDDE|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments (forthcoming), | 
| Line 92: | Line 95: | 
| * [[http://zil.ipipan.waw.pl/DistrNKJP/|DistrNKJP]] – a distributable (IPR-free) subcorpus of National Corpus of Polish, | 
The Linguistic Engineering Group
The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (ICS PAS).
People
| Anna Andrzejczuk, PhD | |
| Łukasz Degórski, MSc | |
| Elżbieta Hajnicz, PhD | |
| Łukasz Kobyliński, PhD | |
| Mateusz Kopeć, MSc | |
| Katarzyna Krasnowska, MSc | |
| Anna Kupść, PhD (on leave) | |
| Małgorzata Marciniak, PhD | |
| Agnieszka Mykowiecka, PhD | |
| Maciej Ogrodniczuk, PhD | |
| Agnieszka Patejuk, MSc | |
| Jakub Piskorski, PhD, Associate | |
| Adam Przepiórkowski, PhD, Head of the Group | |
| Piotr Rychlik, PhD | |
| Tomek Strzałkowski, PhD, Foreign Associate | |
| Stan Szpakowicz, PhD, Foreign Associate | |
| Jakub Waszczuk, MSc | |
| Aleksander Wawer, PhD | |
| Marcin Woliński, PhD | |
| Beata Wójtowicz, PhD (part time) | |
| Alina Wróblewska, MSc | |
| Bartosz Zaborowski, MSc | |
| Sebastian Żurowski, PhD (part time) | 
Research
The main research areas of the Group
- (Polish) corpus linguistics; cf. the IPI PAN Corpus of Polish and the National Corpus of Polish, 
- syntactic and semantic parsing of Polish; cf. Spejd and Świgra, 
- extraction of linguistic knowledge from corpora,
- information extraction,
- sentiment analysis,
- morphosyntactic system of Polish,
- generative linguistic formalisms, esp., HPSG and LFG.
The Group is a member of CLARIN, FLaReNet and META-NET.
Current externally funded projects
- CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure) 
- PARSEME (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing) 
- CORE (Computer-based methods for coreference resolution in Polish texts), 
- NEKST (An adaptive system to support problem-solving on the basis of document collections in the Internet), 
- SYNAT (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society). 
- Automatic detection and correction of annotation errors in Polish language corpora 
Some of our past projects
- ATLAS (Applied Technology for Language-Aided CMS), 
- CESAR (CEntral and South-east europeAn Resources), 
- Construction of a treebank for Polish using automatic syntactic analysis, 
- CLARIN (Common Language Resources and Technology Infrastructure), 
- NKJP (National Corpus of Polish), 
- Automatic detection of semantic dependencies within verb argument structures in large treebanks, 
- LUNA (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support, 
- LT4eL (Language Technology for eLearning), 
- Automatic extraction of linguistic knowledge from a large corpus of Polish, 
Publicly available tools and resources
Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources.
Tools (all open source, under GPL):
- Świgra – a DCG parser, 
- Spejd – a shallow parsing and disambiguation system, 
- TaKIPI – a morphosyntactic tagger for Polish, 
- PANTERA – a morphosyntactic tagger for Polish, 
- Poliqarp – a corpus indexing and search engine, 
- Dendrarium – a treebank development system (under development), 
- Anotatornia – a system for multi-level manual annotation of corpora (forthcoming), 
- WSDDE – a system for designing and performing Word Sense Disambiguation experiments (forthcoming), 
Resources:
- DistrNKJP – a distributable (IPR-free) subcorpus of National Corpus of Polish, 
- IPI PAN Corpus of Polish (obsolete). 
Other activities
Links to some other activities of the Group:
- Intelligent Information Systems series of conferences. 
