Size: 8340
Comment:
|
Size: 9494
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
#acl CLIPWarszawaGroup:read,write All:read | #acl +All:read Default |
Line 8: | Line 8: |
|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], MSc (on leave) || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KacperChwialkowski|Kacper Chwiałkowski]] (part time) || [[mailto:kacper.chwialkowski@ipipan.waw.pl|kacper.chwialkowski@ipipan.waw.pl]] || |
|| [[http://zil.ipipan.waw.pl/AnnaAndrzejczuk|Anna Andrzejczuk]], PhD || [[mailto:anna.andrzejczuk@ipipan.waw.pl|anna.andrzejczuk@ipipan.waw.pl]] || |
Line 11: | Line 10: |
|| [[http://zil.ipipan.waw.pl/KonradGoluchowski|Konrad Gołuchowski]], MSc || [[mailto:kodieg@gmail.com|kodieg@gmail.com]] || | |
Line 12: | Line 12: |
|| [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], MSc || [[mailto:lkobylinski@ipipan.waw.pl|lkobylinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MateuszKopec|Mateusz Kopeć]] || [[mailto:mateusz.kopec@ipipan.waw.pl|mateusz.kopec@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MichalLenart|Michał Lenart]] || [[mailto:michal.lenart@ipipan.waw.pl|michal.lenart@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]] (part time) || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || |
|| [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]], PhD || [[mailto:lkobylinski@ipipan.waw.pl|lkobylinski@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MateuszKopec|Mateusz Kopeć]], MSc || [[mailto:mateusz.kopec@ipipan.waw.pl|mateusz.kopec@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/KatarzynaKrasnowska|Katarzyna Krasnowska]], MSc || [[mailto:katarzyna.krasnowska@ipipan.waw.pl|katarzyna.krasnowska@ipipan.waw.pl]] || |
Line 17: | Line 16: |
|| [[http://zil.ipipan.waw.pl/MalgorzataMarciniak|Małgorzata Marciniak]], PhD || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MarcinMilkowski|Marcin Miłkowski]], PhD (part time) || [[mailto:marcin.milkowski@ifispan.waw.pl|marcin.milkowski@ifispan.waw.pl]] || |
|| [[http://zil.ipipan.waw.pl/MalgorzataMarciniak|Małgorzata Marciniak]], PhD || [[mailto:michal.lenart@ipipan.waw.pl|michal.lenart@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/MichalLenart|Michał Lenart]], MSc || [[mailto:malgorzata.marciniak@ipipan.waw.pl|malgorzata.marciniak@ipipan.waw.pl]] || |
Line 21: | Line 20: |
|| [[http://zil.ipipan.waw.pl/AgnieszkaPatejuk|Agnieszka Patejuk]], MSc || [[mailto:aep@ipipan.waw.pl|aep@ipipan.waw.pl]] || | |
Line 23: | Line 23: |
|| [[http://zil.ipipan.waw.pl/DominikaRogozinska|Dominika Rogozińska]] || [[mailto:dominika.rogozinska@students.mimuw.edu.pl|dominika.rogozinska@students.mimuw.edu.pl]] || | |
Line 24: | Line 25: |
|| [[http://zil.ipipan.waw.pl/PiotrSikora|Piotr Sikora]], MSc || [[mailto:piotr.sikora@ipipan.waw.pl|piotr.sikora@ipipan.waw.pl]] || | |
Line 25: | Line 27: |
|| [[http://zil.ipipan.waw.pl/DanutaSkowronska|Danuta Skowrońska]], MSc || [[mailto:danuta.skowronska@ipipan.waw.pl|danuta.skowronska@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/JanSzejko|Jan Szejko]] (part time) || [[mailto:jan.szejko@ipipan.waw.pl|jan.szejko@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/LukaszSzalkiewicz|Łukasz Szałkiewicz]], MSc || [[mailto:lukasz.szalkiewicz@ipipan.waw.pl|lukasz.szalkiewicz@ipipan.waw.pl]] || |
|| [[http://zil.ipipan.waw.pl/JanSzejko|Jan Szejko]] || [[mailto:jan.szejko@ipipan.waw.pl|jan.szejko@ipipan.waw.pl]] || |
Line 29: | Line 29: |
|| [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], MSc || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AleksandraWieczorek|Aleksandra Wieczorek]], MSc (part time) || [[mailto:aleksandra.wieczorek@ipipan.waw.pl|aleksandra.wieczorek@ipipan.waw.pl]] || |
|| [[http://zil.ipipan.waw.pl/JakubWaszczuk|Jakub Waszczuk]], MSc || [[mailto:jakub.waszczuk@ipipan.waw.pl|jakub.waszczuk@ipipan.waw.pl]] || || [[http://zil.ipipan.waw.pl/AleksanderWawer|Aleksander Wawer]], PhD || [[mailto:aleksander.wawer@ipipan.waw.pl|aleksander.wawer@ipipan.waw.pl]] || |
Line 32: | Line 32: |
|| [[http://zil.ipipan.waw.pl/BeataWojtowicz|Beata Wójtowicz]], PhD (part time) || [[mailto:beata.wojtowicz@ipipan.waw.pl|beata.wojtowicz@ipipan.waw.pl]] || | |
Line 33: | Line 34: |
|| [[http://zil.ipipan.waw.pl/BartoszZaborowski|Bartosz Zaborowski]], MSc || [[mailto:bartosz.zaborowski@ipipan.waw.pl|bartosz.zaborowski@ipipan.waw.pl]] || | |
Line 41: | Line 43: |
* syntactic and semantic parsing of Polish; cf. [[http://nlp.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]], | * syntactic and semantic parsing of Polish; cf. [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]], |
Line 52: | Line 54: |
* [[CORE]] (Computer-based methods for coreference resolution in Polish texts), * [[NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet), * [[SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society), * [[ATLAS]] (Applied Technology for Language-Aided CMS), * [[CESAR]] (CEntral and South-east europeAn Resources), * [[Construction of a treebank for Polish using automatic syntactic analysis]]. |
* [[http://www.clarin-pl.eu/|CLARIN-PL]] (Polish chapter of [[http://www.clarin.eu/|Common Language Resources and Technology Infrastructure]]) * [[PARSEME|PARSEME]] (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing) * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts), * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet), * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society). * [[http://zil.ipipan.waw.pl/Automatic%20detection%20and%20correction%20of%20annotation%20errors%20in%20Polish%20language%20corpora|Automatic detection and correction of annotation errors in Polish language corpora]] |
Line 61: | Line 63: |
* [[CLARIN]] (Common Language Resources and Technology Infrastructure), * [[NKJP]] (National Corpus of Polish), * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks]], * [[LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support, * [[LT4eL]] (Language Technology for eLearning), * [[Automatic extraction of linguistic knowledge from a large corpus of Polish]], * [[Information Extraction from Polish free text]], |
* [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS), * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources), * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]], * [[CLARIN|CLARIN]] (Common Language Resources and Technology Infrastructure), * [[NKJP|NKJP]] (National Corpus of Polish), * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]], * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support, * [[LT4eL|LT4eL]] (Language Technology for eLearning), * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]], * [[Information Extraction from Polish free text|Information Extraction from Polish free text]], |
Line 70: | Line 75: |
* [[HPSG Grammar of Polish]]. | * [[HPSG Grammar of Polish|HPSG Grammar of Polish]]. |
Line 79: | Line 84: |
* [[http://nlp.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system, | * [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system, |
Line 81: | Line 86: |
* [[http://code.google.com/p/pantera-tagger/|PANTERA]] – a morphosyntactic tagger for Polish, | * [[http://zil.ipipan.waw.pl/PANTERA|PANTERA]] – a morphosyntactic tagger for Polish, |
Line 84: | Line 89: |
* [[http://nlp.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming), * [[http://nlp.ipipan.waw.pl/WSDDE/|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments (forthcoming), |
* [[http://zil.ipipan.waw.pl/Anotatornia/|Anotatornia]] – a system for multi-level manual annotation of corpora (forthcoming), * [[http://zil.ipipan.waw.pl/WSDDE|WSDDE]] – a system for designing and performing Word Sense Disambiguation experiments (forthcoming), |
Line 92: | Line 97: |
* [[http://zil.ipipan.waw.pl/DistrNKJP/|DistrNKJP]] – a distributable (IPR-free) subcorpus of National Corpus of Polish, |
The Linguistic Engineering Group
The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (ICS PAS).
People
Anna Andrzejczuk, PhD |
|
Łukasz Degórski, MSc |
|
Konrad Gołuchowski, MSc |
|
Elżbieta Hajnicz, PhD |
|
Łukasz Kobyliński, PhD |
|
Mateusz Kopeć, MSc |
|
Katarzyna Krasnowska, MSc |
|
Anna Kupść, PhD (on leave) |
|
Małgorzata Marciniak, PhD |
|
Michał Lenart, MSc |
|
Agnieszka Mykowiecka, PhD |
|
Maciej Ogrodniczuk, PhD |
|
Agnieszka Patejuk, MSc |
|
Jakub Piskorski, PhD, Associate |
|
Adam Przepiórkowski, PhD, Head of the Group |
|
Piotr Rychlik, PhD |
|
Piotr Sikora, MSc |
|
Tomek Strzałkowski, PhD, Foreign Associate |
|
Stan Szpakowicz, PhD, Foreign Associate |
|
Jakub Waszczuk, MSc |
|
Aleksander Wawer, PhD |
|
Marcin Woliński, PhD |
|
Beata Wójtowicz, PhD (part time) |
|
Alina Wróblewska, MSc |
|
Bartosz Zaborowski, MSc |
|
Sebastian Żurowski, PhD (part time) |
Research
The main research areas of the Group
(Polish) corpus linguistics; cf. the IPI PAN Corpus of Polish and the National Corpus of Polish,
syntactic and semantic parsing of Polish; cf. Spejd and Świgra,
- extraction of linguistic knowledge from corpora,
- information extraction,
- sentiment analysis,
- morphosyntactic system of Polish,
- generative linguistic formalisms, esp., HPSG and LFG.
The Group is a member of CLARIN, FLaReNet and META-NET.
Current externally funded projects
CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure)
PARSEME (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing)
CORE (Computer-based methods for coreference resolution in Polish texts),
NEKST (An adaptive system to support problem-solving on the basis of document collections in the Internet),
SYNAT (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society).
Automatic detection and correction of annotation errors in Polish language corpora
Some of our past projects
ATLAS (Applied Technology for Language-Aided CMS),
CESAR (CEntral and South-east europeAn Resources),
Construction of a treebank for Polish using automatic syntactic analysis,
CLARIN (Common Language Resources and Technology Infrastructure),
NKJP (National Corpus of Polish),
Automatic detection of semantic dependencies within verb argument structures in large treebanks,
LUNA (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support,
LT4eL (Language Technology for eLearning),
Automatic extraction of linguistic knowledge from a large corpus of Polish,
Publicly available tools and resources
Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources.
Tools (all open source, under GPL):
Świgra – a DCG parser,
Spejd – a shallow parsing and disambiguation system,
TaKIPI – a morphosyntactic tagger for Polish,
PANTERA – a morphosyntactic tagger for Polish,
Poliqarp – a corpus indexing and search engine,
Dendrarium – a treebank development system (under development),
Anotatornia – a system for multi-level manual annotation of corpora (forthcoming),
WSDDE – a system for designing and performing Word Sense Disambiguation experiments (forthcoming),
Resources:
DistrNKJP – a distributable (IPR-free) subcorpus of National Corpus of Polish,
IPI PAN Corpus of Polish (obsolete).
Other activities
Links to some other activities of the Group:
Intelligent Information Systems series of conferences.