Differences between revisions 2 and 9 (spanning 7 versions)

The Linguistic Engineering Group

The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (ICS PAS).

People

Anna Andrzejczuk, MSc (on leave)	anna.andrzejczuk@ipipan.waw.pl
Kacper Chwiałkowski (part time)	kacper.chwialkowski@ipipan.waw.pl
Łukasz Degórski, MSc	ldegorski@ipipan.waw.pl
Elżbieta Hajnicz, PhD	elzbieta.hajnicz@ipipan.waw.pl
Łukasz Kobyliński, MSc	lkobylinski@ipipan.waw.pl
Mateusz Kopeć	mateusz.kopec@ipipan.waw.pl
Katarzyna Krasnowska (part time)	katarzyna.krasnowska@ipipan.waw.pl
Anna Kupść, PhD (on leave)	anna.kupsc@ipipan.waw.pl
Michał Lenart	michal.lenart@ipipan.waw.pl
Małgorzata Marciniak, PhD	malgorzata.marciniak@ipipan.waw.pl
Marcin Miłkowski, PhD (part time)	marcin.milkowski@ifispan.waw.pl
Agnieszka Mykowiecka, PhD	agnieszka.mykowiecka@ipipan.waw.pl
Maciej Ogrodniczuk, PhD	maciej.ogrodniczuk@ipipan.waw.pl
Jakub Piskorski, PhD, Associate	jakub.piskorski@ipipan.waw.pl
Adam Przepiórkowski, PhD, Head of the Group	adam.przepiorkowski@ipipan.waw.pl
Piotr Rychlik, PhD	rychlik@ipipan.waw.pl
Tomek Strzałkowski, PhD, Foreign Associate	tomek@cs.albany.edu
Danuta Skowrońska, MSc	danuta.skowronska@ipipan.waw.pl
Jan Szejko (part time)	jan.szejko@ipipan.waw.pl
Łukasz Szałkiewicz, MSc	lukasz.szalkiewicz@ipipan.waw.pl
Stan Szpakowicz, PhD, Foreign Associate	szpak@site.uottawa.ca
Aleksander Wawer, MSc	aleksander.wawer@ipipan.waw.pl
Aleksandra Wieczorek, MSc (part time)	aleksandra.wieczorek@ipipan.waw.pl
Marcin Woliński, PhD	marcin.wolinski@ipipan.waw.pl
Alina Wróblewska, MSc	alina.wroblewska@ipipan.waw.pl
Sebastian Żurowski, PhD (part time)	sebastian.zurowski@ipipan.waw.pl

Research

The main research areas of the Group

(Polish) corpus linguistics; cf. the IPI PAN Corpus of Polish and the National Corpus of Polish,
syntactic and semantic parsing of Polish; cf. Spejd and Świgra,
extraction of linguistic knowledge from corpora,
information extraction,
sentiment analysis,
morphosyntactic system of Polish,
generative linguistic formalisms, esp., HPSG and LFG.

The Group is a member of CLARIN, FLaReNet and META-NET.

Current externally funded projects

CORE (Computer-based methods for coreference resolution in Polish texts),
NEKST (An adaptive system to support problem-solving on the basis of document collections in the Internet),
SYNAT (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society),
ATLAS (Applied Technology for Language-Aided CMS),
CESAR (CEntral and South-east europeAn Resources).

Some of our past projects

Construction of a treebank for Polish using automatic syntactic analysis,
CLARIN (Common Language Resources and Technology Infrastructure),
NKJP (National Corpus of Polish),
Automatic detection of semantic dependencies within verb argument structures in large treebanks,
LUNA (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support,
LT4eL (Language Technology for eLearning),
Automatic extraction of linguistic knowledge from a large corpus of Polish,
Information Extraction from Polish free text,
The IPI PAN Corpus of Polish,
Treebank / Test Suite of Polish Utterances,
HPSG Grammar of Polish.

Publicly available tools and resources

Here are some of the tools and resources created within our projects. See [[|CLIP]] pages for a more exhaustive list of Polish tools and resources.

Tools (all open source, under GPL):

Świgra – a DCG parser,
Spejd – a shallow parsing and disambiguation system,
TaKIPI – a morphosyntactic tagger for Polish,
PANTERA – a morphosyntactic tagger for Polish,
Poliqarp – a corpus indexing and search engine,
Dendrarium – a treebank development system (under development),
Anotatornia – a system for multi-level manual annotation of corpora (forthcoming),
WSDDE – a system for designing and performing Word Sense Disambiguation experiments (forthcoming),
etc.

Resources:

National Corpus of Polish,
IPI PAN Corpus of Polish (obsolete).

Other activities

Links to some other activities of the Group:

NLP Seminar at IPI PAN;
Intelligent Information Systems series of conferences.

-  ⇤ ← Revision 2 as of 2011-11-14 14:59:51 → 
  Size: 8756
  Editor: MichalLenart
  Comment:
+   ← Revision 9 as of 2011-12-02 19:10:26 → ⇥
  Size: 8669
  Editor: AdamPrzepiorkowski
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 1:
-#acl CLIPWarszawaGroup:read,write All:read
+#acl ZILGroup:read,write All:read
 Line 14:
-|| [[http://zil.ipipan.waw.pl/MichalLenart|Michał Lenart]]                            || [[mailto:michal.lenart@ipipan.waw.pl|michal.lenart@ipipan.waw.pl]] ||
-Line 17:
+Line 16:
+|| [[http://zil.ipipan.waw.pl/MichalLenart|Michał Lenart]]                            || [[mailto:michal.lenart@ipipan.waw.pl|michal.lenart@ipipan.waw.pl]] ||
 Line 41:
- * syntactic and semantic parsing of Polish; cf. [[http://nlp.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]],
+ * syntactic and semantic parsing of Polish; cf. [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] and [[http://nlp.ipipan.waw.pl/~wolinski/swigra/|Świgra]],
 Line 52:
- * [[http://clip.ipipan.waw.pl/CORE]] (Computer-based methods for coreference resolution in Polish texts),
 * [[http://clip.ipipan.waw.pl/NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet),
 * [[http://clip.ipipan.waw.pl/SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society),
 * [[http://clip.ipipan.waw.pl/ATLAS]] (Applied Technology for Language-Aided CMS),
 * [[http://clip.ipipan.waw.pl/CESAR]] (CEntral and South-east europeAn Resources),
 * [[http://clip.ipipan.waw.pl/Construction of a treebank for Polish using automatic syntactic analysis]].
+ * [[CORE|CORE]] (Computer-based methods for coreference resolution in Polish texts),
 * [[NEKST|NEKST]] (An adaptive system to support problem-solving on the basis of document collections in the Internet),
 * [[SYNAT|SYNAT]] (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society),
 * [[ATLAS|ATLAS]] (Applied Technology for Language-Aided CMS),
 * [[CESAR|CESAR]] (CEntral and South-east europeAn Resources).
-Line 61:
+Line 60:
- * [[http://clip.ipipan.waw.pl/CLARIN]] (Common Language Resources and Technology Infrastructure),
 * [[http://clip.ipipan.waw.pl/NKJP]] (National Corpus of Polish),
 * [[http://clip.ipipan.waw.pl/Automatic detection of semantic dependencies within verb argument structures in large treebanks]],
 * [[http://clip.ipipan.waw.pl/LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support,
 * [[http://clip.ipipan.waw.pl/LT4eL]] (Language Technology for eLearning),
 * [[http://clip.ipipan.waw.pl/Automatic extraction of linguistic knowledge from a large corpus of Polish]],
 * [[http://clip.ipipan.waw.pl/Information Extraction from Polish free text]],
 * [[http://clip.ipipan.waw.pl/IPI PAN Corpus|The IPI PAN Corpus of Polish]],
 * [[http://clip.ipipan.waw.pl/Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]],
 * [[http://clip.ipipan.waw.pl/HPSG Grammar of Polish]].
+ * [[Construction of a treebank for Polish using automatic syntactic analysis|Construction of a treebank for Polish using automatic syntactic analysis]],
 * [[CLARIN|CLARIN]] (Common Language Resources and Technology Infrastructure),
 * [[NKJP|NKJP]] (National Corpus of Polish),
 * [[Automatic detection of semantic dependencies within verb argument structures in large treebanks|Automatic detection of semantic dependencies within verb argument structures in large treebanks]],
 * [[LUNA|LUNA]] (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support,
 * [[LT4eL|LT4eL]] (Language Technology for eLearning),
 * [[Automatic extraction of linguistic knowledge from a large corpus of Polish|Automatic extraction of linguistic knowledge from a large corpus of Polish]],
 * [[Information Extraction from Polish free text|Information Extraction from Polish free text]],
 * [[IPI PAN Corpus|The IPI PAN Corpus of Polish]],
 * [[Test Suite of Polish Utterances|Treebank / Test Suite of Polish Utterances]],
 * [[HPSG Grammar of Polish|HPSG Grammar of Polish]].
 Line 74:
-Here are some of the tools and resources created within our projects.  See [[http://clip.ipipan.waw.pl/|CLIP]] pages for a more exhaustive list of Polish tools and resources.
+Here are some of the tools and resources created within our projects.  See [[|CLIP]] pages for a more exhaustive list of Polish tools and resources.
 Line 79:
- * [[http://nlp.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system,
+ * [[http://zil.ipipan.waw.pl/Spejd/|Spejd]] – a shallow parsing and disambiguation system,

Diff for "ZILStart"

Menu