Differences between revisions 90 and 94 (spanning 4 versions)

The Linguistic Engineering Group

The Linguistic Engineering (LE) Group is part of the Department of Artificial Intelligence at the Institute of Computer Science, Polish Academy of Sciences (ICS PAS).

People

Anna Andrzejczuk, PhD	anna.andrzejczuk@ipipan.waw.pl
Tomasz Bartosiak, MSc	tomasz.bartosiak@gmail.com
Konrad Gołuchowski, MSc (part time)	kodieg@gmail.com
Elżbieta Hajnicz, PhD, Assoc. Prof.	elzbieta.hajnicz@ipipan.waw.pl
Wojciech Jaworski, PhD (part time)	wjaworski@mimuw.edu.pl
Łukasz Kobyliński, PhD	lkobylinski@ipipan.waw.pl
Katarzyna Krasnowska, MSc (part time)	katarzyna.krasnowska@ipipan.waw.pl
Małgorzata Marciniak, PhD	malgorzata.marciniak@ipipan.waw.pl
Agnieszka Mykowiecka, PhD, Assoc. Prof.	agnieszka.mykowiecka@ipipan.waw.pl
Bartłomiej Nitoń, MSc	bartek.niton@gmail.com
Maciej Ogrodniczuk, PhD, Head of the Group	maciej.ogrodniczuk@ipipan.waw.pl
Agnieszka Patejuk, PhD	aep@ipipan.waw.pl
Jakub Piskorski, PhD (associate)	jakub.piskorski@ipipan.waw.pl
Adam Przepiórkowski, PhD, Assoc. Prof.	adam.przepiorkowski@ipipan.waw.pl
Piotr Rychlik, PhD	rychlik@ipipan.waw.pl
Piotr Sikora, MSc (part time)	piotr.sikora@ipipan.waw.pl
Jan Szejko, MSc (part time)	jan.szejko@ipipan.waw.pl
Aleksander Wawer, PhD	aleksander.wawer@ipipan.waw.pl
Marcin Woliński, PhD	marcin.wolinski@ipipan.waw.pl
Beata Wójtowicz, PhD (part time)	beata.wojtowicz@ipipan.waw.pl
Alina Wróblewska, PhD	alina.wroblewska@ipipan.waw.pl
Aleksander Zabłocki, MSc (part time)	olekz@mimuw.edu.pl
Bartosz Zaborowski, MSc	bartosz.zaborowski@ipipan.waw.pl

Research

The main research areas of the Group

(Polish) corpus linguistics; cf. the IPI PAN Corpus of Polish and the National Corpus of Polish,
syntactic and semantic parsing of Polish; cf. Spejd and Świgra,
extraction of linguistic knowledge from corpora,
information extraction,
sentiment analysis,
morphosyntactic system of Polish,
generative linguistic formalisms, esp., HPSG and LFG.

The Group is a member of CLARIN, DARIAH-PL, FLaReNet and META-NET.

Current externally funded projects

CLARIN-PL (Polish chapter of Common Language Resources and Technology Infrastructure)
COTHEC
Chronofleks
KORBA
PARSEME (PARSing and Multi-word Expressions. Towards linguistic precision and computational efficiency in natural language processing)
Parthenos
Scwad
SYNAMET
TextLink

Some of our past projects

Readability (Measuring the degree of readability of nonliterary Polish texts)
OPTA (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim)
TrendMiner (Large-scale, Cross-lingual Trend Mining and Summarisation of Real-time Media Streams)
NEKST (An adaptive system to support problem-solving on the basis of document collections in the Internet),
SYNAT (Creation of a universal, open repository platform for hosting and communication of networked resources of knowledge for science, education and open knowledge-based society),
Automatic detection and correction of annotation errors in Polish language corpora,
ATLAS (Applied Technology for Language-Aided CMS),
CESAR (CEntral and South-east europeAn Resources),
Construction of a treebank for Polish using automatic syntactic analysis,
CORE (Computer-based methods for coreference resolution in Polish texts),
CLARIN (Common Language Resources and Technology Infrastructure),
NKJP (National Corpus of Polish),
Automatic detection of semantic dependencies within verb argument structures in large treebanks,
LUNA (spoken Language UNderstanding in multilinguAl communication systems) with the Polish support,
LT4eL (Language Technology for eLearning),
Automatic extraction of linguistic knowledge from a large corpus of Polish,
Information Extraction from Polish free text,
The IPI PAN Corpus of Polish,
Treebank / Test Suite of Polish Utterances,
HPSG Grammar of Polish.

Publicly available tools and resources

Here are some of the tools and resources created within our projects. See CLIP pages for a more exhaustive list of Polish tools and resources, including more tools and resources developed at ZIL IPI PAN.

Some tools (all open source, under GPL; see also CLIP):

Świgra – a DCG parser,
Spejd – a shallow parsing and disambiguation system,
TaKIPI – a morphosyntactic tagger for Polish,
PANTERA – a morphosyntactic tagger for Polish,
Poliqarp – a corpus indexing and search engine,
Dendrarium – a treebank development system (under development),
Anotatornia – a system for multi-level manual annotation of corpora (forthcoming),
WSDDE – a system for designing and performing Word Sense Disambiguation experiments,
Multiservice – web service for various of our tools,
etc.

Main resources (many more at CLIP):

Walenty – a valence dictionary of Polish (described here),
National Corpus of Polish.

Other activities

Links to some other activities of the Group:

Journal of Language Modelling,
NLP Seminar at IPI PAN,
conferences organised by the Group:
- Intelligent Information Systems series of conferences,
- PolTAL 2014 – 9th International Conference on Natural Language Processing, 17–19 September 2014, Warsaw, Poland,
- TLT14 – 14th International Workshop on Treebanks and Linguistic Theories, 11–12 December 2015, Warsaw, Poland,
- HeadLex16 – Joint 2016 Conference on Head-driven Phrase Structure Grammar and Lexical Functional Grammar, 24–29 July 2016, Warsaw, Poland

-  ⇤ ← Revision 90 as of 2016-02-11 17:40:18 → 
  Size: 10079
  Editor: MaciejOgrodniczuk
  Comment:
+   ← Revision 94 as of 2016-04-20 12:32:21 → ⇥
  Size: 10352
  Editor: MaciejOgrodniczuk
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 45:
-The Group is a member of [[http://www.clarin.eu/|CLARIN]], [http://dariah.pl/|DARIAH-PL]], [[http://www.flarenet.eu/|FLaReNet]] and [[http://www.meta-net.eu/|META-NET]].
+The Group is a member of [[http://www.clarin.eu/|CLARIN]], [[http://dariah.pl/|DARIAH-PL]], [[http://www.flarenet.eu/|FLaReNet]] and [[http://www.meta-net.eu/|META-NET]].
 Line 62:
+ * [[http://zil.ipipan.waw.pl/OPTA|OPTA]] (Automatyczne metody rozpoznawania przedmiotów i wyrażeń opinii w języku polskim)
 * [[http://clip.ipipan.waw.pl/TrendMiner|TrendMiner]] (Large-scale, Cross-lingual Trend Mining and Summarisation of Real-time Media Streams)

Diff for "ZILStart"

Menu