Differences between revisions 1 and 8 (spanning 7 versions)

Information Extraction from Polish free text

Project factsheet

Polish name:	Opracowanie narzędzi do ekstrakcji informacji z tekstów w języku polskim
Project type:	A national Ministry of Science and Higher Education research grant (number 3T11C00727)
Duration:	20 October 2004 ‒ 19 October 2007
Principal investigator:	Agnieszka Mykowiecka
Institution:	Linguistic Engineering Group, Institute of Computer Science, Polish Academy of Sciences

Project description

Motivations:

not many efforts on IE on Polish texts in contrast to many existing applications for many languages,
existing IE tools could not be directly used for processing Polish.

Goals:

adapting chosen IE tools for processing Polish,
collecting some linguistic resources for IE.

Activities:

adapting IE platforms SProUT and (recently) GATE for tokenization and morphological analysis of Polish texts,
collecting resourses and IE grammars for named entities recognition (NER) in Polish texts,
ruled based IE experiments in a selected domain (medical texts),
testing methods of terminology extraction on Polish data.

-  ⇤ ← Revision 1 as of 2011-03-25 15:19:00 → 
  Size: 563
  Editor: MaciejOgrodniczuk
  Comment:
+   ← Revision 8 as of 2019-10-08 15:36:06 → ⇥
  Size: 1303
  Editor: MaciejOgrodniczuk
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 1:
+#acl +All:read Default
-Line 2:
+Line 3:
-= ATLAS project =
-Line 8:
+Line 7:
-|| Project type:         || A national [[|http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] research grant (number 3T11C00727) ||
+|| Project type:         || A national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] research grant (number 3T11C00727) ||
-Line 10:
+Line 9:
-|| Project Web page:     || ??? ||
-Line 12:
+Line 10:
+|| Institution:          || Linguistic Engineering Group, Institute of Computer Science, Polish Academy of Sciences ||
 Line 15:
-???
+Motivations:
 * not many efforts on IE on Polish texts in contrast to many existing applications for many languages,
 * existing IE tools could not be directly used for processing Polish.

Goals:
 * adapting chosen IE tools for processing Polish,
 * collecting some linguistic resources for IE.

Activities:
 * adapting IE platforms SProUT and (recently) GATE for tokenization and morphological analysis of Polish texts,
 * collecting resourses and IE grammars for named entities recognition (NER) in Polish texts,
 * ruled based IE experiments in a selected domain (medical texts),
 * testing methods of terminology extraction on Polish data.

Diff for "Information Extraction from Polish free text"

Menu

Information Extraction from Polish free text

Project factsheet

Project description