Locked History Actions

Diff for "Information Extraction from Polish free text"

Differences between revisions 1 and 8 (spanning 7 versions)
Revision 1 as of 2011-03-25 15:19:00
Size: 563
Comment:
Revision 8 as of 2019-10-08 15:36:06
Size: 1303
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
#acl +All:read Default
Line 2: Line 3:

= ATLAS project =
Line 8: Line 7:
|| Project type: || A national [[|http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] research grant (number 3T11C00727) || || Project type: || A national [[http://www.eng.nauka.gov.pl/meinen/|Ministry of Science and Higher Education]] research grant (number 3T11C00727) ||
Line 10: Line 9:
|| Project Web page: || ??? ||
Line 12: Line 10:
|| Institution: || Linguistic Engineering Group, Institute of Computer Science, Polish Academy of Sciences ||
Line 15: Line 15:
??? Motivations:
 * not many efforts on IE on Polish texts in contrast to many existing applications for many languages,
 * existing IE tools could not be directly used for processing Polish.

Goals:
 * adapting chosen IE tools for processing Polish,
 * collecting some linguistic resources for IE.

Activities:
 * adapting IE platforms SProUT and (recently) GATE for tokenization and morphological analysis of Polish texts,
 * collecting resourses and IE grammars for named entities recognition (NER) in Polish texts,
 * ruled based IE experiments in a selected domain (medical texts),
 * testing methods of terminology extraction on Polish data.

Information Extraction from Polish free text

Project factsheet

Polish name:

Opracowanie narzędzi do ekstrakcji informacji z tekstów w języku polskim

Project type:

A national Ministry of Science and Higher Education research grant (number 3T11C00727)

Duration:

20 October 2004 ‒ 19 October 2007

Principal investigator:

Agnieszka Mykowiecka

Institution:

Linguistic Engineering Group, Institute of Computer Science, Polish Academy of Sciences

Project description

Motivations:

  • not many efforts on IE on Polish texts in contrast to many existing applications for many languages,
  • existing IE tools could not be directly used for processing Polish.

Goals:

  • adapting chosen IE tools for processing Polish,
  • collecting some linguistic resources for IE.

Activities:

  • adapting IE platforms SProUT and (recently) GATE for tokenization and morphological analysis of Polish texts,
  • collecting resourses and IE grammars for named entities recognition (NER) in Polish texts,
  • ruled based IE experiments in a selected domain (medical texts),
  • testing methods of terminology extraction on Polish data.