Locked History Actions

Diff for "SEJFEK"

Differences between revisions 4 and 10 (spanning 6 versions)
Revision 4 as of 2012-07-20 17:21:57
Size: 2575
Editor: AgataSavary
Comment:
Revision 10 as of 2012-10-16 17:03:09
Size: 2781
Editor: AgataSavary
Comment:
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
= Grammatical Lexicon of Polish Economical Phraseology = = Grammatical Lexicon of Polish Economic Phraseology =
Line 4: Line 4:
The Grammatical Lexicon of Polish Economical Phraseology (SEJFEK – Słownik Elektroniczny Języka polskiego dla wyrażeń Frazeologicznych z EKonomii) is an electronic lexicon containing multi-word nominal terms of Polish economical and financial terminology. It has been created within the ERDF [[http://zil.ipipan.waw.pl/NEKST|Nekst]] project. The Grammatical Lexicon of Polish Economic Phraseology ('''SEJFEK''''''Słownik Elektroniczny Jednostek Frazeologicznych z EKonomii''') is an electronic lexicon containing multi-word nominal terms of Polish economic and financial terminology. It has been created within the ERDF [[http://zil.ipipan.waw.pl/NEKST|Nekst]] project.  
Line 14: Line 14:

See also [[http://zil.ipipan.waw.pl/SEJFEK4Spejd|SEJFEK4Spejd]] - a shallow grammar for [[http://zil.ipipan.waw.pl/Spejd|Spejd]] with fully lexicalized rules automatically generated from SEJFEK lexicon entries.
Line 31: Line 33:
 * [[attachment:Slownik.zip|Slownik]] -- the binary source file in [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]] format
 * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]]-compatible [[attachment:SEJFEK.zip|archive]] containing:
 * [[attachment:Slownik.tar.gz|Slownik]] -- the binary source file in [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]] format
 * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]]-compatible [[attachment:SEJFEK.tar.gz|archive]] containing:

Grammatical Lexicon of Polish Economic Phraseology

The Grammatical Lexicon of Polish Economic Phraseology (SEJFEKSłownik Elektroniczny Jednostek Frazeologicznych z EKonomii) is an electronic lexicon containing multi-word nominal terms of Polish economic and financial terminology. It has been created within the ERDF Nekst project.

Some aspects of its construction, contents and use have been described in:

The lexicon contains:

  • 11,212 multi-word nominal lexemes (e.g. aktywne ryzyko płynności),

  • 146,861 corresponding inflected forms (e.g. aktywnego ryzyka płynności),

  • 305 graph-based inflection paradigms.

See also SEJFEK4Spejd - a shallow grammar for Spejd with fully lexicalized rules automatically generated from SEJFEK lexicon entries.

Authors

  • Filip Makowiecki - lexicography
  • Agata Savary - automatic inflection and validation

Tools

The lexicon has been created within Toposław, tool for developping and managing inflectional dictionaries of multi-word units. Toposław integrates:

  • Morfeusz SGJP -- a morphological analyser and generator of Polish,

  • Multiflex -- a morpho-syntactic generator of multi-word units,

  • graph editor stemming from Unitex.

License

The data are available under the CC BY-SA license.

Available resources

  • Slownik -- the binary source file in Toposław format

  • Multiflex-compatible archive containing:

    • the list of morphologically annotated lexemes,
    • the list of corresponding inflected forms and variants,
    • inflection graphs compatible with Unitex graph editor,

    • list of known problems.

Future work

Defining an LMF format for the lexicon.