Locked History Actions

Diff for "SEJFEK"

Differences between revisions 3 and 4
Revision 3 as of 2012-06-25 23:50:50
Size: 109
Editor: MichalLenart
Comment:
Revision 4 as of 2012-07-20 17:21:57
Size: 2575
Editor: AgataSavary
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
## page was renamed from SWTE
Line 3: Line 2:
= Słownik wielowyrazowych terminów ekonomicznych = = Grammatical Lexicon of Polish Economical Phraseology =

The Grammatical Lexicon of Polish Economical Phraseology (SEJFEK – Słownik Elektroniczny Języka polskiego dla wyrażeń Frazeologicznych z EKonomii) is an electronic lexicon containing multi-word nominal terms of Polish economical and financial terminology. It has been created within the ERDF [[http://zil.ipipan.waw.pl/NEKST|Nekst]] project.
      
Some aspects of its construction, contents and use have been described in:

 * GRALIŃSKI, F., SAVARY, A., CZEREPOWICKA, M., MAKOWIECKI, F. (2010): ''[[http://multiword.sourceforge.net/CONF_30_MWE_2010___lb__COLING__rb__/CONF_50_Online_Proceedings/pdf/MWE01.pdf|Computational Lexicography of Multi-Word Units: How Efficient Can It Be?]]'', in Proceedings of Multiword Expressions: from Theory to Applications (MWE 2010), Workshop at COLING 2010, Beijing, China, August 28.

The lexicon contains:
 * 11,212 multi-word nominal lexemes (e.g. ''aktywne ryzyko płynności''),
 * 146,861 corresponding inflected forms (e.g. ''aktywnego ryzyka płynności''),
 * 305 graph-based inflection paradigms.

== Authors ==
 * Filip Makowiecki - lexicography
 * [[http://www.info.univ-tours.fr/~savary/English/indexgb.html|Agata Savary]] - automatic inflection and validation
 
== Tools ==
The lexicon has been created within [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]], tool for developping and managing inflectional dictionaries of multi-word units. Toposław integrates:
 * [[http://sgjp.pl/morfeusz/|Morfeusz SGJP]] -- a morphological analyser and generator of Polish,
 * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]] -- a morpho-syntactic generator of multi-word units,
 * graph editor stemming from [[http://igm.univ-mlv.fr/~unitex/|Unitex]].

== License ==

The data are available under the [[http://creativecommons.org/licenses/by-sa/3.0/|CC BY-SA license]].

== Available resources ==

 * [[attachment:Slownik.zip|Slownik]] -- the binary source file in [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]] format
 * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]]-compatible [[attachment:SEJFEK.zip|archive]] containing:
   * the list of morphologically annotated lexemes,
   * the list of corresponding inflected forms and variants,
   * inflection graphs compatible with [[http://igm.univ-mlv.fr/~unitex/|Unitex]] graph editor,
   * list of known problems.

== Future work ==

Defining an [[http://www.lexicalmarkupframework.org/|LMF]] format for the lexicon.

Grammatical Lexicon of Polish Economical Phraseology

The Grammatical Lexicon of Polish Economical Phraseology (SEJFEK – Słownik Elektroniczny Języka polskiego dla wyrażeń Frazeologicznych z EKonomii) is an electronic lexicon containing multi-word nominal terms of Polish economical and financial terminology. It has been created within the ERDF Nekst project.

Some aspects of its construction, contents and use have been described in:

The lexicon contains:

  • 11,212 multi-word nominal lexemes (e.g. aktywne ryzyko płynności),

  • 146,861 corresponding inflected forms (e.g. aktywnego ryzyka płynności),

  • 305 graph-based inflection paradigms.

Authors

  • Filip Makowiecki - lexicography
  • Agata Savary - automatic inflection and validation

Tools

The lexicon has been created within Toposław, tool for developping and managing inflectional dictionaries of multi-word units. Toposław integrates:

  • Morfeusz SGJP -- a morphological analyser and generator of Polish,

  • Multiflex -- a morpho-syntactic generator of multi-word units,

  • graph editor stemming from Unitex.

License

The data are available under the CC BY-SA license.

Available resources

  • Slownik -- the binary source file in Toposław format

  • Multiflex-compatible archive containing:

    • the list of morphologically annotated lexemes,
    • the list of corresponding inflected forms and variants,
    • inflection graphs compatible with Unitex graph editor,

    • list of known problems.

Future work

Defining an LMF format for the lexicon.