Locked History Actions

Diff for "SEJFEK"

Differences between revisions 6 and 22 (spanning 16 versions)
Revision 6 as of 2012-07-20 17:27:35
Size: 2581
Editor: AgataSavary
Comment:
Revision 22 as of 2013-01-23 10:40:59
Size: 3321
Editor: AgataSavary
Comment:
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
= Grammatical Lexicon of Polish Economical Phraseology = = Grammatical Lexicon of Polish Economic Phraseology =
Line 4: Line 4:
The Grammatical Lexicon of Polish Economical Phraseology (SEJFEK – Słownik Elektroniczny Języka polskiego dla wyrażeń Frazeologicznych z EKonomii) is an electronic lexicon containing multi-word nominal terms of Polish economical and financial terminology. It has been created within the ERDF [[http://zil.ipipan.waw.pl/NEKST|Nekst]] project. The Grammatical Lexicon of Polish Economic Phraseology ('''SEJFEK''''''Słownik Elektroniczny Jednostek Frazeologicznych z EKonomii''') is an electronic lexicon containing multi-word nominal terms of Polish economic and financial terminology. It has been created within the ERDF [[http://zil.ipipan.waw.pl/NEKST|Nekst]] project.  
Line 8: Line 8:
 * GRALIŃSKI, F., SAVARY, A., CZEREPOWICKA, M., MAKOWIECKI, F. (2010): ''[[http://multiword.sourceforge.net/CONF_30_MWE_2010___lb__COLING__rb__/CONF_50_Online_Proceedings/pdf/MWE01.pdf|Computational Lexicography of Multi-Word Units: How Efficient Can It Be?]]'', in Proceedings of Multiword Expressions: from Theory to Applications (MWE 2010), Workshop at COLING 2010, Beijing, China, August 28.  * GRALIŃSKI, F., SAVARY, A., CZEREPOWICKA, M., MAKOWIECKI, F. (2010): ''[[http://multiword.sourceforge.net/CONF_30_MWE_2010___lb__COLING__rb__/CONF_50_Online_Proceedings/pdf/MWE01.pdf|Computational Lexicography of Multi-Word Units: How Efficient Can It Be?]]'', in Proceedings of Multiword Expressions: from Theory to Applications ([[http://multiword.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_30_MWE_2010___lb__COLING__rb__|MWE 2010]]), Workshop at COLING 2010, Beijing, China, August 28.
 * SAVARY, A., ZABOROWSKI, B., KRAWCZYK-WIECZOREK, A., MAKOWIECKI, F. (2012): ''[[attachment:http://www.info.univ-tours.fr/~savary/Papers/sav-wie-zab-mak-cogalex-2012.zip|SEJFEK — a Lexicon and a Shallow Grammar of Polish Economic Multi-Word Units]]'', in Proceedings of Cognitive Aspects of the Lexicon ([[http://pageperso.lif.univ-mrs.fr/~michael.zock/cogalex-3.html|COGALEX-III]]), a Workshop at COLING 2012, Mumbai, India.
Line 15: Line 16:
See also [[http://zil.ipipan.waw.pl/SEJFEK4Spejd|SEJFEK4Spejd]] – a shallow grammar for [[http://zil.ipipan.waw.pl/Spejd|Spejd]] with fully lexicalized rules automatically generated from SEJFEK lexicon entries.
Line 16: Line 19:
 * Filip Makowiecki - lexicography
 * [[http://www.info.univ-tours.fr/~savary/English/indexgb.html|Agata Savary]] - automatic inflection and validation
 * Filip Makowiecki lexicography
 * [[http://www.info.univ-tours.fr/~savary/English/indexgb.html|Agata Savary]] automatic inflection and validation
Line 21: Line 24:
 * [[http://sgjp.pl/morfeusz/|Morfeusz SGJP]] -- a morphological analyser and generator of Polish,
 * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]] -- a morpho-syntactic generator of multi-word units,
 * [[http://sgjp.pl/morfeusz/|Morfeusz SGJP]] a morphological analyser and generator of Polish,
 * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]] – a morpho-syntactic generator of multi-word units,
Line 31: Line 34:
 * [[attachment:Slownik.tar.gz|Slownik]] -- the binary source file in [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]] format  * [[attachment:Slownik.tar.gz|Slownik]] the binary source file in [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]] format

Grammatical Lexicon of Polish Economic Phraseology

The Grammatical Lexicon of Polish Economic Phraseology (SEJFEKSłownik Elektroniczny Jednostek Frazeologicznych z EKonomii) is an electronic lexicon containing multi-word nominal terms of Polish economic and financial terminology. It has been created within the ERDF Nekst project.

Some aspects of its construction, contents and use have been described in:

The lexicon contains:

  • 11,212 multi-word nominal lexemes (e.g. aktywne ryzyko płynności),

  • 146,861 corresponding inflected forms (e.g. aktywnego ryzyka płynności),

  • 305 graph-based inflection paradigms.

See also SEJFEK4Spejd – a shallow grammar for Spejd with fully lexicalized rules automatically generated from SEJFEK lexicon entries.

Authors

  • Filip Makowiecki – lexicography
  • Agata Savary – automatic inflection and validation

Tools

The lexicon has been created within Toposław, tool for developping and managing inflectional dictionaries of multi-word units. Toposław integrates:

  • Morfeusz SGJP – a morphological analyser and generator of Polish,

  • Multiflex – a morpho-syntactic generator of multi-word units,

  • graph editor stemming from Unitex.

License

The data are available under the CC BY-SA license.

Available resources

  • Slownik – the binary source file in Toposław format

  • Multiflex-compatible archive containing:

    • the list of morphologically annotated lexemes,
    • the list of corresponding inflected forms and variants,
    • inflection graphs compatible with Unitex graph editor,

    • list of known problems.

Future work

Defining an LMF format for the lexicon.