Locked History Actions

Diff for "SEJFEK"

Differences between revisions 2 and 26 (spanning 24 versions)
Revision 2 as of 2012-06-19 22:57:14
Size: 78
Editor: MichalLenart
Comment:
Revision 26 as of 2021-04-28 10:09:24
Size: 3497
Editor: AgataSavary
Comment:
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
= Słownik wielowyrazowych terminów ekonomicznych = = Grammatical Lexicon of Polish Economic Phraseology =

The Grammatical Lexicon of Polish Economic Phraseology ('''SEJFEK''' – '''Słownik Elektroniczny Jednostek Frazeologicznych z EKonomii''') is an electronic lexicon containing multi-word nominal terms of Polish economic and financial terminology. It has been created within the ERDF [[http://zil.ipipan.waw.pl/NEKST|Nekst]] project.
      
Some aspects of its construction, contents and use have been described in:

 * GRALIŃSKI, F., SAVARY, A., CZEREPOWICKA, M., MAKOWIECKI, F. (2010): ''[[http://multiword.sourceforge.net/CONF_30_MWE_2010___lb__COLING__rb__/CONF_50_Online_Proceedings/pdf/MWE01.pdf|Computational Lexicography of Multi-Word Units: How Efficient Can It Be?]]'', in Proceedings of Multiword Expressions: from Theory to Applications ([[http://multiword.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_30_MWE_2010___lb__COLING__rb__|MWE 2010]]), Workshop at COLING 2010, Beijing, China, August 28.
 * SAVARY, A., ZABOROWSKI, B., KRAWCZYK-WIECZOREK, A., MAKOWIECKI, F. (2012): ''[[http://aclweb.org/anthology//W/W12/W12-5116.pdf|SEJFEK — a Lexicon and a Shallow Grammar of Polish Economic Multi-Word Units]]'', in Proceedings of Cognitive Aspects of the Lexicon ([[http://pageperso.lif.univ-mrs.fr/~michael.zock/cogalex-3.html|COGALEX-III]]), a Workshop at COLING 2012, Mumbai, India.

The lexicon contains:
 * 11,212 multi-word nominal lexemes (e.g. ''aktywne ryzyko płynności''),
 * 146,861 corresponding inflected forms (e.g. ''aktywnego ryzyka płynności''),
 * 305 graph-based inflection paradigms.

See also [[http://zil.ipipan.waw.pl/SEJFEK4Spejd|SEJFEK4Spejd]] – a shallow grammar for [[http://zil.ipipan.waw.pl/Spejd|Spejd]] with fully lexicalized rules automatically generated from SEJFEK lexicon entries.

== Authors ==
 * Filip Makowiecki – lexicography
 * [[https://ijp.pan.pl/pracownicy/aleksandra-wieczorek/|Aleksandra Krawczyk-Wieczorek]] - lexicon-grammar conversion
 * [[http://www.info.univ-tours.fr/~savary/English/indexgb.html|Agata Savary]] – automatic inflection and validation
 * [[http://zil.ipipan.waw.pl/BartoszZaborowski|Bartosz Zaborowski]] - lexicon-grammar conversion
 
== Tools ==
The lexicon has been created within [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]], tool for developing and managing inflectional dictionaries of multi-word units. Toposław integrates:
 * [[http://sgjp.pl/morfeusz/|Morfeusz SGJP]] – a morphological analyser and generator of Polish,
 * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]] – a morpho-syntactic generator of multi-word units,
 * graph editor stemming from [[http://igm.univ-mlv.fr/~unitex/|Unitex]].

== License ==

The data are available under the [[http://creativecommons.org/licenses/by-sa/3.0/|CC BY-SA license]].

== Available resources ==

 * [[attachment:Slownik.tar.gz|Slownik]] – the binary source file in [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]] format
 * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]]-compatible [[attachment:SEJFEK.tar.gz|archive]] containing:
   * the list of morphologically annotated lexemes,
   * the list of corresponding inflected forms and variants,
   * inflection graphs compatible with [[http://igm.univ-mlv.fr/~unitex/|Unitex]] graph editor,
   * list of known problems.

== Future work ==

Defining an [[http://www.lexicalmarkupframework.org/|LMF]] format for the lexicon.

Grammatical Lexicon of Polish Economic Phraseology

The Grammatical Lexicon of Polish Economic Phraseology (SEJFEKSłownik Elektroniczny Jednostek Frazeologicznych z EKonomii) is an electronic lexicon containing multi-word nominal terms of Polish economic and financial terminology. It has been created within the ERDF Nekst project.

Some aspects of its construction, contents and use have been described in:

The lexicon contains:

  • 11,212 multi-word nominal lexemes (e.g. aktywne ryzyko płynności),

  • 146,861 corresponding inflected forms (e.g. aktywnego ryzyka płynności),

  • 305 graph-based inflection paradigms.

See also SEJFEK4Spejd – a shallow grammar for Spejd with fully lexicalized rules automatically generated from SEJFEK lexicon entries.

Authors

Tools

The lexicon has been created within Toposław, tool for developing and managing inflectional dictionaries of multi-word units. Toposław integrates:

  • Morfeusz SGJP – a morphological analyser and generator of Polish,

  • Multiflex – a morpho-syntactic generator of multi-word units,

  • graph editor stemming from Unitex.

License

The data are available under the CC BY-SA license.

Available resources

  • Slownik – the binary source file in Toposław format

  • Multiflex-compatible archive containing:

    • the list of morphologically annotated lexemes,
    • the list of corresponding inflected forms and variants,
    • inflection graphs compatible with Unitex graph editor,

    • list of known problems.

Future work

Defining an LMF format for the lexicon.