Size: 54
Comment:
|
Size: 3310
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
= Słownik wielowyrazowych terminów ekonomicznych = | #acl +All:read Default = Grammatical Lexicon of Polish Economic Phraseology = The Grammatical Lexicon of Polish Economic Phraseology ('''SEJFEK''' – '''Słownik Elektroniczny Jednostek Frazeologicznych z EKonomii''') is an electronic lexicon containing multi-word nominal terms of Polish economic and financial terminology. It has been created within the ERDF [[http://zil.ipipan.waw.pl/NEKST|Nekst]] project. Some aspects of its construction, contents and use have been described in: * GRALIŃSKI, F., SAVARY, A., CZEREPOWICKA, M., MAKOWIECKI, F. (2010): ''[[http://multiword.sourceforge.net/CONF_30_MWE_2010___lb__COLING__rb__/CONF_50_Online_Proceedings/pdf/MWE01.pdf|Computational Lexicography of Multi-Word Units: How Efficient Can It Be?]]'', in Proceedings of Multiword Expressions: from Theory to Applications ([[http://multiword.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_30_MWE_2010___lb__COLING__rb__|MWE 2010]]), Workshop at COLING 2010, Beijing, China, August 28. * SAVARY, A., ZABOROWSKI, B., KRAWCZYK-WIECZOREK, A., MAKOWIECKI, F. (2012): ''[[http://www.info.univ-tours.fr/~savary/Papers/sav-wie-zab-mak-cogalex-2012.zip|SEJFEK — a Lexicon and a Shallow Grammar of Polish Economic Multi-Word Units]]'', in Proceedings of Cognitive Aspects of the Lexicon ([[http://pageperso.lif.univ-mrs.fr/~michael.zock/cogalex-3.html|COGALEX-III]]), a Workshop at COLING 2012, Mumbai, India. The lexicon contains: * 11,212 multi-word nominal lexemes (e.g. ''aktywne ryzyko płynności''), * 146,861 corresponding inflected forms (e.g. ''aktywnego ryzyka płynności''), * 305 graph-based inflection paradigms. See also [[http://zil.ipipan.waw.pl/SEJFEK4Spejd|SEJFEK4Spejd]] – a shallow grammar for [[http://zil.ipipan.waw.pl/Spejd|Spejd]] with fully lexicalized rules automatically generated from SEJFEK lexicon entries. == Authors == * Filip Makowiecki – lexicography * [[http://www.info.univ-tours.fr/~savary/English/indexgb.html|Agata Savary]] – automatic inflection and validation == Tools == The lexicon has been created within [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]], tool for developping and managing inflectional dictionaries of multi-word units. Toposław integrates: * [[http://sgjp.pl/morfeusz/|Morfeusz SGJP]] – a morphological analyser and generator of Polish, * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]] – a morpho-syntactic generator of multi-word units, * graph editor stemming from [[http://igm.univ-mlv.fr/~unitex/|Unitex]]. == License == The data are available under the [[http://creativecommons.org/licenses/by-sa/3.0/|CC BY-SA license]]. == Available resources == * [[attachment:Slownik.tar.gz|Slownik]] – the binary source file in [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]] format * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]]-compatible [[attachment:SEJFEK.tar.gz|archive]] containing: * the list of morphologically annotated lexemes, * the list of corresponding inflected forms and variants, * inflection graphs compatible with [[http://igm.univ-mlv.fr/~unitex/|Unitex]] graph editor, * list of known problems. == Future work == Defining an [[http://www.lexicalmarkupframework.org/|LMF]] format for the lexicon. |
Grammatical Lexicon of Polish Economic Phraseology
The Grammatical Lexicon of Polish Economic Phraseology (SEJFEK – Słownik Elektroniczny Jednostek Frazeologicznych z EKonomii) is an electronic lexicon containing multi-word nominal terms of Polish economic and financial terminology. It has been created within the ERDF Nekst project.
Some aspects of its construction, contents and use have been described in:
GRALIŃSKI, F., SAVARY, A., CZEREPOWICKA, M., MAKOWIECKI, F. (2010): Computational Lexicography of Multi-Word Units: How Efficient Can It Be?, in Proceedings of Multiword Expressions: from Theory to Applications (MWE 2010), Workshop at COLING 2010, Beijing, China, August 28.
SAVARY, A., ZABOROWSKI, B., KRAWCZYK-WIECZOREK, A., MAKOWIECKI, F. (2012): SEJFEK — a Lexicon and a Shallow Grammar of Polish Economic Multi-Word Units, in Proceedings of Cognitive Aspects of the Lexicon (COGALEX-III), a Workshop at COLING 2012, Mumbai, India.
The lexicon contains:
11,212 multi-word nominal lexemes (e.g. aktywne ryzyko płynności),
146,861 corresponding inflected forms (e.g. aktywnego ryzyka płynności),
- 305 graph-based inflection paradigms.
See also SEJFEK4Spejd – a shallow grammar for Spejd with fully lexicalized rules automatically generated from SEJFEK lexicon entries.
Authors
- Filip Makowiecki – lexicography
Agata Savary – automatic inflection and validation
Tools
The lexicon has been created within Toposław, tool for developping and managing inflectional dictionaries of multi-word units. Toposław integrates:
Morfeusz SGJP – a morphological analyser and generator of Polish,
Multiflex – a morpho-syntactic generator of multi-word units,
graph editor stemming from Unitex.
License
The data are available under the CC BY-SA license.
Available resources
Multiflex-compatible archive containing:
- the list of morphologically annotated lexemes,
- the list of corresponding inflected forms and variants,
inflection graphs compatible with Unitex graph editor,
- list of known problems.
Future work
Defining an LMF format for the lexicon.