Size: 2578
Comment:
|
← Revision 26 as of 2021-04-28 10:09:24 ⇥
Size: 3497
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 2: | Line 2: |
= Grammatical Lexicon of Polish Economical Phraseology = | = Grammatical Lexicon of Polish Economic Phraseology = |
Line 4: | Line 4: |
The Grammatical Lexicon of Polish Economical Phraseology (SEJFEK – Słownik Elektroniczny Języka polskiego dla wyrażeń Frazeologicznych z EKonomii) is an electronic lexicon containing multi-word nominal terms of Polish economical and financial terminology. It has been created within the ERDF [[http://zil.ipipan.waw.pl/NEKST|Nekst]] project. | The Grammatical Lexicon of Polish Economic Phraseology ('''SEJFEK''' – '''Słownik Elektroniczny Jednostek Frazeologicznych z EKonomii''') is an electronic lexicon containing multi-word nominal terms of Polish economic and financial terminology. It has been created within the ERDF [[http://zil.ipipan.waw.pl/NEKST|Nekst]] project. |
Line 8: | Line 8: |
* GRALIŃSKI, F., SAVARY, A., CZEREPOWICKA, M., MAKOWIECKI, F. (2010): ''[[http://multiword.sourceforge.net/CONF_30_MWE_2010___lb__COLING__rb__/CONF_50_Online_Proceedings/pdf/MWE01.pdf|Computational Lexicography of Multi-Word Units: How Efficient Can It Be?]]'', in Proceedings of Multiword Expressions: from Theory to Applications (MWE 2010), Workshop at COLING 2010, Beijing, China, August 28. | * GRALIŃSKI, F., SAVARY, A., CZEREPOWICKA, M., MAKOWIECKI, F. (2010): ''[[http://multiword.sourceforge.net/CONF_30_MWE_2010___lb__COLING__rb__/CONF_50_Online_Proceedings/pdf/MWE01.pdf|Computational Lexicography of Multi-Word Units: How Efficient Can It Be?]]'', in Proceedings of Multiword Expressions: from Theory to Applications ([[http://multiword.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_30_MWE_2010___lb__COLING__rb__|MWE 2010]]), Workshop at COLING 2010, Beijing, China, August 28. * SAVARY, A., ZABOROWSKI, B., KRAWCZYK-WIECZOREK, A., MAKOWIECKI, F. (2012): ''[[http://aclweb.org/anthology//W/W12/W12-5116.pdf|SEJFEK — a Lexicon and a Shallow Grammar of Polish Economic Multi-Word Units]]'', in Proceedings of Cognitive Aspects of the Lexicon ([[http://pageperso.lif.univ-mrs.fr/~michael.zock/cogalex-3.html|COGALEX-III]]), a Workshop at COLING 2012, Mumbai, India. |
Line 15: | Line 16: |
See also [[http://zil.ipipan.waw.pl/SEJFEK4Spejd|SEJFEK4Spejd]] – a shallow grammar for [[http://zil.ipipan.waw.pl/Spejd|Spejd]] with fully lexicalized rules automatically generated from SEJFEK lexicon entries. |
|
Line 16: | Line 19: |
* Filip Makowiecki - lexicography * [[http://www.info.univ-tours.fr/~savary/English/indexgb.html|Agata Savary]] - automatic inflection and validation |
* Filip Makowiecki – lexicography * [[https://ijp.pan.pl/pracownicy/aleksandra-wieczorek/|Aleksandra Krawczyk-Wieczorek]] - lexicon-grammar conversion * [[http://www.info.univ-tours.fr/~savary/English/indexgb.html|Agata Savary]] – automatic inflection and validation * [[http://zil.ipipan.waw.pl/BartoszZaborowski|Bartosz Zaborowski]] - lexicon-grammar conversion |
Line 20: | Line 25: |
The lexicon has been created within [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]], tool for developping and managing inflectional dictionaries of multi-word units. Toposław integrates: * [[http://sgjp.pl/morfeusz/|Morfeusz SGJP]] -- a morphological analyser and generator of Polish, * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]] -- a morpho-syntactic generator of multi-word units, |
The lexicon has been created within [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]], tool for developing and managing inflectional dictionaries of multi-word units. Toposław integrates: * [[http://sgjp.pl/morfeusz/|Morfeusz SGJP]] – a morphological analyser and generator of Polish, * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]] – a morpho-syntactic generator of multi-word units, |
Line 31: | Line 36: |
* [[attachment:Slownik.tar.gz|Slownik]] -- the binary source file in [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]] format * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]]-compatible [[attachment:SEJFEK.zip|archive]] containing: |
* [[attachment:Slownik.tar.gz|Slownik]] – the binary source file in [[http://zil.ipipan.waw.pl/Toposlaw|Toposław]] format * [[http://www.springerlink.com/content/n265j22n73084433/|Multiflex]]-compatible [[attachment:SEJFEK.tar.gz|archive]] containing: |
Grammatical Lexicon of Polish Economic Phraseology
The Grammatical Lexicon of Polish Economic Phraseology (SEJFEK – Słownik Elektroniczny Jednostek Frazeologicznych z EKonomii) is an electronic lexicon containing multi-word nominal terms of Polish economic and financial terminology. It has been created within the ERDF Nekst project.
Some aspects of its construction, contents and use have been described in:
GRALIŃSKI, F., SAVARY, A., CZEREPOWICKA, M., MAKOWIECKI, F. (2010): Computational Lexicography of Multi-Word Units: How Efficient Can It Be?, in Proceedings of Multiword Expressions: from Theory to Applications (MWE 2010), Workshop at COLING 2010, Beijing, China, August 28.
SAVARY, A., ZABOROWSKI, B., KRAWCZYK-WIECZOREK, A., MAKOWIECKI, F. (2012): SEJFEK — a Lexicon and a Shallow Grammar of Polish Economic Multi-Word Units, in Proceedings of Cognitive Aspects of the Lexicon (COGALEX-III), a Workshop at COLING 2012, Mumbai, India.
The lexicon contains:
11,212 multi-word nominal lexemes (e.g. aktywne ryzyko płynności),
146,861 corresponding inflected forms (e.g. aktywnego ryzyka płynności),
- 305 graph-based inflection paradigms.
See also SEJFEK4Spejd – a shallow grammar for Spejd with fully lexicalized rules automatically generated from SEJFEK lexicon entries.
Authors
- Filip Makowiecki – lexicography
Aleksandra Krawczyk-Wieczorek - lexicon-grammar conversion
Agata Savary – automatic inflection and validation
Bartosz Zaborowski - lexicon-grammar conversion
Tools
The lexicon has been created within Toposław, tool for developing and managing inflectional dictionaries of multi-word units. Toposław integrates:
Morfeusz SGJP – a morphological analyser and generator of Polish,
Multiflex – a morpho-syntactic generator of multi-word units,
graph editor stemming from Unitex.
License
The data are available under the CC BY-SA license.
Available resources
Multiflex-compatible archive containing:
- the list of morphologically annotated lexemes,
- the list of corresponding inflected forms and variants,
inflection graphs compatible with Unitex graph editor,
- list of known problems.
Future work
Defining an LMF format for the lexicon.