<?xml version="1.0" encoding="utf-8"?><!DOCTYPE article  PUBLIC '-//OASIS//DTD DocBook XML V4.4//EN'  'http://www.docbook.org/xml/4.4/docbookx.dtd'><article><articleinfo><title>SEJF</title><revhistory><revision><revnumber>36</revnumber><date>2021-04-28 09:58:19</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>35</revnumber><date>2021-04-28 09:57:34</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>34</revnumber><date>2015-08-25 20:15:22</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>33</revnumber><date>2015-08-25 20:13:27</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>32</revnumber><date>2015-08-25 20:12:34</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>31</revnumber><date>2015-08-25 20:09:32</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>30</revnumber><date>2015-08-25 20:04:41</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>29</revnumber><date>2015-08-25 14:27:30</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>28</revnumber><date>2015-08-25 14:22:01</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>27</revnumber><date>2015-08-25 14:21:03</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>26</revnumber><date>2015-08-25 14:19:23</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>25</revnumber><date>2014-06-26 15:18:28</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>24</revnumber><date>2014-06-26 15:17:48</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>23</revnumber><date>2014-06-26 15:11:51</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>22</revnumber><date>2014-06-26 15:09:54</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>21</revnumber><date>2012-09-05 09:49:46</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>20</revnumber><date>2012-07-23 11:31:53</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>19</revnumber><date>2012-07-23 11:26:31</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>18</revnumber><date>2012-07-23 11:23:28</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>17</revnumber><date>2012-07-23 11:20:19</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>16</revnumber><date>2012-07-23 11:19:46</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>15</revnumber><date>2012-07-20 11:36:51</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>14</revnumber><date>2012-07-20 10:28:31</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>13</revnumber><date>2012-07-19 16:41:54</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>12</revnumber><date>2012-07-19 16:27:58</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>11</revnumber><date>2012-07-19 16:25:58</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>10</revnumber><date>2012-07-19 16:25:11</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>9</revnumber><date>2012-07-19 16:24:43</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>8</revnumber><date>2012-07-19 16:23:45</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>7</revnumber><date>2012-07-19 16:23:29</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>6</revnumber><date>2012-07-19 16:22:38</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>5</revnumber><date>2012-07-19 16:22:07</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>4</revnumber><date>2012-07-19 16:13:46</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>3</revnumber><date>2012-07-19 15:45:06</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>2</revnumber><date>2012-06-19 22:57:24</date><authorinitials>MichalLenart</authorinitials></revision><revision><revnumber>1</revnumber><date>2012-06-19 14:26:38</date><authorinitials>MichalLenart</authorinitials></revision></revhistory></articleinfo><section><title>Grammatical Lexicon of Polish Phraseology</title><para>The Grammatical Lexicon of Polish Phraseology (SEJF = Słownik Elektroniczny Jednostek Frazeologicznych) is an electronic lexicon containing multi-word units (mainly nominal, adjectival and adverbial compounds) of the general (non terminological) Polish language. It has been created within the ERDF <ulink url="http://zil.ipipan.waw.pl/NEKST">Nekst</ulink> project and the IC1207 COST action <ulink url="http://www.parseme.eu">PARSEME</ulink>. </para><para>Some aspects of its construction, contents and use have been described in: </para><itemizedlist><listitem><para>GRALIŃSKI, F., SAVARY, A., CZEREPOWICKA, M., MAKOWIECKI, F. (2010): <emphasis><ulink url="http://multiword.sourceforge.net/CONF_30_MWE_2010___lb__COLING__rb__/CONF_50_Online_Proceedings/pdf/MWE01.pdf">Computational Lexicography of Multi-Word Units: How Efficient Can It Be?</ulink></emphasis>, in Proceedings of Multiword Expressions: from Theory to Applications (MWE 2010), Workshop at COLING 2010, Beijing, China, August 28. </para></listitem><listitem><para>CZEREPOWICKA, M., KOSEK, I. (2011): <emphasis>Problemy opisu związków frazeologicznych w formalizmie „Multifleks” (na przykładzie rodzaju wyrażeń frazeologicznych)</emphasis>, in Kopcińska, D., Bańko, M. (eds.) &quot;Różne formy, różne treści&quot;, pp. 117–126, Warszawa 2011. </para></listitem><listitem><para>CZEREPOWICKA, M. (2011): <emphasis>„Toposław” jako narzędzie znakowania jednostek wieloczłonowych</emphasis>, in Matusiak-Kempa, I., Przybyszewski, S. (eds.) Nowe zjawiska w języku, tekście, komunikacji. Kontekst a komunikacja, Olsztyn, pp. 28–35. </para></listitem><listitem><para>CZEREPOWICKA, M. (2014): <emphasis>Jednostki obce w słowniku języka polskiego na przykładzie „Słownika elektronicznego jednostek frazeologicznych” (SEJF)</emphasis>, in <ulink url="http://www.akademicka.pl/index.php?detale=1&amp;e=1&amp;a=1&amp;id=100000087">LingVaria (IX), vol. 1 (17)</ulink>, pp. 59-68 [doi: 10.12797/LV.09.2014.17.04]. </para></listitem><listitem><para>CZEREPOWICKA, M. (2014): <emphasis>SEJF - Słownik elektroniczny jednostek frazeologicznych</emphasis>, in <ulink url="http://www.jezyk-polski.pl">Język Polski</ulink> (XCIV), v. 2, pp. 116-129. </para></listitem><listitem><para>CZEREPOWICKA, M., SAVARY, A., (2018) <emphasis><ulink url="https://link.springer.com/chapter/10.1007/978-3-319-93782-3_5">SEJF - A Grammatical Lexicon of Polish Multiword Expressions</ulink></emphasis>, In: Vetulani Z., Mariani J., Kubis M. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2015. Lecture Notes in Computer Science, vol 10930. Springer, Cham. </para></listitem></itemizedlist><para>The lexicon contains about 3200 multi-word lexemes, 68,000 corresponding inflected forms, and 160 graph-based inflection paradigms, with the following distribution: </para><itemizedlist><listitem><para>3,705 nominal compounds (e.g. <emphasis>bajońskie sumy</emphasis>), </para></listitem><listitem><para>422 adjectival compounds (e.g. <emphasis>prosty jak strzała</emphasis>, <emphasis>wprost proporcjonalny</emphasis>), </para></listitem><listitem><para>609 adverbial compounds (e.g. <emphasis>chcąc nie chcąc</emphasis>), </para></listitem><listitem><para>40 others (e.g. <emphasis>ni z gruszki, ni z pietruszki</emphasis>). </para></listitem></itemizedlist><section><title>Authors</title><itemizedlist><listitem><para><ulink url="http://www.uwm.edu.pl/polonistyka/index.php?option=com_content&amp;view=article&amp;id=95&amp;catid=50&amp;Itemid=9">Monika Czerepowicka</ulink> - lexicography </para></listitem><listitem><para><ulink url="http://www.info.univ-tours.fr/~savary/English/indexgb.html">Agata Savary</ulink> - automatic inflection and validation </para></listitem></itemizedlist></section><section><title>Tools</title><para>The lexicon has been created within <ulink url="http://zil.ipipan.waw.pl/Toposlaw">Toposław</ulink>, tool for developping and managing inflectional dictionaries of multi-word units. Toposław integrates: </para><itemizedlist><listitem><para><ulink url="http://sgjp.pl/morfeusz/">Morfeusz SGJP</ulink> -- a morphological analyser and generator of Polish, </para></listitem><listitem><para><ulink url="http://www.springerlink.com/content/n265j22n73084433/">Multiflex</ulink> -- a morpho-syntactic generator of multi-word units, </para></listitem><listitem><para>graph editor stemming from <ulink url="http://igm.univ-mlv.fr/~unitex/">Unitex</ulink>. </para></listitem></itemizedlist></section><section><title>License</title><para>The data are available under the <ulink url="http://creativecommons.org/licenses/by-sa/3.0/">CC BY-SA license</ulink>. </para></section><section><title>Available resources</title><itemizedlist><listitem><para>SEJF version 1.1  </para><itemizedlist><listitem><para><ulink url="http://zil.ipipan.waw.pl/SEJF/SEJF?action=AttachFile&amp;do=get&amp;target=SEJF-1.1-Slownik.tar.gz">Slownik</ulink> -- the binary source file in <ulink url="http://zil.ipipan.waw.pl/Toposlaw">Toposław</ulink> format </para></listitem><listitem><para><ulink url="http://www.springerlink.com/content/n265j22n73084433/">Multiflex</ulink>-compatible <ulink url="http://zil.ipipan.waw.pl/SEJF/SEJF?action=AttachFile&amp;do=get&amp;target=SEJF-1.1.tar.gz">archive</ulink> including: </para><itemizedlist><listitem><para>the list of morphologically annotated lexemes, </para></listitem><listitem><para>the list of corresponding inflected forms and variants, </para></listitem><listitem><para>inflection graphs compatible with <ulink url="http://igm.univ-mlv.fr/~unitex/">Unitex</ulink> graph editor, </para></listitem><listitem><para>list of known problems, </para></listitem><listitem><para>a README.txt file. </para></listitem></itemizedlist></listitem></itemizedlist></listitem><listitem><para>SEJF version 1.0 - 3200 multi-word lexemes (2121 nominal, 446 adjectival, 604 adverbial, 43 others), 68,000 corresponding inflected forms, and 160 graph-based inflection paradigms </para><itemizedlist><listitem><para><ulink url="http://zil.ipipan.waw.pl/SEJF/SEJF?action=AttachFile&amp;do=get&amp;target=Slownik.tar.gz">Slownik</ulink> -- the binary source file in <ulink url="http://zil.ipipan.waw.pl/Toposlaw">Toposław</ulink> format </para></listitem><listitem><para><ulink url="http://www.springerlink.com/content/n265j22n73084433/">Multiflex</ulink>-compatible <ulink url="http://zil.ipipan.waw.pl/SEJF/SEJF?action=AttachFile&amp;do=get&amp;target=SEJF.tar.gz">archive</ulink> including: </para><itemizedlist><listitem><para>the list of morphologically annotated lexemes, </para></listitem><listitem><para>the list of corresponding inflected forms and variants, </para></listitem><listitem><para>inflection graphs compatible with <ulink url="http://igm.univ-mlv.fr/~unitex/">Unitex</ulink> graph editor, </para></listitem><listitem><para>list of known problems, </para></listitem><listitem><para>a README.txt file. </para></listitem></itemizedlist></listitem></itemizedlist></listitem></itemizedlist></section><section><title>Future work</title><para>Defining an <ulink url="http://www.lexicalmarkupframework.org/">LMF</ulink> format for the lexicon. </para></section></section></article>