<?xml version="1.0" encoding="utf-8"?><!DOCTYPE article  PUBLIC '-//OASIS//DTD DocBook XML V4.4//EN'  'http://www.docbook.org/xml/4.4/docbookx.dtd'><article><articleinfo><title>Prolexbase</title><revhistory><revision><revnumber>56</revnumber><date>2015-02-11 19:37:25</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>55</revnumber><date>2015-02-11 19:31:49</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>54</revnumber><date>2015-02-11 19:30:15</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>53</revnumber><date>2015-02-11 19:27:54</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>52</revnumber><date>2015-02-11 19:27:02</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>51</revnumber><date>2015-02-11 19:25:39</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>50</revnumber><date>2015-02-11 19:22:22</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>49</revnumber><date>2013-11-26 14:24:41</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>48</revnumber><date>2013-11-26 14:20:34</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>47</revnumber><date>2013-08-01 07:48:09</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>46</revnumber><date>2013-08-01 07:47:17</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>45</revnumber><date>2013-08-01 07:46:03</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>44</revnumber><date>2013-08-01 07:44:33</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>43</revnumber><date>2013-04-15 10:43:11</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>42</revnumber><date>2013-04-12 11:58:34</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>41</revnumber><date>2013-03-18 22:45:40</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>40</revnumber><date>2013-03-17 11:16:03</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>39</revnumber><date>2013-02-04 19:12:46</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>38</revnumber><date>2013-02-04 19:11:03</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>37</revnumber><date>2013-01-23 13:43:48</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>36</revnumber><date>2013-01-23 12:21:00</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>35</revnumber><date>2013-01-23 12:02:13</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>34</revnumber><date>2013-01-23 12:01:54</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>33</revnumber><date>2013-01-23 12:00:41</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>32</revnumber><date>2013-01-23 11:57:48</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>31</revnumber><date>2013-01-23 11:57:23</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>30</revnumber><date>2013-01-23 11:57:09</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>29</revnumber><date>2013-01-23 11:20:47</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>28</revnumber><date>2013-01-23 11:14:49</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>27</revnumber><date>2013-01-23 11:14:31</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>26</revnumber><date>2013-01-23 11:11:32</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>25</revnumber><date>2013-01-23 09:36:56</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>24</revnumber><date>2013-01-23 09:34:38</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>23</revnumber><date>2013-01-23 09:22:31</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>22</revnumber><date>2013-01-23 09:21:43</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>21</revnumber><date>2013-01-23 00:36:37</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>20</revnumber><date>2013-01-23 00:35:50</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>19</revnumber><date>2013-01-23 00:12:32</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>18</revnumber><date>2013-01-22 23:44:41</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>17</revnumber><date>2013-01-22 23:12:12</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>16</revnumber><date>2013-01-22 23:10:00</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>15</revnumber><date>2013-01-22 23:09:15</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>14</revnumber><date>2013-01-22 23:08:42</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>13</revnumber><date>2013-01-22 23:07:25</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>12</revnumber><date>2013-01-22 23:06:48</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>11</revnumber><date>2013-01-22 23:05:23</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>10</revnumber><date>2013-01-22 23:02:16</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>9</revnumber><date>2013-01-22 22:50:40</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>8</revnumber><date>2013-01-22 22:49:38</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>7</revnumber><date>2013-01-22 22:44:00</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>6</revnumber><date>2013-01-22 22:24:43</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>5</revnumber><date>2013-01-22 22:19:09</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>4</revnumber><date>2013-01-22 22:18:50</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>3</revnumber><date>2013-01-22 22:16:34</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>2</revnumber><date>2013-01-22 21:54:54</date><authorinitials>AgataSavary</authorinitials></revision><revision><revnumber>1</revnumber><date>2013-01-22 21:51:33</date><authorinitials>AgataSavary</authorinitials></revision></revhistory></articleinfo><section><title>Prolexbase 2.0 and 2.2</title><para>Prolexbase 2.0 and 2.2 is a multilingual relational dictionary of proper names, conceived initially at the University of Tours, France and at the University of Belgrade, Serbia, and further developed at the Polish Academy of Sciences (IPIPAN). It contains a language-independent typology of proper names with 4 supertypes and 34 types, as well as various language-independent or language-specific relations (synonymy, meronymy accessibility, variation etc.). A pivot-oriented design of concepts yields alignment of proper names in a language with their counterparts if other languages. A large majority of the data have been extracted from <ulink url="http://pl.wikipedia.org">Wikipedia</ulink> and <ulink url="http://www.geonames.org/">GeoNames</ulink>. <emphasis>All data have been manually validated</emphasis>. </para><para>Prolexbase creation has been supported by the following projects: </para><itemizedlist><listitem><para><ulink url="http://tln.li.univ-tours.fr/Tln_Technolangue.html">Technolangue</ulink> programme from the French Ministry of Industry (2003-2005), </para></listitem><listitem><para>Egide <ulink url="http://tln.li.univ-tours.fr/Tln_Pavle_Savic.html">Pavle-Savic</ulink> programme from the French Ministry of Foreign Affairs, the French Ministry of Research, and the Serbian Ministry of Science (2004-2005),  </para></listitem><listitem><para>ERDF <ulink url="http://zil.ipipan.waw.pl/NEKST">Nekst</ulink> project (2009-2014), </para></listitem><listitem><para>European (CIP ICT-PSP) <ulink url="http://clip.ipipan.waw.pl/CESAR">CESAR</ulink> project, part of <ulink url="http://www.meta-net.eu/">META-NET</ulink> (2011-2013).  </para></listitem></itemizedlist><para>The construction and contents of Prolexbase have been described in: </para><itemizedlist><listitem><para>Savary, A., Manicki, L., Baron, M. (2013): <ulink url="http://jlm.ipipan.waw.pl/index.php/JLM/article/view/63">Populating a Multilingual Ontology of Proper Names from Open Sources</ulink>. In Journal of Language Modelling, Vol 2, No. 2, pp. 189-225. </para></listitem><listitem><para>Savary, A., Manicki, L., Baron, M. (2013): <ulink url="http://www.info.univ-tours.fr/~savary/Papers/sav-man-bar-rapport-306.pdf">ProlexFeeder - Populating a Multilingual Ontology of Proper Names from Open Sources</ulink>. Technical Report 306, Laboratoire d'Informatique, Université François Rabelais Tours, France. </para></listitem><listitem><para>Maurel, D., Bouchou, B. (2013): <emphasis>Prolmf, a Multilingual Dictionary of Proper Names and their Relations</emphasis>. In Gil Francopoulo (ed.), LMF: Lexical Markup Framework, theory and practice, Iste-Wiley, pp. 67-81. </para></listitem><listitem><para>Spędzia, M., Maurel, D., Savary, A. (2011): <emphasis><ulink url="http://zil.ipipan.waw.pl/Prolexbase/Prolexbase?action=AttachFile&amp;do=get&amp;target=prolexbase.documentation.pdf">Multilingual Relational Database of Proper Names: Prolexbase Documentation</ulink></emphasis>. Technical report #297, Laboratoire d'informatique, Université François Rabelais Tours. </para></listitem><listitem><para>Bouchou, B., Maurel, D. (2008): <emphasis><ulink url="http://www.atala.org/IMG/pdf/TAL-2008-49-1-04-Bouchou.pdf">Prolexbase et LMF : vers un standard pour les ressources lexicales sur les noms propres</ulink></emphasis>. In Traitement Automatique des Langues, 49(1). </para></listitem><listitem><para>Maurel, D. (2008): <emphasis><ulink url="http://www.lrec-conf.org/proceedings/lrec2008/summaries/91.html">Prolexbase: a Multilingual Relational Lexical Database of Proper Names</ulink></emphasis>. In proceedings of LREC 2008, Marrakech, Morocco. </para></listitem><listitem><para>Tran, M., Maurel, D. (2006): <emphasis><ulink url="http://www.atala.org/IMG/pdf/TAL-2006-47-3-06-Tran.pdf">Prolexbase. Un dictionnaire relationnel multilingue de noms propres</ulink></emphasis>. In Traitement Automatique des Langues, 47(3). </para></listitem><listitem><para>Krstev S., Vitas D., Maurel D., Tran M. (2005): <emphasis>Multilingual Ontology of Proper Names</emphasis>. In Second Language &amp; Technology Conference (LTC'05), Poznań, Poland. </para></listitem></itemizedlist><para>Prolexbase 2.0 and 2.2 contains the following interlinked data: </para><itemizedlist><listitem><para>67,000 language-independent pivots, </para></listitem><listitem><para>40,000 Polish proper names and their corresponding 165,000 inflected forms, </para></listitem><listitem><para>33,000 English proper names and their corresponding 18,000 inflected forms, </para></listitem><listitem><para>100,000 French proper names and their corresponding 142,000 inflected forms, </para></listitem><listitem><para>65,500 relations. </para></listitem></itemizedlist><para>See also <ulink url="http://www.cnrtl.fr/lexiques/prolex/">Prolexbase on CNRTL</ulink> for a proofread version (2.2), serialized in an LMF standard format. </para><section><title>Authors</title><itemizedlist><listitem><para>Małgorzata Baron - lexicography, </para></listitem><listitem><para><ulink url="http://www.info.univ-tours.fr/~bouchou/index_a.html">Béatrice Bouchou Markhoff</ulink> - LMF format design, </para></listitem><listitem><para>Leszek Manicki - design and implementation of Prolexbase population from Wikipedia,  </para></listitem><listitem><para><ulink url="http://www.univ-tours.fr/acces-rapide/m-maurel-denis-84407.kjsp">Denis Maurel</ulink> - design and dissemination, project management, </para></listitem><listitem><para><ulink url="http://www.info.univ-tours.fr/~savary/English/indexgb.html">Agata Savary</ulink> – project management, </para></listitem><listitem><para>Mickaël Tran - database design and implementation. </para></listitem><listitem><para><ulink url="http://poincare.matf.bg.ac.rs/~vitas//index-en.html">Duško Vitas</ulink> - design and management of Serbian data. </para></listitem></itemizedlist></section><section><title>Tools</title><itemizedlist><listitem><para><ulink url="http://jlm.ipipan.waw.pl/index.php/JLM/article/view/63">ProlexFeeder</ulink>, a tool for semi-automatic population of Prolexbase from open sources, notably Wikipedia and Geonames, </para></listitem><listitem><para><ulink url="http://www.translatica.pl/">Translatica</ulink>'s automatic inflection tool for multi-word units. </para></listitem></itemizedlist></section><section><title>License</title><para>All Prolexbase 2.0 data are available under the <ulink url="http://creativecommons.org/licenses/by-sa/3.0/">CC BY-SA license</ulink>, i.e. the same as for <ulink url="http://en.wikipedia.org/wiki/Wikipedia:Text_of_Creative_Commons_Attribution-ShareAlike_3.0_Unported_License">Wikipedia</ulink> and <ulink url="http://www.geonames.org/">GeoNames</ulink>. The Prolexbase 2.2 data are available under the <ulink url="http://infolingu.univ-mlv.fr/DonneesLinguistiques/Lexiques-Grammaires/lgpllr.html">LGPL-LR license</ulink>. </para></section><section><title>Available resources</title><para>For version 2.2 </para><itemizedlist><listitem><para>XML serialisation in an LMF standard format, and its documentation in French -- see <ulink url="http://www.cnrtl.fr/lexiques/prolex/">Prolexbase on CNRTL</ulink>. </para></listitem></itemizedlist><para>For version 2.0  </para><itemizedlist><listitem><para>Prolexbase <ulink url="http://zil.ipipan.waw.pl/Prolexbase/Prolexbase?action=AttachFile&amp;do=get&amp;target=prolexbase.documentation.pdf">documentation</ulink>. </para></listitem><listitem><para>Prolexbase <ulink url="http://zil.ipipan.waw.pl/Prolexbase/Prolexbase?action=AttachFile&amp;do=get&amp;target=prolexbase-schema.tar.gz">schema</ulink> description. </para></listitem><listitem><para><ulink url="http://zil.ipipan.waw.pl/Prolexbase/Prolexbase?action=AttachFile&amp;do=get&amp;target=prolexbase_en_fr_pl_20130204.sql.tar.gz">MySQL dump</ulink> file, </para></listitem><listitem><para>List of all inflected forms of <ulink url="http://zil.ipipan.waw.pl/Prolexbase/Prolexbase?action=AttachFile&amp;do=get&amp;target=prolexbase-polish-inflected.tar.gz">Polish names</ulink> together with their semantic and grammatical tags. </para></listitem></itemizedlist></section><section><title>Future work</title><itemizedlist><listitem><para>Opening a web interface for Prolexbase 2.0/2.2 navigation. </para></listitem><listitem><para>Releasing <ulink url="http://jlm.ipipan.waw.pl/index.php/JLM/article/view/63">ProlexFeeder</ulink> under an open license. </para></listitem></itemizedlist></section></section></article>