<?xml version="1.0" encoding="utf-8"?><!DOCTYPE article  PUBLIC '-//OASIS//DTD DocBook XML V4.4//EN'  'http://www.docbook.org/xml/4.4/docbookx.dtd'><article><articleinfo><title>OpenCYCPL</title><revhistory><revision><revnumber>5</revnumber><date>2013-01-22 12:29:28</date><authorinitials>MichalLenart</authorinitials></revision><revision><revnumber>4</revnumber><date>2013-01-22 12:28:11</date><authorinitials>MichalLenart</authorinitials></revision><revision><revnumber>3</revnumber><date>2013-01-22 12:28:00</date><authorinitials>MichalLenart</authorinitials></revision><revision><revnumber>2</revnumber><date>2013-01-22 12:27:41</date><authorinitials>AleksanderPohl</authorinitials><revremark>creation</revremark></revision><revision><revnumber>1</revnumber><date>2013-01-18 16:11:55</date><authorinitials>MichalLenart</authorinitials></revision></revhistory></articleinfo><section><title>Polish OpenCyc lexicon</title><para>The Polish OpenCyc lexicon is a set of mappings between OpenCyc symbols and their Polish counterparts.  It might be though of as a English-Polish dictionary, but since Cyc contains many abstract concepts, it is better viewed as a set of Polish names for the concepts that are present in Cyc. Due to the fact, that the latest OpenCyc version contains more than 200 thousands of concepts, the mapping covers only part of it -- the concepts that are also present in <ulink url="http://umbel.org/">Umbel</ulink>, an ontology that was developed on the basis of Cyc, but is devoid of many application-specific Cyc concepts. Still many concepts that are present in Umbel lack mapping, since they are very specific and doesn't translate into Polish (e.g. names of species found only in North America). As a result the current mapping contains approx. 16 thousands of translations for approx. 14 thousands of concepts. </para><para>The wiki of the project, an issue tracker and its  latest version can be found on <ulink url="https://github.com/apohllo/polish-cyc">the project's page on github</ulink>. </para><section><title>Downloads</title><para>The lexicon is distributed in three versions: </para><itemizedlist><listitem><para><ulink url="https://github.com/apohllo/polish-cyc/blob/master/PolishLexicon.txt">Polish OpenCyc lexicon 0.2</ulink> -- the lexicon data sorted by OpenCyc symbol name, each symbol has only one translation </para></listitem><listitem><para><ulink url="https://github.com/apohllo/polish-cyc/blob/master/PolishLexicon_multiple.txt">Polish OpenCyc lexicon 0.2 with multiple translations</ulink> --  the lexicon data sorted by Polish expressions, each symbol might have one or more translations (e.g. a name of an occupation which has different gender forms in Polish: Actor -- aktor, aktorka) </para></listitem><listitem><para><ulink url="https://github.com/apohllo/polish-cyc/blob/master/PolishLexicon.yml">Polish OpenCyc lexicon 0.2 with morphological data</ulink> -- the lexicon data sorted byt OpenCyc symbol name, with morphological data attached to the Polish translation </para></listitem></itemizedlist></section><section><title>OpenCyc</title><para><ulink url="http://opencyc.org">OpenCyc</ulink> is an open version of the <ulink url="http://cyc.com">Cyc</ulink> formal ontology.  Its <ulink url="http://www.opencyc.org/Version%204.0%20Release%20notes">current release</ulink> contains more than 2 millions of facts about more than 200 thousands concepts, including: </para><itemizedlist><listitem><para>19 thousands of places </para></listitem><listitem><para>26 thousands of organizations </para></listitem><listitem><para>22 thousands of predicates </para></listitem><listitem><para>28 thousands of business related things  </para></listitem><listitem><para>12 thousands of people </para></listitem></itemizedlist><para>Although there are larger databases like DBpedia, Freebase and YAGO, the most important feature of Cyc is the  taxonomical part of the ontology, that has been developed during the last 20 years. It covers very abstract concepts like <emphasis>relation</emphasis>, <emphasis>mathematical thing</emphasis> and <emphasis>process</emphasis>, as well as quite concrete concepts, like <emphasis>mobile phone</emphasis>,  <emphasis>prime minister</emphasis> and even <emphasis>iPhone 4</emphasis>. It also covers many factual data about individual things, such as <emphasis>Barak Obama</emphasis> and  <emphasis>Poland</emphasis>. </para></section></section></article>