<?xml version="1.0" encoding="utf-8"?><!DOCTYPE article  PUBLIC '-//OASIS//DTD DocBook XML V4.4//EN'  'http://www.docbook.org/xml/4.4/docbookx.dtd'><article><articleinfo><title>LemmaPL</title><revhistory><revision><revnumber>4</revnumber><date>2014-12-18 13:08:50</date><authorinitials>LukaszKobylinski</authorinitials></revision><revision><revnumber>3</revnumber><date>2014-12-18 12:49:46</date><authorinitials>LukaszKobylinski</authorinitials></revision><revision><revnumber>2</revnumber><date>2014-12-18 10:35:21</date><authorinitials>LukaszKobylinski</authorinitials></revision><revision><revnumber>1</revnumber><date>2014-12-18 10:32:55</date><authorinitials>LukaszKobylinski</authorinitials></revision></revhistory></articleinfo><section><title>LemmaPL</title><para>LemmaPL is a lemmatization tool, which uses several existing tools and resources to provide higher than state-of-the-art lemmatization performance for Polish. Specifically, the following tools are used: </para><itemizedlist><listitem><para><ulink url="http://sgjp.pl/">Morfeusz analyzer</ulink> (version 1 and 2), by Marcin Woliński, </para></listitem><listitem><para><ulink url="http://nlp.pwr.wroc.pl/redmine/projects/wcrft/wiki">WCRFT tagger</ulink>, by Adam Radziszewski, </para></listitem><listitem><para><ulink url="http://zil.ipipan.waw.pl/Spejd">Spejd parser</ulink>, by Bartosz Zaborowski and Adam Przepiórkowski, </para></listitem><listitem><para>Spejd grammar, by Katarzyna Głowińska, Łukasz Degórski and Piotr Przybyła, </para></listitem><listitem><para>abbreviations dictionary, </para></listitem><listitem><para>frequency data from National Corpus of Polish. </para></listitem></itemizedlist><para><emphasis role="strong">Author:</emphasis> <ulink url="http://zil.ipipan.waw.pl/LukaszKobylinski">Łukasz Kobyliński</ulink> </para><para> <emphasis role="strong">License:</emphasis> GPL </para><section><title>Usage</title><para>LemmaPL is available in a form of a web service (SOON). </para><para>Currently, LemmaPL can be used from a <ulink url="https://www.docker.com/">Docker</ulink> container: ipipan/langtools-all or ipipan/langtools-taggers (with your own WCRFT model attached to the container). </para><para>Instructions for ipipan/langtools-all image: </para><itemizedlist><listitem><para>docker pull ipipan/langtools-all </para></listitem><listitem><para>docker run -v /home/username/my_tests:/root/my_tests -it ipipan/langtools-all /bin/bash </para></listitem></itemizedlist><para><emphasis>inside container</emphasis>: </para><itemizedlist><listitem><para>cd /root/lemmapl </para></listitem><listitem><para>python lemmapl.py ../my_tests/test.txt </para></listitem></itemizedlist></section></section></article>