#acl +All:read Default = LemmaPL = LemmaPL is a lemmatization tool, which uses several existing tools and resources to provide higher than state-of-the-art lemmatization performance for Polish. Specifically, the following tools are used: * [[http://sgjp.pl/|Morfeusz analyzer]] (version 1 and 2), by Marcin Woliński, * [[http://nlp.pwr.wroc.pl/redmine/projects/wcrft/wiki|WCRFT tagger]], by Adam Radziszewski, * [[http://zil.ipipan.waw.pl/Spejd|Spejd parser]], by Bartosz Zaborowski and Adam Przepiórkowski, * Spejd grammar, by Katarzyna Głowińska, Łukasz Degórski and Piotr Przybyła, * abbreviations dictionary, * frequency data from National Corpus of Polish. '''Author:''' [[http://zil.ipipan.waw.pl/LukaszKobylinski|Łukasz Kobyliński]] <
> '''License:''' GPL == Usage == LemmaPL is available in a form of a web service (SOON). Currently, LemmaPL can be used from a [[https://www.docker.com/|Docker]] container: ipipan/langtools-all or ipipan/langtools-taggers (with your own WCRFT model attached to the container). Instructions for ipipan/langtools-all image: * docker pull ipipan/langtools-all * docker run -v /home/username/my_tests:/root/my_tests -it ipipan/langtools-all /bin/bash ''inside container'': * cd /root/lemmapl * python lemmapl.py ../my_tests/test.txt