Locked History Actions

Multiservice

Multiservice

This page offers the official Creative Commons Attribution 3.0 Unported License release of the Multiservice, a robust linguistic Web service for Polish, combining several mature offline linguistic tools in a common online platform. By downloading the package you accept the conditions of that licence.

Packaged TEI P5-based annotation is used as representation format for the service. Apache Thrift is used for data exchange between underlying tools. In contrast to most frequent approaches, the architecture supports asynchronous handling of requests to enable processing large amounts of text.

Contact person: Maciej Ogrodniczuk
Authors: Michał Lenart, Maciej Ogrodniczuk, Mateusz Kopeć
License: CC BY v.3

http://i.creativecommons.org/l/by/3.0/88x31.png

Download

If you want to set up your own instance of the web service, you may download source code from the web service git repository. Multiservice web demo is also hosted in this repository.

If you want to use existing instance, you may check out one of the clients available at clients git repository. Currently, example Java and Python clients are available.

Configurations and models of processing tools used in Multiservice are available below:

You may download them all at once if you use install scripts from git repository.

Documentation

Preliminary documentation

Demo

The Web demo of the service: http://multiservice.nlp.ipipan.waw.pl/

Any bug reports are welcome, please use the form available at http://multiservice.nlp.ipipan.waw.pl/ ("Report a bug" tab).

SOAP and Thrift webservices

Multiservice is also available to access programatically, via:

  • Thrift service (host: multiservice.nlp.ipipan.waw.pl, port: 20000)
  • SOAP webservice (WSDL file)

Both ways are implemented in sample clients, available in clients git repository.

Reports and publications

Please cite the LREC paper:

List of publications

Maciej Ogrodniczuk and Michał Lenart. Web Service integration platform for Polish linguistic resources. In Proceedings of the Eighth International Conference on Language Resources and Evaluation, LREC 2012, pages 1164–1168, Istanbul, Turkey, 2012. European Language Resources Association (ELRA).

but you may also read:

List of publications

Maciej Ogrodniczuk and Michał Lenart. A multi-purpose online toolset for NLP applications. In Elisabeth Métais, Farid Meziane, Mohamed Saraee, Vijay Sugumaran, and Sunil Vadera, editors, Proceedings of the 18th International Conference on Applications of Natural Language to Information Systems, number 7934 in Lecture Notes in Computer Science, pages 392–395. Springer-Verlag, Berlin, Heidelberg, 2013.

List of publications

Maciej Ogrodniczuk and Michał Lenart. Multipurpose Linguistic Web Service for Polish. In Proceedings of the Language Technology for a Multilingual Europe workshop at the German Society for Computational Linguistics and Language Technology Conference (GSCL 2011), Hamburg, 2011.

List of publications

Maciej Ogrodniczuk. The Packaged TEI P5-based Stand-off Annotation Format. Internal description of the Multiservice format, 2011.