Locked History Actions

Diff for "PDB/PDBparser"

Differences between revisions 48 and 49
Revision 48 as of 2019-04-24 11:59:36
Size: 9962
Comment:
Revision 49 as of 2019-04-25 09:03:27
Size: 9962
Comment:
Deletions are marked like this. Additions are marked like this.
Line 6: Line 6:
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190304_COMBO_PDB_nosem_parseonly.pkl|COMBO model]] for dependency parsing only  * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190423_COMBO_PDB_nosem_parseonly.pkl|COMBO model]] for dependency parsing only

PDB-trained dependency parsing models for Polish

The PDB-based models are trained on the current version of Polish Depedency Bank with the publicly available parsing systems – COMBO, MateParser and MaltParser.

  • COMBO model for dependency parsing only

  • COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing

  • COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling

  • MATE model for dependency parsing

  • MaltParser model for dependency parsing

PDB-UD-trained dependency parsing models for Polish

The PDB-UD-based models are trained on the current version of Polish Depedency Bank in Universal Dependencies format with the publicly available parsing systems – UDPipe and COMBO.

  • COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing

  • COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling

  • UDPipe model for tokenisation, part-of-speech tagging, lemmatisation, and dependency parsing

  • UDPipe model for tokenisation

Parsing performance

See Dependency parsing section.

PDB-based MaltParser in Multiservice

  • The performance of MaltParser model for Polish may be tested in Multiservice NLP – http://multiservice.nlp.ipipan.waw.pl.

  • To parse a Polish text in Multiservice "Select predefined chain of actions": 5: Concraft, DependencyParser, input your text, and press the button "Run".

  • To download the parser's output in CoNLL format, "Select output format:":

Publications

List of publications

Alina Wróblewska. Polish Dependency Parser Trained on an Automatically Induced Dependency Bank. Ph.D. dissertation, Institute of Computer Science, Polish Academy of Sciences, Warsaw, 2014.

List of publications

Alina Wróblewska and Adam Przepiórkowski. Projection-based annotation of a Polish dependency treebank. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pages 2306–2312, Reykjavík, Iceland, 2014. European Language Resources Association (ELRA).

List of publications

Alina Wróblewska. Polish dependency bank. Linguistic Issues in Language Technology, 7(1), 2012.

List of publications

Licensing

The dependency parsing models for Polish are released under the CC BY-NC-SA 4.0 licence and by downloading it you accept the conditions of that licence.

Founding

The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure.

Contact

Any questions, comments? Please send them to <alina AT SPAMFREE ipipan DOT waw DOT pl>.