Locked History Actions

Diff for "PDB/PDBparser"

Differences between revisions 68 and 75 (spanning 7 versions)
Revision 68 as of 2020-01-20 10:18:22
Size: 10402
Comment:
Revision 75 as of 2020-09-30 14:40:54
Size: 10440
Comment:
Deletions are marked like this. Additions are marked like this.
Line 6: Line 6:
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/200118_COMBO_PDB_nosem_parseonly.pkl|COMBO model]] for dependency parsing only
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/191107_COMBO_PDB_semlab_parseonly.pkl|COMBO model]] for (semantic) dependency parsing only
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/200128_COMBO_PDB_nosem_parseonly.pkl|COMBO model]] for dependency parsing only
Line 10: Line 9:
{{{#!wiki comment
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/191107_COMBO_PDB_semlab_parseonly.pkl|COMBO model]] for (semantic) dependency parsing only}}}
Line 18: Line 19:
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/190423_COMBO_PDBUD_nosem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, and dependency parsing
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/190423_COMBO_PDBUD_sem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/200118_COMBO_PDBUD_nosem_full.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, and dependency parsing
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/200118_COMBO_PDBUD_sem_full.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling
Line 21: Line 22:
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/UDPIPE/190423_PDBUD_tokeniser.udpipe|UDPipe model]] for tokenisation  * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/UDPIPE/20200930_PDBUD_tokeniser.udpipe|UDPipe model]] for tokenisation
Line 93: Line 94:
 * To download the parser's output in CoNLL format, "Select output format:":  * To download the parser's output in CoNLL format, "Select output format:".
Line 108: Line 109:
== Founding == == Acknowledgment ==

PDB-trained dependency parsing models for Polish

The PDB-based models are trained on the current version of Polish Dependency Bank with the publicly available parsing systems – COMBO, MateParser and MaltParser.

  • COMBO model for dependency parsing only

  • COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing

  • COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling

PDB-UD-trained dependency parsing models for Polish

The PDB-UD-based models are trained on the current version of Polish Dependency Bank in Universal Dependencies format with the publicly available parsing systems – UDPipe and COMBO.

  • COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing

  • COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling

  • UDPipe model for tokenisation, part-of-speech tagging, lemmatisation, and dependency parsing

  • UDPipe model for tokenisation

Parsing performance

See Dependency parsing section.

PDB-based MaltParser in Multiservice

  • The performance of MaltParser model for Polish may be tested in Multiservice NLP – http://multiservice.nlp.ipipan.waw.pl.

  • To parse a Polish text in Multiservice "Select predefined chain of actions": 5: Concraft, DependencyParser, input your text, and press the button "Run".

  • To download the parser's output in CoNLL format, "Select output format:".

Publications

List of publications

Alina Wróblewska and Piotr Rybak. Dependency parsing of Polish. Poznań Studies in Contemporary Linguistics, 55(2):305–337, 2019.

(Note: Please contact the first author to get a copy of this article.) List of publications
List of publications
List of publications
List of publications

Licensing

The dependency parsing models for Polish are released under the CC BY-NC-SA 4.0 licence and by downloading them you accept the conditions of that licence.

Acknowledgment

The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure. The computing was performed at Poznań Supercomputing and Networking Center.

Contact

Any questions, comments? Please send them to <alina AT SPAMFREE ipipan DOT waw DOT pl>.