Size: 8629
Comment:
|
Size: 9723
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 6: | Line 6: |
* COMBO | * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190115_COMBO_PDB_nosem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, and dependency parsing * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190115_COMBO_PDB_sem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling {{{#!wiki comment |
Line 10: | Line 13: |
{{{#!wiki comment | |
Line 16: | Line 18: |
* PDB-based Mate model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMate.mdl]]}}} | * PDB-based Mate model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMate.mdl]] |
Line 19: | Line 21: |
* [[attachment:]] – PDB-based MateParser model for dependency parsing | * [[attachment:190125_MATE_PDB.model]] – PDB-based MateParser model for dependency parsing |
Line 21: | Line 23: |
* [[attachment:190105_MALT_PDB.mco]] – PDB-based MaltParser model for dependency parsing | * [[attachment:190125_MALT_PDB.mco]] – PDB-based MaltParser model for dependency parsing |
Line 23: | Line 25: |
{{{#!wiki comment | |
Line 26: | Line 28: |
* [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190125_MATE_PDB.model|MATE model]] for dependency parsing * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190125_MALT_PDB.mco|MaltParser model]] for dependency parsing |
|
Line 31: | Line 36: |
* COMBO * [[attachment: 190115_COMBO_PDBUD_nosem.pkl]] – PDBUD-based model COMBO for part-of-speech tagging, lemmatisation, and dependency parsing * UDPipe * tba |
* [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190115_COMBO_PDBUD_nosem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, and dependency parsing * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190115_COMBO_PDBUD_sem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling * UDPipe model for tokenisation, part-of-speech tagging, lemmatisation, and dependency parsing * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190125_PDBUD_tokeniser.udpipe|UDPipe model]] for tokenisation |
PDB-based dependency parsing models for Polish
The PDB-based models are trained on the current version of Polish Depedency Bank with the publicly available parsing systems – COMBO, MateParser and MaltParser.
COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing
COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling
MATE model for dependency parsing
MaltParser model for dependency parsing
PDBUD-based dependency parsing models for Polish
The PDBUD-based models are trained on the current version of Polish Depedency Bank in Universal Dependencies format with the publicly available parsing systems – UDPipe and COMBO.
COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing
COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling
- UDPipe model for tokenisation, part-of-speech tagging, lemmatisation, and dependency parsing
UDPipe model for tokenisation
Parsing performance
See Dependency parsing section.
PDB-based MaltParser in Multiservice
The performance of MaltParser model for Polish may be tested in Multiservice NLP – http://multiservice.nlp.ipipan.waw.pl.
To parse a Polish text in Multiservice "Select predefined chain of actions": 5: Concraft, DependencyParser, input your text, and press the button "Run".
- To download the parser's output in CoNLL format, "Select output format:":
Publications
Licensing
The dependency parsing models for Polish are released under the CC BY-NC-SA 4.0 licence and by downloading it you accept the conditions of that licence.
Founding
The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure.
Contact
Any questions, comments? Please send them to <alina AT SPAMFREE ipipan DOT waw DOT pl>.