Size: 7593
Comment:
|
Size: 8288
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 7: | Line 7: |
* tba | * PDB-based COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing: [[attachment:190115_COMBO_PDB_nosem.pkl]] * PDB-based COMBO model for part-of-speech tagging, lemmatisation, dependency parsing and semantic role labelling: [[attachment: 190115_COMBO_PDB_sem.pkl]] |
Line 10: | Line 11: |
* '''NEW!''' PDB-based COMBO model compatible with the tagset of Morfeusz 2: [[attachment:180912_PDBCOMBO.pkl]]}}} | * '''NEW!''' PDB-based COMBO model compatible with the tagset of Morfeusz 2: [[attachment:180912_PDBCOMBO.pkl]] |
Line 15: | Line 16: |
* PDB-based Mate model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMate.mdl]] | * PDB-based Mate model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMate.mdl]]}}} |
Line 18: | Line 19: |
* PDB-based MaltParser model: [[attachment:]] | |
Line 19: | Line 21: |
{{{#!wiki comment | |
Line 20: | Line 23: |
* PDB-basd MaltParser model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMalt.mco]] | * PDB-basd MaltParser model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMalt.mco]]}}} |
Line 26: | Line 29: |
* [[http://mozart.ipipan.waw.pl/~prybak/model_poleval2018/model_A_semi.pkl|COMBO]] model for Polish | * [[http://mozart.ipipan.waw.pl/~prybak/model_poleval2018/model_A_semi.pkl|COMBO]] model for Polish (the model estimated for the [[http://poleval.pl/tasks#task1|PolEval 2018]] competition) |
Line 78: | Line 81: |
=== Publications === | == Publications == |
Line 85: | Line 88: |
=== Licensing === | == Licensing == |
Line 89: | Line 92: |
=== Contact === | == Founding == The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure. == Contact == |
PDB-based dependency parsing models for Polish
The PDB-based models are trained on the current version of Polish Depedency Bank with the publicly available parsing systems – COMBO, MateParser and MaltParser.
- COMBO
PDB-based COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing: 190115_COMBO_PDB_nosem.pkl
PDB-based COMBO model for part-of-speech tagging, lemmatisation, dependency parsing and semantic role labelling: 190115_COMBO_PDB_sem.pkl
PDB-based MaltParser model:
PDBUD-based dependency parsing models for Polish
The PDBUD-based models are trained on the current version of Polish Depedency Bank in Universal Dependencies format with the publicly available parsing systems – UDPipe and COMBO.
COMBO model for Polish (the model estimated for the PolEval 2018 competition)
UDPipe model for Polish
Parsing performance
See Dependency parsing section.
PDB-based MaltParser in Multiservice
The performance of MaltParser model for Polish may be tested in Multiservice NLP – http://multiservice.nlp.ipipan.waw.pl.
To parse a Polish text in Multiservice "Select predefined chain of actions": 5: Concraft, DependencyParser, input your text, and press the button "Run".
- To download the parser's output in CoNLL format, "Select output format:":
Publications
Licensing
The dependency parsing models for Polish are released under the CC BY-NC-SA 4.0 licence and by downloading it you accept the conditions of that licence.
Founding
The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure.
Contact
Any questions, comments? Please send them to <alina AT SPAMFREE ipipan DOT waw DOT pl>.