| Size: 7104 Comment:  | Size: 8650 Comment:  | 
| Deletions are marked like this. | Additions are marked like this. | 
| Line 2: | Line 2: | 
| = PDB-based dependency parsing models for Polish = | == PDB-based dependency parsing models for Polish == | 
| Line 6: | Line 6: | 
| === COMBO === | * COMBO * [[attachment:190115_COMBO_PDB_nosem.pkl]] – PDB-based COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing * [[attachment: 190115_COMBO_PDB_sem.pkl]] – PDB-based COMBO model for part-of-speech tagging, lemmatisation, dependency parsing and semantic role labelling {{{#!wiki comment | 
| Line 9: | Line 13: | 
| === MateParser === | * MateParser | 
| Line 11: | Line 15: | 
| * '''NEW!''' PDB-based Mate model compatible with the tagset of Morfeusz 2: [[attachment:180322_PDBMate.mdl]] * PDB-based Mate model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMate.mdl]] | * '''NEW!''' PDB-based Mate model compatible with the tagset of Morfeusz 2: [[attachment:180322_PDBMate.mdl]] * PDB-based Mate model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMate.mdl]]}}} | 
| Line 14: | Line 18: | 
| === MaltParser === | * MateParser * [[attachment:190125_MATE_PDB.model]] – PDB-based MateParser model for dependency parsing * MaltParser * [[attachment:190125_MALT_PDB.mco]] – PDB-based MaltParser model for dependency parsing | 
| Line 16: | Line 23: | 
| * '''NEW!''' PDB-based MaltParser model compatible with the tagset of Morfeusz 2: [[attachment:180322_PDBMalt.mco]] * PDB-basd MaltParser model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMalt.mco]] | {{{#!wiki comment * '''NEW!''' PDB-based MaltParser model compatible with the tagset of Morfeusz 2: [[attachment:180322_PDBMalt.mco]] * PDB-basd MaltParser model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMalt.mco]]}}} | 
| Line 20: | Line 28: | 
| = PDBUD-based dependency parsing models for Polish= | == PDBUD-based dependency parsing models for Polish == The PDBUD-based models are trained on the current version of [[http://git.nlp.ipipan.waw.pl/alina/PDBUD|Polish Depedency Bank in Universal Dependencies format]] with the publicly available parsing systems – [[http://ufal.mff.cuni.cz/udpipe|UDPipe]] and [[https://github.com/360er0/COMBO|COMBO]]. | 
| Line 22: | Line 31: | 
| === UDPipe === * UDPipe model for Polish: [[attachment:180606_PDBUDPipe.udpipe]] | * COMBO * [[attachment: 190115_COMBO_PDBUD_nosem.pkl]] – PDBUD-based model COMBO for part-of-speech tagging, lemmatisation, and dependency parsing * UDPipe * tba {{{#!wiki comment * [[http://mozart.ipipan.waw.pl/~prybak/model_poleval2018/model_A_semi.pkl|COMBO]] model for Polish (the model estimated for the [[http://poleval.pl/tasks#task1|PolEval 2018]] competition) * [[attachment:180606_PDBUDPipe.udpipe|UDPipe]] model for Polish}}} | 
| Line 26: | Line 41: | 
| See [[http://clip.ipipan.waw.pl/benchmarks|Dependency parsing]] section. | |
| Line 72: | Line 89: | 
| === Publications === | == Publications == | 
| Line 79: | Line 96: | 
| === Licensing === | == Licensing == | 
| Line 83: | Line 100: | 
| === Contact === | == Founding == The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure. == Contact == | 
PDB-based dependency parsing models for Polish
The PDB-based models are trained on the current version of Polish Depedency Bank with the publicly available parsing systems – COMBO, MateParser and MaltParser.
- COMBO - 190115_COMBO_PDB_nosem.pkl – PDB-based COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing 
- 190115_COMBO_PDB_sem.pkl – PDB-based COMBO model for part-of-speech tagging, lemmatisation, dependency parsing and semantic role labelling 
 
- 190125_MATE_PDB.model – PDB-based MateParser model for dependency parsing 
 
- 190125_MALT_PDB.mco – PDB-based MaltParser model for dependency parsing 
 
PDBUD-based dependency parsing models for Polish
The PDBUD-based models are trained on the current version of Polish Depedency Bank in Universal Dependencies format with the publicly available parsing systems – UDPipe and COMBO.
- COMBO - 190115_COMBO_PDBUD_nosem.pkl – PDBUD-based model COMBO for part-of-speech tagging, lemmatisation, and dependency parsing 
 
- UDPipe - tba
 
Parsing performance
See Dependency parsing section.
PDB-based MaltParser in Multiservice
- The performance of MaltParser model for Polish may be tested in Multiservice NLP – http://multiservice.nlp.ipipan.waw.pl. 
- To parse a Polish text in Multiservice "Select predefined chain of actions": 5: Concraft, DependencyParser, input your text, and press the button "Run". 
- To download the parser's output in CoNLL format, "Select output format:":
Publications
Licensing
The dependency parsing models for Polish are released under the CC BY-NC-SA 4.0 licence and by downloading it you accept the conditions of that licence.
Founding
The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure.
Contact
Any questions, comments? Please send them to <alina AT SPAMFREE ipipan DOT waw DOT pl>.

 
 
                            

