| Size: 10313 Comment:  | Size: 10440 Comment:  | 
| Deletions are marked like this. | Additions are marked like this. | 
| Line 6: | Line 6: | 
| * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/191107_COMBO_PDB_semlab_parseonly.pkl|COMBO model]] for dependency parsing only * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/190423_COMBO_PDB_nosem_parseonly.pkl|COMBO model]] for (semantic) dependency parsing only * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/190423_COMBO_PDB_nosem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, and dependency parsing * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/190423_COMBO_PDB_sem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling | * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/200128_COMBO_PDB_nosem_parseonly.pkl|COMBO model]] for dependency parsing only * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/200118_COMBO_PDB_nosem_full.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, and dependency parsing * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/200118_COMBO_PDB_sem_full.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling {{{#!wiki comment * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/191107_COMBO_PDB_semlab_parseonly.pkl|COMBO model]] for (semantic) dependency parsing only}}} | 
| Line 18: | Line 19: | 
| * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/190423_COMBO_PDBUD_nosem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, and dependency parsing * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/190423_COMBO_PDBUD_sem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling | * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/200118_COMBO_PDBUD_nosem_full.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, and dependency parsing * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/COMBO/200118_COMBO_PDBUD_sem_full.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling | 
| Line 21: | Line 22: | 
| * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/UDPIPE/190423_PDBUD_tokeniser.udpipe|UDPipe model]] for tokenisation | * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/UDPIPE/20200930_PDBUD_tokeniser.udpipe|UDPipe model]] for tokenisation | 
| Line 93: | Line 94: | 
| * To download the parser's output in CoNLL format, "Select output format:": | * To download the parser's output in CoNLL format, "Select output format:". | 
| Line 106: | Line 107: | 
| The dependency parsing models for Polish are released under the [[https://creativecommons.org/licenses/by-nc-sa/4.0/|CC BY-NC-SA 4.0]] licence and by downloading it you accept the conditions of that licence. | The dependency parsing models for Polish are released under the [[https://creativecommons.org/licenses/by-nc-sa/4.0/|CC BY-NC-SA 4.0]] licence and by downloading them you accept the conditions of that licence. | 
| Line 108: | Line 109: | 
| == Founding == The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure. | == Acknowledgment == The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure. The computing was performed at Poznań Supercomputing and Networking Center. | 
PDB-trained dependency parsing models for Polish
The PDB-based models are trained on the current version of Polish Dependency Bank with the publicly available parsing systems – COMBO, MateParser and MaltParser.
- COMBO model for dependency parsing only 
- COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing 
- COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling 
- MATE model for dependency parsing 
- MaltParser model for dependency parsing 
PDB-UD-trained dependency parsing models for Polish
The PDB-UD-based models are trained on the current version of Polish Dependency Bank in Universal Dependencies format with the publicly available parsing systems – UDPipe and COMBO.
- COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing 
- COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling 
- UDPipe model for tokenisation, part-of-speech tagging, lemmatisation, and dependency parsing 
- UDPipe model for tokenisation 
Parsing performance
See Dependency parsing section.
PDB-based MaltParser in Multiservice
- The performance of MaltParser model for Polish may be tested in Multiservice NLP – http://multiservice.nlp.ipipan.waw.pl. 
- To parse a Polish text in Multiservice "Select predefined chain of actions": 5: Concraft, DependencyParser, input your text, and press the button "Run". 
- To download the parser's output in CoNLL format, "Select output format:".
Publications
Licensing
The dependency parsing models for Polish are released under the CC BY-NC-SA 4.0 licence and by downloading them you accept the conditions of that licence.
Acknowledgment
The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure. The computing was performed at Poznań Supercomputing and Networking Center.
Contact
Any questions, comments? Please send them to <alina AT SPAMFREE ipipan DOT waw DOT pl>.

 
 
                            


