Polish COMBO models
The COMBO models for Polish are trained on the current version of Polish Dependency Bank. The models use the HerBERT language model.
PDB-trained models
model for dependency parsing only
model for part-of-speech tagging, morphological analysis, lemmatisation, and dependency parsing (dependency relation types without semantic extensions, e.g. adjunct instead of adjunct_temp)
model for part-of-speech tagging, morphological analysis, lemmatisation, and dependency parsing (dependency relation types with semantic extensions, e.g. adjunct_temp)
PDB-UD-trained model
model for part-of-speech tagging, morphological analysis, lemmatisation, and dependency parsing
Parsing performance (outdated)
See Dependency parsing section.
PDB-based dependency parsing demos
COMBO demo (only in Polish)
MaltParser demo in Multiservice NLP
To parse a Polish text in Multiservice "Select predefined chain of actions": 5: Concraft, DependencyParser, input your text, and press the button "Run".
- To download the parser's output in CoNLL format, "Select output format:".
Publications
Licensing
The dependency parsing models for Polish are released under the CC BY-NC-SA 4.0 licence and by downloading them you accept the conditions of that licence.
Acknowledgment
The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure. The computing was performed at Poznań Supercomputing and Networking Center.
Contact
Any questions, comments? Please send them to <alina AT SPAMFREE ipipan DOT waw DOT pl>.
COMBO, MateParser and MaltParser.
COMBO-pytorch model for dependency parsing only (with HerBERT-base embeddings),
COMBO model for dependency parsing only
COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing
COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling
COMBO model for (semantic) dependency parsing only
MATE model for dependency parsing
MaltParser model for dependency parsing
PDB-UD-trained dependency parsing models for Polish
The PDB-UD-based models are trained on the current version of Polish Dependency Bank in Universal Dependencies format with the publicly available parsing systems – COMBO-pytorch, COMBO, UDPipe.
COMBO-pytorch model for for part-of-speech tagging, lemmatisation, and dependency parsing (with HerBERT-base embeddings),
COMBO-pytorch model for for part-of-speech tagging, lemmatisation, and dependency parsing (with HerBERT-large embeddings),
COMBO-pytorch model for for part-of-speech tagging, lemmatisation, and dependency parsing (with fastText embeddings),
COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing
COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling
UDPipe model for tokenisation, part-of-speech tagging, lemmatisation, and dependency parsing
UDPipe model for tokenisation