COMBO's models for Polish
COMBO's models for Polish trained on the current version of Polish Dependency Bank using the HerBERT language model.
PDB-trained models
model for dependency parsing only
model for part-of-speech tagging, morphological analysis, lemmatisation, and dependency parsing (dependency relation types without semantic extensions, e.g. adjunct instead of adjunct_temp)
model for part-of-speech tagging, morphological analysis, lemmatisation, and dependency parsing (dependency relation types with semantic extensions, e.g. adjunct_temp)
PDB-UD-trained model
model for part-of-speech tagging, morphological analysis, lemmatisation, and dependency parsing
COMBO
COMBO's source code
Beginner's tutorial (collab notebook)
COMBO's performance on test sets for multiple languages from Universal Dependencies
- Web demos
Publications
Licensing
Polish NLP models are released under the CC BY-NC-SA 4.0 licence and by downloading them you accept the conditions of that licence.
Acknowledgment
The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science, Higher Education as part of the investment in the CLARIN-PL research infrastructure and by Digital Research Infrastructure for the Arts and Humanities DARIAH-PL. The computing was performed at Poznań Supercomputing and Networking Center.
Contact
Any questions, comments? Please send them to <alina AT SPAMFREE ipipan DOT waw DOT pl>.
COMBO, MateParser and MaltParser.
COMBO-pytorch model for dependency parsing only (with HerBERT-base embeddings),
COMBO model for dependency parsing only
COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing
COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling
COMBO model for (semantic) dependency parsing only
MATE model for dependency parsing
MaltParser model for dependency parsing
PDB-UD-trained dependency parsing models for Polish
The PDB-UD-based models are trained on the current version of Polish Dependency Bank in Universal Dependencies format with the publicly available parsing systems – COMBO-pytorch, COMBO, UDPipe.
COMBO-pytorch model for for part-of-speech tagging, lemmatisation, and dependency parsing (with HerBERT-base embeddings),
COMBO-pytorch model for for part-of-speech tagging, lemmatisation, and dependency parsing (with HerBERT-large embeddings),
COMBO-pytorch model for for part-of-speech tagging, lemmatisation, and dependency parsing (with fastText embeddings),
COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing
COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling
UDPipe model for tokenisation, part-of-speech tagging, lemmatisation, and dependency parsing
UDPipe model for tokenisation