Size: 13133
Comment:
|
Size: 13235
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 46: | Line 46: |
* [[https://colab.research.google.com/drive/1D1P4AiE40Cc_4SF3HY-Mz06JY0XMiEFs?hl=en|Tutorial]] (collab notebook) * [[https://gitlab.clarin-pl.eu/syntactic-tools/combo/-/blob/master/docs/performance.md|Performance]] |
* Beginner's [[https://colab.research.google.com/drive/1D1P4AiE40Cc_4SF3HY-Mz06JY0XMiEFs?hl=en|tutorial]] (collab notebook) * COMBO's [[https://gitlab.clarin-pl.eu/syntactic-tools/combo/-/blob/master/docs/performance.md|performance]] on test sets for multiple languages from [[https://universaldependencies.org|UD]]. |
Polish COMBO models
The COMBO models for Polish are trained on the current version of Polish Dependency Bank. The models use the HerBERT language model.
PDB-trained COMBO models
model for dependency parsing only
model for part-of-speech tagging, morphological analysis, lemmatisation, and dependency parsing (dependency relation types without semantic extensions, e.g. adjunct instead of adjunct_temp)
model for part-of-speech tagging, morphological analysis, lemmatisation, and dependency parsing (dependency relation types with semantic extensions, e.g. adjunct_temp)
PDB-UD-trained COMBO model
model for part-of-speech tagging, morphological analysis, lemmatisation, and dependency parsing
COMBO
Source code
Beginner's tutorial (collab notebook)
COMBO's performance on test sets for multiple languages from UD.
COMBO demos
Publications
Licensing
The dependency parsing models for Polish are released under the CC BY-NC-SA 4.0 licence and by downloading them you accept the conditions of that licence.
Acknowledgment
The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure. The computing was performed at Poznań Supercomputing and Networking Center.
Contact
Any questions, comments? Please send them to <alina AT SPAMFREE ipipan DOT waw DOT pl>.