Locked History Actions

Diff for "PDB/PDBparser"

Differences between revisions 34 and 52 (spanning 18 versions)
Revision 34 as of 2019-01-18 11:29:55
Size: 8293
Comment:
Revision 52 as of 2019-05-06 10:05:19
Size: 9962
Comment:
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
== PDB-based dependency parsing models for Polish == == PDB-trained dependency parsing models for Polish ==
Line 6: Line 6:
 * COMBO
  * [[attachment:190115_COMBO_PDB_nosem.pkl]] – PDB-based COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing
  * [[attachment: 190115_COMBO_PDB_sem.pkl]] – PDB-based COMBO model for part-of-speech tagging, lemmatisation, dependency parsing and semantic role labelling
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190423_COMBO_PDB_nosem_parseonly.pkl|COMBO model]] for dependency parsing only
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190423_COMBO_PDB_nosem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, and dependency parsing
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190423_COMBO_PDB_sem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling
Line 10: Line 10:
 {{{#!wiki comment
 * '''NEW!''' PDB-based COMBO model compatible with the tagset of Morfeusz 2: [[attachment:180912_PDBCOMBO.pkl]]
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190125_MATE_PDB.model|MATE model]] for dependency parsing
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190125_MALT_PDB.mco|MaltParser model]] for dependency parsing
Line 13: Line 13:
 * MateParser
Line 15: Line 14:
  * '''NEW!''' PDB-based Mate model compatible with the tagset of Morfeusz 2: [[attachment:180322_PDBMate.mdl]]
  * PDB-based Mate model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMate.mdl]]}}}
== PDB-UD-trained dependency parsing models for Polish ==
The PDB-UD-based models are trained on the current version of [[http://git.nlp.ipipan.waw.pl/alina/PDBUD|Polish Depedency Bank in Universal Dependencies format]] with the publicly available parsing systems – [[http://ufal.mff.cuni.cz/udpipe|UDPipe]] and [[https://github.com/360er0/COMBO|COMBO]].
Line 18: Line 17:
 * MaltParser
  * [[attachment:]] PDB-based MaltParser model
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190423_COMBO_PDBUD_nosem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, and dependency parsing
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190125_COMBO_PDBUD_sem.pkl|COMBO model]] for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190423_PDBUD_ttp_embedd.udpipe|UDPipe model]] for tokenisation, part-of-speech tagging, lemmatisation, and dependency parsing
 * [[http://mozart.ipipan.waw.pl/~alina/Polish_dependency_parsing_models/190423_PDBUD_tokeniser.udpipe|UDPipe model]] for tokenisation
Line 22: Line 23:
  * '''NEW!''' PDB-based MaltParser model compatible with the tagset of Morfeusz 2: [[attachment:180322_PDBMalt.mco]]
  * PDB-basd MaltParser model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMalt.mco]]}}}


== PDBUD-based dependency parsing models for Polish ==
The PDBUD-based models are trained on the current version of [[http://git.nlp.ipipan.waw.pl/alina/PDBUD|Polish Depedency Bank in Universal Dependencies format]] with the publicly available parsing systems – [[http://ufal.mff.cuni.cz/udpipe|UDPipe]] and [[https://github.com/360er0/COMBO|COMBO]].
Line 30: Line 24:
 * [[attachment:180606_PDBUDPipe.udpipe|UDPipe]] model for Polish  * [[attachment:180606_PDBUDPipe.udpipe|UDPipe]] model for Polish}}}
Line 37: Line 31:
  * [[attachment:190115_COMBO_PDB_nosem.pkl]] – PDB-based COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing
  * [[attachment: 190115_COMBO_PDB_sem.pkl]] – PDB-based COMBO model for part-of-speech tagging, lemmatisation, dependency parsing and semantic role labelling

 * '''NEW!''' PDB-based COMBO model compatible with the tagset of Morfeusz 2: [[attachment:180912_PDBCOMBO.pkl]]

 * MateParser

  * '''NEW!''' PDB-based Mate model compatible with the tagset of Morfeusz 2: [[attachment:180322_PDBMate.mdl]]
  * PDB-based Mate model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMate.mdl]]

 * MateParser
  * [[attachment:190125_MATE_PDB.model]] – PDB-based MateParser model for dependency parsing
 * MaltParser
  * [[attachment:190125_MALT_PDB.mco]] – PDB-based MaltParser model for dependency parsing


  * '''NEW!''' PDB-based MaltParser model compatible with the tagset of Morfeusz 2: [[attachment:180322_PDBMalt.mco]]
  * PDB-basd MaltParser model compatible with the tagset of Morfeusz: [[attachment:170608_PDBMalt.mco]]

PDB-trained dependency parsing models for Polish

The PDB-based models are trained on the current version of Polish Depedency Bank with the publicly available parsing systems – COMBO, MateParser and MaltParser.

  • COMBO model for dependency parsing only

  • COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing

  • COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling

  • MATE model for dependency parsing

  • MaltParser model for dependency parsing

PDB-UD-trained dependency parsing models for Polish

The PDB-UD-based models are trained on the current version of Polish Depedency Bank in Universal Dependencies format with the publicly available parsing systems – UDPipe and COMBO.

  • COMBO model for part-of-speech tagging, lemmatisation, and dependency parsing

  • COMBO model for part-of-speech tagging, lemmatisation, dependency parsing, and semantic role labelling

  • UDPipe model for tokenisation, part-of-speech tagging, lemmatisation, and dependency parsing

  • UDPipe model for tokenisation

Parsing performance

See Dependency parsing section.

PDB-based MaltParser in Multiservice

  • The performance of MaltParser model for Polish may be tested in Multiservice NLP – http://multiservice.nlp.ipipan.waw.pl.

  • To parse a Polish text in Multiservice "Select predefined chain of actions": 5: Concraft, DependencyParser, input your text, and press the button "Run".

  • To download the parser's output in CoNLL format, "Select output format:":

Publications

List of publications

List of publications
List of publications
List of publications

Licensing

The dependency parsing models for Polish are released under the CC BY-NC-SA 4.0 licence and by downloading it you accept the conditions of that licence.

Founding

The research was founded by SONATA 8 grant no 2014/15/D/HS2/03486 from the National Science Centre Poland and by the Polish Ministry of Science and Higher Education as part of the investment in the CLARIN-PL research infrastructure.

Contact

Any questions, comments? Please send them to <alina AT SPAMFREE ipipan DOT waw DOT pl>.