Revision 2 as of 2014-12-05 17:58:39

Clear message
Locked History Actions


Krzaki (bushes)

A manually annotated for dependency structure corpus of Polish. It consists of ~20000 sentences, the same set used in Składnica.

This treebank has only segment-head links determined, without specifying their functions. Contrary to Składnica (which contains only sentences which could be parsed by Świgra), this treebank was created manually, from a representative set of sentences from the manually disambiguated for morphosyntax subcorpus of NKJP.

The corpus is distributed in CONLL format.