Revision 1 as of 2014-03-07 11:05:59

Clear message
Locked History Actions

Krzaki

Krzaki (bushes)

A manually annotated for dependency structure corpus of Polish. It consists of ~20000 sentences, the same set used in Składnica.

This treebank has only segment-head links determined, without specifying their functions. Contrary to Składnica (which contains only sentences which could be parsed by Świgra), this treebank was created manually, from a representative set of sentences from the manually disambiguated for morphosyntax subcorpus of NKJP.

The corpus is distributed in CONLL format.