Differences between revisions 2 and 4 (spanning 2 versions)
Size: 215
Comment:
|
Size: 477
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 3: | Line 3: |
* [[attachment:validateNKJP-2012-03-05-1446.tar.gz]] - validation tool for NKJP_1M | * [[attachment:validateNKJP-2012-04-12-1640.tar.gz]] - validation tool for NKJP_1M * [[attachment:tei2pml-2012-04-12-1648.jar]] - conversion of ann_named.xml, ann_groups.xml and ann_words.xml from TEI P5 format to PML (so it can be manually corrected using Tred). Sources can be found in svn://chopin.ipipan.waw.pl/nkjp/michal.lenart/tei2pml2 |
This page contains tools and resources related to NKJP_1M - NKJP corpus manually annotated sample containing 1 million tokens.
validateNKJP-2012-04-12-1640.tar.gz - validation tool for NKJP_1M
tei2pml-2012-04-12-1648.jar - conversion of ann_named.xml, ann_groups.xml and ann_words.xml from TEI P5 format to PML (so it can be manually corrected using Tred). Sources can be found in svn://chopin.ipipan.waw.pl/nkjp/michal.lenart/tei2pml2