Locked History Actions

Diff for "NKJPTools"

Differences between revisions 1 and 8 (spanning 7 versions)
Revision 1 as of 2012-01-26 11:26:47
Size: 201
Editor: MichalLenart
Comment:
Revision 8 as of 2012-04-12 16:01:51
Size: 689
Editor: MichalLenart
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
Line 4: Line 3:
 * [[attachment:validateNKJP.tar.gz]] - validation tool for NKJP_1M  * [[attachment:validateNKJP-2012-04-12-1640.tar.gz]] - validation tool for NKJP_1M
 * [[attachment:tei2pml-2012-04-12-1648.jar]] - conversion of ann_named.xml, ann_groups.xml and ann_words.xml from TEI P5 format to PML (so it can be manually corrected using Tred). Sources can be found in subversion repository at `svn://chopin.ipipan.waw.pl/nkjp/michal.lenart/tei2pml2`
 * [[attachment:pml2tei-2012-04-12-1701.tar.gz]] - conversion from PML to TEI P5 format (words, groups, names). Sources also at `svn://chopin.ipipan.waw.pl/nkjp/michal.lenart/pml2tei`

This page contains tools and resources related to NKJP_1M - NKJP corpus manually annotated sample containing 1 million tokens.

  • validateNKJP-2012-04-12-1640.tar.gz - validation tool for NKJP_1M

  • tei2pml-2012-04-12-1648.jar - conversion of ann_named.xml, ann_groups.xml and ann_words.xml from TEI P5 format to PML (so it can be manually corrected using Tred). Sources can be found in subversion repository at svn://chopin.ipipan.waw.pl/nkjp/michal.lenart/tei2pml2

  • pml2tei-2012-04-12-1701.tar.gz - conversion from PML to TEI P5 format (words, groups, names). Sources also at svn://chopin.ipipan.waw.pl/nkjp/michal.lenart/pml2tei