Locked History Actions

Diff for "NKJPTools"

Differences between revisions 7 and 19 (spanning 12 versions)
Revision 7 as of 2012-04-12 15:53:46
Size: 504
Editor: MichalLenart
Comment:
Revision 19 as of 2015-08-10 11:30:23
Size: 1038
Editor: MateuszKopec
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
This page contains tools and resources related to NKJP_1M - NKJP corpus manually annotated sample containing 1 million tokens. ## page was renamed from NarzedziaNKJP
## page was renamed from NKJP_1M
#acl +All:read Default
= NKJP tools =
Line 3: Line 6:
 * [[attachment:validateNKJP-2012-04-12-1640.tar.gz]] - validation tool for NKJP_1M
 * [[attachment:tei2pml-2012-04-12-1648.jar]] - conversion of ann_named.xml, ann_groups.xml and ann_words.xml from TEI P5 format to PML (so it can be manually corrected using Tred). Sources can be found in subversion repository at `svn://chopin.ipipan.waw.pl/nkjp/michal.lenart/tei2pml2`
This page contains various tools related to NKJP like parsers, converters, validators etc.

* [[attachment:validateNKJP-2012-04-12-1640.tar.gz]] - validation tool for NKJP-compatible TEI P5 directories (described here: http://nlp.ipipan.waw.pl/TEI4NKJP/). Latest version at: `svn://svn.nlp.ipipan.waw.pl/nkjp/michal.lenart/validateNKJP`
 * [[attachment:tei2pml-2012-04-12-1648.jar]] - converter of ann_named.xml, ann_groups.xml and ann_words.xml from TEI P5 format to PML (so it can be manually corrected using Tred). Latest version can be found in subversion repository at `svn://svn.nlp.ipipan.waw.pl/nkjp/michal.lenart/tei2pml`
 * [[attachment:pml2tei-2012-04-12-1701.tar.gz]] - converter from PML to TEI P5 format (words, groups, names). Sources also at `svn://svn.nlp.ipipan.waw.pl/nkjp/michal.lenart/pml2tei`
 * [[TeiAPI]] - Java API for parsing, manipulating and writing NKJP-compatible TEI P5 directories.

NKJP tools

This page contains various tools related to NKJP like parsers, converters, validators etc.

  • validateNKJP-2012-04-12-1640.tar.gz - validation tool for NKJP-compatible TEI P5 directories (described here: http://nlp.ipipan.waw.pl/TEI4NKJP/). Latest version at: svn://svn.nlp.ipipan.waw.pl/nkjp/michal.lenart/validateNKJP

  • tei2pml-2012-04-12-1648.jar - converter of ann_named.xml, ann_groups.xml and ann_words.xml from TEI P5 format to PML (so it can be manually corrected using Tred). Latest version can be found in subversion repository at svn://svn.nlp.ipipan.waw.pl/nkjp/michal.lenart/tei2pml

  • pml2tei-2012-04-12-1701.tar.gz - converter from PML to TEI P5 format (words, groups, names). Sources also at svn://svn.nlp.ipipan.waw.pl/nkjp/michal.lenart/pml2tei

  • TeiAPI - Java API for parsing, manipulating and writing NKJP-compatible TEI P5 directories.