Locked History Actions

Diff for "TextSelector"

Differences between revisions 2 and 3
Revision 2 as of 2014-07-14 14:18:23
Size: 871
Editor: MateuszKopec
Comment:
Revision 3 as of 2014-07-14 14:44:41
Size: 1770
Editor: MateuszKopec
Comment:
Deletions are marked like this. Additions are marked like this.
Line 16: Line 16:
== Usage ==
Each converter may be run using following command:

java -jar TextSelector_standalone_jar input_dir target_dir

where ''TextSelector_standalone_jar'' is the standalone ''.jar'' file presented below, ''input_dir'' is the corpus in the TEI format (only ''text_structure.xml'' and ''header.xml'' files for each text) and ''target_dir'' is the target directory to place the rejected texts.

Usage is simple: when opened, TextSelector shows first text in the corpus directory. One may edit text content of the corpus in the main window and save changes by pressing CTRL+S shortcut. If the text should be rejected, CTRL+R will do it and display next corpus text. Next corpus text may be shown by pressing CTRL+N, previous one by pressing CTRL+P.
Line 17: Line 26:

TextSelector 1.0 will soon be available to download
TextSelector 1.0 is available to download in two versions:
 * [[attachment:textSelector-1.0-SNAPSHOT.one-jar.jar | standalone jar]]
 * [[attachment:textSelector-1.0-src.jar | source code]]

Text Selector

This page offers the official Creative Commons Attribution 3.0 Unported License release of the TextSelector, a tool for manual text inspection and selection. By downloading the TextSelector package you accept the conditions of that licence.

Principal developer: Mateusz Kopeć
Authors: Mateusz Kopeć
License: CC BY v.3

http://i.creativecommons.org/l/by/3.0/88x31.png

Usage

Each converter may be run using following command:

java -jar TextSelector_standalone_jar input_dir target_dir

where TextSelector_standalone_jar is the standalone .jar file presented below, input_dir is the corpus in the TEI format (only text_structure.xml and header.xml files for each text) and target_dir is the target directory to place the rejected texts.

Usage is simple: when opened, TextSelector shows first text in the corpus directory. One may edit text content of the corpus in the main window and save changes by pressing CTRL+S shortcut. If the text should be rejected, CTRL+R will do it and display next corpus text. Next corpus text may be shown by pressing CTRL+N, previous one by pressing CTRL+P.

Downloads

TextSelector 1.0 is available to download in two versions:

You may also want to see other Polish Coreference Tools.

Citing

When using Text Selector, please cite the following article: List of publications

Maciej Ogrodniczuk, Katarzyna Głowińska, Mateusz Kopeć, Agata Savary, and Magdalena Zawisławska. Polish Coreference Corpus. In Zygmunt Vetulani, editor, Proceedings of the 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 494–498, Poznań, Poland, 2013. Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza.