Revision 4 as of 2014-07-14 14:44:53

Clear message
Locked History Actions

TextSelector

Text Selector

This page offers the official Creative Commons Attribution 3.0 Unported License release of the TextSelector, a tool for manual text inspection and selection. By downloading the TextSelector package you accept the conditions of that licence.

Principal developer: Mateusz Kopeć
Authors: Mateusz Kopeć
License: CC BY v.3

http://i.creativecommons.org/l/by/3.0/88x31.png

Usage

Text Selector may be run using following command:

java -jar TextSelector_standalone_jar input_dir target_dir

where TextSelector_standalone_jar is the standalone .jar file presented below, input_dir is the corpus in the TEI format (only text_structure.xml and header.xml files for each text) and target_dir is the target directory to place the rejected texts.

Usage is simple: when opened, TextSelector shows first text in the corpus directory. One may edit text content of the corpus in the main window and save changes by pressing CTRL+S shortcut. If the text should be rejected, CTRL+R will do it and display next corpus text. Next corpus text may be shown by pressing CTRL+N, previous one by pressing CTRL+P.

Downloads

TextSelector 1.0 is available to download in two versions:

You may also want to see other Polish Coreference Tools.

Citing

When using Text Selector, please cite the following article: List of publications

Maciej Ogrodniczuk, Katarzyna Głowińska, Mateusz Kopeć, Agata Savary, and Magdalena Zawisławska. Polish Coreference Corpus. In Zygmunt Vetulani, editor, Proceedings of the 6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, pages 494–498, Poznań, Poland, 2013. Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza.