Locked History Actions

Diff for "plWikiEcono"

Differences between revisions 1 and 2
Revision 1 as of 2013-01-07 13:32:56
Size: 84
Comment:
Revision 2 as of 2013-01-07 13:38:35
Size: 759
Comment:
Deletions are marked like this. Additions are marked like this.
Line 3: Line 3:

The corpus has been created by selecting economy-related categories from the Polish Wikipedia, including economy-related subcategories, downloading all articles from such a list of categories, stripping Wikipedia annotation, tagging the result with TaKIPI 1.8 and converting to TEI format. The articles were downloaded from Wikipedia in April 2011.

 * [[attachment:wikipedia-econo-tei.7z|Full corpus (format TEI)]]
 * [[attachment:wikipedia-econo-tei-senses.7z|A part of the corpus, which has been manually sense-annotated (TEI format)]]
 * [[attachment:wikipedia-econo-tei-senses-wsdde.7z|A part of the corpus, which has been manually sense-annotated (TEI format)]]

plWikiEcono

A corpus of Polish Wikipedia articles from the domain of economy.

The corpus has been created by selecting economy-related categories from the Polish Wikipedia, including economy-related subcategories, downloading all articles from such a list of categories, stripping Wikipedia annotation, tagging the result with TaKIPI 1.8 and converting to TEI format. The articles were downloaded from Wikipedia in April 2011.