Locked History Actions

Diff for "plWikiEcono"

Differences between revisions 2 and 4 (spanning 2 versions)
Revision 2 as of 2013-01-07 13:38:35
Size: 759
Comment:
Revision 4 as of 2013-01-07 13:39:55
Size: 764
Comment:
Deletions are marked like this. Additions are marked like this.
Line 4: Line 4:
The corpus has been created by selecting economy-related categories from the Polish Wikipedia, including economy-related subcategories, downloading all articles from such a list of categories, stripping Wikipedia annotation, tagging the result with TaKIPI 1.8 and converting to TEI format. The articles were downloaded from Wikipedia in April 2011. The corpus has been created by selecting economy-related categories from the Polish Wikipedia, including economy-related subcategories, downloading all articles from such a list of categories, stripping Wikipedia annotation, tagging the result with TaKIPI 1.8 and converting it to TEI format. The articles were downloaded from Wikipedia in April 2011.
Line 8: Line 8:
 * [[attachment:wikipedia-econo-tei-senses-wsdde.7z|A part of the corpus, which has been manually sense-annotated (TEI format)]]  * [[attachment:wikipedia-econo-tei-senses-wsdde.7z|A part of the corpus, which has been manually sense-annotated (WSDDE format)]]

plWikiEcono

A corpus of Polish Wikipedia articles from the domain of economy.

The corpus has been created by selecting economy-related categories from the Polish Wikipedia, including economy-related subcategories, downloading all articles from such a list of categories, stripping Wikipedia annotation, tagging the result with TaKIPI 1.8 and converting it to TEI format. The articles were downloaded from Wikipedia in April 2011.