Differences between revisions 1 and 2
⇤ ← Revision 1 as of 2013-01-07 13:32:56
Size: 84
Comment:
|
Size: 759
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 3: | Line 3: |
The corpus has been created by selecting economy-related categories from the Polish Wikipedia, including economy-related subcategories, downloading all articles from such a list of categories, stripping Wikipedia annotation, tagging the result with TaKIPI 1.8 and converting to TEI format. The articles were downloaded from Wikipedia in April 2011. * [[attachment:wikipedia-econo-tei.7z|Full corpus (format TEI)]] * [[attachment:wikipedia-econo-tei-senses.7z|A part of the corpus, which has been manually sense-annotated (TEI format)]] * [[attachment:wikipedia-econo-tei-senses-wsdde.7z|A part of the corpus, which has been manually sense-annotated (TEI format)]] |
plWikiEcono
A corpus of Polish Wikipedia articles from the domain of economy.
The corpus has been created by selecting economy-related categories from the Polish Wikipedia, including economy-related subcategories, downloading all articles from such a list of categories, stripping Wikipedia annotation, tagging the result with TaKIPI 1.8 and converting to TEI format. The articles were downloaded from Wikipedia in April 2011.