Differences between revisions 2 and 4 (spanning 2 versions)
Size: 759
Comment:
|
Size: 764
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 4: | Line 4: |
The corpus has been created by selecting economy-related categories from the Polish Wikipedia, including economy-related subcategories, downloading all articles from such a list of categories, stripping Wikipedia annotation, tagging the result with TaKIPI 1.8 and converting to TEI format. The articles were downloaded from Wikipedia in April 2011. | The corpus has been created by selecting economy-related categories from the Polish Wikipedia, including economy-related subcategories, downloading all articles from such a list of categories, stripping Wikipedia annotation, tagging the result with TaKIPI 1.8 and converting it to TEI format. The articles were downloaded from Wikipedia in April 2011. |
Line 8: | Line 8: |
* [[attachment:wikipedia-econo-tei-senses-wsdde.7z|A part of the corpus, which has been manually sense-annotated (TEI format)]] | * [[attachment:wikipedia-econo-tei-senses-wsdde.7z|A part of the corpus, which has been manually sense-annotated (WSDDE format)]] |
plWikiEcono
A corpus of Polish Wikipedia articles from the domain of economy.
The corpus has been created by selecting economy-related categories from the Polish Wikipedia, including economy-related subcategories, downloading all articles from such a list of categories, stripping Wikipedia annotation, tagging the result with TaKIPI 1.8 and converting it to TEI format. The articles were downloaded from Wikipedia in April 2011.