Differences between revisions 1 and 5 (spanning 4 versions)
Size: 304
Comment:
|
Size: 365
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
= HateSpeech corpus = | |
Line 2: | Line 3: |
HateSpeech corpus in the current version contains over 2000 posts crawled from public Polish web. They represent various types and degrees of offensive language, expressed toward minorities. The data were annotated manually. The official page of the project is at http://www.raportmniejszosci.pl/ | [[http://zil.ipipan.waw.pl/HateSpeech?action=AttachFile&do=get&target=hatespeech.tar.bz2|HateSpeech corpus]] in the current version contains over 2000 posts crawled from public Polish web. They represent various types and degrees of offensive language, expressed toward minorities (eg. ethnical, racial). The data were annotated manually. |
HateSpeech corpus
HateSpeech corpus in the current version contains over 2000 posts crawled from public Polish web. They represent various types and degrees of offensive language, expressed toward minorities (eg. ethnical, racial). The data were annotated manually.