Locked History Actions

Diff for "HateSpeech"

Differences between revisions 2 and 8 (spanning 6 versions)
Revision 2 as of 2013-01-23 23:36:29
Size: 324
Comment:
Revision 8 as of 2017-07-24 11:18:48
Size: 707
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
HateSpeech corpus in the current version contains over 2000 posts crawled from public Polish web. They represent various types and degrees of offensive language, expressed toward minorities (eg. ethincal, racial). The data were annotated manually. #acl +All:read Default
= HateSpeech corpus =
Line 3: Line 4:
The official page of the project is at http://www.raportmniejszosci.pl/ [[http://zil.ipipan.waw.pl/HateSpeech?action=AttachFile&do=get&target=hatespeech.tar.bz2|HateSpeech corpus]] in the current version contains over 2000 posts crawled from public Polish web. They represent various types and degrees of offensive language, expressed toward minorities (eg. ethnical, racial). The data were annotated manually.

==== Bibliography ====
Marek Troszyński, Aleksander Wawer. Czy komputer rozpozna hejtera? Wykorzystanie uczenia maszynowego (ML) w jakościowej analizie danych. Przegląd Socjologii Jakościowej. 2017. Tom XIII Numer 2.
[[http://www.qualitativesociologyreview.org/PL/Volume38/PSJ_13_2_Troszynski_Wawer.pdf|PDF]]

HateSpeech corpus

HateSpeech corpus in the current version contains over 2000 posts crawled from public Polish web. They represent various types and degrees of offensive language, expressed toward minorities (eg. ethnical, racial). The data were annotated manually.

Bibliography

Marek Troszyński, Aleksander Wawer. Czy komputer rozpozna hejtera? Wykorzystanie uczenia maszynowego (ML) w jakościowej analizie danych. Przegląd Socjologii Jakościowej. 2017. Tom XIII Numer 2. PDF