Pregled bibliografske jedinice broj: 1085826
Detecting hate speech online: a case of Croatian
Detecting hate speech online: a case of Croatian // Formalizing natural languages with NooJ 2019 and its natural language processing applications : revised selected papers / Fehri, Hela ; Mesfar, Slim ; Silberztein, Max (ur.).
Hammamet, Tunis: Springer, 2020. str. 185-197 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 1085826 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Detecting hate speech online: a case of Croatian
Autori
Kocijan, Kristina ; Košković, Lucija ; Bajac, Petra
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Formalizing natural languages with NooJ 2019 and its natural language processing applications : revised selected papers
/ Fehri, Hela ; Mesfar, Slim ; Silberztein, Max - : Springer, 2020, 185-197
ISBN
978-3-030-38832-4
Skup
International Conference on Automatic Processing of Natural-Language Electronic Texts (NooJ 2019)
Mjesto i datum
Hammamet, Tunis, 07.06.2019. - 09.06.2019
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
hate speech ; insults ; pattern detection ; information extraction ; syntactic grammars ; Croatian ; NooJ
Sažetak
This project proposes a NooJ algorithm with the task to find and categorize various slurs, insults and ultimately, hate speech in Croatian. The results also provide a more detailed insight into inappropriate language in Croatian. We strongly emphasize the ethical considerations of (mis) identifying hate speech and as a result, an unethical and undeserved censorship of inappropriate, but free speech. Thus, we tried to make a clear distinction between insults and hate speech. The test corpus consists of written online comments and remarks posted on five Croatian Facebook news pages during one week period. Given the differences between the standard Croatian grammar and syntax, and what is actually being used in informal on-line communication, the false negatives present the biggest difficulty since some variations (substandard usages of cases, spelling errors, colloquialisms) are impossible to predict, and therefore, extremely hard to implement into the algorithm.
Izvorni jezik
Engleski
Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija
Citiraj ovu publikaciju:
Časopis indeksira:
- Scopus