Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Karan, Mladen; Šnajder, Jan

Pregled bibliografske jedinice broj: 1022280

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Karan, Mladen; Šnajder, Jan

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context // Proceedings of the Third Workshop on Abusive Language Online
Firenza : München: Association for Computational Linguistics (ACL), 2019. str. 129-134 doi:10.18653/v1/w19-3514 (poster, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)

CROSBI ID: 1022280 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Autori
Karan, Mladen ; Šnajder, Jan

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the Third Workshop on Abusive Language Online / - Firenza : München : Association for Computational Linguistics (ACL), 2019, 129-134

Skup
7th Workshop on Balto-Slavic Natural Language Processing. Association for Computational Linguistics

Mjesto i datum
Firenca, Italija, 28.07.2019. - 02.08.2019

Vrsta sudjelovanja
Poster

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
hate speech, natural language processing
(govor mržnje, obrada prirodnog jezika)

Sažetak
We address the task of automatically detecting toxic content in user generated texts. We focus on exploring the potential for preemptive moderation, i.e., predicting whether a particular conversation thread will, in the future, incite a toxic comment. Moreover, we perform preliminary investigation of whether a model that jointly considers all comments in a conversation thread outperforms a model that considers only individual comments. Using an existing dataset of conversations among Wikipedia contributors as a starting point, we compile a new large-scale dataset for this task consisting of labeled comments and comments from their conversation threads.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo

POVEZANOST RADA

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Jan Šnajder (autor)

Mladen Karan (autor)

Poveznice na cjeloviti tekst rada:

doi www.aclweb.org

CROSBI Hrvatska znanstvena bibliografija

Pregled bibliografske jedinice broj: 1022280

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Poveznice na cjeloviti tekst rada:

Citiraj ovu publikaciju:

Citati:

Altmetrijski pokazatelji:

Pregled bibliografske jedinice broj: 1022280

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Poveznice na cjeloviti tekst rada:

Citiraj ovu publikaciju:

Citati:

Altmetrijski pokazatelji:

Podijeli: