Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1132113

PANDORA Talks: Personality and Demographics on Reddit


Gjurković, Matej; Karan, Mladen; Vukojević, Iva; Bošnjak, Mihaela; Šnajder, Jan
PANDORA Talks: Personality and Demographics on Reddit // Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media / Association for Computational Linguistics (ur.).
Online: Association for Computational Linguistics, 2021. str. 138-152 doi:10.18653/v1/2021.socialnlp-1.12 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 1132113 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
PANDORA Talks: Personality and Demographics on Reddit

Autori
Gjurković, Matej ; Karan, Mladen ; Vukojević, Iva ; Bošnjak, Mihaela ; Šnajder, Jan

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media / Association for Computational Linguistics - Online : Association for Computational Linguistics, 2021, 138-152

Skup
Ninth International Workshop on Natural Language Processing for Social Media

Mjesto i datum
Online, 6-11.6.2021

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Personality ; Text Analysis ; Natural Language Processing ; Social Network Analysis ; Reddit ; Computational Social Science

Sažetak
Personality and demographics are important variables in social sciences and computational sociolinguistics. However, datasets with both personality and demographic labels are scarce. To address this, we present PANDORA, the first dataset of Reddit comments of 10k users partially labeled with three personality models and demographics (age, gender, and location), including 1.6k users labeled with the well- established Big 5 personality model. We showcase the usefulness of this dataset on three experiments, where we leverage the more readily available data from other personality models to predict the Big 5 traits, analyze gender classification biases arising from psycho- demographic variables, and carry out a confirmatory and exploratory analysis based on psychological theories. Finally, we present benchmark prediction models for all personality and demographic variables.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Interdisciplinarne tehničke znanosti, Psihologija



POVEZANOST RADA


Projekti:
HRZZ-IP-2020-02-8671 - Računalni modeli za predviđanje i analizu ličnosti na temelju teksta (psy.txt) (Šnajder, Jan, HRZZ - 2020-02) ( POIROT)

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Avatar Url Jan Šnajder (autor)

Avatar Url Mladen Karan (autor)

Avatar Url Matej Gjurković (autor)

Avatar Url Iva Vukojević (autor)

Citiraj ovu publikaciju:

Gjurković, Matej; Karan, Mladen; Vukojević, Iva; Bošnjak, Mihaela; Šnajder, Jan
PANDORA Talks: Personality and Demographics on Reddit // Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media / Association for Computational Linguistics (ur.).
Online: Association for Computational Linguistics, 2021. str. 138-152 doi:10.18653/v1/2021.socialnlp-1.12 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Gjurković, M., Karan, M., Vukojević, I., Bošnjak, M. & Šnajder, J. (2021) PANDORA Talks: Personality and Demographics on Reddit. U: Association for Computational Linguistics (ur.)Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media doi:10.18653/v1/2021.socialnlp-1.12.
@article{article, author = {Gjurkovi\'{c}, Matej and Karan, Mladen and Vukojevi\'{c}, Iva and Bo\v{s}njak, Mihaela and \v{S}najder, Jan}, year = {2021}, pages = {138-152}, DOI = {10.18653/v1/2021.socialnlp-1.12}, keywords = {Personality, Text Analysis, Natural Language Processing, Social Network Analysis, Reddit, Computational Social Science}, doi = {10.18653/v1/2021.socialnlp-1.12}, title = {PANDORA Talks: Personality and Demographics on Reddit}, keyword = {Personality, Text Analysis, Natural Language Processing, Social Network Analysis, Reddit, Computational Social Science}, publisher = {Association for Computational Linguistics}, publisherplace = {Online} }
@article{article, author = {Gjurkovi\'{c}, Matej and Karan, Mladen and Vukojevi\'{c}, Iva and Bo\v{s}njak, Mihaela and \v{S}najder, Jan}, year = {2021}, pages = {138-152}, DOI = {10.18653/v1/2021.socialnlp-1.12}, keywords = {Personality, Text Analysis, Natural Language Processing, Social Network Analysis, Reddit, Computational Social Science}, doi = {10.18653/v1/2021.socialnlp-1.12}, title = {PANDORA Talks: Personality and Demographics on Reddit}, keyword = {Personality, Text Analysis, Natural Language Processing, Social Network Analysis, Reddit, Computational Social Science}, publisher = {Association for Computational Linguistics}, publisherplace = {Online} }

Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font