Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1281622

Data Analysis of the Web News Headlines based on Natural Language Processing


Karna, Hrvoje; Braović, Maja; Vicković, Linda; Krstinić, Damir
Data Analysis of the Web News Headlines based on Natural Language Processing // Journal of Communications Software and Systems, 19 (2023), 2; 158-167 doi:10.24138/jcomss-2023-0047 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 1281622 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Data Analysis of the Web News Headlines based on Natural Language Processing

Autori
Karna, Hrvoje ; Braović, Maja ; Vicković, Linda ; Krstinić, Damir

Izvornik
Journal of Communications Software and Systems (1845-6421) 19 (2023), 2; 158-167

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
data mining, information extraction, natural language processing, news portals, text analysis

Sažetak
This paper explores the problem of media content data analysis with the focus on the phenomenon of vaccination, closely related to the COVID-19 pandemic. The presented research is an extension of the previous work, but it differs in two main areas. Firstly, the text corpus submitted to the analysis has been considerably increased. Secondly, the previous data analysis was performed on the body part of the posts, while now it is focused on the most prominent part of the news posts, their headlines. This change from body to headline analysis was provoked by significant differences in their characteristics and the fact that most people read only headlines. Described data acquisition uses an advanced content collection approach followed by the modeling process, during which a set of natural language processing algorithms were applied. To enable the comparison, the model uses the same set of algorithms in the modeling phase like in previous work. The main contributions of the work are manifested in: i) approaching the problem from a new perspective, ii) applying more efficient method of data collection, and crucially iii) enabling the comparison of analysis results for individual parts of the content, which ensured a comprehensive insight into the characteristics of news posts.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti



POVEZANOST RADA


Profili:

Avatar Url Maja Braović (autor)

Avatar Url Damir Krstinić (autor)

Avatar Url Linda Vicković (autor)

Avatar Url Hrvoje Karna (autor)

Poveznice na cjeloviti tekst rada:

doi jcoms.fesb.unist.hr

Citiraj ovu publikaciju:

Karna, Hrvoje; Braović, Maja; Vicković, Linda; Krstinić, Damir
Data Analysis of the Web News Headlines based on Natural Language Processing // Journal of Communications Software and Systems, 19 (2023), 2; 158-167 doi:10.24138/jcomss-2023-0047 (međunarodna recenzija, članak, znanstveni)
Karna, H., Braović, M., Vicković, L. & Krstinić, D. (2023) Data Analysis of the Web News Headlines based on Natural Language Processing. Journal of Communications Software and Systems, 19 (2), 158-167 doi:10.24138/jcomss-2023-0047.
@article{article, author = {Karna, Hrvoje and Braovi\'{c}, Maja and Vickovi\'{c}, Linda and Krstini\'{c}, Damir}, year = {2023}, pages = {158-167}, DOI = {10.24138/jcomss-2023-0047}, keywords = {data mining, information extraction, natural language processing, news portals, text analysis}, journal = {Journal of Communications Software and Systems}, doi = {10.24138/jcomss-2023-0047}, volume = {19}, number = {2}, issn = {1845-6421}, title = {Data Analysis of the Web News Headlines based on Natural Language Processing}, keyword = {data mining, information extraction, natural language processing, news portals, text analysis} }
@article{article, author = {Karna, Hrvoje and Braovi\'{c}, Maja and Vickovi\'{c}, Linda and Krstini\'{c}, Damir}, year = {2023}, pages = {158-167}, DOI = {10.24138/jcomss-2023-0047}, keywords = {data mining, information extraction, natural language processing, news portals, text analysis}, journal = {Journal of Communications Software and Systems}, doi = {10.24138/jcomss-2023-0047}, volume = {19}, number = {2}, issn = {1845-6421}, title = {Data Analysis of the Web News Headlines based on Natural Language Processing}, keyword = {data mining, information extraction, natural language processing, news portals, text analysis} }

Časopis indeksira:


  • Web of Science Core Collection (WoSCC)
    • Emerging Sources Citation Index (ESCI)
  • Scopus


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font