Pregled bibliografske jedinice broj: 1217279
Analysis of the Textual Information Extracted from News Portals
Analysis of the Textual Information Extracted from News Portals // Proceedings of the 30th Conference on Software, Telecommunications and Computer Networks (SoftCOM 2022) / Rožić, Nikola ; Begušić, Dinko (ur.). (ur.).
Split: Institute of Electrical and Electronics Engineers (IEEE), 2022. str. 1-6 doi:10.23919/SoftCOM55329.2022.9911444 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 1217279 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Analysis of the Textual Information Extracted from
News Portals
Autori
Lovrić, Petra ; Vicković, Linda ; Karna, Hrvoje
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the 30th Conference on Software, Telecommunications and Computer Networks (SoftCOM 2022)
/ Rožić, Nikola ; Begušić, Dinko (ur.). - Split : Institute of Electrical and Electronics Engineers (IEEE), 2022, 1-6
Skup
30th International Conference on Software, Telecommunications and Computer Networks (SoftCOM 2022)
Mjesto i datum
Split, Hrvatska, 22.09.2022. - 24.09.2022
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Digital News, Information Extraction, Natural Language Processing, Text Mining, Web-scraping
Sažetak
The primary means of informing the population in modern society is through news portals. This paper analyses the characteristics and effects that such way of communication creates. The influence was studied in particular on an example of current global phenomenon “vaccination” (hr. cijepljenje). The research method follows the CRISP-DM process adapted to the digitalized form of textual data. The analysed corpus, in the form of natural spoken language, was scraped from Croatian news portals. The subsequent processing extracts information from unstructured textual sources and provides valuable insights, like how much a particular topic is represented in the article. Modeling is based on the application of multiple text mining algorithms, like Words Cloud, Topic Modelling, Concordance and Sentiment Analysis. The implemented model produces indicators for objective information interpretation. The findings suggest that the portals associated the notion of vaccination with the COVID-19 pandemic. Furthermore, this term was often used in a political context. The words used and predominantly negative character of texts dealing with vaccination has led to the transmission of negative emotions to readers. A significant aspect of the study is the fact that it was conducted on the corpus of texts written in Croatian – a relatively small and morphologically complex language.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti