Pregled bibliografske jedinice broj: 1114794
Through the Limits of Newspeak: an Analysis of the Vector Representation of Words in George Orwell’s 1984
Through the Limits of Newspeak: an Analysis of the Vector Representation of Words in George Orwell’s 1984 // 2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) - proceedings / Skala, Karolj (ur.).
Rijeka: Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2019. str. 583-588 doi:10.23919/MIPRO.2019.8756892 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 1114794 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Through the Limits of Newspeak: an Analysis of the Vector Representation of Words in George Orwell’s 1984
Autori
Dunđer, Ivan ; Pavlovski, Marko
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) - proceedings
/ Skala, Karolj - Rijeka : Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2019, 583-588
ISBN
978-953-233-098-4
Skup
42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO 2019)
Mjesto i datum
Opatija, Hrvatska, 20.05.2019. - 24.05.2019
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
natural language processing (NLP) ; word vector representation ; word2vec ; word embeddings ; vector space model ; George Orwell’s 1984 ; dystopia ; Newspeak ; machine learning ; information and communication sciences
Sažetak
The era of fake news, media manipulation and information wars has been beneficent to the lasting fame and continuous acclaim of George Orwell's 1984. The novel, published in 1949, influences to the present day the terminology of various political and social analysts through the use of its fictional language called “Newspeak”. The question arises - can the inner connections of the concepts present in Orwell's 1984, when analysing the text on a semantic level of the words in their contextual environment, be used to further the understanding of the inner-workings of the novel's language itself? More specifically, the aim of this paper is to examine whether a reader without knowledge of the subject of a fictional work of art, exemplified by Orwell's 1984, could gain deeper comprehension of the text just from analysing word vector representations, without the use of external resources and only through an overview of the established similarities on the semantic level of words in a given text. In fact, word vector representations, as a form of word embeddings in a vector space model, are a machine learning technique sometimes applied in natural language processing, which attempts to identify semantically similar words.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti
POVEZANOST RADA
Projekti:
NadSve-Sveučilište u Zagrebu-43-922-1011 - Disruptivne tehnologije u stvaranju novoga znanja: strojno učenje i podatkovna analitika s modelima primjene u specijaliziranim domenama (Seljan, Sanja, NadSve - Natječaj za dodjelu sredstava za financiranje temeljne znanstvene djelatnosti u 2019. godini dodijeljenih Filozofskom fakultetu Sveučilištu u Zagrebu) ( CroRIS)
Ustanove:
Filozofski fakultet, Zagreb