Pregled bibliografske jedinice broj: 655162
Preliminary Report on the Structure of Croatian Linguistic Co-occurrence Networks
Preliminary Report on the Structure of Croatian Linguistic Co-occurrence Networks // 5th International Conference on Information Technologies and Information Society -ITIS 2013 / Levnajić, Zoran (ur.).
Novo Mesto: Fakulteta za informacijske študije v Novem mestu, 2013. str. 89-96 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 655162 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Preliminary Report on the Structure of Croatian Linguistic Co-occurrence Networks
Autori
Margan, Domagoj ; Martinčić-Ipšić, Sanda ; Meštrović, Ana
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
5th International Conference on Information Technologies and Information Society -ITIS 2013
/ Levnajić, Zoran - Novo Mesto : Fakulteta za informacijske študije v Novem mestu, 2013, 89-96
Skup
5th International Conference on Information Technologies and Information Society -ITIS 2013
Mjesto i datum
Novo Mesto, Slovenija, 07.11.2013. - 09.11.2013
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
complex networks; linguistic co-occurrence networks; Croatian corpus; stopwords
Sažetak
In this article, we investigate the structure of Croatian linguistic co-occurrence networks. We examine the change of network structure properties by systematically varying the co-occurrence window sizes, the corpus sizes and removing stopwords. In a cooccurrence window of size n we establish a link between the current word and n − 1 subsequent words. The results point out that the increase of the co-occurrence window size is followed by a decrease in diameter, average path shortening and expectedly condensing the average clustering coefficient. The same can be noticed for the removal of the stopwords. Finally, since the size of texts is reflected in the network properties, our results suggest that the corpus influence can be reduced by increasing the co-occurrence window size.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti
POVEZANOST RADA
Ustanove:
Fakultet informatike i digitalnih tehnologija, Rijeka