Napredna pretraga

Pregled bibliografske jedinice broj: 655162

Preliminary Report on the Structure of Croatian Linguistic Co-occurrence Networks


Margan, Domagoj; Martinčić-Ipšić, Sanda; Meštrović, Ana
Preliminary Report on the Structure of Croatian Linguistic Co-occurrence Networks // 5th International Conference on Information Technologies and Information Society -ITIS 2013 / Levnajić, Zoran (ur.).
Novo mesto, Slovenija: Faculty of Information Studies in Novo mesto, 2013. str. 89-96 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


Naslov
Preliminary Report on the Structure of Croatian Linguistic Co-occurrence Networks

Autori
Margan, Domagoj ; Martinčić-Ipšić, Sanda ; Meštrović, Ana

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
5th International Conference on Information Technologies and Information Society -ITIS 2013 / Levnajić, Zoran - Novo mesto, Slovenija : Faculty of Information Studies in Novo mesto, 2013, 89-96

Skup
5th International Conference on Information Technologies and Information Society -ITIS 2013

Mjesto i datum
Novo mesto, Slovenija, 7-9.11.2013

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Complex networks; linguistic co-occurrence networks; Croatian corpus; stopwords

Sažetak
In this article, we investigate the structure of Croatian linguistic co-occurrence networks. We examine the change of network structure properties by systematically varying the co-occurrence window sizes, the corpus sizes and removing stopwords. In a cooccurrence window of size n we establish a link between the current word and n − 1 subsequent words. The results point out that the increase of the co-occurrence window size is followed by a decrease in diameter, average path shortening and expectedly condensing the average clustering coefficient. The same can be noticed for the removal of the stopwords. Finally, since the size of texts is reflected in the network properties, our results suggest that the corpus influence can be reduced by increasing the co-occurrence window size.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove
Sveučilište u Rijeci - Odjel za informatiku