Napredna pretraga

Pregled bibliografske jedinice broj: 695345

Differentiation between Normal and Shuffled Croatian Texts


Margan, Domagoj; Meštrović, Ana; Martinčić-Ipšić, Sanda
Differentiation between Normal and Shuffled Croatian Texts // Adriatic Conference on Graph Theory and Complexity / Vukičević, Damir (ur.).
Split: PMF Split, 2014. str. 9-9 (predavanje, nije recenziran, sažetak, ostalo)


Naslov
Differentiation between Normal and Shuffled Croatian Texts

Autori
Margan, Domagoj ; Meštrović, Ana ; Martinčić-Ipšić, Sanda

Vrsta, podvrsta i kategorija rada
Sažeci sa skupova, sažetak, ostalo

Izvornik
Adriatic Conference on Graph Theory and Complexity / Vukičević, Damir - Split : PMF Split, 2014, 9-9

Skup
2014 Adriatic Conference on Graph Theory and Complexity

Mjesto i datum
MedILS, Split, Hrvatska, 25-27.4.2014

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Nije recenziran

Ključne riječi
Complex networks; linguistic co-occurrence networks; shuffled texts; Croatian corpus

Sažetak
Properties of normal and shuffled Croatian texts can be examined with complex networks measures. Here we present network properties of normal and shuffled Croatian texts for different shuffling principles: on the sentence level and on the text level. In both experiments we preserved the vocabulary size, word and sentence frequency distributions. Additionally, in the first shuffling approach we preserved the sentence structure of the text and the number of words per sentence. Obtained results showed that degree rank distributions exhibit no substantial deviation in shuffled networks, and strength rank distributions are preserved due to the same word frequencies. Therefore, standard approach to study the structure of linguistic co-occurrence networks showed no clear difference among the topologies of normal and shuffled texts. Finally, we showed that the in- and out- selectivity values from shuffled texts are constantly below selectivity values calculated from normal texts. Our results corroborate that the node selectivity measure can capture structural differences between original and shuffled Croatian texts.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekt / tema
UniRi - 13.13.2.2.07

Ustanove
Sveučilište u Rijeci - Odjel za informatiku