Napredna pretraga

Pregled bibliografske jedinice broj: 655168

Initial Comparison of Linguistic Networks Measures for Parallel Texts


Ban, Kristina; Meštrović, Ana; Martinčić-Ipšić, Sanda
Initial Comparison of Linguistic Networks Measures for Parallel Texts // 5th International Conference on Information Technologies and Information Society -ITIS 2013 / Levnajić, Zoran (ur.).
Novo mesto, Slovenija: Faculty of Information Studies in Novo mesto, 2013. str. 97-104 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


Naslov
Initial Comparison of Linguistic Networks Measures for Parallel Texts

Autori
Ban, Kristina ; Meštrović, Ana ; Martinčić-Ipšić, Sanda

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
5th International Conference on Information Technologies and Information Society -ITIS 2013 / Levnajić, Zoran - Novo mesto, Slovenija : Faculty of Information Studies in Novo mesto, 2013, 97-104

Skup
5th International Conference on Information Technologies and Information Society -ITIS 2013

Mjesto i datum
Novo mesto, Slovenija, 7-9.11.2013

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Linguistic networks; co-occurrence networks; small-world; parallel texts

Sažetak
In this paper we compared the properties of linguistic networks for Croatian, English and Italian languages. We constructed co- occurrence networks from parallel text corpora, consisting of the translations of five books in the three languages. We generated an Erdös-Rényi random graph with the same number of nodes and links, which enabled the comparison with linguistic co- occurrence networks, showing small- world properties. Furthermore, the comparison of Croatian, English and Italian linguistic networks showed that, besides expected commonalities of networks, there are also certain differences. The networks’ measures across the three studied languages differ particularly in the shortest path length. The results indicate that size of the corpus and anomalies in text affect the network structure.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove
Sveučilište u Rijeci - Odjel za informatiku