Pregled bibliografske jedinice broj: 765392
LaNCoA: A Python Toolkit for Language Networks Construction and Analysis
LaNCoA: A Python Toolkit for Language Networks Construction and Analysis // Proceedings MIPRO junior - Student Papers / Biljanović, Petar (ur.).
Opatija: Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2015. str. 1961-1966 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 765392 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
LaNCoA: A Python Toolkit for Language Networks Construction and Analysis
Autori
Margan, Domagoj ; meštrović, Ana
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings MIPRO junior - Student Papers
/ Biljanović, Petar - Opatija : Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2015, 1961-1966
ISBN
978-953-233-083-0
Skup
IEEE 38th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO 2015)
Mjesto i datum
Opatija, Hrvatska, 25.05.2015. - 29.05.2015
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
complex networks; language networks; NetworkX
Sažetak
In this paper we describe LaNCoA, Language Networks Construction and Analysis toolkit implemented in Python. The toolkit provides various procedures for network construction from the text: on the word-level (co-occurrence networks, syntactic networks, shuffled networks), and on the subword level (syllable networks, grapheme networks). Furthermore, we implement functions for the language networks analysis on the global and local level. The toolkit is organized in several modules that enable various aspects of language analysis: analysis of global network measures for different co-occurrence window, comparison of networks based on original and shuffled texts, comparison of networks constructed on different language levels, etc. Text manipulation methods, like corpora cleaning, lemmatization and stopwords removal, are also implemented. For the basic network representation we use available NetworkX functions and methods. However, language network analysis is specific and it requires implementation of additional functions and methods. That was the main motivation for this research.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti
POVEZANOST RADA
Projekti:
UniRi - 13.13.2.2.07
Ustanove:
Fakultet informatike i digitalnih tehnologija, Rijeka
Profili:
Ana Meštrović
(autor)