Toward Selectivity-Based Keyword Extraction for Croatian News (CROSBI ID 616070)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Beliga, Slobodan ; Meštrović, Ana ; Martinčić- Ipšić, Sanda
engleski
Toward Selectivity-Based Keyword Extraction for Croatian News
Our approach proposes a novel network measure - the node selectivity for the task of keyword extraction. The node selectivity is de- ned as the average strength of the node. Firstly, we show that selectivity- based keyword extraction slightly outperforms the extraction based on the standard centrality measures: in-degree, out- degree, betweenness, and closeness. Furthermore, from the data set of Croatian news we ex- tract keyword candidates and expand extracted nodes to word-tuples ranked with the highest in/out selectivity values. The obtained sets are evaluated on manually annotated keywords: for the set of extracted key- word candidates the average F1 score is 24.63%, and the average F2 score is 21.19% ; for the exacted word-tuples candidates the average F1 score is 25.9% and the average F2 score is 24.47%. Selectivity-based ex- traction does not require linguistic knowledge while it is purely derived from statistical and structural information of the network.
keyword extraction ; complex network ; centrality measures ; selectivity ; Croatian news texts
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
1-14.
2014.
objavljeno
Podaci o matičnoj publikaciji
Surfacing the Deep and the Social Web (SDSW 2014)
Rupino da Cunha, Paulo ; Nguyen, Ngoc Thanh ; Boucelma, Omar ; Cautis, Bogdan ; Velegrakis, Yannis
Lahti: CEUR Proc. vol. 1310
1613-0073
Podaci o skupu
Surfacing the Deep and the Social Web
predavanje
01.01.2014-01.01.2014
Italija