Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 723769

Toward Selectivity-Based Keyword Extraction for Croatian News


Beliga, Slobodan; Meštrović, Ana; Martinčić- Ipšić, Sanda
Toward Selectivity-Based Keyword Extraction for Croatian News // Surfacing the Deep and the Social Web (SDSW 2014) / Rupino da Cunha, Paulo ; Nguyen, Ngoc Thanh ; Boucelma, Omar ; Cautis, Bogdan ; Velegrakis, Yannis (ur.).
Lahti: CEUR Proc. vol. 1310, 2014. str. 1-14 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 723769 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Toward Selectivity-Based Keyword Extraction for Croatian News

Autori
Beliga, Slobodan ; Meštrović, Ana ; Martinčić- Ipšić, Sanda

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Surfacing the Deep and the Social Web (SDSW 2014) / Rupino da Cunha, Paulo ; Nguyen, Ngoc Thanh ; Boucelma, Omar ; Cautis, Bogdan ; Velegrakis, Yannis - Lahti : CEUR Proc. vol. 1310, 2014, 1-14

Skup
Surfacing the Deep and the Social Web

Mjesto i datum
Italija, 19.10

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
keyword extraction ; complex network ; centrality measures ; selectivity ; Croatian news texts

Sažetak
Our approach proposes a novel network measure - the node selectivity for the task of keyword extraction. The node selectivity is de- ned as the average strength of the node. Firstly, we show that selectivity- based keyword extraction slightly outperforms the extraction based on the standard centrality measures: in-degree, out- degree, betweenness, and closeness. Furthermore, from the data set of Croatian news we ex- tract keyword candidates and expand extracted nodes to word-tuples ranked with the highest in/out selectivity values. The obtained sets are evaluated on manually annotated keywords: for the set of extracted key- word candidates the average F1 score is 24.63%, and the average F2 score is 21.19% ; for the exacted word-tuples candidates the average F1 score is 25.9% and the average F2 score is 24.47%. Selectivity-based ex- traction does not require linguistic knowledge while it is purely derived from statistical and structural information of the network.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekti:
Uniri-LangNet

Ustanove:
Fakultet informatike i digitalnih tehnologija, Rijeka

Citiraj ovu publikaciju:

Beliga, Slobodan; Meštrović, Ana; Martinčić- Ipšić, Sanda
Toward Selectivity-Based Keyword Extraction for Croatian News // Surfacing the Deep and the Social Web (SDSW 2014) / Rupino da Cunha, Paulo ; Nguyen, Ngoc Thanh ; Boucelma, Omar ; Cautis, Bogdan ; Velegrakis, Yannis (ur.).
Lahti: CEUR Proc. vol. 1310, 2014. str. 1-14 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Beliga, S., Meštrović, A. & Martinčić- Ipšić, S. (2014) Toward Selectivity-Based Keyword Extraction for Croatian News. U: Rupino da Cunha, P., Nguyen, N., Boucelma, O., Cautis, B. & Velegrakis, Y. (ur.)Surfacing the Deep and the Social Web (SDSW 2014).
@article{article, author = {Beliga, Slobodan and Me\v{s}trovi\'{c}, Ana and Martin\v{c}i\'{c}- Ip\v{s}i\'{c}, Sanda}, year = {2014}, pages = {1-14}, keywords = {keyword extraction, complex network, centrality measures, selectivity, Croatian news texts}, title = {Toward Selectivity-Based Keyword Extraction for Croatian News}, keyword = {keyword extraction, complex network, centrality measures, selectivity, Croatian news texts}, publisher = {CEUR Proc. vol. 1310}, publisherplace = {Italija} }
@article{article, author = {Beliga, Slobodan and Me\v{s}trovi\'{c}, Ana and Martin\v{c}i\'{c}- Ip\v{s}i\'{c}, Sanda}, year = {2014}, pages = {1-14}, keywords = {keyword extraction, complex network, centrality measures, selectivity, Croatian news texts}, title = {Toward Selectivity-Based Keyword Extraction for Croatian News}, keyword = {keyword extraction, complex network, centrality measures, selectivity, Croatian news texts}, publisher = {CEUR Proc. vol. 1310}, publisherplace = {Italija} }




Contrast
Increase Font
Decrease Font
Dyslexic Font