Comparing measures of semantic similarity

Ljubešić, Nikola; Boras, Damir; Bakarić, Nikola; Njavro, Jasmina

izvor podataka: crosbi ✓

Comparing measures of semantic similarity (CROSBI ID 571694)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Ljubešić, Nikola ; Boras, Damir ; Bakarić, Nikola ; Njavro, Jasmina Comparing measures of semantic similarity // ITI ... / Hljuz Dobrić, Vesna (ur.). 2008. str. 675-682

Podaci o odgovornosti

Autori

Ljubešić, Nikola ; Boras, Damir ; Bakarić, Nikola ; Njavro, Jasmina

Osnovni podaci na izvornom jeziku
Osnovni podaci na ostalim jezicima

Jezik

engleski

Naslov

Comparing measures of semantic similarity

Sažetak

The aim of this paper is to compare different methods for automatic extraction of semantic similarity measures from corpora. The semantic similarity measure is proven to be very useful for many tasks in natural language processing like information retrieval, information extraction, machine translation etc. Additionally, one of the main problems in natural language processing is data sparseness since no language sample is large enough to seize all possible language combinations. In our research we experiment with four different measures of association with context and eight different measures of vector similarity. The results show that the Jensen-Shannon divergence and L1 and L2 norm outperform other measures of vector similarity regardless of the measure of association with context used. Maximum likelihood estimate and t-test show better results than other measures of association with context.

Ključne riječi

calculating semantic similarity ; context ; association measures ; similarity measures

Napomena

nije evidentirano

Jezik

nije evidentirano

Naslov

nije evidentirano

Sažetak

nije evidentirano

Ključne riječi

nije evidentirano

Napomena

nije evidentirano

Podaci o prilogu

Stranice rada

675-682.

Godina izdavanja

2008.

Status objave rada

objavljeno

Podaci o matičnoj publikaciji

Naslov

Proceedings of the 30th International Conference on Information Technology Interfaces

Urednici

Hljuz Dobrić, Vesna

Izdavač

Institute of Electrical and Electronics Engineers (IEEE)

ISBN

978-953-7138-12-7

ISSN

1330-1012

Podaci o skupu

Skup

30th International Conference on Information Technology Interfaces

Vrsta sudjelovanja

predavanje

Datum održavanja skupa

23.06.2008-26.06.2008

Mjesto održavanja skupa

Dubrovnik, Hrvatska

Povezanost rada

Povezane osobe

Nikola Ljubešić (autor/i)

Damir Boras (autor/i)

Nikola Bakarić (autor/i)

Povezane ustanove

Filozofski fakultet u Zagrebu (130) (autorova ustanova)

Povezani projekti

Hrvatska rječnička baština i hrvatski europski identitet (rezultat rada na projektu)

Područje

Informacijske i komunikacijske znanosti