Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 507889

Comparing measures of semantic similarity


Ljubešić, Nikola; Boras, Damir; Bakarić, Nikola; Njavro, Jasmina
Comparing measures of semantic similarity // Proceedings of the 30th International Conference on Information Technology Interfaces / Hljuz Dobrić, Vesna (ur.).
Dubrovnik, Hrvatska: Institute of Electrical and Electronics Engineers (IEEE), 2008. str. 675-682 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 507889 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Comparing measures of semantic similarity

Autori
Ljubešić, Nikola ; Boras, Damir ; Bakarić, Nikola ; Njavro, Jasmina

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the 30th International Conference on Information Technology Interfaces / Hljuz Dobrić, Vesna - : Institute of Electrical and Electronics Engineers (IEEE), 2008, 675-682

ISBN
978-953-7138-12-7

Skup
30th International Conference on Information Technology Interfaces

Mjesto i datum
Dubrovnik, Hrvatska, 23-26.06.2008

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
calculating semantic similarity ; context ; association measures ; similarity measures

Sažetak
The aim of this paper is to compare different methods for automatic extraction of semantic similarity measures from corpora. The semantic similarity measure is proven to be very useful for many tasks in natural language processing like information retrieval, information extraction, machine translation etc. Additionally, one of the main problems in natural language processing is data sparseness since no language sample is large enough to seize all possible language combinations. In our research we experiment with four different measures of association with context and eight different measures of vector similarity. The results show that the Jensen-Shannon divergence and L1 and L2 norm outperform other measures of vector similarity regardless of the measure of association with context used. Maximum likelihood estimate and t-test show better results than other measures of association with context.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekti:
MZOS-130-1301679-1380 - Hrvatska rječnička baština i hrvatski europski identitet (Boras, Damir, MZOS ) ( POIROT)

Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Nikola Ljubešić (autor)

Avatar Url Nikola Bakaric (autor)

Avatar Url Damir Boras (autor)

Poveznice na cjeloviti tekst rada:

Pristup cjelovitom tekstu rada

Citiraj ovu publikaciju:

Ljubešić, Nikola; Boras, Damir; Bakarić, Nikola; Njavro, Jasmina
Comparing measures of semantic similarity // Proceedings of the 30th International Conference on Information Technology Interfaces / Hljuz Dobrić, Vesna (ur.).
Dubrovnik, Hrvatska: Institute of Electrical and Electronics Engineers (IEEE), 2008. str. 675-682 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Ljubešić, N., Boras, D., Bakarić, N. & Njavro, J. (2008) Comparing measures of semantic similarity. U: Hljuz Dobrić, V. (ur.)Proceedings of the 30th International Conference on Information Technology Interfaces.
@article{article, author = {Ljube\v{s}i\'{c}, Nikola and Boras, Damir and Bakari\'{c}, Nikola and Njavro, Jasmina}, editor = {Hljuz Dobri\'{c}, V.}, year = {2008}, pages = {675-682}, keywords = {calculating semantic similarity, context, association measures, similarity measures}, isbn = {978-953-7138-12-7}, title = {Comparing measures of semantic similarity}, keyword = {calculating semantic similarity, context, association measures, similarity measures}, publisher = {Institute of Electrical and Electronics Engineers (IEEE)}, publisherplace = {Dubrovnik, Hrvatska} }
@article{article, author = {Ljube\v{s}i\'{c}, Nikola and Boras, Damir and Bakari\'{c}, Nikola and Njavro, Jasmina}, editor = {Hljuz Dobri\'{c}, V.}, year = {2008}, pages = {675-682}, keywords = {calculating semantic similarity, context, association measures, similarity measures}, isbn = {978-953-7138-12-7}, title = {Comparing measures of semantic similarity}, keyword = {calculating semantic similarity, context, association measures, similarity measures}, publisher = {Institute of Electrical and Electronics Engineers (IEEE)}, publisherplace = {Dubrovnik, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font