Pregled bibliografske jedinice broj: 199890
Enhanced Thesaurus Terms Extraction for Document Indexing
Enhanced Thesaurus Terms Extraction for Document Indexing // Proceedingss of the 27th International Conference on Information Technology Interfaces : ITI 2005 / Lužar - Stiffler, Vesna ; Hljuz Dobrić, Vesna (ur.).
Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2005. str. 227-232 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 199890 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Enhanced Thesaurus Terms Extraction for Document Indexing
Autori
Šarić, Frane ; Šnajder, Jan ; Dalbelo Bašić, Bojana ; Eklić, Hrvoje
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedingss of the 27th International Conference on Information Technology Interfaces : ITI 2005
/ Lužar - Stiffler, Vesna ; Hljuz Dobrić, Vesna - Zagreb : Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2005, 227-232
Skup
International Conference on Information Technology Interfaces (27 ; 2005)
Mjesto i datum
Cavtat, Hrvatska, 20.06.2005. - 23.06.2005
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Information retrieval; term extraction; NLP; lemmatisation; Eurovoc
Sažetak
In this paper we present an enhanced method for the thesaurus term extraction regarded as the main support to a semi-automatic indexing system. The enhancement is achieved by neutralising the efect of language morphology applying lemmatisation on both the text and the thesaurus, and by implementing an effcient recursive algorithm for term extraction. Formal definition and statistical evaluation of the experimental results of the proposed method for thesaurus term extraction are given. The need for disambiguation methods and the efect of lemmatisation in the realm of thesaurus term extraction are discussed.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb