Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 199890

Enhanced Thesaurus Terms Extraction for Document Indexing


Šarić, Frane; Šnajder, Jan; Dalbelo Bašić, Bojana; Eklić, Hrvoje
Enhanced Thesaurus Terms Extraction for Document Indexing // Proceedingss of the 27th International Conference on Information Technology Interfaces : ITI 2005 / Lužar - Stiffler, Vesna ; Hljuz Dobrić, Vesna (ur.).
Zagreb: SRCE University Computing Centre, University of Zagreb, 2005. str. 227-232 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 199890 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Enhanced Thesaurus Terms Extraction for Document Indexing

Autori
Šarić, Frane ; Šnajder, Jan ; Dalbelo Bašić, Bojana ; Eklić, Hrvoje

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedingss of the 27th International Conference on Information Technology Interfaces : ITI 2005 / Lužar - Stiffler, Vesna ; Hljuz Dobrić, Vesna - Zagreb : SRCE University Computing Centre, University of Zagreb, 2005, 227-232

Skup
International Conference on Information Technology Interfaces (27 ; 2005)

Mjesto i datum
Cavtat, Hrvatska, 20-23.06.2005

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Information retrieval; term extraction; NLP; lemmatisation; Eurovoc

Sažetak
In this paper we present an enhanced method for the thesaurus term extraction regarded as the main support to a semi-automatic indexing system. The enhancement is achieved by neutralising the efect of language morphology applying lemmatisation on both the text and the thesaurus, and by implementing an effcient recursive algorithm for term extraction. Formal definition and statistical evaluation of the experimental results of the proposed method for thesaurus term extraction are given. The need for disambiguation methods and the efect of lemmatisation in the realm of thesaurus term extraction are discussed.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo



POVEZANOST RADA


Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Avatar Url Jan Šnajder (autor)

Avatar Url Bojana Dalbelo-Bašić (autor)

Avatar Url Frane Šarić (autor)

Citiraj ovu publikaciju

Šarić, Frane; Šnajder, Jan; Dalbelo Bašić, Bojana; Eklić, Hrvoje
Enhanced Thesaurus Terms Extraction for Document Indexing // Proceedingss of the 27th International Conference on Information Technology Interfaces : ITI 2005 / Lužar - Stiffler, Vesna ; Hljuz Dobrić, Vesna (ur.).
Zagreb: SRCE University Computing Centre, University of Zagreb, 2005. str. 227-232 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Šarić, F., Šnajder, J., Dalbelo Bašić, B. & Eklić, H. (2005) Enhanced Thesaurus Terms Extraction for Document Indexing. U: Lužar - Stiffler, V. & Hljuz Dobrić, V. (ur.)Proceedingss of the 27th International Conference on Information Technology Interfaces : ITI 2005.
@article{article, year = {2005}, pages = {227-232}, keywords = {Information retrieval, term extraction, NLP, lemmatisation, Eurovoc}, title = {Enhanced Thesaurus Terms Extraction for Document Indexing}, keyword = {Information retrieval, term extraction, NLP, lemmatisation, Eurovoc}, publisher = {SRCE University Computing Centre, University of Zagreb}, publisherplace = {Cavtat, Hrvatska} }
@article{article, year = {2005}, pages = {227-232}, keywords = {Information retrieval, term extraction, NLP, lemmatisation, Eurovoc}, title = {Enhanced Thesaurus Terms Extraction for Document Indexing}, keyword = {Information retrieval, term extraction, NLP, lemmatisation, Eurovoc}, publisher = {SRCE University Computing Centre, University of Zagreb}, publisherplace = {Cavtat, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font