Pregled bibliografske jedinice broj: 386760
Addition of Documents' Representations in the Latent Semantic Space
Addition of Documents' Representations in the Latent Semantic Space // Proceedings of International Conference Applied Statistics 2008 / Lusa, Lara ; Stare, Janez (ur.).
Ljubljana: Statistical Society of Slovenia, 2008. str. 67-67 (predavanje, međunarodna recenzija, sažetak, ostalo)
CROSBI ID: 386760 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Addition of Documents' Representations in the Latent Semantic Space
Autori
Dobša, Jasminka
Vrsta, podvrsta i kategorija rada
Sažeci sa skupova, sažetak, ostalo
Izvornik
Proceedings of International Conference Applied Statistics 2008
/ Lusa, Lara ; Stare, Janez - Ljubljana : Statistical Society of Slovenia, 2008, 67-67
Skup
International Conference Applied Statistics 2008
Mjesto i datum
Ribno, Slovenija, 21.09.2008. - 24.09.2008
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
information retrieval; latent semantic indexing; addition of documents
Sažetak
Latent semantic indexing (LSI) is the most popular method of dimensionality reduction of original representation of textual documents in the vector space model. Collections of documents very often are dinamical because new documents constantly are added to collection. Vectors on which the projection is done in the process of dimensionality reduction are constructed on the basis of representations of all documents in the collection, and computation of the new representations in the space of reduced dimension demands recomputation of singular value decomposition. In order to overcome that problem Barry and coworkers (1995) sugessted approximative representation of added documents by projections on existing left singular vectors. They also propose method for approximative representation of added index terms. Here will be proposed modification of approximative representations of terms and documents by combination of these two methods. It is shown that representation of documents by extended list of index terms does not improve performance of information retrieval significantly.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti
POVEZANOST RADA
Projekti:
016-0161741-1739 - Razvoj informacijske infrastrukture i deduktivnih mehanizama Semantičkog Weba (Čubrilo, Mirko, MZOS ) ( CroRIS)
016-0361935-1728 - Semantičko modeliranje višeagentnih sustava (Maleković, Mirko, MZOS ) ( CroRIS)
Ustanove:
Fakultet organizacije i informatike, Varaždin
Profili:
Jasminka Dobša
(autor)