Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 302118

Query-Driven Indexing for Scalable Peer-to-Peer Text Retrieval


Skobeltsyn, Gleb; Luu, Toan; Podnar Žarko, Ivana; Rajman, Martin; Aberer, Karl
Query-Driven Indexing for Scalable Peer-to-Peer Text Retrieval // Infoscale: the Second International Conference on Scalable Information Systems
New York (NY): The Association for Computing Machinery (ACM), 2007. (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 302118 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Query-Driven Indexing for Scalable Peer-to-Peer Text Retrieval

Autori
Skobeltsyn, Gleb ; Luu, Toan ; Podnar Žarko, Ivana ; Rajman, Martin ; Aberer, Karl

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Infoscale: the Second International Conference on Scalable Information Systems / - New York (NY) : The Association for Computing Machinery (ACM), 2007

Skup
Infoscale: the Second International Conference on Scalable Information Systems

Mjesto i datum
Suzhou, Kina, 06.06.2007. - 08.06.2007

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
P2P; DHT; IR; Query-Driven Indexing; Scalability

Sažetak
We present a query-driven algorithm for the distributed indexing of large document collections within structured P2P networks. To cope with bandwidth consumption that has been identified as the major problem for the standard P2P approach with single term indexing, we leverage a distributed index that stores up to top-k document references only for carefully chosen indexing term combinations. In addition, since the number of possible term combinations extracted from a document collection can be very large, we propose to use query statistics to index only such combinations that are indeed frequently requested by the users. Thus, by avoiding the maintenance of superfluous indexing information, we achieve a substantial reduction in bandwidth and storage. A specific activation mechanism is applied to continuously update the indexing information according to changes in the query distribution, resulting in an efficient, constantly evolving query-driven indexing structure. We show that the size of the index and the generated indexing/retrieval traffic remains manageable even for web-size document collections at the price of a marginal loss in precision for rare queries. Our theoretical analysis and experimental results provide convincing evidence about the feasibility of the query-driven indexing strategy for large scale P2P text retrieval. Moreover, our experiments confirm that the retrieval performance is only slightly lower than the one obtained with state-of-the-art centralized query engines.

Izvorni jezik
Engleski

Znanstvena područja
Elektrotehnika, Računarstvo



POVEZANOST RADA


Projekti:
036-0362027-1639 - Isporuka sadržaja i pokretljivost korisnika i usluga u mrežama nove generacije (Matijašević, Maja, MZO ) ( CroRIS)

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Avatar Url Ivana Podnar Žarko (autor)


Citiraj ovu publikaciju:

Skobeltsyn, Gleb; Luu, Toan; Podnar Žarko, Ivana; Rajman, Martin; Aberer, Karl
Query-Driven Indexing for Scalable Peer-to-Peer Text Retrieval // Infoscale: the Second International Conference on Scalable Information Systems
New York (NY): The Association for Computing Machinery (ACM), 2007. (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Skobeltsyn, G., Luu, T., Podnar Žarko, I., Rajman, M. & Aberer, K. (2007) Query-Driven Indexing for Scalable Peer-to-Peer Text Retrieval. U: Infoscale: the Second International Conference on Scalable Information Systems.
@article{article, author = {Skobeltsyn, Gleb and Luu, Toan and Podnar \v{Z}arko, Ivana and Rajman, Martin and Aberer, Karl}, year = {2007}, keywords = {P2P, DHT, IR, Query-Driven Indexing, Scalability}, title = {Query-Driven Indexing for Scalable Peer-to-Peer Text Retrieval}, keyword = {P2P, DHT, IR, Query-Driven Indexing, Scalability}, publisher = {The Association for Computing Machinery (ACM)}, publisherplace = {Suzhou, Kina} }
@article{article, author = {Skobeltsyn, Gleb and Luu, Toan and Podnar \v{Z}arko, Ivana and Rajman, Martin and Aberer, Karl}, year = {2007}, keywords = {P2P, DHT, IR, Query-Driven Indexing, Scalability}, title = {Query-Driven Indexing for Scalable Peer-to-Peer Text Retrieval}, keyword = {P2P, DHT, IR, Query-Driven Indexing, Scalability}, publisher = {The Association for Computing Machinery (ACM)}, publisherplace = {Suzhou, Kina} }




Contrast
Increase Font
Decrease Font
Dyslexic Font