Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 315216

Web Text Retrieval with a P2P Query-Driven Index


Skobeltsyn, Gleb; Luu, Toan; Podnar Žarko, Ivana; Rajman, Martin; Aberer, Karl
Web Text Retrieval with a P2P Query-Driven Index // Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval / Clarke, Charles L. A. ; Fuhr, Norbert ; Kando, Noriko (ur.).
New York (NY): The Association for Computing Machinery (ACM), 2007. str. 679 - 686 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 315216 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Web Text Retrieval with a P2P Query-Driven Index

Autori
Skobeltsyn, Gleb ; Luu, Toan ; Podnar Žarko, Ivana ; Rajman, Martin ; Aberer, Karl

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval / Clarke, Charles L. A. ; Fuhr, Norbert ; Kando, Noriko - New York (NY) : The Association for Computing Machinery (ACM), 2007, 679 - 686

ISBN
978-1-59593-597-7

Skup
SIGIR '07: 30th annual international ACM SIGIR conference on Research and development in information retrieval

Mjesto i datum
Amsterdam, Nizozemska, 23.07.2007. - 27.07.2007

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
P2P; DHT; Text Retrieval; Query-Driven Indexing; TREC

Sažetak
In this paper, we present a query-driven indexing/retrieval strategy for efficient full text retrieval from large document collections distributed within a structured P2P network. Our indexing strategy is based on two important properties: (1) the generated distributed index stores posting lists for carefully chosen indexing term combinations, and (2) the posting lists containing too many document references are truncated to a bounded number of their top-ranked elements. These two properties guarantee acceptable storage and bandwidth requirements, essentially because the number of indexing term combinations remains scalable and the transmitted posting lists never exceed a constant size. However, as the number of generated term combinations can still become quite large, we also use term statistics extracted from available query logs to index only such combinations that are frequently present in user queries. Thus, by avoiding the generation of superfluous indexing term combinations, we achieve an additional substantial reduction in bandwidth and storage consumption. As a result, the generated distributed index corresponds to a constantly evolving query-driven indexing structure that efficiently follows current information needs of the users. More precisely, our theoretical analysis and experimental results indicate that, at the price of a marginal loss in retrieval quality for rare queries, the generated index size and network tra± c remain manageable even for web-size document collections. Furthermore, our experiments show that at the same time the achieved retrieval quality is fully comparable to the one obtained with a state-of-the-art centralized query engine.

Izvorni jezik
Engleski

Znanstvena područja
Elektrotehnika, Računarstvo



POVEZANOST RADA


Projekti:
036-0362027-1639 - Isporuka sadržaja i pokretljivost korisnika i usluga u mrežama nove generacije (Matijašević, Maja, MZO ) ( CroRIS)

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Avatar Url Ivana Podnar Žarko (autor)


Citiraj ovu publikaciju:

Skobeltsyn, Gleb; Luu, Toan; Podnar Žarko, Ivana; Rajman, Martin; Aberer, Karl
Web Text Retrieval with a P2P Query-Driven Index // Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval / Clarke, Charles L. A. ; Fuhr, Norbert ; Kando, Noriko (ur.).
New York (NY): The Association for Computing Machinery (ACM), 2007. str. 679 - 686 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Skobeltsyn, G., Luu, T., Podnar Žarko, I., Rajman, M. & Aberer, K. (2007) Web Text Retrieval with a P2P Query-Driven Index. U: Clarke, C., Fuhr, N. & Kando, N. (ur.)Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval.
@article{article, author = {Skobeltsyn, Gleb and Luu, Toan and Podnar \v{Z}arko, Ivana and Rajman, Martin and Aberer, Karl}, year = {2007}, pages = {679 - 686}, keywords = {P2P, DHT, Text Retrieval, Query-Driven Indexing, TREC}, isbn = {978-1-59593-597-7}, title = {Web Text Retrieval with a P2P Query-Driven Index}, keyword = {P2P, DHT, Text Retrieval, Query-Driven Indexing, TREC}, publisher = {The Association for Computing Machinery (ACM)}, publisherplace = {Amsterdam, Nizozemska} }
@article{article, author = {Skobeltsyn, Gleb and Luu, Toan and Podnar \v{Z}arko, Ivana and Rajman, Martin and Aberer, Karl}, year = {2007}, pages = {679 - 686}, keywords = {P2P, DHT, Text Retrieval, Query-Driven Indexing, TREC}, isbn = {978-1-59593-597-7}, title = {Web Text Retrieval with a P2P Query-Driven Index}, keyword = {P2P, DHT, Text Retrieval, Query-Driven Indexing, TREC}, publisher = {The Association for Computing Machinery (ACM)}, publisherplace = {Amsterdam, Nizozemska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font