Pregled bibliografske jedinice broj: 76126
Using inverted files to compress text
Using inverted files to compress text // Proceedings of the 24th Conference on Information Technology Interfaces / Glavinic, Vlado; Hljuz Dobric, vesna; Šimic, Diana (ur.).
Cavtat: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2002. str. 443-447 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 76126 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Using inverted files to compress text
Autori
Ristov, Strahil
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the 24th Conference on Information Technology Interfaces
/ Glavinic, Vlado; Hljuz Dobric, vesna; Šimic, Diana - Cavtat : Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2002, 443-447
Skup
24th Conference on Information Technology Interfaces
Mjesto i datum
Cavtat, Hrvatska, 24.06.2002. - 27.06.2002
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
text compression; inverted file; index compression; lexicon compression
Sažetak
This is the first report on a new approach to text compression. It consists of representing the text file with compressed inverted file index in conjunction with very compact lexicon, where lexicon includes every word in the text. The index is compressed using standard index compression techniques, and lexicon is compressed with original dictionary compression method that gives better compression results than existing procedures. Compression procedure is complex, but decompression time is linear with the file size, although it requires two passes and hence can not be performed online. First experiments show that this method, when refined, can be competitive for larger texts that only need to be decompressed in the real time.
Izvorni jezik
Engleski
Znanstvena područja
Elektrotehnika
POVEZANOST RADA