Using inverted files to compress text (CROSBI ID 482817)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Ristov, Strahil
engleski
Using inverted files to compress text
This is the first report on a new approach to text compression. It consists of representing the text file with compressed inverted file index in conjunction with very compact lexicon, where lexicon includes every word in the text. The index is compressed using standard index compression techniques, and lexicon is compressed with original dictionary compression method that gives better compression results than existing procedures. Compression procedure is complex, but decompression time is linear with the file size, although it requires two passes and hence can not be performed online. First experiments show that this method, when refined, can be competitive for larger texts that only need to be decompressed in the real time.
text compression; inverted file; index compression; lexicon compression
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
443-447-x.
2002.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the 24th Conference on Information Technology Interfaces
Glavinic, Vlado; Hljuz Dobric, vesna; Šimic, Diana
Cavtat: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce)
Podaci o skupu
24th Conference on Information Technology Interfaces
predavanje
24.06.2002-27.06.2002
Cavtat, Hrvatska