A Note on Indexing DNA and Protein Sequences

Ristov, Strahil

izvor podataka: crosbi !

A Note on Indexing DNA and Protein Sequences (CROSBI ID 495206)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Ristov, Strahil A Note on Indexing DNA and Protein Sequences // Proceedings 6th Intl. Multi-Conference Information Society IS 2003, Vol A, Intelligent and Computer Systems / Bohanec, Marko ; Filipič, Bogdan ; Gams, Matjaž (ur.). Ljubljana: Institut Jožef Stefan, 2003. str. 121-126-x

Podaci o odgovornosti

Autori

Ristov, Strahil

Osnovni podaci na izvornom jeziku
Osnovni podaci na ostalim jezicima

Jezik

engleski

Naslov

A Note on Indexing DNA and Protein Sequences

Sažetak

Many applications in computational biology rely on indexing biological sequences. Indexing the sequences greatly reduces the time complexity of a search. However, good index structures, such as suffix trees, require inordinate amounts of space. We describe a work in progress on a new approach to indexing using truncated suffix tree implemented with a LZ compressed trie. The index would require about 4 bytes per symbol for the largest collection of protein sequences (over 450 M amino acids) and about 5 bytes for the largest collection of DNA sequences (over 20 G bases).

Ključne riječi

DNA indexing; protein sequence indexing; suffix trees; LZ trie; sequence matching; truncated suffix tree; suffix sequoia

Napomena

nije evidentirano

Jezik

nije evidentirano

Naslov

nije evidentirano

Sažetak

nije evidentirano

Ključne riječi

nije evidentirano

Napomena

nije evidentirano

Podaci o prilogu

Stranice rada

121-126-x.

Godina izdavanja

2003.

Status objave rada

objavljeno

Podaci o matičnoj publikaciji

Naslov

Proceedings 6th Intl. Multi-Conference Information Society IS 2003, Vol A, Intelligent and Computer Systems

Urednici

Bohanec, Marko ; Filipič, Bogdan ; Gams, Matjaž

Izdavač

Ljubljana: Institut Jožef Stefan

Podaci o skupu

Skup

6th International Multi-Conference Information Society IS 2003, Intelligent and Computer Systems

Vrsta sudjelovanja

predavanje

Datum održavanja skupa

13.10.2003-17.10.2003

Mjesto održavanja skupa

Ljubljana, Slovenija

Povezanost rada

Povezane osobe

Strahil Ristov (autor/i)

Povezane ustanove

Institut Ruđer Bošković (098) (autorova ustanova)

Područje

Računarstvo