Pregled bibliografske jedinice broj: 1268907
Sažimanje genoma korištenjem referentnog genoma
Sažimanje genoma korištenjem referentnog genoma, 2020., diplomski rad, preddiplomski, Fakultet elektrotehnike i računarstva, Zagreb
CROSBI ID: 1268907 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Sažimanje genoma korištenjem referentnog genoma
(Referential genome compression)
Autori
Čeple, Kristijan
Vrsta, podvrsta i kategorija rada
Ocjenski radovi, diplomski rad, preddiplomski
Fakultet
Fakultet elektrotehnike i računarstva
Mjesto
Zagreb
Datum
07.07
Godina
2020
Stranica
63
Mentor
Domazet-Lošo, Mirjana
Ključne riječi
DNA ; kompresija ; dekompresija ; genom ; bioinformatika
(DNA ; compression ; decompression ; genome ; bioinformatics)
Sažetak
HiRGC is a 2017 genome compression algorithm, which is explored in this thesis. Before compressing the DNA sequences as raw textual data, one can first pursue certain DNA qualities to enhance the compression before-hand the traditional text compression methods. One such way is to process and store the sequences as a list of similarities and differences between a reference sequence, and 1(or more!) target sequences. Human DNA is mutually (between 2 units) 99.9% similar, and this algorithm takes advantage of that. After the DNA is processed in such a manner, it can then be converted using traditional text compression methods. This produces outstanding results – such as reducing a ~3GB human genome into a 200-300MB file.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb
Profili:
Mirjana Domazet Lošo
(mentor)