Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

DNA Nanopore Sequencing Basecaller (CROSBI ID 447850)

Ocjenski rad | diplomski rad

Pavlić, Stanislav DNA Nanopore Sequencing Basecaller / Šikić, Mile (mentor); Stanojević, Dominik (neposredni voditelj). Zagreb, Sveučilište u Zagrebu, . 2021

Podaci o odgovornosti

Pavlić, Stanislav

Šikić, Mile

Stanojević, Dominik

engleski

DNA Nanopore Sequencing Basecaller

Nanopore sequencing is one of the state-of-the-art sequencing technologies. It passes a DNA sample through a pore which changes the ionic current in the pore. Due to the size of the pore, there are usually five nucleotides (5-mer) present in the pore influencing the measured signal. Each of the 1024 possible 5-mers produces a different signal, and this information is used for basecalling (converting the raw signal to a sequence of nucleotides). The signal is approximately rectangular because the 5-mer changes one nucleotide at a time, but there is a lot of noise present. The goal of this thesis was to develop a DNA nanopore sequencing basecaller using modern deep learning architectures with self-supervised learning in mind. The architecture is mainly based on transformers. The basecaller was evaluated on publicly available datasets. The solution called AttentionCall was implemented in Python and the PyTorch library. The source code is available on GitHub at github.com/StanislavPavlic/attentioncall.

bioinformatics ; basecalling ; nanopore sequencing ; deep learning ; transformers ; CTC

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

47

01.07.2021.

obranjeno

Podaci o ustanovi koja je dodijelila akademski stupanj

Sveučilište u Zagrebu

Zagreb

Povezanost rada

Računarstvo