DNA Nanopore Sequencing Basecaller (CROSBI ID 447850)
Ocjenski rad | diplomski rad
Podaci o odgovornosti
Pavlić, Stanislav
Šikić, Mile
Stanojević, Dominik
engleski
DNA Nanopore Sequencing Basecaller
Nanopore sequencing is one of the state-of-the-art sequencing technologies. It passes a DNA sample through a pore which changes the ionic current in the pore. Due to the size of the pore, there are usually five nucleotides (5-mer) present in the pore influencing the measured signal. Each of the 1024 possible 5-mers produces a different signal, and this information is used for basecalling (converting the raw signal to a sequence of nucleotides). The signal is approximately rectangular because the 5-mer changes one nucleotide at a time, but there is a lot of noise present. The goal of this thesis was to develop a DNA nanopore sequencing basecaller using modern deep learning architectures with self-supervised learning in mind. The architecture is mainly based on transformers. The basecaller was evaluated on publicly available datasets. The solution called AttentionCall was implemented in Python and the PyTorch library. The source code is available on GitHub at github.com/StanislavPavlic/attentioncall.
bioinformatics ; basecalling ; nanopore sequencing ; deep learning ; transformers ; CTC
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o izdanju
47
01.07.2021.
obranjeno
Podaci o ustanovi koja je dodijelila akademski stupanj
Sveučilište u Zagrebu
Zagreb