De Novo Assembly using Semi-Supervised Read Categorization (CROSBI ID 656855)
Prilog sa skupa u zborniku | prošireni sažetak izlaganja sa skupa | međunarodna recenzija
Podaci o odgovornosti
Šebrek, Tomislav ; Tomljanović, Jan ; Šikić, Mile
engleski
De Novo Assembly using Semi-Supervised Read Categorization
In this paper, we propose a semi-supervised deep learning method for categorization of reads that impede the de novo genome assembly process. In- stead of dealing directly with sequenced reads, we analyze their coverage graphs converted to 1D-signals. We noticed that specific signal pat-terns occur in each relevant class of reads. Semi-supervised approach is chosen because manually labelling the data is a very slow and tedious process, so our goal was to facili- tate the assembly process with as little labeled data as possible. We tested two models to learn patterns in the coverage graphs: M1 + M2 and semi-GAN. We evaluated the performance of each model based on a manually labeled dataset that comprises various reads from multiple reference genomes with respect to the number of labeled examples that were used during the training process. In addition, we embedded our detection in the assembly process which improved the quality of assemblies.
Deep-learning ; Semi-supervised learning ; De novo assembly ; Chimeric read ; Repeat read.
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
73-75.
2017.
objavljeno
Podaci o matičnoj publikaciji
Second International Workshop on Data Science
Lončarić, Sven ; Šmuc, Tomislav
Podaci o skupu
Second International Workshop on Data Science
poster
30.11.2017-30.11.2017
Zagreb, Hrvatska