Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Unsupervised Learning of Sequencing Read Types (CROSBI ID 656857)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Tomljanović, Jan ; Šebrek, Tomislav ; Šikić, Mile Unsupervised Learning of Sequencing Read Types // Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics. 2017. str. 12-17 doi: 10.1145/3155077.3155080

Podaci o odgovornosti

Tomljanović, Jan ; Šebrek, Tomislav ; Šikić, Mile

engleski

Unsupervised Learning of Sequencing Read Types

In this work, we present a novel method for improvement of de novo genome assembly which is based on detection of chimeric and repeat reads. Using this information, we can facilitate the detection of unique sequences which results in more contiguous final sequences. We showed that read types can be separated by transforming a coverage graph for each read into 1D signal. We found that signals for repeat and chimeric reads differ significantly from signals for regular reads. Because manual determination of correct read types is a tedious and time-consuming job, we chose unsupervised learning. For feature extraction, we applied and compared variational and denoising autoencoders. Clustering was performed by K-means algorithm. We tested the method on four bacterial genomes sequenced by Pacific Biosciences devices. The achieved results show that using labelled read types can significant improve the contiguity of the assembled final sequence.

Deep-learning ; unsupervised learning ; De novo assembly ; Chimeric read ; Repeat read.

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

12-17.

2017.

objavljeno

10.1145/3155077.3155080

Podaci o matičnoj publikaciji

Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics

978-1-4503-5322-9

Podaci o skupu

ICCBB 2017

predavanje

18.10.2017-20.10.2017

Newark (DE), Sjedinjene Američke Države

Povezanost rada

Biologija, Računarstvo

Poveznice