Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1209164

SFQ: Constructing and Querying a Succinct Representation of FASTQ Files


Bakarić, Robert; Korenčić, Damir; Hršak, Dalibor; Ristov, Strahil
SFQ: Constructing and Querying a Succinct Representation of FASTQ Files // Electronics, 11 (2022), 11; 1783, 12 doi:10.3390/electronics11111783 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 1209164 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
SFQ: Constructing and Querying a Succinct Representation of FASTQ Files

Autori
Bakarić, Robert ; Korenčić, Damir ; Hršak, Dalibor ; Ristov, Strahil

Izvornik
Electronics (2079-9292) 11 (2022), 11; 1783, 12

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
bioinformatics ; FASTQ data compression ; random access

Sažetak
A large and ever increasing quantity of high throughput sequencing (HTS) data is stored in FASTQ files. Various methods for data compression are used to mitigate the storage and transmission costs, from the still prevalent general purpose Gzip to state-of-the-art specialized methods. However, all of the existing methods for FASTQ file compression require the decompression stage before the HTS data can be used. This is particularly costly with the random access to specific records in FASTQ files. We propose the sFASTQ format, a succinct representation of FASTQ files that can be used without decompression (i.e., the records can be retrieved and listed online), and that supports random access to individual records. The sFASTQ format can be searched on the disk, which eliminates the need for any additional memory resources. The searchable sFASTQ archive is of comparable size to the corresponding Gzip file. sFASTQ format outputs (interleaved) FASTQ records to the STDOUT stream. We provide SFQ, a software for the construction and usage of the sFASTQ format that supports variable length reads, pairing of records, and both lossless and lossy compression of quality scores.

Izvorni jezik
Engleski

Znanstvena područja
Biologija, Računarstvo, Temeljne tehničke znanosti



POVEZANOST RADA


Projekti:
HRZZ-IP-2018-01-7317 - Napredni deterministički i hibridni algoritmi na nizovima, sljedovima i stablima s primjenama u tehničkim znanostima i znanostima o životu (ALGSEQ18) (Ristov, Strahil, HRZZ - 2018-01) ( CroRIS)
IP-2018-01-8708 - Primjena NGS metoda u procjeni genomske varijabilnosti preživača (ANAGRAMS) (Čubrić Čurik, Vlatka, HRZZ - 2018-01) ( CroRIS)

Ustanove:
Institut "Ruđer Bošković", Zagreb

Profili:

Avatar Url Robert Bakarić (autor)

Avatar Url Damir Korenčić (autor)

Avatar Url Dalibor Hršak (autor)

Avatar Url Strahil Ristov (autor)

Poveznice na cjeloviti tekst rada:

doi www.mdpi.com doi.org fulir.irb.hr

Citiraj ovu publikaciju:

Bakarić, Robert; Korenčić, Damir; Hršak, Dalibor; Ristov, Strahil
SFQ: Constructing and Querying a Succinct Representation of FASTQ Files // Electronics, 11 (2022), 11; 1783, 12 doi:10.3390/electronics11111783 (međunarodna recenzija, članak, znanstveni)
Bakarić, R., Korenčić, D., Hršak, D. & Ristov, S. (2022) SFQ: Constructing and Querying a Succinct Representation of FASTQ Files. Electronics, 11 (11), 1783, 12 doi:10.3390/electronics11111783.
@article{article, author = {Bakari\'{c}, Robert and Koren\v{c}i\'{c}, Damir and Hr\v{s}ak, Dalibor and Ristov, Strahil}, year = {2022}, pages = {12}, DOI = {10.3390/electronics11111783}, chapter = {1783}, keywords = {bioinformatics, FASTQ data compression, random access}, journal = {Electronics}, doi = {10.3390/electronics11111783}, volume = {11}, number = {11}, issn = {2079-9292}, title = {SFQ: Constructing and Querying a Succinct Representation of FASTQ Files}, keyword = {bioinformatics, FASTQ data compression, random access}, chapternumber = {1783} }
@article{article, author = {Bakari\'{c}, Robert and Koren\v{c}i\'{c}, Damir and Hr\v{s}ak, Dalibor and Ristov, Strahil}, year = {2022}, pages = {12}, DOI = {10.3390/electronics11111783}, chapter = {1783}, keywords = {bioinformatics, FASTQ data compression, random access}, journal = {Electronics}, doi = {10.3390/electronics11111783}, volume = {11}, number = {11}, issn = {2079-9292}, title = {SFQ: Constructing and Querying a Succinct Representation of FASTQ Files}, keyword = {bioinformatics, FASTQ data compression, random access}, chapternumber = {1783} }

Časopis indeksira:


  • Current Contents Connect (CCC)
  • Web of Science Core Collection (WoSCC)
    • Science Citation Index Expanded (SCI-EXP)
    • Social Science Citation Index (SSCI)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font