Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1076536

A Fast Algorithm for the Largest Area First Parsing of Real Strings


Katanić, Ivan; Ristov, Strahil; Rosenzweig, Martin
A Fast Algorithm for the Largest Area First Parsing of Real Strings // IEEE access, 8 (2020), 141990-142002 doi:10.1109/access.2020.3013676 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 1076536 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
A Fast Algorithm for the Largest Area First Parsing of Real Strings

Autori
Katanić, Ivan ; Ristov, Strahil ; Rosenzweig, Martin

Izvornik
IEEE access (2169-3536) 8 (2020); 141990-142002

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
greedy grammar compression ; largest area first parsing ; dynamic text indexing ; enhanced suffix array

Sažetak
The largest area first parsing of a string often leads to the best results in grammar compression for a variety of input data. However, the fastest existing algorithm has Θ(N2logN) time complexity, which makes it impractical for real-life applications. We present a new largest area first parsing method that has O(N3) complexity in the improbable worst case but works in the quasilinear time for most practical purposes. This result is based on the fact that in the real data, the sum of all depths of an LCP-interval tree, over all of the positions in a suffix array of an input string, is only larger than the size of the input by a small factor α . We present the analysis of the algorithm in terms of α , and the experimental results confirm that our method is practical even for genome sized inputs. We provide the C ++ 11 code for the implementation of our method. Additionally, we show that by a combination of the previous and new algorithms, the worst-case complexity of the largest area first parsing is improved by a factor of N−−√3 .

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo



POVEZANOST RADA


Projekti:
HRZZ-IP-2018-01-7317 - Napredni deterministički i hibridni algoritmi na nizovima, sljedovima i stablima s primjenama u tehničkim znanostima i znanostima o životu (ALGSEQ18) (Ristov, Strahil, HRZZ - 2018-01) ( CroRIS)
KK.01.1.1.01.0009 - Napredne metode i tehnologije u znanosti o podatcima i kooperativnim sustavima (EK )

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb,
Institut "Ruđer Bošković", Zagreb

Profili:

Avatar Url Strahil Ristov (autor)

Citiraj ovu publikaciju:

Katanić, Ivan; Ristov, Strahil; Rosenzweig, Martin
A Fast Algorithm for the Largest Area First Parsing of Real Strings // IEEE access, 8 (2020), 141990-142002 doi:10.1109/access.2020.3013676 (međunarodna recenzija, članak, znanstveni)
Katanić, I., Ristov, S. & Rosenzweig, M. (2020) A Fast Algorithm for the Largest Area First Parsing of Real Strings. IEEE access, 8, 141990-142002 doi:10.1109/access.2020.3013676.
@article{article, author = {Katani\'{c}, Ivan and Ristov, Strahil and Rosenzweig, Martin}, year = {2020}, pages = {141990-142002}, DOI = {10.1109/access.2020.3013676}, keywords = {greedy grammar compression, largest area first parsing, dynamic text indexing, enhanced suffix array}, journal = {IEEE access}, doi = {10.1109/access.2020.3013676}, volume = {8}, issn = {2169-3536}, title = {A Fast Algorithm for the Largest Area First Parsing of Real Strings}, keyword = {greedy grammar compression, largest area first parsing, dynamic text indexing, enhanced suffix array} }
@article{article, author = {Katani\'{c}, Ivan and Ristov, Strahil and Rosenzweig, Martin}, year = {2020}, pages = {141990-142002}, DOI = {10.1109/access.2020.3013676}, keywords = {greedy grammar compression, largest area first parsing, dynamic text indexing, enhanced suffix array}, journal = {IEEE access}, doi = {10.1109/access.2020.3013676}, volume = {8}, issn = {2169-3536}, title = {A Fast Algorithm for the Largest Area First Parsing of Real Strings}, keyword = {greedy grammar compression, largest area first parsing, dynamic text indexing, enhanced suffix array} }

Časopis indeksira:


  • Current Contents Connect (CCC)
  • Web of Science Core Collection (WoSCC)
    • Science Citation Index Expanded (SCI-EXP)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font