Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Fast construction of space-optimized recursive automaton (CROSBI ID 209681)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Ristov, Strahil ; Korenčić, Damir Fast construction of space-optimized recursive automaton // Software, practice & experience, 45 (2015), 6; 783-799. doi: 10.1002/spe.2261

Podaci o odgovornosti

Ristov, Strahil ; Korenčić, Damir

engleski

Fast construction of space-optimized recursive automaton

Finite-state automata are important components in information retrieval and natural language processing software. A recursive automaton is the most compact representation of the acyclic deterministic finite-state automata. It is based on merging not only the equivalent states but also the identical substructures in an automaton. The LZ trie variant is the state-of-the-art in automata compression regarding space, but the time needed for its construction was, until now, quadratic, which has made it impractical for large inputs. In this paper, we present the first algorithm for LZ trie construction that runs in effectively linear time thereby making it an attractive choice for finite-state automata implementation. We achieve this goal by adding a new functionality to the enhanced suffix array data structure. We present two variants of the construction procedure – an optimal method regarding the final size and a method that sacrifices some compression for low intermediate memory usage. We have made the implementation of our algorithms available in an open source software package LzLex.

finite-state automata; recursive automaton; lexicon implementation; compressed data structures

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

45 (6)

2015.

783-799

objavljeno

0038-0644

10.1002/spe.2261

Povezanost rada

Računarstvo

Poveznice
Indeksiranost