High Performance Processing for Speech Recognition

Ramljak, Milan; Stella, Maja; Šarić, Matko

izvor podataka: crosbi !

High Performance Processing for Speech Recognition (CROSBI ID 206780)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Ramljak, Milan ; Stella, Maja ; Šarić, Matko High Performance Processing for Speech Recognition // International journal of circuits, systems and signal processing, 8 (2014), 166-172

Podaci o odgovornosti

Autori

Ramljak, Milan ; Stella, Maja ; Šarić, Matko

Osnovni podaci na izvornom jeziku
Osnovni podaci na ostalim jezicima

Jezik

engleski

Naslov

High Performance Processing for Speech Recognition

Sažetak

The evolution of computer technology, including operating systems and applications, resulted in designing intelligent machines that can recognize the spoken word and find out its meaning. During the years, processing time required for speech recognition has been significantly improved, not only thanks to improvements in algorithms, but also with more processing power of nowadays computers. In this paper we analyze processing time and reconstructed speech quality of the three common front-end methods (Linear Predictive Coding - LPC, Mel-Frequency Cepstrum - MFC, Perceptual Linear Prediction - PLP) for calculating coefficients. Reconstructed speech quality is measured with Perceptual Evaluation of Speech Quality (PESQ) score. It is visible from our analysis that, if required, higher number of coefficients could be used without significant impact on processing time for MFC and PLP coefficients. Another very important aspect for processing time is a choice of back-end. In this paper we propose high performance neural network back-end implementation on distributed system based on Erlang programming language. Erlang processes can act as neural network neurons, and asynchronous message exchange is connection within processes transforming Erlang program in a normal neural network structure. With this kind of neural network implementation we have obtained significant increase in performance.

Ključne riječi

speech recognition; coefficients; PESQ; processing time; neural network; Erlang

Napomena

nije evidentirano

Jezik

nije evidentirano

Naslov

nije evidentirano

Sažetak

nije evidentirano

Ključne riječi

nije evidentirano

Napomena

nije evidentirano

Podaci o izdanju

Časopis

International journal of circuits, systems and signal processing

Volumen (broj)

Godina

2014.

Stranice rada

166-172

Status objave rada

objavljeno

e-ISSN

1998-4464

Povezanost rada

Povezane osobe

Matko Šarić (autor/i)

Maja Stella (autor/i)

Povezane ustanove

Fakultet elektrotehnike, strojarstva i brodogradnje u Splitu (023) (autorova ustanova)

Područje

Elektrotehnika, Računarstvo

Poveznice

naun.org

Indeksiranost

Scopus