Pregled bibliografske jedinice broj: 702430
Front-End Signal Processing for Speech Recognition
Front-End Signal Processing for Speech Recognition // Recent Advances in Circuits, Systems, Telecommunications and Control
Pariz, Francuska, 2013. str. 102-106 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 702430 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Front-End Signal Processing for Speech Recognition
Autori
Ramljak, Milan ; Stella, Maja ; Šarić, Matko
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Recent Advances in Circuits, Systems, Telecommunications and Control
/ - , 2013, 102-106
ISBN
978-960-474-341-4
Skup
1st International Conference on Wireless and Mobile Communication Systems (WMCS'13)
Mjesto i datum
Pariz, Francuska, 29.10.2013. - 31.10.2013
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
speech recognition; front-end; LPC; MFC; PLP; signal processing; PESQ
Sažetak
The evolution of computer technology, including operating systems and applications, resulted in designing intelligent machines that can recognize the spoken word and find out its meaning. Different front-end models have specific processing time required for calculating the same number of coefficients used for pattern recognition. During the years, it has been significantly improved, not only thanks to improvements in algorithms, but also with more processing power of nowadays computers. In this paper we analyze processing time and reconstructed speech quality of the three common front-end methods (Linear Predictive Coding - LPC, Mel-Frequency Cepstrum - MFC, Perceptual Linear Prediction - PLP) for calculating coefficients. Reconstructed speech quality is measured with Perceptual Evaluation of Speech Quality (PESQ) score. It is visible from our analysis that, if required, higher number of coefficients could be used without significant impact on processing time for MFC and PLP coefficients.
Izvorni jezik
Engleski
Znanstvena područja
Elektrotehnika, Računarstvo
POVEZANOST RADA
Projekti:
023-0231924-1660 - NAPREDNE HETEROGENE MREŽNE TEHNOLOGIJE (Begušić, Dinko, MZOS ) ( CroRIS)
023-0231924-1661 - ICT sustavi i usluge temeljeni na integraciji informacija (Rožić, Nikola, MZOS ) ( CroRIS)
Ustanove:
Fakultet elektrotehnike, strojarstva i brodogradnje, Split