Comparison of Statistical Model-Based Voice Activity Detectors for Mobile Robot Speech Applications (CROSBI ID 587860)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Marković, Ivan ; Domitrović, Hrvoje ; Petrović, Ivan
engleski
Comparison of Statistical Model-Based Voice Activity Detectors for Mobile Robot Speech Applications
This paper deals with the problem of voice activity detection in adverse acoustic conditions, namely high and varying noise scenarios. For robotic applications, we need the voice activity detector to be computationally light, robust to varying levels of background noise, and have a low latency, especially if we are tracking moving speakers. We analyze three voice activity detectors - two model the discrete Fourier transform coefficients by Gaussian and generalized Gaussian distribution, while the third models the spectral envelope as having either Rayleigh or Rice distribution---and we present them in a unifying and consistent manner, with respect to a statistical hypotheses ratio measure and a joint noise spectrum estimation algorithm. Moreover, we compare the performance under various noise conditions ; three types of noises, three different signal-to-noise ratios and six different speakers, by means of receiver operating characteristic curves and area under a curve score. The results showed that the Rayleigh-Rice model had on average better results and medium computational demand.
voice activity detection; statistical model-based detectors; receiver operating characteristic curves
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
2012.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the 10th IFAC Symposioum on Robotic Control (SYROCO2012), Volume 10, Part 1
Petrovic, Ivan ; Korondi, Peter
Dubrovnik:
978-3-902823-11-3
Podaci o skupu
10th IFAC Symposioum on Robotic Control (SYROCO2012)
predavanje
05.09.2012-07.09.2012
Dubrovnik, Hrvatska