Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 607594

Optimization of Cost Function Weights for Unit Selection Speech Synthesis Using Speech Recognition


Pobar, Miran; Martinčić-Ipšić, Sanda; Ipšić, Ivo
Optimization of Cost Function Weights for Unit Selection Speech Synthesis Using Speech Recognition // Neural network world, 22 (2012), 5; 429-441 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 607594 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Optimization of Cost Function Weights for Unit Selection Speech Synthesis Using Speech Recognition

Autori
Pobar, Miran ; Martinčić-Ipšić, Sanda ; Ipšić, Ivo

Izvornik
Neural network world (1210-0552) 22 (2012), 5; 429-441

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
speech synthesis; statistical parametrical synthesis; unit selection; weight tuning

Sažetak
A well known problem in unit selection speech synthesis is designing the join and target function sub-costs and optimizing their corresponding weights so that they reflect the human listeners' preferences. To achieve this, we propose a procedure where an objective criterion for optimal speech unit selection is used. The objective criterion for tuning the cost function weights is based on automatic speech recognition results. In order to demonstrate the effectiveness of the proposed method listening tests with 31 naïve listeners were performed. The experimental results have shown that the proposed method improves speech quality and intelligibility. In order to evaluate the quality of synthesized speech, the unit selection speech synthesis system is compared with two other Croatian speech synthesis systems with voices built using the same recorded speech corpus. One of these voices was built with the Festival speech synthesis system using the statistical parametric method and the other is a diphone concatenation based text-to-speech system. The comparison is based on subjective tests using MOS (mean opinion score) evaluation. The system using the proposed method used for cost function weights optimization performs better than other compared systems according to the subjective tests.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekti:
318-0361935-0852 - Govorne tehnologije (Ipšić, Ivo, MZOS ) ( CroRIS)

Ustanove:
Fakultet informatike i digitalnih tehnologija, Rijeka

Citiraj ovu publikaciju:

Pobar, Miran; Martinčić-Ipšić, Sanda; Ipšić, Ivo
Optimization of Cost Function Weights for Unit Selection Speech Synthesis Using Speech Recognition // Neural network world, 22 (2012), 5; 429-441 (međunarodna recenzija, članak, znanstveni)
Pobar, M., Martinčić-Ipšić, S. & Ipšić, I. (2012) Optimization of Cost Function Weights for Unit Selection Speech Synthesis Using Speech Recognition. Neural network world, 22 (5), 429-441.
@article{article, author = {Pobar, Miran and Martin\v{c}i\'{c}-Ip\v{s}i\'{c}, Sanda and Ip\v{s}i\'{c}, Ivo}, year = {2012}, pages = {429-441}, keywords = {speech synthesis, statistical parametrical synthesis, unit selection, weight tuning}, journal = {Neural network world}, volume = {22}, number = {5}, issn = {1210-0552}, title = {Optimization of Cost Function Weights for Unit Selection Speech Synthesis Using Speech Recognition}, keyword = {speech synthesis, statistical parametrical synthesis, unit selection, weight tuning} }
@article{article, author = {Pobar, Miran and Martin\v{c}i\'{c}-Ip\v{s}i\'{c}, Sanda and Ip\v{s}i\'{c}, Ivo}, year = {2012}, pages = {429-441}, keywords = {speech synthesis, statistical parametrical synthesis, unit selection, weight tuning}, journal = {Neural network world}, volume = {22}, number = {5}, issn = {1210-0552}, title = {Optimization of Cost Function Weights for Unit Selection Speech Synthesis Using Speech Recognition}, keyword = {speech synthesis, statistical parametrical synthesis, unit selection, weight tuning} }

Časopis indeksira:


  • Current Contents Connect (CCC)
  • Web of Science Core Collection (WoSCC)
    • Science Citation Index Expanded (SCI-EXP)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus


Uključenost u ostale bibliografske baze podataka::


  • Compu-Math Citation Index





Contrast
Increase Font
Decrease Font
Dyslexic Font