A Comparison of Two Approaches to Bilingual HMM- Based Speech Synthesis

Pobar, Miran; Justin, Tadej; Žibert, Janez; Mihelič, France; Ipšić, Ivo

izvor podataka: crosbi !

A Comparison of Two Approaches to Bilingual HMM- Based Speech Synthesis (CROSBI ID 602293)

Prilog sa skupa u časopisu | izvorni znanstveni rad | međunarodna recenzija

Pobar, Miran ; Justin, Tadej ; Žibert, Janez ; Mihelič, France ; Ipšić, Ivo A Comparison of Two Approaches to Bilingual HMM- Based Speech Synthesis // Lecture notes in computer science / Habernal, Ivan ; Matoušek, Václav (ur.). 2013. str. 44-51

Podaci o odgovornosti

Autori

Pobar, Miran ; Justin, Tadej ; Žibert, Janez ; Mihelič, France ; Ipšić, Ivo

Osnovni podaci na izvornom jeziku
Osnovni podaci na ostalim jezicima

Jezik

engleski

Naslov

A Comparison of Two Approaches to Bilingual HMM- Based Speech Synthesis

Sažetak

We compare the performance of two approaches when using cross-lingual data from different speakers to build bilingual speech synthesis systems capable of producing speech with the same speaker identity. One approach treats data from both languages as monolingual, by labeling all data with a manually joined phoneme set. Speaker independent voice is trained using the joined data, and adapted to the target speaker using the CMLLR adaptation. In the second approach, speaker independent voices are trained for each language separately. State mapping between these voices is derived automatically from minimum Kullback–Leibler divergence between state distributions. The mapping is used to apply the adaptation transformations calculated within one language across languages to the other speaker independent voice. We evaluate the quality of speech on MOS scale and similarity of synthesized speech characteristics to the target speaker using DMOS on the example of Croatian-Slovene language pair.

Ključne riječi

bilingual; HMM; speech synthesis; phoneme mapping; state mapping; speaker adaptation; Kullback-Leibler divergence

Napomena

nije evidentirano

Jezik

nije evidentirano

Naslov

nije evidentirano

Sažetak

nije evidentirano

Ključne riječi

nije evidentirano

Napomena

nije evidentirano

Podaci o prilogu

Stranice rada

44-51.

Godina izdavanja

2013.

Volumen (broj)

nije evidentirano

Status objave rada

objavljeno

Podaci o matičnoj publikaciji

Naslov

Lecture notes in computer science

Urednici

Habernal, Ivan ; Matoušek, Václav

Izdavač

Plzeň: Springer

ISBN

978-3-642-40584-6

ISSN

0302-9743

Podaci o skupu

Skup

16th International Conference, TSD 2013, Pilsen, Czech Republic, September 1-5, 2013. Proceedings

Vrsta sudjelovanja

predavanje

Datum održavanja skupa

01.09.2013-05.09.2013

Mjesto održavanja skupa

Brno, Češka Republika

Povezanost rada

Povezane osobe

Miran Pobar (autor/i)

Ivo Ipšić (autor/i)

Povezane ustanove

Sveučilište u Rijeci, Fakultet informatike i digitalnih tehnologija (318) (autorova ustanova)

Povezani projekti

Govorne tehnologije (rezultat rada na projektu)

Područje

Računarstvo, Informacijske i komunikacijske znanosti

Indeksiranost

Scopus