Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

A Method for Estimating Variations in Speech Tempo from Recorded Speech (CROSBI ID 680433)

Prilog sa skupa u zborniku | sažetak izlaganja sa skupa | međunarodna recenzija

Stojanović, Aleksandar ; Lazić, Nikolaj A Method for Estimating Variations in Speech Tempo from Recorded Speech // MIPRO / Ribarić, Slobodan (ur.). 2019. str. 1277-1282

Podaci o odgovornosti

Stojanović, Aleksandar ; Lazić, Nikolaj

engleski

A Method for Estimating Variations in Speech Tempo from Recorded Speech

In this paper we describe a method for measuring variations in speech tempo from speech samples recorded from Croatian news channels, where the text of what was spoken is available through subtitles. For speech recognition we use a feed- forward neural network trained with about 150 seconds of speech. To extract word boundaries, we created a speech-to-text aligner capable of finding an acceptable match between text and sequence of phonemes classified by the neural network. The aligner takes into consideration certain categories of phonemes for which the neural network has higher accuracy. Preliminary experiments show average alignment miss of about one to three phonemes, depending on the speaker, the content, and recording quality.

speech recognition, text-to-speech alignment, speech tempo, neural network

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

1277-1282.

2019.

objavljeno

Podaci o matičnoj publikaciji

MIPRO 2019

Ribarić, Slobodan

Rijeka:

1847-3946

1847-3946

Podaci o skupu

MIPRO 2019

predavanje

20.05.2019-24.05.2019

Opatija, Hrvatska

Povezanost rada

Informacijske i komunikacijske znanosti, Interdisciplinarne humanističke znanosti, Računarstvo