Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1018680

A Method for Estimating Variations in Speech Tempo from Recorded Speech


Stojanović, Aleksandar; Lazić, Nikolaj
A Method for Estimating Variations in Speech Tempo from Recorded Speech // MIPRO 2019 / Ribarić, Slobodan (ur.).
Rijeka, 2019. str. 1277-1282 (predavanje, međunarodna recenzija, sažetak, znanstveni)


CROSBI ID: 1018680 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
A Method for Estimating Variations in Speech Tempo from Recorded Speech

Autori
Stojanović, Aleksandar ; Lazić, Nikolaj

Vrsta, podvrsta i kategorija rada
Sažeci sa skupova, sažetak, znanstveni

Izvornik
MIPRO 2019 / Ribarić, Slobodan - Rijeka, 2019, 1277-1282

Skup
42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO 2019)

Mjesto i datum
Opatija, Hrvatska, 20.05.2019. - 24.05.2019

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
speech recognition, text-to-speech alignment, speech tempo, neural network

Sažetak
In this paper we describe a method for measuring variations in speech tempo from speech samples recorded from Croatian news channels, where the text of what was spoken is available through subtitles. For speech recognition we use a feed- forward neural network trained with about 150 seconds of speech. To extract word boundaries, we created a speech-to-text aligner capable of finding an acceptable match between text and sequence of phonemes classified by the neural network. The aligner takes into consideration certain categories of phonemes for which the neural network has higher accuracy. Preliminary experiments show average alignment miss of about one to three phonemes, depending on the speaker, the content, and recording quality.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti, Interdisciplinarne humanističke znanosti



POVEZANOST RADA


Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Nikolaj Lazić (autor)

Avatar Url Aleksandar Stojanović (autor)

Poveznice na cjeloviti tekst rada:

Pristup cjelovitom tekstu rada

Citiraj ovu publikaciju:

Stojanović, Aleksandar; Lazić, Nikolaj
A Method for Estimating Variations in Speech Tempo from Recorded Speech // MIPRO 2019 / Ribarić, Slobodan (ur.).
Rijeka, 2019. str. 1277-1282 (predavanje, međunarodna recenzija, sažetak, znanstveni)
Stojanović, A. & Lazić, N. (2019) A Method for Estimating Variations in Speech Tempo from Recorded Speech. U: Ribarić, S. (ur.)MIPRO 2019.
@article{article, author = {Stojanovi\'{c}, Aleksandar and Lazi\'{c}, Nikolaj}, editor = {Ribari\'{c}, S.}, year = {2019}, pages = {1277-1282}, keywords = {speech recognition, text-to-speech alignment, speech tempo, neural network}, title = {A Method for Estimating Variations in Speech Tempo from Recorded Speech}, keyword = {speech recognition, text-to-speech alignment, speech tempo, neural network}, publisherplace = {Opatija, Hrvatska} }
@article{article, author = {Stojanovi\'{c}, Aleksandar and Lazi\'{c}, Nikolaj}, editor = {Ribari\'{c}, S.}, year = {2019}, pages = {1277-1282}, keywords = {speech recognition, text-to-speech alignment, speech tempo, neural network}, title = {A Method for Estimating Variations in Speech Tempo from Recorded Speech}, keyword = {speech recognition, text-to-speech alignment, speech tempo, neural network}, publisherplace = {Opatija, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font