Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 483626

INTRODUCTION TO SPEECH RECOGNITION, Exercise in ASR using HTK


Petrinović, Davor; Dropuljić, Branimir
INTRODUCTION TO SPEECH RECOGNITION, Exercise in ASR using HTK
Zagreb: Fakultet elektrotehnike i računarstva Sveučilišta u Zagrebu, 2010


CROSBI ID: 483626 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
INTRODUCTION TO SPEECH RECOGNITION, Exercise in ASR using HTK

Autori
Petrinović, Davor ; Dropuljić, Branimir

Vrsta obrazovnog materijala
Ostalo (nedefinirano)

Izdavač
Fakultet elektrotehnike i računarstva Sveučilišta u Zagrebu

Grad
Zagreb

Godina
2010

Stranica
19

Ključne riječi
Automatic speech recognition; ASR; HTK

Sažetak
These course notes are intended to cover the practical part of the course “Introduction to speech recognition”. The course covers the basics about hidden Markov models (HMM), how to build a new acoustic model for English language using HMMs and how to test the quality of this model (recognition accuracy). Model testing will be performed on utterances with limited vocabulary and strictly defined word-to-word transitions. Utterances will be recorded during these exercises by each student individually. Finally, quality and complexity of the acoustic model will be compared with another model, which will be constructed from all the utterances of all students. Furthermore, students will learn how to work with the Hidden Markov Model Toolkit (HTK) which is a widespread tool for building and testing of acoustic and language models. It provides an opportunity to build models from scratch and more importantly, step by step, thus gaining insight into the structure of a typical ASR system. The main goal of these exercises is to teach the students to research and explore the ASR world and gain hands-on experience using their own speech examples.

Izvorni jezik
Engleski

Znanstvena područja
Elektrotehnika, Računarstvo

Napomena
Course on „Automatic speech recognition“ (ASR) was prepared and given as a one of 12 courses given on the summer school entitled: “Interdisciplinary Summer School, Workshop and Round Table in Computational Linguistics, Cognitive and Information Science”, Zadar, 2010. The course was comprised of 5 hours of lectures, giving theoretical background of ASR, and 5 hours of exercises (covered by these course notes). All exercises were based on individual student work, using Hidden Markov Model Toolkit (HTK). The course covered main aspects of ASR systems, such as feature vector extraction, building of acoustic models, actual recognition as well as recognition accuracy evaluation.



POVEZANOST RADA


Projekti:
0036054
036-0000000-2029 - Adaptivno upravljanje scenarijima u VR terapiji PTSP-a (Ćosić, Krešimir, MZO ) ( CroRIS)

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Avatar Url Davor Petrinović (autor)

Avatar Url Branimir Dropuljić (autor)

Poveznice na cjeloviti tekst rada:

Pristup cjelovitom tekstu rada ling.unizd.hr

Citiraj ovu publikaciju:

Petrinović, Davor; Dropuljić, Branimir
INTRODUCTION TO SPEECH RECOGNITION, Exercise in ASR using HTK
Zagreb: Fakultet elektrotehnike i računarstva Sveučilišta u Zagrebu, 2010
Petrinović, D. & Dropuljić, B. (2010) INTRODUCTION TO SPEECH RECOGNITION, Exercise in ASR using HTK. Zagreb. Fakultet elektrotehnike i računarstva Sveučilišta u Zagrebu.
@unknown{unknown, author = {Petrinovi\'{c}, Davor and Dropulji\'{c}, Branimir}, year = {2010}, pages = {19}, keywords = {Automatic speech recognition, ASR, HTK}, title = {INTRODUCTION TO SPEECH RECOGNITION, Exercise in ASR using HTK}, keyword = {Automatic speech recognition, ASR, HTK}, publisher = {Fakultet elektrotehnike i ra\v{c}unarstva Sveu\v{c}ili\v{s}ta u Zagrebu}, publisherplace = {Zagreb} }
@unknown{unknown, author = {Petrinovi\'{c}, Davor and Dropulji\'{c}, Branimir}, year = {2010}, pages = {19}, keywords = {Automatic speech recognition, ASR, HTK}, title = {INTRODUCTION TO SPEECH RECOGNITION, Exercise in ASR using HTK}, keyword = {Automatic speech recognition, ASR, HTK}, publisher = {Fakultet elektrotehnike i ra\v{c}unarstva Sveu\v{c}ili\v{s}ta u Zagrebu}, publisherplace = {Zagreb} }




Contrast
Increase Font
Decrease Font
Dyslexic Font