Pregled bibliografske jedinice broj: 252279
Automatic Lip Synchronization by Speech Signal Analysis
Automatic Lip Synchronization by Speech Signal Analysis, 2005., magistarski rad, Fakultet elektrotehnike i računarstva, Zagreb
CROSBI ID: 252279 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Automatic Lip Synchronization by Speech Signal Analysis
Autori
Zorić, Goranka
Vrsta, podvrsta i kategorija rada
Ocjenski radovi, magistarski rad
Fakultet
Fakultet elektrotehnike i računarstva
Mjesto
Zagreb
Datum
21.10
Godina
2005
Stranica
74
Mentor
Pandžić, Igor
Ključne riječi
lip synchronization; facial animation; MPEG-4 FBA; virtual characters; speech processing; neural networks; genetic algorithms
Sažetak
This master thesis investigates automatic lip synchronization. It is a method for generating an animation of 3D human face model where the animation is driven only by a speech signal. The whole process is completely automatic and starts from the speech signal. The automatic lip synchronization consists of two main parts: audio to visual mapping and a face synthesis. The thesis proposes and implements a system for the automatic lip synchronization of synthetic 3D avatars based only on the speech input. The speech signal is classified into viseme classes using neural networks. The topology of neural networks is automatically configured using genetic algorithms. Visual representation of phonemes, viseme, defined in MPEG-4 FA, is used for face synthesis. The system is adopted for specificity of the Croatian language. Detailed system validation based on three different evaluation methods is done and potential applications of these technologies are discussed in details. This method is suitable for real-time and offline applications. It is speaker independent and multilingual.
Izvorni jezik
Engleski
Znanstvena područja
Elektrotehnika, Računarstvo, Informacijske i komunikacijske znanosti