Pregled bibliografske jedinice broj: 255852
Intelligent Content Production for a Virtual Speaker
Intelligent Content Production for a Virtual Speaker // Lecture notes in computer science, 3490 (2005), 163-174 doi:10.1007/11558637_17 (međunarodna recenzija, članak, znanstveni)
CROSBI ID: 255852 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Intelligent Content Production for a Virtual Speaker
Autori
Šmid, Karlo ; Pandžić, Igor ; Radman, Viktorija
Izvornik
Lecture notes in computer science (0302-9743) 3490
(2005);
163-174
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
graphically embodied animated agent ; speech ; facial gestures ; lexical analysis ; statistical models
Sažetak
We present a graphically embodied animated agent (a virtual speaker) capable of reading a plain English text and rendering it in a form of speech accompanied by the appropriate facial gestures. Our system uses a lexical analysis of an English text and statistical models of facial gestures in order to automatically generate the gestures related to the spoken text. It is intended for the automatic creation of the realistically animated virtual speakers, such as newscasters and storytellers and incorporates the characteristics of such speakers captured from the training video clips. Our system is based on a visual text-to-speech system which generates a lip movement synchronised with the generated speech. This is extended to include eye blinks, head and eyebrow motion, and a simple gaze following behaviour. The result is a full face animation produced automatically from the plain English text.
Izvorni jezik
Engleski
Znanstvena područja
Elektrotehnika
Citiraj ovu publikaciju:
Časopis indeksira:
- Web of Science Core Collection (WoSCC)
- Science Citation Index Expanded (SCI-EXP)
- SCI-EXP, SSCI i/ili A&HCI
- Scopus