Talking Faces - Technologies and Applications

Ostermann, Jörn; Weissenfeld, Axel; Liu, Kang

View/Open

157-157.pdf (37.30Kb)

Date

2005

Author

Ostermann, Jörn

Weissenfeld, Axel

Liu, Kang

Pay-Per-View via TIB Hannover:

Try if this item/paper is available.

Metadata

Show full item record

Abstract

Facial animation has been combined with text-to-speech synthesis to create innovative multimodal interfaces. In this lecture, we present the technology and architecture in order to use this multimodal interface in an web-based environment to support education, entertainment and e-commerce applications. Modern text to speech synthesizers using concatenative speech synthesis are able to generate high quality speech. Face animation uses the phoneme and timing information provided by such a speech synthesizer in order to animate the mouth. There are 2 basic technologies that are used to render talking faces: 3D face models as described in MPEG-4 may be used to provide the impression of a talking cartoon or human-like character. Sample-based face models generated from recorded video enable the synthesis of a talking head that cannot be distinguished from a real person. Depending on the chosen face animation technology and latency requirements, different architectures for delivering the talking head over the Internet are required for interactive applications. Keywords: Face animation, visual speech

BibTeX

@inproceedings {10.2312:vvg.20051020,
booktitle = {Vision, Video, and Graphics (2005)},
editor = {Mike Chantler},
title = {{Talking Faces - Technologies and Applications}},
author = {Ostermann, Jörn and Weissenfeld, Axel and Liu, Kang},
year = {2005},
publisher = {The Eurographics Association},
ISBN = {3-905673-57-6},
DOI = {10.2312/vvg.20051020}
}

URI

http://dx.doi.org/10.2312/vvg.20051020

Collections

VVG05