Paper
21 April 1995 Speech recognition for acoustic-assisted video coding and animation
Homer H. Chen, Wu Chou, Barry G. Haskell, Tsuhan Chen
Author Affiliations +
Proceedings Volume 2501, Visual Communications and Image Processing '95; (1995) https://doi.org/10.1117/12.206731
Event: Visual Communications and Image Processing '95, 1995, Taipei, Taiwan
Abstract
In this paper, we discuss issues related to analysis and synthesis of facial images using speech information. An approach to speaker independent acoustic-assisted image coding and animation is studied. A perceptually based sliding window encoder is proposed. It utilizes the high rate (or oversampled) acoustic viseme sequence from the audio domain for image domain viseme interpolation and smoothing. The image domain visemes in our approach are dynamically constructed from a set of basic visemes. The look-ahead and look-back moving interpolations in the proposed approach provide an effective way to compensate the mismatch between auditory and visual perceptions.
© (1995) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Homer H. Chen, Wu Chou, Barry G. Haskell, and Tsuhan Chen "Speech recognition for acoustic-assisted video coding and animation", Proc. SPIE 2501, Visual Communications and Image Processing '95, (21 April 1995); https://doi.org/10.1117/12.206731
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Acoustics

Mouth

3D modeling

Visualization

Video

Laser induced plasma spectroscopy

Computer programming

Back to Top