Paper
2 May 2006 Comparison of weighting strategies in early and late fusion approaches to audio-visual person authentication
Author Affiliations +
Abstract
Person authentication can be strongly enhanced by the combination of different modalities. This is also true for the face and voice signals, which can be obtained with minimal inconvenience for the user. However, features from each modality can be combined at various different levels of processing and for face and voice signals the advantage of fusion depends strongly on the way they are combined. The aim of the work presented is to investigate the optimal strategy for combining voice and face modalities for signals of varying quality. The experimental data are taken from a newly acquired database using a PDA, which contains audio-visual recordings in different conditions. Voice features use mel-frequency cepstral coefficients, while the face signal is parameterised using wavelet coefficients in certain subbands. Results are presented for both early (feature-level) and late (score-level) fusion. At each level different fixed and variable weightings are used, both to weight between frames within each modality and to weight between modalities, where weights are based on some measure of signal reliability, such as the accuracy of automatic face detection or the audio signal to noise ratio. In addition, the contribution to authentication of information from different areas of the face is explored to determine a regional weighting for the face coefficients.
© (2006) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Harin Sellahewa, Naseer Al-Jawad, Andrew C. Morris, Dalei Wu, Jacques Koreman, and Sabah A. Jassim "Comparison of weighting strategies in early and late fusion approaches to audio-visual person authentication", Proc. SPIE 6250, Mobile Multimedia/Image Processing for Military and Security Applications, 62500C (2 May 2006); https://doi.org/10.1117/12.667214
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Biometrics

Databases

Personal digital assistants

Wavelets

Mouth

Video

Visualization

RELATED CONTENT

Perceptual tools for quality-aware video networks
Proceedings of SPIE (February 03 2014)
Multi-stream face recognition for crime-fighting
Proceedings of SPIE (April 12 2007)
Wavelet-based face verification for constrained platforms
Proceedings of SPIE (March 28 2005)
Wavelet library for constrained devices
Proceedings of SPIE (May 02 2007)

Back to Top