Paper
15 March 2019 Bimodal person identification using voice data and face images
V. V. Khryashchev, A. I. Topnikov, A. F. Stefanidi, A. L. Priorov
Author Affiliations +
Proceedings Volume 11041, Eleventh International Conference on Machine Vision (ICMV 2018); 1104116 (2019) https://doi.org/10.1117/12.2523138
Event: Eleventh International Conference on Machine Vision (ICMV 2018), 2018, Munich, Germany
Abstract
The paper considers bimodal person identification problem by analyzing the speaker’s face and voice. Two speaker identification algorithms are developed and compared. The idea of the first algorithm consists of extracting features from the speech signal in the form of mel frequency cepstral coefficients and, with this basis, forming a speaker model using Gaussian mixtures. Second approach is based on the use of a universal background model obtained from the records of a large number of speakers. For face identification, a neural network with 13 convolutional layers was used. For the learning and testing, the databases of speech signals and face images of 100 people were formed. The final bimodal identification system shows the high level of accuracy identification of more than 95%. The results of this experiment demonstrated the possibility of applying the proposed algorithms to the person identification problem in real-life systems.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
V. V. Khryashchev, A. I. Topnikov, A. F. Stefanidi, and A. L. Priorov "Bimodal person identification using voice data and face images", Proc. SPIE 11041, Eleventh International Conference on Machine Vision (ICMV 2018), 1104116 (15 March 2019); https://doi.org/10.1117/12.2523138
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Algorithm development

Signal to noise ratio

Convolutional neural networks

Facial recognition systems

System identification

Biometrics

Denoising

RELATED CONTENT


Back to Top