Paper
4 May 2010 Real-time speaker identification for video conferencing
Author Affiliations +
Abstract
Automatic speaker identification in a videoconferencing environment will allow conference attendees to focus their attention on the conference rather than having to be engaged manually in identifying which channel is active and who may be the speaker within that channel. In this work we present a real-time, audio-coupled video based approach to address this problem, but focus more on the video analysis side. The system is driven by the need for detecting a talking human via the use of computer vision algorithms. The initial stage consists of a face detector which is subsequently followed by a lip-localization algorithm that segments the lip region. A novel approach for lip movement detection based on image registration and using the Coherent Point Drift (CPD) algorithm is proposed. Coherent Point Drift (CPD) is a technique for rigid and non-rigid registration of point sets. We provide experimental results to analyse the performance of the algorithm when used in monitoring real life videoconferencing data.
© (2010) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
S. Saravi, I. Zafar, E. A. Edirisinghe, and R. S. Kalawsky "Real-time speaker identification for video conferencing", Proc. SPIE 7724, Real-Time Image and Video Processing 2010, 77240D (4 May 2010); https://doi.org/10.1117/12.854846
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Laser induced plasma spectroscopy

Video

Facial recognition systems

Detection and tracking algorithms

Edge detection

Sensors

Image processing algorithms and systems

Back to Top