Paper
15 May 2015 Open-set speaker identification with diverse-duration speech data
Rawande Karadaghi, Heinz Hertlein, Aladdin Ariyaeeinia
Author Affiliations +
Abstract
The concern in this paper is an important category of applications of open-set speaker identification in criminal investigation, which involves operating with short and varied duration speech. The study presents investigations into the adverse effects of such an operating condition on the accuracy of open-set speaker identification, based on both GMMUBM and i-vector approaches. The experiments are conducted using a protocol developed for the identification task, based on the NIST speaker recognition evaluation corpus of 2008. In order to closely cover the real-world operating conditions in the considered application area, the study includes experiments with various combinations of training and testing data duration. The paper details the characteristics of the experimental investigations conducted and provides a thorough analysis of the results obtained.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Rawande Karadaghi, Heinz Hertlein, and Aladdin Ariyaeeinia "Open-set speaker identification with diverse-duration speech data", Proc. SPIE 9457, Biometric and Surveillance Technology for Human and Activity Identification XII, 94570G (15 May 2015); https://doi.org/10.1117/12.2176335
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Speaker recognition

Surveillance

Data modeling

Biometrics

Current controlled current source

Factor analysis

Matrices

Back to Top