20 April 2017 The impact of compression of speech signal, background noise and acoustic disturbances on the effectiveness of speaker identification
Author Affiliations +
Proceedings Volume 10418, XI Conference on Reconnaissance and Electronic Warfare Systems; 104180L (2017) https://doi.org/10.1117/12.2269338
Event: Conference on Reconnaissance and Electronic Warfare Systems, 2016, Oltarzew, Poland
Abstract
The paper presents the architecture and the results of optimization of selected elements of the Automatic Speaker Recognition (ASR) system that uses Gaussian Mixture Models (GMM) in the classification process. Optimization was performed on the process of selection of individual characteristics using the genetic algorithm and the parameters of Gaussian distributions used to describe individual voices. The system that was developed was tested in order to evaluate the impact of different compression methods used, among others, in landline, mobile, and VoIP telephony systems, on effectiveness of the speaker identification. Also, the results were presented of effectiveness of speaker identification at specific levels of noise with the speech signal and occurrence of other disturbances that could appear during phone calls, which made it possible to specify the spectrum of applications of the presented ASR system.
© (2017) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
K. Kamiński, A. P. Dobrowolski, "The impact of compression of speech signal, background noise and acoustic disturbances on the effectiveness of speaker identification", Proc. SPIE 10418, XI Conference on Reconnaissance and Electronic Warfare Systems, 104180L (20 April 2017); doi: 10.1117/12.2269338; https://doi.org/10.1117/12.2269338
PROCEEDINGS
12 PAGES


SHARE
Back to Top