29 May 2013 Emotional state and its impact on voice authentication accuracy
Author Affiliations +
The paper deals with the increasing accuracy of voice authentication methods. The developed algorithm first extracts segmental parameters, such as Zero Crossing Rate, the Fundamental Frequency and Mel-frequency cepstral coefficients from voice. Based on these parameters, the neural network classifier detects the speaker's emotional state. These parameters shape the distribution of neurons in Kohonen maps, forming clusters of neurons on the map characterizing a particular emotional state. Using regression analysis, we can calculate the function of the parameters of individual emotional states. This relationship increases voice authentication accuracy and prevents unjust rejection.
© (2013) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Miroslav Voznak, Miroslav Voznak, Pavol Partila, Pavol Partila, Marek Penhaker, Marek Penhaker, Tomas Peterek, Tomas Peterek, Karel Tomala, Karel Tomala, Filip Rezac, Filip Rezac, Jakub Safarik, Jakub Safarik, "Emotional state and its impact on voice authentication accuracy", Proc. SPIE 8750, Independent Component Analyses, Compressive Sampling, Wavelets, Neural Net, Biosystems, and Nanoengineering XI, 875006 (29 May 2013); doi: 10.1117/12.2015719; https://doi.org/10.1117/12.2015719

Back to Top