17 July 2000 Bispectrum analysis for speaker identification in a noisy environment with Karhunen-Loeve transformation technique
Author Affiliations +
Abstract
The work described in this paper addresses the problem for extracting bispectrum feature of speech data. Very often the bispectrum feature extraction and data reduction are complicated due to some limiting constraints, i.e., no prior knowledge of feature's distribution and higher dimensionality of bispectrum data. In this article we developed an adaptive feature extraction mechanism based on cascade neural network in conjunction with feature's dimensionality reduction based on Karhunen-Loeve transformation technique. An adaptive codebook generation algorithm which is a cascade configuration of SOFM (Self Organizing Feature Map) and LVQ (Learning Vector Quantization) was used before the K-L transformation. The transformation was experimentally shown as an effective procedure for orthogonalization and dimensionality reduction of spectrum feature. Performance of our speaker identification system was perceived to be significantly increased even though using limited number of channels in noisy environment. We also tried to improve the capability of adaptive codebook generation algorithm by applying simplified differential competitive learning network.
© (2000) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Benyamin Kusumoputro, Benyamin Kusumoputro, Muhammad Ivan Fanany, Muhammad Ivan Fanany, Dian Indrawati, Dian Indrawati, } "Bispectrum analysis for speaker identification in a noisy environment with Karhunen-Loeve transformation technique", Proc. SPIE 4044, Hybrid Image and Signal Processing VII, (17 July 2000); doi: 10.1117/12.391925; https://doi.org/10.1117/12.391925
PROCEEDINGS
7 PAGES


SHARE
Back to Top