Translator Disclaimer
17 September 2005 The speech scale and spectral transformation
Author Affiliations +
Proceedings Volume 5914, Wavelets XI; 59140G (2005)
Event: Optics and Photonics 2005, 2005, San Diego, California, United States
A fundamental issue of speech is that when different individuals enunciate the same perceived sound, the corresponding spectra are different, but since we perceive them to be the same, there must have a commonality that the ear extracts to recognize the same perceived sound. In previous publications we have established this commonality and have argued that the "spectra of sounds made by different individuals and perceived to be the same can be transformed into each other by a universal warping function". We call the warping function the speech scale and in previous works we have obtained it experimentally from actual speech. In this paper we give the mathematical equation that allows one to obtain the transformation function so that the transformation results in identical warped spectra except for a translation factor.
© (2005) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Srinivasan Umesh, Leon Cohen, and Douglas Nelson "The speech scale and spectral transformation", Proc. SPIE 5914, Wavelets XI, 59140G (17 September 2005);


The homogeneous dispersive lineshape as a wavelet basis
Proceedings of SPIE (February 08 2007)
Estimating speaker scale factors from vowels
Proceedings of SPIE (November 13 2003)
The speech scale, the Mel scale, and the tube model...
Proceedings of SPIE (December 06 2002)
Spectral BRDF modeling using wavelets
Proceedings of SPIE (March 08 2002)

Back to Top