Translator Disclaimer
1 September 1995 Scale-invariant speech analysis via joint time-frequency-scale processing
Author Affiliations +
Abstract
We argue that an important aspect of the human speech signal is scaling in the frequency domain. We discuss the two physical mechanisms responsible for the scaling. The first mechanism is that when we have a harmonic signal whose fundamental is frequency modulated then the spectrum is the sum of scaled functions. The second comes about from the consideration that while different speakers have very different size vocal tracts (for example an adult and a child), we none the less produce speech which is similar in some sense. We will argue and present evidence to show that the speaker differences result in scaling in the frequency domain. We further discuss how one can handle scale processing.
© (1995) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Leon Cohen, Nenad M. Marinovic, Srinivasan Umesh, and Douglas J. Nelson "Scale-invariant speech analysis via joint time-frequency-scale processing", Proc. SPIE 2569, Wavelet Applications in Signal and Image Processing III, (1 September 1995); https://doi.org/10.1117/12.217607
PROCEEDINGS
16 PAGES


SHARE
Advertisement
Advertisement
RELATED CONTENT

TTCP AG-6: acoustic detection and tracking of UAVs
Proceedings of SPIE (September 01 2004)
Chirp signal analysis with amplitude modulation
Proceedings of SPIE (November 01 2012)
Modulation frequency and efficient audio coding
Proceedings of SPIE (November 20 2001)
The speech scale, the Mel scale, and the tube model...
Proceedings of SPIE (December 06 2002)

Back to Top