13 October 1997 Phoneme fuzzy characterization in speech recognition systems
Author Affiliations +
The acoustic approach to speech recognition has an important advantage compared with pattern recognition approach: it presents a lower complexity because it doesn't require explicit structures such as the hidden Markov model. In this work, we show how to characterize some phonetic classes of the Italian language in order to obtain a speaker and vocabulary independent speech recognition system. A phonetic data base is carried out with 200 continuous speech sentences of 12 speakers, 6 females and 6 males. The sentences are sampled at 8000 Hz and manual labelled with Asystem Sound Impression Software to obtain about 1600 units. We analyzed several speech parameters such as formants, LPC and reflection coefficients, energy, normal/differential zero crossing rate, cepstral and autocorrelation coefficients. The aim is the achievement of a phonetic recognizer to facilitate the so- called lexical access problem, that is to decode phonetic units into complete sense word strings. The knowledge is supplied to the recognizer in terms of fuzzy systems. The utilized software is called adaptive fuzzy modeler and it belongs to the rule generator family. A procedure has been implemented to integrate in the fuzzy system an 'expert' knowledge in order to obtain significant improvements in the recognition accuracy. Up to this point the tests show a recognition rate of 92% for the vocal class, 89% for the fricatives class and 94% for the nasal class, utilizing 1000 phonemes in phase of learning and 600 phonemes in phase of testing. Our intention is to complete the fuzzy recognizer extending this work to the other phonetic classes.
© (1997) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Francesco Beritelli, Francesco Beritelli, Luca Borrometi, Luca Borrometi, Antonino Cuce, Antonino Cuce, "Phoneme fuzzy characterization in speech recognition systems", Proc. SPIE 3165, Applications of Soft Computing, (13 October 1997); doi: 10.1117/12.279597; https://doi.org/10.1117/12.279597

Back to Top