An automatic time-aligned phone transcription toolbox of English speech corpora has been developed. Especially the
toolbox would be very useful to generate robust automatic transcription and able to produce phone level transcription
using speaker independent models as well as speaker dependent models without manual intervention. The system is
based on standard Hidden Markov Models (HMM) approach and it was successfully experimented over a large audiovisual
speech corpus namely GRID corpus. One of the most powerful features of the toolbox is the increased flexibility
in speech processing where the speech community would be able to import the automatic transcription generated by
HMM Toolkit (HTK) into a popular transcription software, PRAAT, and vice-versa. The toolbox has been evaluated
through statistical analysis on GRID data which shows that automatic transcription deviates by an average of 20 ms with
respect to manual transcription.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.