24 August 1999 Video-assisted segmentation of speech and audio track
Author Affiliations +
Proceedings Volume 3846, Multimedia Storage and Archiving Systems IV; (1999) https://doi.org/10.1117/12.360456
Event: Photonics East '99, 1999, Boston, MA, United States
Video database research is commonly concerned with the storage and retrieval of visual information invovling sequence segmentation, shot representation and video clip retrieval. In multimedia applications, video sequences are usually accompanied by a sound track. The sound track contains potential cues to aid shot segmentation such as different speakers, background music, singing and distinctive sounds. These different acoustic categories can be modeled to allow for an effective database retrieval. In this paper, we address the problem of automatic segmentation of audio track of multimedia material. This audio based segmentation can be combined with video scene shot detection in order to achieve partitioning of the multimedia material into semantically significant segments.
© (1999) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Medha Pandit, Medha Pandit, Yusseri Yusoff, Yusseri Yusoff, Josef Kittler, Josef Kittler, William J. Christmas, William J. Christmas, E. H. S. Chilton, E. H. S. Chilton, "Video-assisted segmentation of speech and audio track", Proc. SPIE 3846, Multimedia Storage and Archiving Systems IV, (24 August 1999); doi: 10.1117/12.360456; https://doi.org/10.1117/12.360456


Back to Top