25 October 2004 Segmentation of singing voice within music signals
Author Affiliations +
Proceedings Volume 5601, Internet Multimedia Management Systems V; (2004) https://doi.org/10.1117/12.571280
Event: Optics East, 2004, Philadelphia, Pennsylvania, United States
Abstract
This paper proposes a novel approach to accomplish the automatic segmentation of singing voice within music signals, based on the difference between the dynamic harmonic content of singing voice and that of musical instrument signals. The obtained results are compared with those of another approach proposed in the literature, considering the same music database. For both techniques, an accuracy rate around 80% is obtained, even using a more rigorous performance measure for our approach only. As an advantage, the new procedure presents lower computational complexity. In addition, we discuss other results obtained by extending the tests over the whole database (upholding the same performance level) and by discriminating the error types (boundaries shifted in time, insertion and deletion of singing segments). The analysis of these errors suggests some alternative ways of reducing them, as for example, to adopt a confidence level based on a minimum harmonic content for the input signals. In this way, considering only signals with confidence level equal to one, the obtained performance is improved to almost 87%.
© (2004) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Phabio J. Setubal, Sidnei Noceti Filho, Rui Seara, "Segmentation of singing voice within music signals", Proc. SPIE 5601, Internet Multimedia Management Systems V, (25 October 2004); doi: 10.1117/12.571280; https://doi.org/10.1117/12.571280
PROCEEDINGS
10 PAGES


SHARE
Back to Top