24 March 2014 Optical music recognition on the International Music Score Library Project
Author Affiliations +
Abstract
A system is presented for optical recognition of music scores. The system processes a document page in three main phases. First it performs a hierarchical decomposition of the page, identifying systems, staves and measures. The second phase, which forms the heart of the system, interprets each measure found in the previous phase as a collection of non-overlapping symbols including both primitive symbols (clefs, rests, etc.) with fixed templates, and composite symbols (chords, beamed groups, etc.) constructed through grammatical composition of primitives (note heads, ledger lines, beams, etc.). This phase proceeds by first building separate top-down recognizers for the symbols of interest. Then, it resolves the inevitable overlap between the recognized symbols by exploring the possible assignment of overlapping regions, seeking globally optimal and grammatically consistent explanations. The third phase interprets the recognized symbols in terms of pitch and rhythm, focusing on the main challenge of rhythm. We present results that compare our system to the leading commercial OMR system using MIDI ground truth for piano music.
© (2014) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Christopher Raphael, Rong Jin, "Optical music recognition on the International Music Score Library Project", Proc. SPIE 9021, Document Recognition and Retrieval XXI, 90210F (24 March 2014); doi: 10.1117/12.2040247; https://doi.org/10.1117/12.2040247
PROCEEDINGS
12 PAGES


SHARE
RELATED CONTENT


Back to Top