Translator Disclaimer
4 February 2013 Local projection-based character segmentation method for historical Chinese documents
Author Affiliations +
Proceedings Volume 8658, Document Recognition and Retrieval XX; 86580O (2013)
Event: IS&T/SPIE Electronic Imaging, 2013, Burlingame, California, United States
Digitization of historical Chinese documents includes two key technologies, character segmentation and character recognition. This paper focuses on developing character segmentation algorithm. As a preprocessing step, we combine several effective measures to remove noises in a historical Chinese document image. After binarization, a new character segmentation algorithm segment single characters based on projections of a cost image in local windows. The cost image is constructed by utilizing the information of stroke bounding boxes and a skeleton image extracted from the binarized image. We evaluate the proposed algorithm based on matching degrees of character bounding boxes between segmentation results and ground-truth data, and achieve a recall rate of 74.3% on a test set, which shows the effectiveness of the proposed algorithm.
© (2013) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Linjie Yang and Liangrui Peng "Local projection-based character segmentation method for historical Chinese documents", Proc. SPIE 8658, Document Recognition and Retrieval XX, 86580O (4 February 2013);


Back to Top