15 December 2003 Comprehensive printed Tibetan/English mixed text segmentation method
Author Affiliations +
Abstract
Text segmentation plays a crucial role in a text recognition system. A comprehensive method is proposed to solve Tibetan/English text segmentation. 2 algorithms based on Tibetan inter-syllabic tshegs and discirminant function, respectively, are presented to perform skew detection before text line separation. Then a dynamic recursive character segmentation algorithm integrating multi-level information is developed. The encouraging experimental results on a large-scale Tibetan/English mixed text set show the validity of proposed method.
© (2003) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hua Wang, Xiaoqing Ding, "Comprehensive printed Tibetan/English mixed text segmentation method", Proc. SPIE 5296, Document Recognition and Retrieval XI, (15 December 2003); doi: 10.1117/12.528949; https://doi.org/10.1117/12.528949
PROCEEDINGS
11 PAGES


SHARE
Back to Top