19 June 2017 Recognizing Chinese characters in digital ink from non-native language writers using hierarchical models
Author Affiliations +
Proceedings Volume 10443, Second International Workshop on Pattern Recognition; 104430A (2017); doi: 10.1117/12.2280237
Event: Second International Workshop on Pattern Recognition, 2017, Singapore, Singapore
Abstract
While Chinese is learned as a second language, its characters are taught step by step from their strokes to components, radicals to components, and their complex relations. Chinese Characters in digital ink from non-native language writers are deformed seriously, thus the global recognition approaches are poorer. So a progressive approach from bottom to top is presented based on hierarchical models. Hierarchical information includes strokes and hierarchical components. Each Chinese character is modeled as a hierarchical tree. Strokes in one Chinese characters in digital ink are classified with Hidden Markov Models and concatenated to the stroke symbol sequence. And then the structure of components in one ink character is extracted. According to the extraction result and the stroke symbol sequence, candidate characters are traversed and scored. Finally, the recognition candidate results are listed by descending. The method of this paper is validated by testing 19815 copies of the handwriting Chinese characters written by foreign students.
© (2017) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Hao Bai, Xi-wen Zhang, "Recognizing Chinese characters in digital ink from non-native language writers using hierarchical models", Proc. SPIE 10443, Second International Workshop on Pattern Recognition, 104430A (19 June 2017); doi: 10.1117/12.2280237; http://dx.doi.org/10.1117/12.2280237
PROCEEDINGS
6 PAGES


SHARE
KEYWORDS
Error analysis

Optical character recognition

Chromium

Calcium

Computing systems

Information science

Machine learning

Back to Top