17 October 2017 Review network for scene text recognition
Author Affiliations +
Abstract
Recognizing text in images captured in the wild is a fundamental preprocessing task for many computer vision and machine learning applications and has gained significant attention in recent years. This paper proposes an end-to-end trainable deep review neural network for scene text recognition, which is a combination of feature extraction, feature reviewing, feature attention, and sequence recognition. Our model can generate the predicted text without any segmentation or grouping algorithm. Because the attention model in the feature attention stage lacks global modeling ability, a review network is applied to extract the global context of sequence data in the feature reviewing stage. We perform rigorous experiments across a number of standard benchmarks, including IIIT5K, SVT, ICDAR03, and ICDAR13 datasets. Experimental results show that our model is comparable to or outperforms state-of-the-art techniques.
© 2017 SPIE and IS&T
Shuohao Li, Shuohao Li, Anqi Han, Anqi Han, Xu Chen, Xu Chen, Xiaoqing Yin, Xiaoqing Yin, Jun Zhang, Jun Zhang, } "Review network for scene text recognition," Journal of Electronic Imaging 26(5), 053023 (17 October 2017). https://doi.org/10.1117/1.JEI.26.5.053023 . Submission: Received: 17 June 2017; Accepted: 26 September 2017
Received: 17 June 2017; Accepted: 26 September 2017; Published: 17 October 2017
JOURNAL ARTICLE
9 PAGES


SHARE
Back to Top