13 April 2018 Generation method of synthetic training data for mobile OCR system
Author Affiliations +
Proceedings Volume 10696, Tenth International Conference on Machine Vision (ICMV 2017); 106962G (2018) https://doi.org/10.1117/12.2310119
Event: Tenth International Conference on Machine Vision, 2017, Vienna, Austria
Abstract
This paper addresses one of the fundamental problems of machine learning - training data acquiring. Obtaining enough natural training data is rather difficult and expensive. In last years usage of synthetic images has become more beneficial as it allows to save human time and also to provide a huge number of images which otherwise would be difficult to obtain. However, for successful learning on artificial dataset one should try to reduce the gap between natural and synthetic data distributions. In this paper we describe an algorithm which allows to create artificial training datasets for OCR systems using russian passport as a case study.
© (2018) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yulia S. Chernyshova, Yulia S. Chernyshova, Alexander V. Gayer, Alexander V. Gayer, Alexander V. Sheshkus, Alexander V. Sheshkus, "Generation method of synthetic training data for mobile OCR system", Proc. SPIE 10696, Tenth International Conference on Machine Vision (ICMV 2017), 106962G (13 April 2018); doi: 10.1117/12.2310119; https://doi.org/10.1117/12.2310119
PROCEEDINGS
7 PAGES


SHARE
Back to Top