1 April 1998 Finding text in color images
Author Affiliations +
Abstract
In this paper, we consider the problem of locating and extracting text from WWW images. A previous algorithm based on color clustering and connected components analysis works well as long as the color of each character is relatively uniform and the typography is fairly simple. It breaks down quickly, however, when these assumptions are violated. In this paper, we describe more robust techniques for dealing with this challenging problem. We present an improved color clustering algorithm that measures similarity based on both RGB and spatial proximity. Layout analysis is also incorporated to handle more complex typography. THese changes significantly enhance the performance of our text detection procedure.
© (1998) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jiangying Zhou, Daniel P. Lopresti, Tolga Tasdizen, "Finding text in color images", Proc. SPIE 3305, Document Recognition V, (1 April 1998); doi: 10.1117/12.304625; https://doi.org/10.1117/12.304625
PROCEEDINGS
11 PAGES


SHARE
Back to Top