24 March 2014 Variational dynamic background model for keyword spotting in handwritten documents
Author Affiliations +
We propose a bayesian framework for keyword spotting in handwritten documents. This work is an extension to our previous work where we proposed dynamic background model, DBM for keyword spotting that takes into account the local character level scores and global word level scores to learn a logistic regression classifier to separate keywords from non-keywords. In this work, we add a bayesian layer on top of the DBM called the variational dynamic background model, VDBM. The logistic regression classifier uses the sigmoid function to separate keywords from non-keywords. The sigmoid function being neither convex nor concave, exact inference of VDBM becomes intractable. An expectation maximization step is proposed to do approximate inference. The advantage of VDBM over the DBM is multi-fold. Firstly, being bayesian, it prevents over-fitting of data. Secondly, it provides better modeling of data and an improved prediction of unseen data. VDBM is evaluated on the IAM dataset and the results prove that it outperforms our prior work and other state of the art line based word spotting system.
© (2014) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Gaurav Kumar, Gaurav Kumar, Safwan Wshah, Safwan Wshah, Venu Govindaraju, Venu Govindaraju, "Variational dynamic background model for keyword spotting in handwritten documents", Proc. SPIE 9021, Document Recognition and Retrieval XXI, 902104 (24 March 2014); doi: 10.1117/12.2041244; https://doi.org/10.1117/12.2041244

Back to Top