Paper
8 February 2015 Exploring multiple feature combination strategies with a recurrent neural network architecture for off-line handwriting recognition
L. Mioulet, G. Bideault, C. Chatelain, T. Paquet, S. Brunessaux
Author Affiliations +
Proceedings Volume 9402, Document Recognition and Retrieval XXII; 94020F (2015) https://doi.org/10.1117/12.2075665
Event: SPIE/IS&T Electronic Imaging, 2015, San Francisco, California, United States
Abstract
The BLSTM-CTC is a novel recurrent neural network architecture that has outperformed previous state of the art algorithms in tasks such as speech recognition or handwriting recognition. It has the ability to process long term dependencies in temporal signals in order to label unsegmented data. This paper describes different ways of combining features using a BLSTM-CTC architecture. Not only do we explore the low level combination (feature space combination) but we also explore high level combination (decoding combination) and mid-level (internal system representation combination). The results are compared on the RIMES word database. Our results show that the low level combination works best, thanks to the powerful data modeling of the LSTM neurons.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
L. Mioulet, G. Bideault, C. Chatelain, T. Paquet, and S. Brunessaux "Exploring multiple feature combination strategies with a recurrent neural network architecture for off-line handwriting recognition", Proc. SPIE 9402, Document Recognition and Retrieval XXII, 94020F (8 February 2015); https://doi.org/10.1117/12.2075665
Lens.org Logo
CITATIONS
Cited by 10 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Neural networks

Databases

Feature extraction

Signal processing

Detection and tracking algorithms

Data processing

Image processing

Back to Top