Volume Table of Contents

PROCEEDINGS VOLUME 9402

SPIE/IS&T ELECTRONIC IMAGING | 8-12 FEBRUARY 2015

Document Recognition and Retrieval XXII

Editor(s): Eric K. Ringger, Bart Lamiroy

Editor Affiliations +

IN THIS VOLUME

8 Sessions, 23 Papers, 0 Presentations

Front Matter: Volume 9402 (1)

Document Layout Analysis and Understanding (3)

Document Structure Semantics, Forms, and Tables (4)

Text Analysis (4)

Handwriting I (5)

Quality and Compression (2)

Graphics and Structure (3)

Handwriting II (1)

SPIE/IS&T ELECTRONIC IMAGING

8-12 February 2015

San Francisco, California, United States

Present at an SPIE Conference

Subscribe to Digital Library

VIEW ALL ABSTRACTS +

Front Matter: Volume 9402

Front Matter: Volume 9402 Open Access

Proceedings Volume Document Recognition and Retrieval XXII, 940201 (2015) https://doi.org/10.1117/12.2185612

Read Abstract +

Document Layout Analysis and Understanding

Ground truth model, tool, and dataset for layout analysis of historical documents Paper

Kai Chen, Mathias Seuret, Hao Wei, Marcus Liwicki, Jean Hennebert, Rolf Ingold

Proceedings Volume Document Recognition and Retrieval XXII, 940204 (2015) https://doi.org/10.1117/12.2075858

Read Abstract +

Use of SLIC superpixels for ancient document image enhancement and segmentation Paper

Maroua Mehri, Nabil Sliti, Pierre Héroux, Petra Gomez-Krämer, Najoua Essoukri Ben Amara, Rémy Mullot

Proceedings Volume Document Recognition and Retrieval XXII, 940205 (2015) https://doi.org/10.1117/12.2076020

Read Abstract +

Software workflow for the automatic tagging of medieval manuscript images (SWATI) Paper

Swati Chandna, Danah Tonne, Thomas Jejkal, Rainer Stotzka, Celia Krause, Philipp Vanscheidt, Hannah Busch, Ajinkya Prabhune

Proceedings Volume Document Recognition and Retrieval XXII, 940206 (2015) https://doi.org/10.1117/12.2076124

Read Abstract +

Digital methods, tools and algorithms are gaining in importance for the analysis of digitized manuscript collections in the arts and humanities. One example is the BMBF-funded research project “eCodicology” which aims to design, evaluate and optimize algorithms for the automatic identification of macro- and micro-structural layout features of medieval manuscripts. The main goal of this research project is to provide better insights into high-dimensional datasets of medieval manuscripts for humanities scholars. The heterogeneous nature and size of the humanities data and the need to create a database of automatically extracted reproducible features for better statistical and visual analysis are the main challenges in designing a workflow for the arts and humanities. This paper presents a concept of a workflow for the automatic tagging of medieval manuscripts. As a starting point, the workflow uses medieval manuscripts digitized within the scope of the project Virtual Scriptorium St. Matthias". Firstly, these digitized manuscripts are ingested into a data repository. Secondly, specific algorithms are adapted or designed for the identification of macro- and micro-structural layout elements like page size, writing space, number of lines etc. And lastly, a statistical analysis and scientific evaluation of the manuscripts groups are performed. The workflow is designed generically to process large amounts of data automatically with any desired algorithm for feature extraction. As a result, a database of objectified and reproducible features is created which helps to analyze and visualize hidden relationships of around 170,000 pages. The workflow shows the potential of automatic image analysis by enabling the processing of a single page in less than a minute. Furthermore, the accuracy tests of the workflow on a small set of manuscripts with respect to features like page size and text areas show that automatic and manual analysis are comparable. The usage of a computer cluster will allow the highly performant processing of large amounts of data. The software framework itself will be integrated as a service into the DARIAH infrastructure to make it adaptable for wider range of communities.

Document Structure Semantics, Forms, and Tables

Math expression retrieval using an inverted index over symbol pairs Paper

David Stalnaker, Richard Zanibbi

Proceedings Volume Document Recognition and Retrieval XXII, 940207 (2015) https://doi.org/10.1117/12.2074084

Read Abstract +

Min-cut segmentation of cursive handwriting in tabular documents Paper

Brian L. Davis, William A. Barrett, Scott D. Swingle

Proceedings Volume Document Recognition and Retrieval XXII, 940208 (2015) https://doi.org/10.1117/12.2076228

Read Abstract +

Cross-reference identification within a PDF document Paper

Sida Li, Liangcai Gao, Zhi Tang, Yinyan Yu

Proceedings Volume Document Recognition and Retrieval XXII, 940209 (2015) https://doi.org/10.1117/12.2076237

Read Abstract +

Intelligent indexing: a semi-automated, trainable system for field labeling Paper

Robert Clawson, William Barrett

Proceedings Volume Document Recognition and Retrieval XXII, 94020A (2015) https://doi.org/10.1117/12.2076862

Read Abstract +

Text Analysis

Re-typograph phase I: a proof-of-concept for typeface parameter extraction from historical documents Paper

Bart Lamiroy, Thomas Bouville, Julien Blégean, Hongliu Cao, Salah Ghamizi, Romain Houpin, Matthias Lloyd

Proceedings Volume Document Recognition and Retrieval XXII, 94020B (2015) https://doi.org/10.1117/12.2075813

Read Abstract +

Clustering of Farsi sub-word images for whole-book recognition Paper

Mohammad Reza Soheili, Ehsanollah Kabir, Didier Stricker

Proceedings Volume Document Recognition and Retrieval XXII, 94020C (2015) https://doi.org/10.1117/12.2075931

Read Abstract +

Gaussian process style transfer mapping for historical Chinese character recognition Paper

Jixiong Feng, Liangrui Peng, Franck Lebourgeois

Proceedings Volume Document Recognition and Retrieval XXII, 94020D (2015) https://doi.org/10.1117/12.2076119

Read Abstract +

Boost OCR accuracy using iVector based system combination approach Paper

Xujun Peng, Huaigu Cao, Prem Natarajan

Proceedings Volume Document Recognition and Retrieval XXII, 94020E (2015) https://doi.org/10.1117/12.2076241

Read Abstract +

Handwriting I

Exploring multiple feature combination strategies with a recurrent neural network architecture for off-line handwriting recognition Paper

L. Mioulet, G. Bideault, C. Chatelain, T. Paquet, S. Brunessaux

Proceedings Volume Document Recognition and Retrieval XXII, 94020F (2015) https://doi.org/10.1117/12.2075665

Read Abstract +

Spotting handwritten words and REGEX using a two stage BLSTM-HMM architecture Paper

Gautier Bideault, Luc Mioulet, Clément Chatelain, Thierry Paquet

Proceedings Volume Document Recognition and Retrieval XXII, 94020G (2015) https://doi.org/10.1117/12.2075796

Read Abstract +

A comparison of 1D and 2D LSTM architectures for the recognition of handwritten Arabic Paper

Mohammad Reza Yousefi, Mohammad Reza Soheili, Thomas M. Breuel, Didier Stricker

Proceedings Volume Document Recognition and Retrieval XXII, 94020H (2015) https://doi.org/10.1117/12.2075930

Read Abstract +

Aligning transcript of historical documents using dynamic programming Paper

Irina Rabaev, Rafi Cohen, Jihad El-Sana, Klara Kedem

Proceedings Volume Document Recognition and Retrieval XXII, 94020I (2015) https://doi.org/10.1117/12.2076062

Read Abstract +

Offline handwritten word recognition using MQDF-HMMs Paper

Sitaram Ramachandrula, Mangesh Hambarde, Ajay Patial, Dushyant Sahoo, Shaivi Kochar

Proceedings Volume Document Recognition and Retrieval XXII, 94020J (2015) https://doi.org/10.1117/12.2076144

Read Abstract +

Quality and Compression

Separation of text and background regions for high performance document image compression Paper

Wei Fan, Jun Sun, Satoshi Naoi

Proceedings Volume Document Recognition and Retrieval XXII, 94020K (2015) https://doi.org/10.1117/12.2075416

Read Abstract +

Metric-based no-reference quality assessment of heterogeneous document images Paper

Nibal Nayef, Jean-Marc Ogier

Proceedings Volume Document Recognition and Retrieval XXII, 94020L (2015) https://doi.org/10.1117/12.2076150

Read Abstract +

Graphics and Structure

Clustering header categories extracted from web tables Paper

George Nagy, David W. Embley, Mukkai Krishnamoorthy, Sharad Seth

Proceedings Volume Document Recognition and Retrieval XXII, 94020M (2015) https://doi.org/10.1117/12.2076209

Read Abstract +

A diagram retrieval method with multi-label learning Paper

Songping Fu, Xiaoqing Lu, Lu Liu, Jingwei Qu, Zhi Tang

Proceedings Volume Document Recognition and Retrieval XXII, 94020N (2015) https://doi.org/10.1117/12.2075848

Read Abstract +

Detection of electrical circuit elements from documents images Paper

Paramita De, Sekhar Mandal, Amit Das, Bhabatosh Chanda

Proceedings Volume Document Recognition and Retrieval XXII, 94020O (2015) https://doi.org/10.1117/12.2078211

Read Abstract +

Handwriting II

Missing value imputation: with application to handwriting data Paper

Zhen Xu, Sargur N. Srihari

Proceedings Volume Document Recognition and Retrieval XXII, 94020P (2015) https://doi.org/10.1117/12.2075842

Read Abstract +

Keywords/Phrases

Search In:

Publication Years