Paper
8 February 2015 Min-cut segmentation of cursive handwriting in tabular documents
Brian L. Davis, William A. Barrett, Scott D. Swingle
Author Affiliations +
Proceedings Volume 9402, Document Recognition and Retrieval XXII; 940208 (2015) https://doi.org/10.1117/12.2076228
Event: SPIE/IS&T Electronic Imaging, 2015, San Francisco, California, United States
Abstract
Handwritten tabular documents, such as census, birth, death and marriage records, contain a wealth of information vital to genealogical and related research. Much work has been done in segmenting freeform handwriting, however, segmentation of cursive handwriting in tabular documents is still an unsolved problem. Tabular documents present unique segmentation challenges caused by handwriting overlapping cell-boundaries and other words, both horizontally and vertically, as “ascenders” and “descenders” overlap into adjacent cells. This paper presents a method for segmenting handwriting in tabular documents using a min-cut/max-flow algorithm on a graph formed from a distance map and connected components of handwriting. Specifically, we focus on line, word and first letter segmentation. Additionally, we include the angles of strokes of the handwriting as a third dimension to our graph to enable the resulting segments to share pixels of overlapping letters. Word segmentation accuracy is 89.5% evaluating lines of the data set used in the ICDAR2013 Handwriting Segmentation Contest. Accuracy is 92.6% for a specific application of segmenting first and last names from noisy census records. Accuracy for segmenting lines of names from noisy census records is 80.7%. The 3D graph cutting shows promise in segmenting overlapping letters, although highly convoluted or overlapping handwriting remains an ongoing challenge.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Brian L. Davis, William A. Barrett, and Scott D. Swingle "Min-cut segmentation of cursive handwriting in tabular documents", Proc. SPIE 9402, Document Recognition and Retrieval XXII, 940208 (8 February 2015); https://doi.org/10.1117/12.2076228
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

3D image processing

Neodymium

Cameras

Clouds

Algorithm development

Computer science

Back to Top