1 April 2001 Document compression using rate-distortion optimized segmentation
Author Affiliations +
Effective document compression algorithms require that scanned document images be first segmented into regions such as text, pictures, and background. In this paper, we present a multilayer compression algorithm for document images. This compression algorithm first segments a scanned document image into different classes, then compresses each class using an algorithm specifically designed for that class. Two algorithms are investigated for segmenting document images: a direct image segmentation algorithm called the trainable sequential MAP (TSMAP) segmentation algorithm, and a rate-distortion optimized segmentation (RDOS) algorithm. The RDOS algorithm works in a closed loop fashion by applying each coding method to each region of the document and then selecting the method that yields the best rate-distortion trade-off. Compared with the TSMAP algorithm, the RDOS algorithm can often result in a better rate-distortion trade-off, and produce more robust segmentations by eliminating those misclassifications which can cause severe artifacts. At similar bit rates, the multilayer compression algorithm using RDOS can achieve a much higher subjective quality than state-of-the-art compression algorithms, such as DjVu and SPIHT.
©(2001) Society of Photo-Optical Instrumentation Engineers (SPIE)
Hui Cheng and Charles A. Bouman "Document compression using rate-distortion optimized segmentation," Journal of Electronic Imaging 10(2), (1 April 2001). https://doi.org/10.1117/1.1344590
Published: 1 April 2001
Lens.org Logo
CITATIONS
Cited by 46 scholarly publications and 2 patents.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Image compression

Image processing algorithms and systems

Distortion

Quantization

Binary data

Visualization

Back to Top