An Improved Adaptive Document Image Binarization Method

Document image binarization is the basis of Optical Character Recognition (OCR). For B. Gatos' adaptive binarization method exists the shortcomings, we propose an improved adaptive document image binarization method which consists of four steps. The first step is dedicated to a denoising proced...

Full description

Saved in:
Bibliographic Details
Published in2009 2nd International Congress on Image and Signal Processing pp. 1 - 5
Main Authors Shuang-fei Zhou, Chun-ping Liu, Zhi-ming Cui, Sheng-rong Gong
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.10.2009
Subjects
Online AccessGet full text
ISBN1424441293
9781424441297
DOI10.1109/CISP.2009.5302270

Cover

Loading…
More Information
Summary:Document image binarization is the basis of Optical Character Recognition (OCR). For B. Gatos' adaptive binarization method exists the shortcomings, we propose an improved adaptive document image binarization method which consists of four steps. The first step is dedicated to a denoising procedure using a low-pass Wiener filter based on local statistics. In the second step, we use a first rough estimation of foreground regions using binarization method based on the Laplacian-Gauss algorithm. As a third step, we compute the background surface of the image by interpolating neighboring background intensities into the foreground areas that result from the previous step. In the fourth step, we proceed to the final binarization by combining information from the calculated background surface and the original image, and accomplish final binary. The method has good robustness for uneven illumination, using the algorithm extract foreground regions, it will get fewer lost strokes and can be effective to retain the edge information. The experimental results show that the improved method has better characteristics than other four kinds of typical document image binarization methods.
ISBN:1424441293
9781424441297
DOI:10.1109/CISP.2009.5302270