Binarization of Document Images: A Comprehensive Review

Document image binarization is one important pre-processing step, especially for data analysis. Extraction of text from images and its recognition may be challenging due to the presence of noise and degradation in document images. In this paper, seven (7) types of binarization method were discussed...

Full description

Saved in:
Bibliographic Details
Published inJournal of physics. Conference series Vol. 1019; no. 1; pp. 12023 - 12031
Main Authors Mustafa, Wan Azani, Abdul Kader, Mohamed Mydin M.
Format Journal Article
LanguageEnglish
Published Bristol IOP Publishing 01.06.2018
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Document image binarization is one important pre-processing step, especially for data analysis. Extraction of text from images and its recognition may be challenging due to the presence of noise and degradation in document images. In this paper, seven (7) types of binarization method were discussed and tested on Handwritten Document Image Binarization Contest (H-DIBCO 2012). The aim of this paper is to provide comprehensive review methods in order to binary document images in the damaging background. The results of the numerical simulation indicate that the Gradient Based method most effective and efficient compared to other methods. Hopefully, the implications of this review give future research directions for the researchers.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/1019/1/012023