Preprint Review Version 1 Preserved in Portico This version is not peer-reviewed

Summary of Document Image Binarization

Version 1 : Received: 20 January 2024 / Approved: 21 January 2024 / Online: 22 January 2024 (04:32:59 CET)

A peer-reviewed article of this Preprint also exists.

Yang, Z.; Zuo, S.; Zhou, Y.; He, J.; Shi, J. A Review of Document Binarization: Main Techniques, New Challenges, and Trends. Electronics 2024, 13, 1394. Yang, Z.; Zuo, S.; Zhou, Y.; He, J.; Shi, J. A Review of Document Binarization: Main Techniques, New Challenges, and Trends. Electronics 2024, 13, 1394.

Abstract

Document image binarization is a challenging task, especially when it comes to text segmentation in degraded document images. The binarization, as a pre-processing step of optical character recognition (OCR), is one of the most fundamental and commonly used segmentation methods. It separates the foreground text from the background of the document image to facilitate subsequent image processing. In view of the different degradation degree of document image, researchers have proposed a variety of solutions. This paper reviews the main binarization techniques, including both traditional algorithms and deep learning-based algorithms. We also summarize some difficulties and challenges in the field of document image binarization. Here, we evaluate various image binarization techniques to identify shortcomings in current methods and provide some help for future research.

Keywords

degraded document images; binarization; threshold processing; deep learning

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.