Title: Optimization of Degraded Document Image Binarization Method Based on Background Estimation
Authors: Michalak, Hubert
Okarma, Krzysztof
Citation: WSCG 2020: full papers proceedings: 28th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision, p. 89-98.
Issue Date: 2020
Publisher: Václav Skala - UNION Agency
Document type: konferenční příspěvek
URI: http://wscg.zcu.cz/WSCG2020/2020-CSRN-3001.pdf
ISBN: 978-80-86943-35-0
ISSN: 2464–4617 (print)
2464–4625 (CD-ROM)
Keywords: prahování obrazu;obrázky dokumentů;odhad pozadí;binarizace
Keywords in different language: image thresholding;document images;background estimation;binarization
Abstract in different language: Binarization of document images is one of the most relevant pre-processing operations, leading to a significant decrease of the amount of information used during their further analysis. Since many document images, particularly historical, may be degraded over time, the application of some simple global thresholding methods usually lead to highly unsatisfactory results. A similar situation may occur for unevenly illuminated images, limiting the visibility of various shapes, representing not only the alphanumerical characters. A typical solution of this problem is the application of some adaptive thresholding methods, as well as more sophisticated solutions, proposed recently e.g. during Document Image Binarization Competitions (DIBCO) or TQ-DIB 2019 competition. Nevertheless, due to their relatively high computational demands, there is still a need of some faster methods, leading to high binarization accuracy for challenging benchmark datasets, such as DIBCO or Nabuco. Hence, the adaptation and optimization of the parameters of the fast thresholding method utilizing background estimation, proposed originally for the OCR purposes and verified for unevenly illuminated printed documents, is presented in this paper. The proposed solution has been optimized and verified using the state-of-the-art datasets containing 166 degraded document images together with their ground-truth binary equivalents, leading to better results, also in comparison to much slower adaptive thresholding methods. The performance of all methods used in comparisons has been determined using commonly accepted metrics, such as F-Measure, Accuracy, Distance Reciprocal Distortion (DRD) or Misclassification Penalty Metric (MPM), and relative execution time, calculated for all used image datasets.
Rights: © Václav Skala - UNION Agency
Appears in Collections:WSCG 2020: Full Papers Proceedings

Files in This Item:
File Description SizeFormat 
G05.pdfPlný text7,91 MBAdobe PDFView/Open

Please use this identifier to cite or link to this item: http://hdl.handle.net/11025/38455

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.