{"title":"A system for document binarization","authors":"E. Badekas, N. Papamarkos","doi":"10.1109/ISPA.2003.1296408","DOIUrl":null,"url":null,"abstract":"This paper presents a system for binarization of digital documents. The system comprises the benefits of a set of other binarization techniques by combining their results. This is necessary for bad illuminated and degraded document where there are many pixels that cannot be easily classified as foreground or background. For this reason, it is necessary to perform the final binarization by exploiting the results of a set of binarization algorithms, especially for the document pixels that have high vagueness. Also, in this paper significant improvements are proposed for two of the methods used, i.e. for the Adaptive Logical Level Technique (ALLT) and the Improvement of Integrated Function Algorithm (IIFA). The entire system is extensively tested with a variety of degraded and bad-illuminated documents.","PeriodicalId":218932,"journal":{"name":"3rd International Symposium on Image and Signal Processing and Analysis, 2003. ISPA 2003. Proceedings of the","volume":"338 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"3rd International Symposium on Image and Signal Processing and Analysis, 2003. ISPA 2003. Proceedings of the","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPA.2003.1296408","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
This paper presents a system for binarization of digital documents. The system comprises the benefits of a set of other binarization techniques by combining their results. This is necessary for bad illuminated and degraded document where there are many pixels that cannot be easily classified as foreground or background. For this reason, it is necessary to perform the final binarization by exploiting the results of a set of binarization algorithms, especially for the document pixels that have high vagueness. Also, in this paper significant improvements are proposed for two of the methods used, i.e. for the Adaptive Logical Level Technique (ALLT) and the Improvement of Integrated Function Algorithm (IIFA). The entire system is extensively tested with a variety of degraded and bad-illuminated documents.