{"title":"Error Diffusion Halftone Classification using Contrastive Learning","authors":"Jing-Ming Guo, S. Sankarasrinivasan","doi":"10.1109/ICCE-Taiwan55306.2022.9869191","DOIUrl":null,"url":null,"abstract":"Error diffusion halftoning is one of the widely adopted technique in printers, to transform the gray-scale image into its approximate binary version. Further, the classification of halftones is very important to facilitate inverse halftoning, source printer identification, forensics analysis and other halftone processing tasks. Practically, majority of the printed documents are unlabeled and hence hard to train using supervised approach. This study exploits the advantage of self-supervised learning (SSL), in particular the simplified framework for contrastive learning of visual representation in learning best representation features for halftones. As the data augmentation play a critical role in SSL models, and this study focus on optimization of the existing augmentations and also added new random augmentation techniques to enhance the feature learning. In addition, different variants of ResNet backbone is tried to find the ideal case, and the error diffusion dataset is also generated for analysis. From detailed experiments, it has been found that the proposed method can perform consistent with supervised learning approach without large labelled data.","PeriodicalId":164671,"journal":{"name":"2022 IEEE International Conference on Consumer Electronics - Taiwan","volume":"159 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Consumer Electronics - Taiwan","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE-Taiwan55306.2022.9869191","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Error diffusion halftoning is one of the widely adopted technique in printers, to transform the gray-scale image into its approximate binary version. Further, the classification of halftones is very important to facilitate inverse halftoning, source printer identification, forensics analysis and other halftone processing tasks. Practically, majority of the printed documents are unlabeled and hence hard to train using supervised approach. This study exploits the advantage of self-supervised learning (SSL), in particular the simplified framework for contrastive learning of visual representation in learning best representation features for halftones. As the data augmentation play a critical role in SSL models, and this study focus on optimization of the existing augmentations and also added new random augmentation techniques to enhance the feature learning. In addition, different variants of ResNet backbone is tried to find the ideal case, and the error diffusion dataset is also generated for analysis. From detailed experiments, it has been found that the proposed method can perform consistent with supervised learning approach without large labelled data.