{"title":"Convolutional Neural Network for Visual Artifacts Classification","authors":"A. Holesova, P. Sykora, P. Kamencay, M. Uhrina","doi":"10.1109/ELEKTRO53996.2022.9803377","DOIUrl":null,"url":null,"abstract":"In this paper, we present an effective convolutional classifier for recognition of visual artifacts. The proposed deep-learned model is simple in architecture and number of learnable parameters while retaining a sufficient generalization ability. The latter was achieved by compilation of extensive dataset based on ImageNet database. The model was trained sequentially on over 3 million images with 3 types of distortions with various severity, specifically 10 levels of Gaussian noise, 10 levels of Gaussian blur and 5 levels of blocking effect caused by JPEG compression. The model achieved excellent 99.88% accuracy on generated images. The performance of the model was evaluated on 4 well-known IQA datasets, where it reached 71.95% accuracy in average. Furthermore, after transferring the weights from the proposed model and short training on IQA datasets, its accuracy increased by more than 7%.","PeriodicalId":396752,"journal":{"name":"2022 ELEKTRO (ELEKTRO)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 ELEKTRO (ELEKTRO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELEKTRO53996.2022.9803377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we present an effective convolutional classifier for recognition of visual artifacts. The proposed deep-learned model is simple in architecture and number of learnable parameters while retaining a sufficient generalization ability. The latter was achieved by compilation of extensive dataset based on ImageNet database. The model was trained sequentially on over 3 million images with 3 types of distortions with various severity, specifically 10 levels of Gaussian noise, 10 levels of Gaussian blur and 5 levels of blocking effect caused by JPEG compression. The model achieved excellent 99.88% accuracy on generated images. The performance of the model was evaluated on 4 well-known IQA datasets, where it reached 71.95% accuracy in average. Furthermore, after transferring the weights from the proposed model and short training on IQA datasets, its accuracy increased by more than 7%.