{"title":"Image data augmentation method based on maximum activation point guided erasure","authors":"Yun Jiang, Shengxin Tao, Hai Zhang, Simin Cao","doi":"10.1109/CTISC49998.2020.00027","DOIUrl":null,"url":null,"abstract":"Deep neural networks usually contain tens to hundreds of millions of orders of learning parameters that provide the necessary representation to solve various visual tasks. But with the increase of the representational ability, the possibility of over-fitting also increase, which bring about poor generalization. In this paper, we propose MA (Maximum Activation point processing) algorithm, a new image data augmentation method which is designed to improve the generalization ability of the model and reduce the risk of overfitting. During the training process, the most discriminative part of the input image is searched for, and the model is driven to search for the supplementary information of the most important feature information by erasing the maximum attention image block. During this process, training images with different occlusion levels are generated as new inputs to the network and the model continues to be trained. The image erasure method based on the maximum activation point guidance only needs to modify the input image, which can effectively improve the robustness of the model to occluded image recognition, and can be integrated with various network structures. The effectiveness of our method is verified on the Cifar10, Cifar100 and Fashion-MNIST datasets.","PeriodicalId":266384,"journal":{"name":"2020 2nd International Conference on Advances in Computer Technology, Information Science and Communications (CTISC)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd International Conference on Advances in Computer Technology, Information Science and Communications (CTISC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CTISC49998.2020.00027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Deep neural networks usually contain tens to hundreds of millions of orders of learning parameters that provide the necessary representation to solve various visual tasks. But with the increase of the representational ability, the possibility of over-fitting also increase, which bring about poor generalization. In this paper, we propose MA (Maximum Activation point processing) algorithm, a new image data augmentation method which is designed to improve the generalization ability of the model and reduce the risk of overfitting. During the training process, the most discriminative part of the input image is searched for, and the model is driven to search for the supplementary information of the most important feature information by erasing the maximum attention image block. During this process, training images with different occlusion levels are generated as new inputs to the network and the model continues to be trained. The image erasure method based on the maximum activation point guidance only needs to modify the input image, which can effectively improve the robustness of the model to occluded image recognition, and can be integrated with various network structures. The effectiveness of our method is verified on the Cifar10, Cifar100 and Fashion-MNIST datasets.
深度神经网络通常包含数千万到数亿阶的学习参数,这些参数为解决各种视觉任务提供了必要的表示。但随着表征能力的提高,过度拟合的可能性也随之增加,导致泛化效果较差。本文提出了一种新的图像数据增强方法MA (Maximum Activation point processing,最大激活点处理)算法,该算法旨在提高模型的泛化能力,降低过拟合的风险。在训练过程中,搜索输入图像中最具判别性的部分,并通过擦除最大关注图像块来驱动模型搜索最重要特征信息的补充信息。在此过程中,生成不同遮挡水平的训练图像作为网络的新输入,并继续训练模型。基于最大激活点制导的图像擦除方法只需要修改输入图像,可以有效提高模型对遮挡图像识别的鲁棒性,并且可以与各种网络结构集成。在Cifar10、Cifar100和Fashion-MNIST数据集上验证了该方法的有效性。