{"title":"Class Activation Map based Random Erasing for Data Augmentation","authors":"Juhyeon Oh, Kyujoong Lee","doi":"10.9717/kmms.2023.26.10.1231","DOIUrl":null,"url":null,"abstract":"Random erasing offers various levels of occlusion for data augmentation. However, due to its uniform distribution of random selection, it sometimes occludes regions that are unrelated to the object of interest. In this paper, we propose a novel method that utilizes Gradient Weighted Class Activation Mapping (Grad-CAM) for estimating the location of the object of interest and selectively erasing the surrounding areas. By utilizing Grad-CAM, we improve random erasing for CNN models without requiring additional modules or architectural changes. We generate Grad-CAM after the intermediate epochs where CNN models have sufficient representational power for the training data. The hyperparameter that restrict the erasing to the vicinity of the object is set based on Grad-CAM, and experiments were conducted accordingly. As a result of our experiments, we observed a 0.33% decrease in error-rate for image classification tasks using ResNet-20 on the CIFAR-10 dataset.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"27 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Korea Multimedia Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9717/kmms.2023.26.10.1231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Random erasing offers various levels of occlusion for data augmentation. However, due to its uniform distribution of random selection, it sometimes occludes regions that are unrelated to the object of interest. In this paper, we propose a novel method that utilizes Gradient Weighted Class Activation Mapping (Grad-CAM) for estimating the location of the object of interest and selectively erasing the surrounding areas. By utilizing Grad-CAM, we improve random erasing for CNN models without requiring additional modules or architectural changes. We generate Grad-CAM after the intermediate epochs where CNN models have sufficient representational power for the training data. The hyperparameter that restrict the erasing to the vicinity of the object is set based on Grad-CAM, and experiments were conducted accordingly. As a result of our experiments, we observed a 0.33% decrease in error-rate for image classification tasks using ResNet-20 on the CIFAR-10 dataset.