{"title":"基于类激活映射的随机擦除数据增强","authors":"Juhyeon Oh, Kyujoong Lee","doi":"10.9717/kmms.2023.26.10.1231","DOIUrl":null,"url":null,"abstract":"Random erasing offers various levels of occlusion for data augmentation. However, due to its uniform distribution of random selection, it sometimes occludes regions that are unrelated to the object of interest. In this paper, we propose a novel method that utilizes Gradient Weighted Class Activation Mapping (Grad-CAM) for estimating the location of the object of interest and selectively erasing the surrounding areas. By utilizing Grad-CAM, we improve random erasing for CNN models without requiring additional modules or architectural changes. We generate Grad-CAM after the intermediate epochs where CNN models have sufficient representational power for the training data. The hyperparameter that restrict the erasing to the vicinity of the object is set based on Grad-CAM, and experiments were conducted accordingly. As a result of our experiments, we observed a 0.33% decrease in error-rate for image classification tasks using ResNet-20 on the CIFAR-10 dataset.","PeriodicalId":16316,"journal":{"name":"Journal of Korea Multimedia Society","volume":"27 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Class Activation Map based Random Erasing for Data Augmentation\",\"authors\":\"Juhyeon Oh, Kyujoong Lee\",\"doi\":\"10.9717/kmms.2023.26.10.1231\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Random erasing offers various levels of occlusion for data augmentation. However, due to its uniform distribution of random selection, it sometimes occludes regions that are unrelated to the object of interest. In this paper, we propose a novel method that utilizes Gradient Weighted Class Activation Mapping (Grad-CAM) for estimating the location of the object of interest and selectively erasing the surrounding areas. By utilizing Grad-CAM, we improve random erasing for CNN models without requiring additional modules or architectural changes. We generate Grad-CAM after the intermediate epochs where CNN models have sufficient representational power for the training data. The hyperparameter that restrict the erasing to the vicinity of the object is set based on Grad-CAM, and experiments were conducted accordingly. As a result of our experiments, we observed a 0.33% decrease in error-rate for image classification tasks using ResNet-20 on the CIFAR-10 dataset.\",\"PeriodicalId\":16316,\"journal\":{\"name\":\"Journal of Korea Multimedia Society\",\"volume\":\"27 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Korea Multimedia Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.9717/kmms.2023.26.10.1231\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Korea Multimedia Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9717/kmms.2023.26.10.1231","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
随机擦除为数据增强提供了不同级别的遮挡。然而,由于其随机选择的均匀分布,有时会遮挡与感兴趣对象无关的区域。在本文中,我们提出了一种利用梯度加权类激活映射(Gradient Weighted Class Activation Mapping, Grad-CAM)来估计感兴趣对象的位置并选择性地擦除周围区域的新方法。通过使用Grad-CAM,我们改进了CNN模型的随机擦除,而不需要额外的模块或架构更改。我们在中间时代之后生成Grad-CAM,其中CNN模型对训练数据具有足够的表征能力。基于Grad-CAM设置了将擦除限制在目标附近的超参数,并进行了相应的实验。通过实验,我们发现在CIFAR-10数据集上使用ResNet-20进行图像分类任务的错误率降低了0.33%。
Class Activation Map based Random Erasing for Data Augmentation
Random erasing offers various levels of occlusion for data augmentation. However, due to its uniform distribution of random selection, it sometimes occludes regions that are unrelated to the object of interest. In this paper, we propose a novel method that utilizes Gradient Weighted Class Activation Mapping (Grad-CAM) for estimating the location of the object of interest and selectively erasing the surrounding areas. By utilizing Grad-CAM, we improve random erasing for CNN models without requiring additional modules or architectural changes. We generate Grad-CAM after the intermediate epochs where CNN models have sufficient representational power for the training data. The hyperparameter that restrict the erasing to the vicinity of the object is set based on Grad-CAM, and experiments were conducted accordingly. As a result of our experiments, we observed a 0.33% decrease in error-rate for image classification tasks using ResNet-20 on the CIFAR-10 dataset.