{"title":"基于类间特征转换的计算机视觉任务数据增强方法","authors":"Jiewen Lin , Gui Hu , Jian Chen","doi":"10.1016/j.compag.2025.109909","DOIUrl":null,"url":null,"abstract":"<div><div>Agricultural samples are unbalanced, complex, and scarce, which is the main factor restricting the popularization and application of agricultural computer vision. This paper proposes a feature conversion between classes method for data augmentation of computer vision tasks. We make contributions in the following three aspects: 1) Proposing an optimization method of attention mechanism to optimize the generator of CycleGAN. Through the module: efficient convolutional block attention model (ECBAM), the generator network structure of CycleGAN is improved to learn the feature transformation from “healthy leaves” to “fake diseased leaves”. 2) An label assignment method based on proportionally assigned receptive field is proposed to realize the label replacement from “healthy leaves” to “fake diseased leaves”. 3) Enhanced the original data by a factor of n <span><math><mrow><mo>×</mo></mrow></math></span> oversampling. The experimental results show that the improved CycleGAN proposed in this paper can effectively generate “fake diseased leaves”, the Inception Score (IS) is 2.3 ± 0.14, the Fréchet Inception Distance (FID) is 41.49, and the Kernel Inception Distance (KID) is 0.025. We have verified the feasibility of the method for classification, object detection, and semantic segmentation tasks. When using the improved CycleGAN for data augmentation, the accuracy of ResNet152 has been improved by 1.71 %. We further verified the effectiveness of improved CycleGAN and reactive field object assignment(RFOA) methods for data augmentation. By testing in the object detection task, when t = 0.75, and n = 1, the mAP reaches 78.97 %. By testing in a semantic segmentation task, when t = 0.50&0.75, and n = 2, the mIOU reaches 81.41 %.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"231 ","pages":"Article 109909"},"PeriodicalIF":8.9000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A data augmentation method for computer vision task with feature conversion between class\",\"authors\":\"Jiewen Lin , Gui Hu , Jian Chen\",\"doi\":\"10.1016/j.compag.2025.109909\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Agricultural samples are unbalanced, complex, and scarce, which is the main factor restricting the popularization and application of agricultural computer vision. This paper proposes a feature conversion between classes method for data augmentation of computer vision tasks. We make contributions in the following three aspects: 1) Proposing an optimization method of attention mechanism to optimize the generator of CycleGAN. Through the module: efficient convolutional block attention model (ECBAM), the generator network structure of CycleGAN is improved to learn the feature transformation from “healthy leaves” to “fake diseased leaves”. 2) An label assignment method based on proportionally assigned receptive field is proposed to realize the label replacement from “healthy leaves” to “fake diseased leaves”. 3) Enhanced the original data by a factor of n <span><math><mrow><mo>×</mo></mrow></math></span> oversampling. The experimental results show that the improved CycleGAN proposed in this paper can effectively generate “fake diseased leaves”, the Inception Score (IS) is 2.3 ± 0.14, the Fréchet Inception Distance (FID) is 41.49, and the Kernel Inception Distance (KID) is 0.025. We have verified the feasibility of the method for classification, object detection, and semantic segmentation tasks. When using the improved CycleGAN for data augmentation, the accuracy of ResNet152 has been improved by 1.71 %. We further verified the effectiveness of improved CycleGAN and reactive field object assignment(RFOA) methods for data augmentation. By testing in the object detection task, when t = 0.75, and n = 1, the mAP reaches 78.97 %. By testing in a semantic segmentation task, when t = 0.50&0.75, and n = 2, the mIOU reaches 81.41 %.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"231 \",\"pages\":\"Article 109909\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925000158\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/8 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925000158","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/8 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
农业样本不平衡、复杂、稀缺,是制约农业计算机视觉推广应用的主要因素。提出了一种用于计算机视觉任务数据增强的类间特征转换方法。我们在以下三个方面做出了贡献:1)提出了一种关注机制的优化方法来优化CycleGAN的生成器。通过高效卷积块注意模型(ECBAM)模块,对CycleGAN的生成器网络结构进行改进,学习从“健康叶片”到“假病叶”的特征转换。2)提出了一种基于比例分配感受野的标签分配方法,实现了“健康叶”到“假病叶”的标签替换。3)对原始数据进行n ×过采样的增强。实验结果表明,本文提出的改进CycleGAN可以有效地生成“假病叶”,初始分数(Inception Score, IS)为2.3±0.14,fr初始距离(FID)为41.49,内核初始距离(KID)为0.025。我们已经验证了该方法在分类、对象检测和语义分割任务中的可行性。当使用改进的CycleGAN进行数据增强时,ResNet152的准确率提高了1.71%。我们进一步验证了改进的CycleGAN和反应性场目标分配(reactive field object assignment, RFOA)方法在数据增强方面的有效性。通过在目标检测任务中测试,当t = 0.75, n = 1时,mAP达到78.97%。通过在语义分割任务中测试,当t = 0.50&0.75, n = 2时,mIOU达到81.41%。
A data augmentation method for computer vision task with feature conversion between class
Agricultural samples are unbalanced, complex, and scarce, which is the main factor restricting the popularization and application of agricultural computer vision. This paper proposes a feature conversion between classes method for data augmentation of computer vision tasks. We make contributions in the following three aspects: 1) Proposing an optimization method of attention mechanism to optimize the generator of CycleGAN. Through the module: efficient convolutional block attention model (ECBAM), the generator network structure of CycleGAN is improved to learn the feature transformation from “healthy leaves” to “fake diseased leaves”. 2) An label assignment method based on proportionally assigned receptive field is proposed to realize the label replacement from “healthy leaves” to “fake diseased leaves”. 3) Enhanced the original data by a factor of n oversampling. The experimental results show that the improved CycleGAN proposed in this paper can effectively generate “fake diseased leaves”, the Inception Score (IS) is 2.3 ± 0.14, the Fréchet Inception Distance (FID) is 41.49, and the Kernel Inception Distance (KID) is 0.025. We have verified the feasibility of the method for classification, object detection, and semantic segmentation tasks. When using the improved CycleGAN for data augmentation, the accuracy of ResNet152 has been improved by 1.71 %. We further verified the effectiveness of improved CycleGAN and reactive field object assignment(RFOA) methods for data augmentation. By testing in the object detection task, when t = 0.75, and n = 1, the mAP reaches 78.97 %. By testing in a semantic segmentation task, when t = 0.50&0.75, and n = 2, the mIOU reaches 81.41 %.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.