基于类间特征转换的计算机视觉任务数据增强方法

IF 8.9 1区 农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY Computers and Electronics in Agriculture Pub Date : 2025-04-01 Epub Date: 2025-02-08 DOI:10.1016/j.compag.2025.109909
Jiewen Lin , Gui Hu , Jian Chen
{"title":"基于类间特征转换的计算机视觉任务数据增强方法","authors":"Jiewen Lin ,&nbsp;Gui Hu ,&nbsp;Jian Chen","doi":"10.1016/j.compag.2025.109909","DOIUrl":null,"url":null,"abstract":"<div><div>Agricultural samples are unbalanced, complex, and scarce, which is the main factor restricting the popularization and application of agricultural computer vision. This paper proposes a feature conversion between classes method for data augmentation of computer vision tasks. We make contributions in the following three aspects: 1) Proposing an optimization method of attention mechanism to optimize the generator of CycleGAN. Through the module: efficient convolutional block attention model (ECBAM), the generator network structure of CycleGAN is improved to learn the feature transformation from “healthy leaves” to “fake diseased leaves”. 2) An label assignment method based on proportionally assigned receptive field is proposed to realize the label replacement from “healthy leaves” to “fake diseased leaves”. 3) Enhanced the original data by a factor of n <span><math><mrow><mo>×</mo></mrow></math></span> oversampling. The experimental results show that the improved CycleGAN proposed in this paper can effectively generate “fake diseased leaves”, the Inception Score (IS) is 2.3 ± 0.14, the Fréchet Inception Distance (FID) is 41.49, and the Kernel Inception Distance (KID) is 0.025. We have verified the feasibility of the method for classification, object detection, and semantic segmentation tasks. When using the improved CycleGAN for data augmentation, the accuracy of ResNet152 has been improved by 1.71 %. We further verified the effectiveness of improved CycleGAN and reactive field object assignment(RFOA) methods for data augmentation. By testing in the object detection task, when t = 0.75, and n = 1, the mAP reaches 78.97 %. By testing in a semantic segmentation task, when t = 0.50&amp;0.75, and n = 2, the mIOU reaches 81.41 %.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"231 ","pages":"Article 109909"},"PeriodicalIF":8.9000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A data augmentation method for computer vision task with feature conversion between class\",\"authors\":\"Jiewen Lin ,&nbsp;Gui Hu ,&nbsp;Jian Chen\",\"doi\":\"10.1016/j.compag.2025.109909\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Agricultural samples are unbalanced, complex, and scarce, which is the main factor restricting the popularization and application of agricultural computer vision. This paper proposes a feature conversion between classes method for data augmentation of computer vision tasks. We make contributions in the following three aspects: 1) Proposing an optimization method of attention mechanism to optimize the generator of CycleGAN. Through the module: efficient convolutional block attention model (ECBAM), the generator network structure of CycleGAN is improved to learn the feature transformation from “healthy leaves” to “fake diseased leaves”. 2) An label assignment method based on proportionally assigned receptive field is proposed to realize the label replacement from “healthy leaves” to “fake diseased leaves”. 3) Enhanced the original data by a factor of n <span><math><mrow><mo>×</mo></mrow></math></span> oversampling. The experimental results show that the improved CycleGAN proposed in this paper can effectively generate “fake diseased leaves”, the Inception Score (IS) is 2.3 ± 0.14, the Fréchet Inception Distance (FID) is 41.49, and the Kernel Inception Distance (KID) is 0.025. We have verified the feasibility of the method for classification, object detection, and semantic segmentation tasks. When using the improved CycleGAN for data augmentation, the accuracy of ResNet152 has been improved by 1.71 %. We further verified the effectiveness of improved CycleGAN and reactive field object assignment(RFOA) methods for data augmentation. By testing in the object detection task, when t = 0.75, and n = 1, the mAP reaches 78.97 %. By testing in a semantic segmentation task, when t = 0.50&amp;0.75, and n = 2, the mIOU reaches 81.41 %.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"231 \",\"pages\":\"Article 109909\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925000158\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/8 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925000158","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/8 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

农业样本不平衡、复杂、稀缺,是制约农业计算机视觉推广应用的主要因素。提出了一种用于计算机视觉任务数据增强的类间特征转换方法。我们在以下三个方面做出了贡献:1)提出了一种关注机制的优化方法来优化CycleGAN的生成器。通过高效卷积块注意模型(ECBAM)模块,对CycleGAN的生成器网络结构进行改进,学习从“健康叶片”到“假病叶”的特征转换。2)提出了一种基于比例分配感受野的标签分配方法,实现了“健康叶”到“假病叶”的标签替换。3)对原始数据进行n ×过采样的增强。实验结果表明,本文提出的改进CycleGAN可以有效地生成“假病叶”,初始分数(Inception Score, IS)为2.3±0.14,fr初始距离(FID)为41.49,内核初始距离(KID)为0.025。我们已经验证了该方法在分类、对象检测和语义分割任务中的可行性。当使用改进的CycleGAN进行数据增强时,ResNet152的准确率提高了1.71%。我们进一步验证了改进的CycleGAN和反应性场目标分配(reactive field object assignment, RFOA)方法在数据增强方面的有效性。通过在目标检测任务中测试,当t = 0.75, n = 1时,mAP达到78.97%。通过在语义分割任务中测试,当t = 0.50&0.75, n = 2时,mIOU达到81.41%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A data augmentation method for computer vision task with feature conversion between class
Agricultural samples are unbalanced, complex, and scarce, which is the main factor restricting the popularization and application of agricultural computer vision. This paper proposes a feature conversion between classes method for data augmentation of computer vision tasks. We make contributions in the following three aspects: 1) Proposing an optimization method of attention mechanism to optimize the generator of CycleGAN. Through the module: efficient convolutional block attention model (ECBAM), the generator network structure of CycleGAN is improved to learn the feature transformation from “healthy leaves” to “fake diseased leaves”. 2) An label assignment method based on proportionally assigned receptive field is proposed to realize the label replacement from “healthy leaves” to “fake diseased leaves”. 3) Enhanced the original data by a factor of n × oversampling. The experimental results show that the improved CycleGAN proposed in this paper can effectively generate “fake diseased leaves”, the Inception Score (IS) is 2.3 ± 0.14, the Fréchet Inception Distance (FID) is 41.49, and the Kernel Inception Distance (KID) is 0.025. We have verified the feasibility of the method for classification, object detection, and semantic segmentation tasks. When using the improved CycleGAN for data augmentation, the accuracy of ResNet152 has been improved by 1.71 %. We further verified the effectiveness of improved CycleGAN and reactive field object assignment(RFOA) methods for data augmentation. By testing in the object detection task, when t = 0.75, and n = 1, the mAP reaches 78.97 %. By testing in a semantic segmentation task, when t = 0.50&0.75, and n = 2, the mIOU reaches 81.41 %.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers and Electronics in Agriculture
Computers and Electronics in Agriculture 工程技术-计算机:跨学科应用
CiteScore
15.30
自引率
14.50%
发文量
800
审稿时长
62 days
期刊介绍: Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.
期刊最新文献
Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning Data-driven optimization of CO2 regulation intervals in controlled environment agriculture EnvGreenAE: A denoising-enhanced transformer autoencoder for greenhouse anomaly detection Multi-band microwave sensor for agricultural microsample analysis using dual asymmetric metamaterial resonators toward enhanced and balanced full-band sensitivity Clustering in mixed broiler chicken batches for improvement of daily average weight
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1