一种用于工业缺陷检测的掩模引导交叉数据增强方法

IF 6.2 2区 计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Future Generation Computer Systems-The International Journal of Escience Pub Date : 2024-12-24 DOI:10.1016/j.future.2024.107676
Xubin Wang, Wenju Li, Chang Lu
{"title":"一种用于工业缺陷检测的掩模引导交叉数据增强方法","authors":"Xubin Wang,&nbsp;Wenju Li,&nbsp;Chang Lu","doi":"10.1016/j.future.2024.107676","DOIUrl":null,"url":null,"abstract":"<div><div>Expert in pattern recognition, deep learning is widely used for defect detection. Data augmentation is in principle capable of improving the accuracy and robustness of such data-driven models by supplementing patterns. However, the requirements for realistic images, precise annotations, and diverse defect patterns often cannot be addressed simultaneously in current augmentation methods. In this work, a <strong>M</strong>ask <strong>G</strong>uided <strong>C</strong>ross <strong>D</strong>ata <strong>A</strong>ugmentation method dubbed MGCDA using diffusion model is proposed to boost defect detection. Firstly, a generation pipeline in latent diffusion space utilizing autoencoder is formulated to improve the fidelity and resource effort. Based on this, we propose to adopt conditional mechanism to enable samples being synthesized under the guidance of specific masks. To further enhance the information gain, a cross-learning strategy is proposed to empower MGCDA learning and generalizing diverse defect patterns from different categories, making detection more robust. Finally, two strategies are proposed to tackle the demand for data augmentation in different situations. Experiments on eight common industrial datasets show that MGCDA has high applicability to different scenarios and detection models, it can generate high-fidelity samples aligned to guidance and effectively improve the performance of baselines at both image- and pixel-level.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"166 ","pages":"Article 107676"},"PeriodicalIF":6.2000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A mask guided cross data augmentation method for industrial defect detection\",\"authors\":\"Xubin Wang,&nbsp;Wenju Li,&nbsp;Chang Lu\",\"doi\":\"10.1016/j.future.2024.107676\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Expert in pattern recognition, deep learning is widely used for defect detection. Data augmentation is in principle capable of improving the accuracy and robustness of such data-driven models by supplementing patterns. However, the requirements for realistic images, precise annotations, and diverse defect patterns often cannot be addressed simultaneously in current augmentation methods. In this work, a <strong>M</strong>ask <strong>G</strong>uided <strong>C</strong>ross <strong>D</strong>ata <strong>A</strong>ugmentation method dubbed MGCDA using diffusion model is proposed to boost defect detection. Firstly, a generation pipeline in latent diffusion space utilizing autoencoder is formulated to improve the fidelity and resource effort. Based on this, we propose to adopt conditional mechanism to enable samples being synthesized under the guidance of specific masks. To further enhance the information gain, a cross-learning strategy is proposed to empower MGCDA learning and generalizing diverse defect patterns from different categories, making detection more robust. Finally, two strategies are proposed to tackle the demand for data augmentation in different situations. Experiments on eight common industrial datasets show that MGCDA has high applicability to different scenarios and detection models, it can generate high-fidelity samples aligned to guidance and effectively improve the performance of baselines at both image- and pixel-level.</div></div>\",\"PeriodicalId\":55132,\"journal\":{\"name\":\"Future Generation Computer Systems-The International Journal of Escience\",\"volume\":\"166 \",\"pages\":\"Article 107676\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2024-12-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Future Generation Computer Systems-The International Journal of Escience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167739X2400640X\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X2400640X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

摘要

深度学习是模式识别的专家,被广泛应用于缺陷检测。数据增强原则上能够通过补充模式来提高这种数据驱动模型的准确性和健壮性。然而,在当前的增强方法中,对真实图像、精确注释和多种缺陷模式的需求往往不能同时得到满足。在这项工作中,提出了一种基于扩散模型的掩模引导交叉数据增强方法MGCDA来增强缺陷检测。首先,提出了一种利用自编码器的潜在扩散空间生成流程,以提高图像的保真度和资源消耗。基于此,我们提出采用条件机制,使样品在特定掩模的引导下合成。为了进一步提高信息增益,提出了一种交叉学习策略,使MGCDA能够学习和概括来自不同类别的不同缺陷模式,使检测更加鲁棒。最后,针对不同情况下的数据扩充需求,提出了两种策略。在8个常见工业数据集上的实验表明,MGCDA对不同场景和检测模型具有较高的适用性,能够生成与指导对齐的高保真样本,有效提高了基线在图像和像素级的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A mask guided cross data augmentation method for industrial defect detection
Expert in pattern recognition, deep learning is widely used for defect detection. Data augmentation is in principle capable of improving the accuracy and robustness of such data-driven models by supplementing patterns. However, the requirements for realistic images, precise annotations, and diverse defect patterns often cannot be addressed simultaneously in current augmentation methods. In this work, a Mask Guided Cross Data Augmentation method dubbed MGCDA using diffusion model is proposed to boost defect detection. Firstly, a generation pipeline in latent diffusion space utilizing autoencoder is formulated to improve the fidelity and resource effort. Based on this, we propose to adopt conditional mechanism to enable samples being synthesized under the guidance of specific masks. To further enhance the information gain, a cross-learning strategy is proposed to empower MGCDA learning and generalizing diverse defect patterns from different categories, making detection more robust. Finally, two strategies are proposed to tackle the demand for data augmentation in different situations. Experiments on eight common industrial datasets show that MGCDA has high applicability to different scenarios and detection models, it can generate high-fidelity samples aligned to guidance and effectively improve the performance of baselines at both image- and pixel-level.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
19.90
自引率
2.70%
发文量
376
审稿时长
10.6 months
期刊介绍: Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications. Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration. Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.
期刊最新文献
Self-sovereign identity framework with user-friendly private key generation and rule table Accelerating complex graph queries by summary-based hybrid partitioning for discovering vulnerabilities of distribution equipment DNA: Dual-radio Dual-constraint Node Activation scheduling for energy-efficient data dissemination in IoT Blending lossy and lossless data compression methods to support health data streaming in smart cities Energy–time modelling of distributed multi-population genetic algorithms with dynamic workload in HPC clusters
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1