一种用于工业缺陷检测的掩模引导交叉数据增强方法

IF 6.2 2区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS Future Generation Computer Systems-The International Journal of Escience Pub Date : 2024-12-24 DOI:10.1016/j.future.2024.107676

Xubin Wang, Wenju Li, Chang Lu

{"title":"一种用于工业缺陷检测的掩模引导交叉数据增强方法","authors":"Xubin Wang, Wenju Li, Chang Lu","doi":"10.1016/j.future.2024.107676","DOIUrl":null,"url":null,"abstract":"<div><div>Expert in pattern recognition, deep learning is widely used for defect detection. Data augmentation is in principle capable of improving the accuracy and robustness of such data-driven models by supplementing patterns. However, the requirements for realistic images, precise annotations, and diverse defect patterns often cannot be addressed simultaneously in current augmentation methods. In this work, a <strong>M</strong>ask <strong>G</strong>uided <strong>C</strong>ross <strong>D</strong>ata <strong>A</strong>ugmentation method dubbed MGCDA using diffusion model is proposed to boost defect detection. Firstly, a generation pipeline in latent diffusion space utilizing autoencoder is formulated to improve the fidelity and resource effort. Based on this, we propose to adopt conditional mechanism to enable samples being synthesized under the guidance of specific masks. To further enhance the information gain, a cross-learning strategy is proposed to empower MGCDA learning and generalizing diverse defect patterns from different categories, making detection more robust. Finally, two strategies are proposed to tackle the demand for data augmentation in different situations. Experiments on eight common industrial datasets show that MGCDA has high applicability to different scenarios and detection models, it can generate high-fidelity samples aligned to guidance and effectively improve the performance of baselines at both image- and pixel-level.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"166 ","pages":"Article 107676"},"PeriodicalIF":6.2000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A mask guided cross data augmentation method for industrial defect detection\",\"authors\":\"Xubin Wang, Wenju Li, Chang Lu\",\"doi\":\"10.1016/j.future.2024.107676\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Expert in pattern recognition, deep learning is widely used for defect detection. Data augmentation is in principle capable of improving the accuracy and robustness of such data-driven models by supplementing patterns. However, the requirements for realistic images, precise annotations, and diverse defect patterns often cannot be addressed simultaneously in current augmentation methods. In this work, a <strong>M</strong>ask <strong>G</strong>uided <strong>C</strong>ross <strong>D</strong>ata <strong>A</strong>ugmentation method dubbed MGCDA using diffusion model is proposed to boost defect detection. Firstly, a generation pipeline in latent diffusion space utilizing autoencoder is formulated to improve the fidelity and resource effort. Based on this, we propose to adopt conditional mechanism to enable samples being synthesized under the guidance of specific masks. To further enhance the information gain, a cross-learning strategy is proposed to empower MGCDA learning and generalizing diverse defect patterns from different categories, making detection more robust. Finally, two strategies are proposed to tackle the demand for data augmentation in different situations. Experiments on eight common industrial datasets show that MGCDA has high applicability to different scenarios and detection models, it can generate high-fidelity samples aligned to guidance and effectively improve the performance of baselines at both image- and pixel-level.</div></div>\",\"PeriodicalId\":55132,\"journal\":{\"name\":\"Future Generation Computer Systems-The International Journal of Escience\",\"volume\":\"166 \",\"pages\":\"Article 107676\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2024-12-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Future Generation Computer Systems-The International Journal of Escience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167739X2400640X\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X2400640X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}

引用次数: 0

摘要

深度学习是模式识别的专家，被广泛应用于缺陷检测。数据增强原则上能够通过补充模式来提高这种数据驱动模型的准确性和健壮性。然而，在当前的增强方法中，对真实图像、精确注释和多种缺陷模式的需求往往不能同时得到满足。在这项工作中，提出了一种基于扩散模型的掩模引导交叉数据增强方法MGCDA来增强缺陷检测。首先，提出了一种利用自编码器的潜在扩散空间生成流程，以提高图像的保真度和资源消耗。基于此，我们提出采用条件机制，使样品在特定掩模的引导下合成。为了进一步提高信息增益，提出了一种交叉学习策略，使MGCDA能够学习和概括来自不同类别的不同缺陷模式，使检测更加鲁棒。最后，针对不同情况下的数据扩充需求，提出了两种策略。在8个常见工业数据集上的实验表明，MGCDA对不同场景和检测模型具有较高的适用性，能够生成与指导对齐的高保真样本，有效提高了基线在图像和像素级的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A mask guided cross data augmentation method for industrial defect detection

Expert in pattern recognition, deep learning is widely used for defect detection. Data augmentation is in principle capable of improving the accuracy and robustness of such data-driven models by supplementing patterns. However, the requirements for realistic images, precise annotations, and diverse defect patterns often cannot be addressed simultaneously in current augmentation methods. In this work, a Mask Guided Cross Data Augmentation method dubbed MGCDA using diffusion model is proposed to boost defect detection. Firstly, a generation pipeline in latent diffusion space utilizing autoencoder is formulated to improve the fidelity and resource effort. Based on this, we propose to adopt conditional mechanism to enable samples being synthesized under the guidance of specific masks. To further enhance the information gain, a cross-learning strategy is proposed to empower MGCDA learning and generalizing diverse defect patterns from different categories, making detection more robust. Finally, two strategies are proposed to tackle the demand for data augmentation in different situations. Experiments on eight common industrial datasets show that MGCDA has high applicability to different scenarios and detection models, it can generate high-fidelity samples aligned to guidance and effectively improve the performance of baselines at both image- and pixel-level.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Future Generation Computer Systems-The International Journal of Escience 工程技术-计算机：理论方法

CiteScore

19.90

自引率

2.70%

发文量

376

审稿时长

10.6 months

期刊介绍： Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications. Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration. Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.

期刊最新文献

Editorial Board A self-organized MoE framework for distributed federated learning Keyed watermarks: A fine-grained watermark generation for Apache Flink Fast and Privacy-Preserving Spatial Keyword Authorization Query with access control Performance and efficiency: A multi-generational benchmark of modern processors on bandwidth-bound HPC applications