{"title":"一种用于工业缺陷检测的掩模引导交叉数据增强方法","authors":"Xubin Wang, Wenju Li, Chang Lu","doi":"10.1016/j.future.2024.107676","DOIUrl":null,"url":null,"abstract":"<div><div>Expert in pattern recognition, deep learning is widely used for defect detection. Data augmentation is in principle capable of improving the accuracy and robustness of such data-driven models by supplementing patterns. However, the requirements for realistic images, precise annotations, and diverse defect patterns often cannot be addressed simultaneously in current augmentation methods. In this work, a <strong>M</strong>ask <strong>G</strong>uided <strong>C</strong>ross <strong>D</strong>ata <strong>A</strong>ugmentation method dubbed MGCDA using diffusion model is proposed to boost defect detection. Firstly, a generation pipeline in latent diffusion space utilizing autoencoder is formulated to improve the fidelity and resource effort. Based on this, we propose to adopt conditional mechanism to enable samples being synthesized under the guidance of specific masks. To further enhance the information gain, a cross-learning strategy is proposed to empower MGCDA learning and generalizing diverse defect patterns from different categories, making detection more robust. Finally, two strategies are proposed to tackle the demand for data augmentation in different situations. Experiments on eight common industrial datasets show that MGCDA has high applicability to different scenarios and detection models, it can generate high-fidelity samples aligned to guidance and effectively improve the performance of baselines at both image- and pixel-level.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"166 ","pages":"Article 107676"},"PeriodicalIF":6.2000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A mask guided cross data augmentation method for industrial defect detection\",\"authors\":\"Xubin Wang, Wenju Li, Chang Lu\",\"doi\":\"10.1016/j.future.2024.107676\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Expert in pattern recognition, deep learning is widely used for defect detection. Data augmentation is in principle capable of improving the accuracy and robustness of such data-driven models by supplementing patterns. However, the requirements for realistic images, precise annotations, and diverse defect patterns often cannot be addressed simultaneously in current augmentation methods. In this work, a <strong>M</strong>ask <strong>G</strong>uided <strong>C</strong>ross <strong>D</strong>ata <strong>A</strong>ugmentation method dubbed MGCDA using diffusion model is proposed to boost defect detection. Firstly, a generation pipeline in latent diffusion space utilizing autoencoder is formulated to improve the fidelity and resource effort. Based on this, we propose to adopt conditional mechanism to enable samples being synthesized under the guidance of specific masks. To further enhance the information gain, a cross-learning strategy is proposed to empower MGCDA learning and generalizing diverse defect patterns from different categories, making detection more robust. Finally, two strategies are proposed to tackle the demand for data augmentation in different situations. Experiments on eight common industrial datasets show that MGCDA has high applicability to different scenarios and detection models, it can generate high-fidelity samples aligned to guidance and effectively improve the performance of baselines at both image- and pixel-level.</div></div>\",\"PeriodicalId\":55132,\"journal\":{\"name\":\"Future Generation Computer Systems-The International Journal of Escience\",\"volume\":\"166 \",\"pages\":\"Article 107676\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2024-12-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Future Generation Computer Systems-The International Journal of Escience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167739X2400640X\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X2400640X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
A mask guided cross data augmentation method for industrial defect detection
Expert in pattern recognition, deep learning is widely used for defect detection. Data augmentation is in principle capable of improving the accuracy and robustness of such data-driven models by supplementing patterns. However, the requirements for realistic images, precise annotations, and diverse defect patterns often cannot be addressed simultaneously in current augmentation methods. In this work, a Mask Guided Cross Data Augmentation method dubbed MGCDA using diffusion model is proposed to boost defect detection. Firstly, a generation pipeline in latent diffusion space utilizing autoencoder is formulated to improve the fidelity and resource effort. Based on this, we propose to adopt conditional mechanism to enable samples being synthesized under the guidance of specific masks. To further enhance the information gain, a cross-learning strategy is proposed to empower MGCDA learning and generalizing diverse defect patterns from different categories, making detection more robust. Finally, two strategies are proposed to tackle the demand for data augmentation in different situations. Experiments on eight common industrial datasets show that MGCDA has high applicability to different scenarios and detection models, it can generate high-fidelity samples aligned to guidance and effectively improve the performance of baselines at both image- and pixel-level.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.