Self-supervised representation learning anomaly detection methodology based on boosting algorithms enhanced by data augmentation using StyleGAN for manufacturing imbalanced data
Yoonseok Kim , Taeheon Lee , Youngjoo Hyun , Eric Coatanea , Siren Mika , Jeonghoon Mo , YoungJun Yoo
{"title":"Self-supervised representation learning anomaly detection methodology based on boosting algorithms enhanced by data augmentation using StyleGAN for manufacturing imbalanced data","authors":"Yoonseok Kim , Taeheon Lee , Youngjoo Hyun , Eric Coatanea , Siren Mika , Jeonghoon Mo , YoungJun Yoo","doi":"10.1016/j.compind.2023.104024","DOIUrl":null,"url":null,"abstract":"<div><p>This study proposes a methodology for detecting anomalies in the manufacturing industry using a self-supervised representation learning approach based on deep generative models. The challenge arises from the limited availability of data on defective products compared with normal data, leading to degradation in the performance of deep learning models owing to data imbalances. To address this limitation, we propose a process that leverages the Gramian angular field to transform time-series data into images, applies StyleGAN for image augmentation of anomalous data, and utilizes a boosting algorithm for classifier selection in supervised learning. Additionally, we compared the accuracy of the classifier before and after data augmentation. In experimental cases involving CNC milling machine data and wire arc additive manufacturing data, the proposed approach outperformed the approach before augmentation, resulting in improved precision, recall, and F1-score for anomaly detection. Furthermore, Bayesian optimization of the hyperparameters of the boosting algorithm further enhanced the performance metrics. The proposed process effectively addresses the data imbalance problem, and demonstrates its applicability to various manufacturing industries.</p></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":null,"pages":null},"PeriodicalIF":8.2000,"publicationDate":"2023-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361523001744","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
This study proposes a methodology for detecting anomalies in the manufacturing industry using a self-supervised representation learning approach based on deep generative models. The challenge arises from the limited availability of data on defective products compared with normal data, leading to degradation in the performance of deep learning models owing to data imbalances. To address this limitation, we propose a process that leverages the Gramian angular field to transform time-series data into images, applies StyleGAN for image augmentation of anomalous data, and utilizes a boosting algorithm for classifier selection in supervised learning. Additionally, we compared the accuracy of the classifier before and after data augmentation. In experimental cases involving CNC milling machine data and wire arc additive manufacturing data, the proposed approach outperformed the approach before augmentation, resulting in improved precision, recall, and F1-score for anomaly detection. Furthermore, Bayesian optimization of the hyperparameters of the boosting algorithm further enhanced the performance metrics. The proposed process effectively addresses the data imbalance problem, and demonstrates its applicability to various manufacturing industries.
期刊介绍:
The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that:
• Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry;
• Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry;
• Foster connections or integrations across diverse application areas of ICT in industry.