{"title":"基于生成-对抗-网络的时间原始轨迹数据增强框架,用于半导体制造中的故障检测","authors":"Shu-Kai S. Fan , Wei-Yu Chen","doi":"10.1016/j.engappai.2024.109624","DOIUrl":null,"url":null,"abstract":"<div><div>In modern semiconductor manufacturing, where sophisticated process control mechanisms are standard, processing tools are equipped with sensors that generate vast amounts of raw trace data for process monitoring and fault detection. However, one of the major challenges data scientists face is the scarcity of sufficient raw trace data for defective wafers, creating an imbalance that complicates the training of machine learning models for effective fault detection. To address this issue, this paper proposes novel data augmentation structures and strategies utilizing Cycle Generative Adversarial Networks (CycleGANs) as an artificial intelligence application to synthesize temporal raw trace data for defective wafers. The effectiveness of these methods is demonstrated using a real-world dataset from the thin-film process in semiconductor fabrication. Several machine learning classification models—Gaussian Naive Bayes, Adaptive Boosting, eXtreme Gradient Boosting, and Light Gradient Boosting Machine—are employed to evaluate the performance of the augmented data. The paper identifies the optimal augmentation structure and strategy to enhance classification performance within the CycleGAN-based framework. For the thin-film processing dataset under study, the best classification performance achieves an accuracy rate of up to 99.30%, with a notably low false negative rate of 6.45%.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109624"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A generative-adversarial-network-based temporal raw trace data augmentation framework for fault detection in semiconductor manufacturing\",\"authors\":\"Shu-Kai S. Fan , Wei-Yu Chen\",\"doi\":\"10.1016/j.engappai.2024.109624\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In modern semiconductor manufacturing, where sophisticated process control mechanisms are standard, processing tools are equipped with sensors that generate vast amounts of raw trace data for process monitoring and fault detection. However, one of the major challenges data scientists face is the scarcity of sufficient raw trace data for defective wafers, creating an imbalance that complicates the training of machine learning models for effective fault detection. To address this issue, this paper proposes novel data augmentation structures and strategies utilizing Cycle Generative Adversarial Networks (CycleGANs) as an artificial intelligence application to synthesize temporal raw trace data for defective wafers. The effectiveness of these methods is demonstrated using a real-world dataset from the thin-film process in semiconductor fabrication. Several machine learning classification models—Gaussian Naive Bayes, Adaptive Boosting, eXtreme Gradient Boosting, and Light Gradient Boosting Machine—are employed to evaluate the performance of the augmented data. The paper identifies the optimal augmentation structure and strategy to enhance classification performance within the CycleGAN-based framework. For the thin-film processing dataset under study, the best classification performance achieves an accuracy rate of up to 99.30%, with a notably low false negative rate of 6.45%.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"139 \",\"pages\":\"Article 109624\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197624017822\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624017822","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
A generative-adversarial-network-based temporal raw trace data augmentation framework for fault detection in semiconductor manufacturing
In modern semiconductor manufacturing, where sophisticated process control mechanisms are standard, processing tools are equipped with sensors that generate vast amounts of raw trace data for process monitoring and fault detection. However, one of the major challenges data scientists face is the scarcity of sufficient raw trace data for defective wafers, creating an imbalance that complicates the training of machine learning models for effective fault detection. To address this issue, this paper proposes novel data augmentation structures and strategies utilizing Cycle Generative Adversarial Networks (CycleGANs) as an artificial intelligence application to synthesize temporal raw trace data for defective wafers. The effectiveness of these methods is demonstrated using a real-world dataset from the thin-film process in semiconductor fabrication. Several machine learning classification models—Gaussian Naive Bayes, Adaptive Boosting, eXtreme Gradient Boosting, and Light Gradient Boosting Machine—are employed to evaluate the performance of the augmented data. The paper identifies the optimal augmentation structure and strategy to enhance classification performance within the CycleGAN-based framework. For the thin-film processing dataset under study, the best classification performance achieves an accuracy rate of up to 99.30%, with a notably low false negative rate of 6.45%.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.