Ensemble Machine Learning Algorithms for Anomaly Detection in Multivariate Time-Series

Youssef Trardi, B. Ananou, Philip Tchatchoua, M. Ouladsine
{"title":"Ensemble Machine Learning Algorithms for Anomaly Detection in Multivariate Time-Series","authors":"Youssef Trardi, B. Ananou, Philip Tchatchoua, M. Ouladsine","doi":"10.1109/ICCAD55197.2022.9853995","DOIUrl":null,"url":null,"abstract":"This paper proposes a multivariate time-series anomaly detection approach using multiple transform techniques and ensemble machine learning (EML) algorithms. The objective is to detect the presence of abnormal wafers during the semiconductor manufacturing process. Therefore, we evaluate a set of eleven features derived from an intermediate manufacturing chain to characterize the wafer status. Data from each feature is recorded over a 150-second time frame. To address the computational complexity of large-scale data processing, a dimensionality reduction step is highly desirable. Indeed, independent component analysis (ICA), principal component analysis (PCA), and factor analysis (FA) are used for comparison purposes. As well, to extract the most significant components from each feature sequence and build a thoroughly combined subset of characteristics. In the sequel, decision trees, bootstrap aggregating, boosting, one of the prevalent evolutions of EML algorithms, are fitted to the obtained characteristics to define the best anomaly detection ranking. The selected model is validated using 7000 samples (i.e. wafers) divided into 5000 normal samples and 2000 abnormal samples. The results highlight the strengths of the proposed approach, which could serve as a valuable decision-making support for abnormal wafer detection in the semiconductor manufacturing process.","PeriodicalId":436377,"journal":{"name":"2022 International Conference on Control, Automation and Diagnosis (ICCAD)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Control, Automation and Diagnosis (ICCAD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAD55197.2022.9853995","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

This paper proposes a multivariate time-series anomaly detection approach using multiple transform techniques and ensemble machine learning (EML) algorithms. The objective is to detect the presence of abnormal wafers during the semiconductor manufacturing process. Therefore, we evaluate a set of eleven features derived from an intermediate manufacturing chain to characterize the wafer status. Data from each feature is recorded over a 150-second time frame. To address the computational complexity of large-scale data processing, a dimensionality reduction step is highly desirable. Indeed, independent component analysis (ICA), principal component analysis (PCA), and factor analysis (FA) are used for comparison purposes. As well, to extract the most significant components from each feature sequence and build a thoroughly combined subset of characteristics. In the sequel, decision trees, bootstrap aggregating, boosting, one of the prevalent evolutions of EML algorithms, are fitted to the obtained characteristics to define the best anomaly detection ranking. The selected model is validated using 7000 samples (i.e. wafers) divided into 5000 normal samples and 2000 abnormal samples. The results highlight the strengths of the proposed approach, which could serve as a valuable decision-making support for abnormal wafer detection in the semiconductor manufacturing process.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多元时间序列异常检测的集成机器学习算法
本文提出了一种基于多变换技术和集成机器学习(EML)算法的多元时间序列异常检测方法。目的是在半导体制造过程中检测异常晶圆的存在。因此,我们评估了一组从中间制造链衍生的11个特征来表征晶圆状态。每个特征的数据记录在150秒的时间框架内。为了解决大规模数据处理的计算复杂性,一个降维步骤是非常必要的。事实上,独立成分分析(ICA)、主成分分析(PCA)和因子分析(FA)被用于比较目的。同时,从每个特征序列中提取最重要的成分,并构建一个完全组合的特征子集。其次,将EML算法中最流行的一种进化方法——决策树、自举聚合、增强,拟合到得到的特征上,以定义最佳的异常检测排序。所选模型使用7000个样本(即晶圆片)进行验证,其中5000个正常样本和2000个异常样本。结果突出了该方法的优势,该方法可以作为半导体制造过程中异常晶圆检测的有价值的决策支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Blockchain Information Based Systems in Aviation: The Advantages for Aircraft Records Management Technician Allocation to Base Maintenance of Aircraft Fleet: a computer application Stabilizing Dynamic Output Feedback Control for Takagi-Sugeno Fuzzy Systems Human-Guided Safe and Efficient Trajectory Replanning for Unmanned Aerial Vehicles Adaptive Large Neighborhood Search for the Just-In-Time Job-shop Scheduling Problem
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1