基于 Shapley 的可解释人工智能在故障诊断和预报中的聚类应用

IF 5.9 2区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Journal of Intelligent Manufacturing Pub Date : 2024-07-29 DOI:10.1007/s10845-024-02468-2
Joseph Cohen, Xun Huan, Jun Ni
{"title":"基于 Shapley 的可解释人工智能在故障诊断和预报中的聚类应用","authors":"Joseph Cohen, Xun Huan, Jun Ni","doi":"10.1007/s10845-024-02468-2","DOIUrl":null,"url":null,"abstract":"<p>Data-driven artificial intelligence models require explainability in intelligent manufacturing to streamline adoption and trust in modern industry. However, recently developed explainable artificial intelligence (XAI) techniques that estimate feature contributions on a model-agnostic level such as SHapley Additive exPlanations (SHAP) have not yet been evaluated for semi-supervised fault diagnosis and prognosis problems characterized by class imbalance and weakly labeled datasets. This paper explores the potential of utilizing Shapley values for a new clustering framework compatible with semi-supervised learning problems, loosening the strict supervision requirement of current XAI techniques. This broad methodology is validated on two case studies: a heatmap image dataset obtained from a semiconductor manufacturing process featuring class imbalance, and the benchmark N-CMAPSS dataset. Semi-supervised clustering based on Shapley values significantly improves upon clustering quality compared to the fully unsupervised case, deriving information-dense and meaningful clusters that relate to underlying fault diagnosis model predictions. These clusters can also be characterized by high-precision decision rules in terms of original feature values, as demonstrated in the second case study. The rules, limited to 2 terms utilizing original feature scales, describe 14 out of the 19 derived equipment failure clusters with average precision exceeding 0.85, showcasing the promising utility of the explainable clustering framework for intelligent manufacturing applications.</p>","PeriodicalId":16193,"journal":{"name":"Journal of Intelligent Manufacturing","volume":"21 1","pages":""},"PeriodicalIF":5.9000,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Shapley-based explainable AI for clustering applications in fault diagnosis and prognosis\",\"authors\":\"Joseph Cohen, Xun Huan, Jun Ni\",\"doi\":\"10.1007/s10845-024-02468-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Data-driven artificial intelligence models require explainability in intelligent manufacturing to streamline adoption and trust in modern industry. However, recently developed explainable artificial intelligence (XAI) techniques that estimate feature contributions on a model-agnostic level such as SHapley Additive exPlanations (SHAP) have not yet been evaluated for semi-supervised fault diagnosis and prognosis problems characterized by class imbalance and weakly labeled datasets. This paper explores the potential of utilizing Shapley values for a new clustering framework compatible with semi-supervised learning problems, loosening the strict supervision requirement of current XAI techniques. This broad methodology is validated on two case studies: a heatmap image dataset obtained from a semiconductor manufacturing process featuring class imbalance, and the benchmark N-CMAPSS dataset. Semi-supervised clustering based on Shapley values significantly improves upon clustering quality compared to the fully unsupervised case, deriving information-dense and meaningful clusters that relate to underlying fault diagnosis model predictions. These clusters can also be characterized by high-precision decision rules in terms of original feature values, as demonstrated in the second case study. The rules, limited to 2 terms utilizing original feature scales, describe 14 out of the 19 derived equipment failure clusters with average precision exceeding 0.85, showcasing the promising utility of the explainable clustering framework for intelligent manufacturing applications.</p>\",\"PeriodicalId\":16193,\"journal\":{\"name\":\"Journal of Intelligent Manufacturing\",\"volume\":\"21 1\",\"pages\":\"\"},\"PeriodicalIF\":5.9000,\"publicationDate\":\"2024-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Intelligent Manufacturing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s10845-024-02468-2\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent Manufacturing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s10845-024-02468-2","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

数据驱动的人工智能模型需要在智能制造中具有可解释性,以简化现代工业的采用和信任。然而,最近开发的可解释人工智能(XAI)技术(如 Shapley Additive exPlanations (SHAP))可在模型无关的层面上估计特征贡献,但尚未针对以类不平衡和弱标记数据集为特征的半监督故障诊断和预报问题进行评估。本文探讨了利用夏普利值建立一个与半监督学习问题兼容的新聚类框架的可能性,从而放宽了当前 XAI 技术的严格监督要求。这种广泛的方法在两个案例研究中得到了验证:一个从半导体制造过程中获得的热图图像数据集,具有类不平衡的特点;另一个是基准 N-CMAPSS 数据集。与完全无监督的情况相比,基于 Shapley 值的半监督聚类显著提高了聚类质量,得到了与底层故障诊断模型预测相关的信息密集且有意义的聚类。这些聚类还可以通过原始特征值的高精度决策规则来表征,第二个案例研究就证明了这一点。这些规则仅限于利用原始特征标度的 2 个术语,描述了 19 个衍生设备故障聚类中的 14 个,平均精度超过 0.85,展示了可解释聚类框架在智能制造应用中的巨大作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Shapley-based explainable AI for clustering applications in fault diagnosis and prognosis

Data-driven artificial intelligence models require explainability in intelligent manufacturing to streamline adoption and trust in modern industry. However, recently developed explainable artificial intelligence (XAI) techniques that estimate feature contributions on a model-agnostic level such as SHapley Additive exPlanations (SHAP) have not yet been evaluated for semi-supervised fault diagnosis and prognosis problems characterized by class imbalance and weakly labeled datasets. This paper explores the potential of utilizing Shapley values for a new clustering framework compatible with semi-supervised learning problems, loosening the strict supervision requirement of current XAI techniques. This broad methodology is validated on two case studies: a heatmap image dataset obtained from a semiconductor manufacturing process featuring class imbalance, and the benchmark N-CMAPSS dataset. Semi-supervised clustering based on Shapley values significantly improves upon clustering quality compared to the fully unsupervised case, deriving information-dense and meaningful clusters that relate to underlying fault diagnosis model predictions. These clusters can also be characterized by high-precision decision rules in terms of original feature values, as demonstrated in the second case study. The rules, limited to 2 terms utilizing original feature scales, describe 14 out of the 19 derived equipment failure clusters with average precision exceeding 0.85, showcasing the promising utility of the explainable clustering framework for intelligent manufacturing applications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Intelligent Manufacturing
Journal of Intelligent Manufacturing 工程技术-工程:制造
CiteScore
19.30
自引率
9.60%
发文量
171
审稿时长
5.2 months
期刊介绍: The Journal of Nonlinear Engineering aims to be a platform for sharing original research results in theoretical, experimental, practical, and applied nonlinear phenomena within engineering. It serves as a forum to exchange ideas and applications of nonlinear problems across various engineering disciplines. Articles are considered for publication if they explore nonlinearities in engineering systems, offering realistic mathematical modeling, utilizing nonlinearity for new designs, stabilizing systems, understanding system behavior through nonlinearity, optimizing systems based on nonlinear interactions, and developing algorithms to harness and leverage nonlinear elements.
期刊最新文献
Industrial vision inspection using digital twins: bridging CAD models and realistic scenarios Reliability-improved machine learning model using knowledge-embedded learning approach for smart manufacturing Smart scheduling for next generation manufacturing systems: a systematic literature review An overview of traditional and advanced methods to detect part defects in additive manufacturing processes A systematic multi-layer cognitive model for intelligent machine tool
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1