Data-driven global sensitivity analysis for group of random variables through knowledge-enhanced machine learning with normalizing flows

IF 11 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Reliability Engineering & System Safety Pub Date : 2025-08-01 Epub Date: 2025-03-13 DOI:10.1016/j.ress.2025.111007
Ziluo Xiong, Gaofeng Jia
{"title":"Data-driven global sensitivity analysis for group of random variables through knowledge-enhanced machine learning with normalizing flows","authors":"Ziluo Xiong,&nbsp;Gaofeng Jia","doi":"10.1016/j.ress.2025.111007","DOIUrl":null,"url":null,"abstract":"<div><div>Different approaches have been developed for evaluating Sobol’ indices for global sensitivity analysis (GSA). Among them sample-based approaches are extremely attractive because they can be purely driven by data and estimate various Sobol’ indices (e.g., first-order, higher-order, total-effects) for any individual or group of random variables using only one set of samples. However, such approaches usually rely on an accurate density estimation for the interested groups of random variables, which can be challenging for high-dimensional groups. For example, the commonly used kernel density estimation (KDE) suffers from curse of dimensionality. In this regard, this paper proposes a novel knowledge-enhanced machine learning approach for data-driven GSA for groups of random variables using sample-based approach and an emerging generative machine learning model, i.e., normalizing flows (NFs), for high-dimensional density estimation. To facilitate reliable and robust NFs training, a knowledge distillation-based two-stage training strategy is developed. Two customized loss functions are introduced, which are inspired by domain knowledge in the context of sample-based approach for GSA. Two examples are considered to illustrate and verify the efficacy of the proposed approach. Results show that introducing NFs can significantly alleviate the curse of dimensionality in the traditional sample-based approach for GSA and improve accuracy of density estimation and estimation of Sobol’ indices.</div></div>","PeriodicalId":54500,"journal":{"name":"Reliability Engineering & System Safety","volume":"260 ","pages":"Article 111007"},"PeriodicalIF":11.0000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reliability Engineering & System Safety","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095183202500208X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0

Abstract

Different approaches have been developed for evaluating Sobol’ indices for global sensitivity analysis (GSA). Among them sample-based approaches are extremely attractive because they can be purely driven by data and estimate various Sobol’ indices (e.g., first-order, higher-order, total-effects) for any individual or group of random variables using only one set of samples. However, such approaches usually rely on an accurate density estimation for the interested groups of random variables, which can be challenging for high-dimensional groups. For example, the commonly used kernel density estimation (KDE) suffers from curse of dimensionality. In this regard, this paper proposes a novel knowledge-enhanced machine learning approach for data-driven GSA for groups of random variables using sample-based approach and an emerging generative machine learning model, i.e., normalizing flows (NFs), for high-dimensional density estimation. To facilitate reliable and robust NFs training, a knowledge distillation-based two-stage training strategy is developed. Two customized loss functions are introduced, which are inspired by domain knowledge in the context of sample-based approach for GSA. Two examples are considered to illustrate and verify the efficacy of the proposed approach. Results show that introducing NFs can significantly alleviate the curse of dimensionality in the traditional sample-based approach for GSA and improve accuracy of density estimation and estimation of Sobol’ indices.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于规范化流程的知识增强机器学习的随机变量组数据驱动全局敏感性分析
已经开发了不同的方法来评估Sobol指数的全球敏感性分析(GSA)。其中,基于样本的方法非常有吸引力,因为它们可以纯粹由数据驱动,并且仅使用一组样本就可以估计任何个体或随机变量组的各种Sobol指数(例如,一阶,高阶,总效应)。然而,这种方法通常依赖于对感兴趣的随机变量组的精确密度估计,这对于高维组来说可能是具有挑战性的。例如,常用的核密度估计(KDE)就存在维数诅咒的问题。在这方面,本文提出了一种新的知识增强机器学习方法,用于随机变量组的数据驱动GSA,使用基于样本的方法和新兴的生成机器学习模型,即用于高维密度估计的归一化流(NFs)。为了实现可靠、鲁棒的神经网络训练,提出了一种基于知识提炼的两阶段训练策略。在基于样本的GSA方法中,引入了两个受领域知识启发的自定义损失函数。通过两个实例来说明和验证所提出方法的有效性。结果表明,引入NFs可以显著缓解传统基于样本的GSA方法的维数诅咒,提高密度估计和Sobol指数估计的精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Reliability Engineering & System Safety
Reliability Engineering & System Safety 管理科学-工程:工业
CiteScore
15.20
自引率
39.50%
发文量
621
审稿时长
67 days
期刊介绍: Elsevier publishes Reliability Engineering & System Safety in association with the European Safety and Reliability Association and the Safety Engineering and Risk Analysis Division. The international journal is devoted to developing and applying methods to enhance the safety and reliability of complex technological systems, like nuclear power plants, chemical plants, hazardous waste facilities, space systems, offshore and maritime systems, transportation systems, constructed infrastructure, and manufacturing plants. The journal normally publishes only articles that involve the analysis of substantive problems related to the reliability of complex systems or present techniques and/or theoretical results that have a discernable relationship to the solution of such problems. An important aim is to balance academic material and practical applications.
期刊最新文献
Quantifying potential cyber-attack risks in CNC systems under zero-subjectivity closed-loop Dempster–Shafer theory FMECA and rule-based Bayesian network modelling Inactivity times of components upon system failure with application to missing data problems Domain knowledge-enhanced dual-stream graph joint learning network for aeroengine remaining useful life prediction Revealing the dynamics and multidimensional resilience of rainstorm-flood cascade disasters in mountain valley cities: An interpretable machine learning case study from Southwestern China Robustness of spatial interdependent networks under extreme geographically localized attacks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1