用函数线性模型解释和概括基于物理问题的深度学习

IF 8.7 2区工程技术 Q1 Mathematics Engineering with Computers Pub Date : 2024-05-08 DOI:10.1007/s00366-024-01987-z

Amirhossein Arzani, Lingxiao Yuan, Pania Newell, Bei Wang

{"title":"用函数线性模型解释和概括基于物理问题的深度学习","authors":"Amirhossein Arzani, Lingxiao Yuan, Pania Newell, Bei Wang","doi":"10.1007/s00366-024-01987-z","DOIUrl":null,"url":null,"abstract":"<p>Although deep learning has achieved remarkable success in various scientific machine learning applications, its opaque nature poses concerns regarding interpretability and generalization capabilities beyond the training data. Interpretability is crucial and often desired in modeling physical systems. Moreover, acquiring extensive datasets that encompass the entire range of input features is challenging in many physics-based learning tasks, leading to increased errors when encountering out-of-distribution (OOD) data. In this work, motivated by the field of functional data analysis (FDA), we propose generalized functional linear models as an interpretable surrogate for a trained deep learning model. We demonstrate that our model could be trained either based on a trained neural network (post-hoc interpretation) or directly from training data (interpretable operator learning). A library of generalized functional linear models with different kernel functions is considered and sparse regression is used to discover an interpretable surrogate model that could be analytically presented. We present test cases in solid mechanics, fluid mechanics, and transport. Our results demonstrate that our model can achieve comparable accuracy to deep learning and can improve OOD generalization while providing more transparency and interpretability. Our study underscores the significance of interpretable representation in scientific machine learning and showcases the potential of functional linear models as a tool for interpreting and generalizing deep learning.</p>","PeriodicalId":11696,"journal":{"name":"Engineering with Computers","volume":"43 1","pages":""},"PeriodicalIF":8.7000,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interpreting and generalizing deep learning in physics-based problems with functional linear models\",\"authors\":\"Amirhossein Arzani, Lingxiao Yuan, Pania Newell, Bei Wang\",\"doi\":\"10.1007/s00366-024-01987-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Although deep learning has achieved remarkable success in various scientific machine learning applications, its opaque nature poses concerns regarding interpretability and generalization capabilities beyond the training data. Interpretability is crucial and often desired in modeling physical systems. Moreover, acquiring extensive datasets that encompass the entire range of input features is challenging in many physics-based learning tasks, leading to increased errors when encountering out-of-distribution (OOD) data. In this work, motivated by the field of functional data analysis (FDA), we propose generalized functional linear models as an interpretable surrogate for a trained deep learning model. We demonstrate that our model could be trained either based on a trained neural network (post-hoc interpretation) or directly from training data (interpretable operator learning). A library of generalized functional linear models with different kernel functions is considered and sparse regression is used to discover an interpretable surrogate model that could be analytically presented. We present test cases in solid mechanics, fluid mechanics, and transport. Our results demonstrate that our model can achieve comparable accuracy to deep learning and can improve OOD generalization while providing more transparency and interpretability. Our study underscores the significance of interpretable representation in scientific machine learning and showcases the potential of functional linear models as a tool for interpreting and generalizing deep learning.</p>\",\"PeriodicalId\":11696,\"journal\":{\"name\":\"Engineering with Computers\",\"volume\":\"43 1\",\"pages\":\"\"},\"PeriodicalIF\":8.7000,\"publicationDate\":\"2024-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering with Computers\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s00366-024-01987-z\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering with Computers","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s00366-024-01987-z","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 0

摘要

虽然深度学习在各种科学机器学习应用中取得了显著的成功，但其不透明的特性也引发了人们对训练数据之外的可解释性和泛化能力的担忧。在物理系统建模中，可解释性是至关重要的，而且往往是人们所期望的。此外，在许多基于物理的学习任务中，获取涵盖整个输入特征范围的广泛数据集具有挑战性，导致在遇到分布外（OOD）数据时误差增加。在这项工作中，受函数数据分析（FDA）领域的启发，我们提出了广义函数线性模型，作为训练有素的深度学习模型的可解释替代物。我们证明，我们的模型既可以基于训练有素的神经网络（事后解释）进行训练，也可以直接从训练数据（可解释算子学习）进行训练。我们考虑了具有不同核函数的广义函数线性模型库，并利用稀疏回归发现了一个可以分析呈现的可解释代用模型。我们介绍了固体力学、流体力学和运输方面的测试案例。结果表明，我们的模型可以达到与深度学习相当的精度，并能提高 OOD 的泛化能力，同时提供更高的透明度和可解释性。我们的研究强调了可解释表征在科学机器学习中的重要性，并展示了函数线性模型作为解释和泛化深度学习工具的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Interpreting and generalizing deep learning in physics-based problems with functional linear models

Although deep learning has achieved remarkable success in various scientific machine learning applications, its opaque nature poses concerns regarding interpretability and generalization capabilities beyond the training data. Interpretability is crucial and often desired in modeling physical systems. Moreover, acquiring extensive datasets that encompass the entire range of input features is challenging in many physics-based learning tasks, leading to increased errors when encountering out-of-distribution (OOD) data. In this work, motivated by the field of functional data analysis (FDA), we propose generalized functional linear models as an interpretable surrogate for a trained deep learning model. We demonstrate that our model could be trained either based on a trained neural network (post-hoc interpretation) or directly from training data (interpretable operator learning). A library of generalized functional linear models with different kernel functions is considered and sparse regression is used to discover an interpretable surrogate model that could be analytically presented. We present test cases in solid mechanics, fluid mechanics, and transport. Our results demonstrate that our model can achieve comparable accuracy to deep learning and can improve OOD generalization while providing more transparency and interpretability. Our study underscores the significance of interpretable representation in scientific machine learning and showcases the potential of functional linear models as a tool for interpreting and generalizing deep learning.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering with Computers 工程技术-工程：机械

CiteScore

16.50

自引率

2.30%

发文量

203

审稿时长

9 months

期刊介绍： Engineering with Computers is an international journal dedicated to simulation-based engineering. It features original papers and comprehensive reviews on technologies supporting simulation-based engineering, along with demonstrations of operational simulation-based engineering systems. The journal covers various technical areas such as adaptive simulation techniques, engineering databases, CAD geometry integration, mesh generation, parallel simulation methods, simulation frameworks, user interface technologies, and visualization techniques. It also encompasses a wide range of application areas where engineering technologies are applied, spanning from automotive industry applications to medical device design.