High accuracy uncertainty-aware interatomic force modeling with equivariant Bayesian neural networks†

IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY Digital discovery Pub Date : 2024-10-15 DOI:10.1039/D4DD00183D

Tim Rensmeyer, Ben Craig, Denis Kramer and Oliver Niggemann

{"title":"High accuracy uncertainty-aware interatomic force modeling with equivariant Bayesian neural networks†","authors":"Tim Rensmeyer, Ben Craig, Denis Kramer and Oliver Niggemann","doi":"10.1039/D4DD00183D","DOIUrl":null,"url":null,"abstract":"\r\n Ab initio molecular dynamics simulations of material properties have become a cornerstone in the development of novel materials for a wide range of applications such as battery technology and catalysis. Unfortunately, their high computational demand can make them unsuitable in many applications. Consequently, surrogate modeling via neural networks has become an active field of research. Two of the major obstacles to their practical application in many cases are assessing the reliability of the neural network predictions and the difficulty of generating suitable datasets to train the neural network in the first place. Bayesian neural networks offer a promising framework for modeling uncertainty, active learning and improving data efficiency and robustness by incorporating prior physical knowledge. However, due to the high computational demand and slow convergence of the gold standard approach of Monte Carlo Markov Chain (MCMC) sampling methods, variational inference via Monte Carlo dropout is currently the only sampling method successfully applied in this domain. Since MCMC methods have often displayed a superior quality in their uncertainty quantification, developing a suitable MCMC method in this domain would be a significant advance in making neural network-based molecular dynamics simulations more practically viable. In this paper, we demonstrate that convergence for state-of-the-art models with high-quality MCMC methods can still be achieved in a practical amount of time by introducing a novel parameter-specific adaptive step size scheme. In addition, we introduce a new stochastic neural network model based on the NequIP architecture and demonstrate that, when combined with our novel sampling algorithm, we obtain predictions with state-of-the-art accuracy as well as a significantly improved measure of uncertainty over Monte Carlo dropout. Lastly, we show that the proposed algorithm can even outperform deep ensembles while sampling from a single Markov chain.","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 11","pages":" 2356-2366"},"PeriodicalIF":6.2000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/dd/d4dd00183d?page=search","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital discovery","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2024/dd/d4dd00183d","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Ab initio molecular dynamics simulations of material properties have become a cornerstone in the development of novel materials for a wide range of applications such as battery technology and catalysis. Unfortunately, their high computational demand can make them unsuitable in many applications. Consequently, surrogate modeling via neural networks has become an active field of research. Two of the major obstacles to their practical application in many cases are assessing the reliability of the neural network predictions and the difficulty of generating suitable datasets to train the neural network in the first place. Bayesian neural networks offer a promising framework for modeling uncertainty, active learning and improving data efficiency and robustness by incorporating prior physical knowledge. However, due to the high computational demand and slow convergence of the gold standard approach of Monte Carlo Markov Chain (MCMC) sampling methods, variational inference via Monte Carlo dropout is currently the only sampling method successfully applied in this domain. Since MCMC methods have often displayed a superior quality in their uncertainty quantification, developing a suitable MCMC method in this domain would be a significant advance in making neural network-based molecular dynamics simulations more practically viable. In this paper, we demonstrate that convergence for state-of-the-art models with high-quality MCMC methods can still be achieved in a practical amount of time by introducing a novel parameter-specific adaptive step size scheme. In addition, we introduce a new stochastic neural network model based on the NequIP architecture and demonstrate that, when combined with our novel sampling algorithm, we obtain predictions with state-of-the-art accuracy as well as a significantly improved measure of uncertainty over Monte Carlo dropout. Lastly, we show that the proposed algorithm can even outperform deep ensembles while sampling from a single Markov chain.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用等变贝叶斯神经网络进行高精度不确定性感知原子力建模†。

材料特性的 Ab initio 分子动力学模拟已成为开发新型材料的基石，广泛应用于电池技术和催化等领域。遗憾的是，由于计算量大，在许多应用中并不适用。因此，通过神经网络进行代用建模已成为一个活跃的研究领域。在许多情况下，神经网络实际应用的两个主要障碍是评估神经网络预测的可靠性，以及难以首先生成合适的数据集来训练神经网络。贝叶斯神经网络为不确定性建模、主动学习以及通过结合先验物理知识提高数据效率和鲁棒性提供了一个前景广阔的框架。然而，由于蒙特卡洛马尔可夫链（MCMC）采样方法这一黄金标准方法的计算需求高、收敛速度慢，通过蒙特卡洛剔除进行变分推理是目前成功应用于这一领域的唯一采样方法。由于 MCMC 方法通常在不确定性量化方面表现出更高的质量，因此在这一领域开发一种合适的 MCMC 方法将是一项重大进步，可使基于神经网络的分子动力学模拟更加切实可行。在本文中，我们通过引入新颖的特定参数自适应步长方案，证明了高质量 MCMC 方法仍可在实际时间内实现最先进模型的收敛。此外，我们还引入了基于 NequIP 架构的新型随机神经网络模型，并证明结合我们的新型采样算法，我们不仅能获得最先进的预测精度，还能显著改善蒙特卡罗遗漏的不确定性度量。最后，我们证明了所提出的算法甚至可以在从单个马尔可夫链采样时超越深度集合。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Digital discovery

CiteScore

2.80

自引率

0.00%

发文量