首页 > 最新文献

Machine Learning Science and Technology最新文献

英文 中文
Autoencoders for discovering manifold dimension and coordinates in data from complex dynamical systems 发现复杂动力系统数据中流形维度和坐标的自动编码器
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-29 DOI: 10.1088/2632-2153/ad4ba5
Kevin Zeng, Carlos E Pérez De Jesús, Andrew J Fox and Michael D Graham
While many phenomena in physics and engineering are formally high-dimensional, their long-time dynamics often live on a lower-dimensional manifold. The present work introduces an autoencoder framework that combines implicit regularization with internal linear layers and L2 regularization (weight decay) to automatically estimate the underlying dimensionality of a data set, produce an orthogonal manifold coordinate system, and provide the mapping functions between the ambient space and manifold space, allowing for out-of-sample projections. We validate our framework’s ability to estimate the manifold dimension for a series of datasets from dynamical systems of varying complexities and compare to other state-of-the-art estimators. We analyze the training dynamics of the network to glean insight into the mechanism of low-rank learning and find that collectively each of the implicit regularizing layers compound the low-rank representation and even self-correct during training. Analysis of gradient descent dynamics for this architecture in the linear case reveals the role of the internal linear layers in leading to faster decay of a ‘collective weight variable’ incorporating all layers, and the role of weight decay in breaking degeneracies and thus driving convergence along directions in which no decay would occur in its absence. We show that this framework can be naturally extended for applications of state-space modeling and forecasting by generating a data-driven dynamic model of a spatiotemporally chaotic partial differential equation using only the manifold coordinates. Finally, we demonstrate that our framework is robust to hyperparameter choices.
虽然物理学和工程学中的许多现象在形式上都是高维的,但它们的长期动态变化往往是在低维流形上进行的。本研究介绍了一种自动编码器框架,它将隐式正则化与内部线性层和 L2 正则化(权重衰减)相结合,自动估算数据集的底层维度,生成正交流形坐标系,并提供环境空间与流形空间之间的映射函数,从而实现样本外投影。我们验证了我们的框架估算流形维度的能力,并与其他最先进的估算器进行了比较。我们分析了网络的训练动态,以深入了解低秩学习的机制,并发现每个隐式正则化层都在集体地复合低秩表示,甚至在训练过程中进行自我修正。对线性情况下该架构的梯度下降动态分析揭示了内部线性层在导致包含所有层的 "集体权重变量 "更快衰减中的作用,以及权重衰减在打破退化中的作用,从而推动沿着没有衰减时不会发生的方向收敛。我们展示了这一框架可以自然地扩展到状态空间建模和预测的应用中,只需使用流形坐标就能生成时空混沌偏微分方程的数据驱动动态模型。最后,我们证明了我们的框架对超参数选择的鲁棒性。
{"title":"Autoencoders for discovering manifold dimension and coordinates in data from complex dynamical systems","authors":"Kevin Zeng, Carlos E Pérez De Jesús, Andrew J Fox and Michael D Graham","doi":"10.1088/2632-2153/ad4ba5","DOIUrl":"https://doi.org/10.1088/2632-2153/ad4ba5","url":null,"abstract":"While many phenomena in physics and engineering are formally high-dimensional, their long-time dynamics often live on a lower-dimensional manifold. The present work introduces an autoencoder framework that combines implicit regularization with internal linear layers and L2 regularization (weight decay) to automatically estimate the underlying dimensionality of a data set, produce an orthogonal manifold coordinate system, and provide the mapping functions between the ambient space and manifold space, allowing for out-of-sample projections. We validate our framework’s ability to estimate the manifold dimension for a series of datasets from dynamical systems of varying complexities and compare to other state-of-the-art estimators. We analyze the training dynamics of the network to glean insight into the mechanism of low-rank learning and find that collectively each of the implicit regularizing layers compound the low-rank representation and even self-correct during training. Analysis of gradient descent dynamics for this architecture in the linear case reveals the role of the internal linear layers in leading to faster decay of a ‘collective weight variable’ incorporating all layers, and the role of weight decay in breaking degeneracies and thus driving convergence along directions in which no decay would occur in its absence. We show that this framework can be naturally extended for applications of state-space modeling and forecasting by generating a data-driven dynamic model of a spatiotemporally chaotic partial differential equation using only the manifold coordinates. Finally, we demonstrate that our framework is robust to hyperparameter choices.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"9 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141195813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised learning of quantum many-body scars using intrinsic dimension 利用本征维度对量子多体伤痕进行无监督学习
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-28 DOI: 10.1088/2632-2153/ad4d3f
Harvey Cao, Dimitris G Angelakis and Daniel Leykam
Quantum many-body scarred systems contain both thermal and non-thermal scar eigenstates in their spectra. When these systems are quenched from special initial states which share high overlap with scar eigenstates, the system undergoes dynamics with atypically slow relaxation and periodic revival. This scarring phenomenon poses a potential avenue for circumventing decoherence in various quantum engineering applications. Given access to an unknown scar system, current approaches for identification of special states leading to non-thermal dynamics rely on costly measures such as entanglement entropy. In this work, we show how two dimensionality reduction techniques, multidimensional scaling and intrinsic dimension estimation, can be used to learn structural properties of dynamics in the PXP model and distinguish between thermal and scar initial states. The latter method is shown to be robust against limited sample sizes and experimental measurement errors.
量子多体瘢痕系统的光谱中同时包含热瘢痕和非热瘢痕特征状态。当这些系统从与疤痕特征态高度重叠的特殊初始态淬火时,系统会发生异常缓慢的弛豫和周期性恢复的动力学过程。这种疤痕现象为在各种量子工程应用中规避退相干现象提供了潜在的途径。在获取未知痕量系统的情况下,目前识别导致非热动力学的特殊状态的方法依赖于昂贵的测量方法,如纠缠熵。在这项工作中,我们展示了如何利用多维缩放和本征维度估计这两种降维技术来学习 PXP 模型的动力学结构特性,并区分热态和疤痕初始态。后一种方法对有限的样本量和实验测量误差具有鲁棒性。
{"title":"Unsupervised learning of quantum many-body scars using intrinsic dimension","authors":"Harvey Cao, Dimitris G Angelakis and Daniel Leykam","doi":"10.1088/2632-2153/ad4d3f","DOIUrl":"https://doi.org/10.1088/2632-2153/ad4d3f","url":null,"abstract":"Quantum many-body scarred systems contain both thermal and non-thermal scar eigenstates in their spectra. When these systems are quenched from special initial states which share high overlap with scar eigenstates, the system undergoes dynamics with atypically slow relaxation and periodic revival. This scarring phenomenon poses a potential avenue for circumventing decoherence in various quantum engineering applications. Given access to an unknown scar system, current approaches for identification of special states leading to non-thermal dynamics rely on costly measures such as entanglement entropy. In this work, we show how two dimensionality reduction techniques, multidimensional scaling and intrinsic dimension estimation, can be used to learn structural properties of dynamics in the PXP model and distinguish between thermal and scar initial states. The latter method is shown to be robust against limited sample sizes and experimental measurement errors.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"24 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141168597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning the dynamics of a one-dimensional plasma model with graph neural networks 利用图神经网络学习一维等离子体模型的动力学特性
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-27 DOI: 10.1088/2632-2153/ad4ba6
Diogo D Carvalho, Diogo R Ferreira and Luís O Silva
We explore the possibility of fully replacing a plasma physics kinetic simulator with a graph neural network-based simulator. We focus on this class of surrogate models given the similarity between their message-passing update mechanism and the traditional physics solver update, and the possibility of enforcing known physical priors into the graph construction and update. We show that our model learns the kinetic plasma dynamics of the one-dimensional plasma model, a predecessor of contemporary kinetic plasma simulation codes, and recovers a wide range of well-known kinetic plasma processes, including plasma thermalization, electrostatic fluctuations about thermal equilibrium, and the drag on a fast sheet and Landau damping. We compare the performance against the original plasma model in terms of run-time, conservation laws, and temporal evolution of key physical quantities. The limitations of the model are presented and possible directions for higher-dimensional surrogate models for kinetic plasmas are discussed.
我们探索了用基于图神经网络的模拟器完全取代等离子体物理动力学模拟器的可能性。鉴于其消息传递更新机制与传统物理求解器更新之间的相似性,以及在图构建和更新中强制执行已知物理先验的可能性,我们将重点放在这类代理模型上。我们的研究表明,我们的模型学习了一维等离子体模型(当代动力学等离子体模拟代码的前身)的动力学等离子体动力学,并恢复了一系列众所周知的动力学等离子体过程,包括等离子体热化、关于热平衡的静电波动、快片上的阻力和朗道阻尼。我们从运行时间、守恒定律和关键物理量的时间演化等方面比较了原始等离子体模型的性能。介绍了模型的局限性,并讨论了动力学等离子体高维代用模型的可能方向。
{"title":"Learning the dynamics of a one-dimensional plasma model with graph neural networks","authors":"Diogo D Carvalho, Diogo R Ferreira and Luís O Silva","doi":"10.1088/2632-2153/ad4ba6","DOIUrl":"https://doi.org/10.1088/2632-2153/ad4ba6","url":null,"abstract":"We explore the possibility of fully replacing a plasma physics kinetic simulator with a graph neural network-based simulator. We focus on this class of surrogate models given the similarity between their message-passing update mechanism and the traditional physics solver update, and the possibility of enforcing known physical priors into the graph construction and update. We show that our model learns the kinetic plasma dynamics of the one-dimensional plasma model, a predecessor of contemporary kinetic plasma simulation codes, and recovers a wide range of well-known kinetic plasma processes, including plasma thermalization, electrostatic fluctuations about thermal equilibrium, and the drag on a fast sheet and Landau damping. We compare the performance against the original plasma model in terms of run-time, conservation laws, and temporal evolution of key physical quantities. The limitations of the model are presented and possible directions for higher-dimensional surrogate models for kinetic plasmas are discussed.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"9 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141168450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid quantum physics-informed neural networks for simulating computational fluid dynamics in complex shapes 用于模拟复杂形状计算流体动力学的混合量子物理信息神经网络
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-21 DOI: 10.1088/2632-2153/ad43b2
Alexandr Sedykh, Maninadh Podapaka, Asel Sagingalieva, Karan Pinto, Markus Pflitsch and Alexey Melnikov
Finding the distribution of the velocities and pressures of a fluid by solving the Navier–Stokes equations is a principal task in the chemical, energy, and pharmaceutical industries, as well as in mechanical engineering and in design of pipeline systems. With existing solvers, such as OpenFOAM and Ansys, simulations of fluid dynamics in intricate geometries are computationally expensive and require re-simulation whenever the geometric parameters or the initial and boundary conditions are altered. Physics-informed neural networks (PINNs) are a promising tool for simulating fluid flows in complex geometries, as they can adapt to changes in the geometry and mesh definitions, allowing for generalization across fluid parameters and transfer learning across different shapes. We present a hybrid quantum PINN (HQPINN) that simulates laminar fluid flow in 3D Y-shaped mixers. Our approach combines the expressive power of a quantum model with the flexibility of a PINN, resulting in a 21% higher accuracy compared to a purely classical neural network. Our findings highlight the potential of machine learning approaches, and in particular HQPINN, for complex shape optimization tasks in computational fluid dynamics. By improving the accuracy of fluid simulations in complex geometries, our research using hybrid quantum models contributes to the development of more efficient and reliable fluid dynamics solvers.
通过求解纳维-斯托克斯方程来找到流体的速度和压力分布是化学、能源和制药行业以及机械工程和管道系统设计中的一项主要任务。利用现有的求解器(如 OpenFOAM 和 Ansys)模拟复杂几何形状中的流体动力学计算成本很高,而且每当几何参数或初始条件和边界条件发生变化时,都需要重新模拟。物理信息神经网络(PINN)是模拟复杂几何体中流体流动的一种很有前途的工具,因为它能适应几何体和网格定义的变化,从而实现流体参数的泛化和不同形状的迁移学习。我们提出的混合量子 PINN(HQPINN)可以模拟三维 Y 型混合器中的层流流体流动。我们的方法结合了量子模型的表现力和 PINN 的灵活性,与纯经典神经网络相比,准确率提高了 21%。我们的研究结果凸显了机器学习方法,特别是 HQPINN,在计算流体动力学复杂形状优化任务中的潜力。通过提高复杂几何形状中流体模拟的准确性,我们使用混合量子模型的研究有助于开发更高效、更可靠的流体动力学求解器。
{"title":"Hybrid quantum physics-informed neural networks for simulating computational fluid dynamics in complex shapes","authors":"Alexandr Sedykh, Maninadh Podapaka, Asel Sagingalieva, Karan Pinto, Markus Pflitsch and Alexey Melnikov","doi":"10.1088/2632-2153/ad43b2","DOIUrl":"https://doi.org/10.1088/2632-2153/ad43b2","url":null,"abstract":"Finding the distribution of the velocities and pressures of a fluid by solving the Navier–Stokes equations is a principal task in the chemical, energy, and pharmaceutical industries, as well as in mechanical engineering and in design of pipeline systems. With existing solvers, such as OpenFOAM and Ansys, simulations of fluid dynamics in intricate geometries are computationally expensive and require re-simulation whenever the geometric parameters or the initial and boundary conditions are altered. Physics-informed neural networks (PINNs) are a promising tool for simulating fluid flows in complex geometries, as they can adapt to changes in the geometry and mesh definitions, allowing for generalization across fluid parameters and transfer learning across different shapes. We present a hybrid quantum PINN (HQPINN) that simulates laminar fluid flow in 3D Y-shaped mixers. Our approach combines the expressive power of a quantum model with the flexibility of a PINN, resulting in a 21% higher accuracy compared to a purely classical neural network. Our findings highlight the potential of machine learning approaches, and in particular HQPINN, for complex shape optimization tasks in computational fluid dynamics. By improving the accuracy of fluid simulations in complex geometries, our research using hybrid quantum models contributes to the development of more efficient and reliable fluid dynamics solvers.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"56 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141146223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unifying O(3) equivariant neural networks design with tensor-network formalism 用张量网络形式主义统一 O(3) 等变神经网络设计
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-20 DOI: 10.1088/2632-2153/ad4a04
Zimu Li, Zihan Pengmei, Han Zheng, Erik Thiede, Junyu Liu and Risi Kondor
Many learning tasks, including learning potential energy surfaces from ab initio calculations, involve global spatial symmetries and permutational symmetry between atoms or general particles. Equivariant graph neural networks are a standard approach to such problems, with one of the most successful methods employing tensor products between various tensors that transform under the spatial group. However, as the number of different tensors and the complexity of relationships between them increase, maintaining parsimony and equivariance becomes increasingly challenging. In this paper, we propose using fusion diagrams, a technique widely employed in simulating SU(2)-symmetric quantum many-body problems, to design new spatial equivariant components for neural networks. This results in a diagrammatic approach to constructing novel neural network architectures. When applied to particles within a given local neighborhood, the resulting components, which we term ‘fusion blocks,’ serve as universal approximators of any continuous equivariant function defined on the neighborhood. We incorporate a fusion block into pre-existing equivariant architectures (Cormorant and MACE), leading to improved performance with fewer parameters on a range of challenging chemical problems. Furthermore, we apply group-equivariant neural networks to study non-adiabatic molecular dynamics of stilbene cis-trans isomerization. Our approach, which combines tensor networks with equivariant neural networks, suggests a potentially fruitful direction for designing more expressive equivariant neural networks.
许多学习任务,包括从原子动力学计算中学习势能面,都涉及原子或一般粒子之间的全局空间对称性和排列对称性。等变图神经网络是解决此类问题的标准方法,其中最成功的方法之一是利用在空间群下变换的各种张量之间的张量乘积。然而,随着不同张量的数量和它们之间关系的复杂性的增加,保持解析性和等差性变得越来越具有挑战性。在本文中,我们建议使用融合图(一种广泛应用于模拟 SU(2)-symmetric 量子多体问题的技术)为神经网络设计新的空间等差数元件。这就产生了一种构建新型神经网络架构的图解方法。当应用于给定局部邻域内的粒子时,由此产生的组件(我们称之为 "融合块")可作为邻域上定义的任何连续等变函数的通用近似值。我们在已有的等变架构(Cormorant 和 MACE)中加入了融合块,从而在一系列具有挑战性的化学问题上以更少的参数提高了性能。此外,我们还将群等变神经网络用于研究二苯乙烯顺反异构的非绝热分子动力学。我们的方法将张量网络与等变神经网络相结合,为设计更具表现力的等变神经网络提供了一个潜在的富有成效的方向。
{"title":"Unifying O(3) equivariant neural networks design with tensor-network formalism","authors":"Zimu Li, Zihan Pengmei, Han Zheng, Erik Thiede, Junyu Liu and Risi Kondor","doi":"10.1088/2632-2153/ad4a04","DOIUrl":"https://doi.org/10.1088/2632-2153/ad4a04","url":null,"abstract":"Many learning tasks, including learning potential energy surfaces from ab initio calculations, involve global spatial symmetries and permutational symmetry between atoms or general particles. Equivariant graph neural networks are a standard approach to such problems, with one of the most successful methods employing tensor products between various tensors that transform under the spatial group. However, as the number of different tensors and the complexity of relationships between them increase, maintaining parsimony and equivariance becomes increasingly challenging. In this paper, we propose using fusion diagrams, a technique widely employed in simulating SU(2)-symmetric quantum many-body problems, to design new spatial equivariant components for neural networks. This results in a diagrammatic approach to constructing novel neural network architectures. When applied to particles within a given local neighborhood, the resulting components, which we term ‘fusion blocks,’ serve as universal approximators of any continuous equivariant function defined on the neighborhood. We incorporate a fusion block into pre-existing equivariant architectures (Cormorant and MACE), leading to improved performance with fewer parameters on a range of challenging chemical problems. Furthermore, we apply group-equivariant neural networks to study non-adiabatic molecular dynamics of stilbene cis-trans isomerization. Our approach, which combines tensor networks with equivariant neural networks, suggests a potentially fruitful direction for designing more expressive equivariant neural networks.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"46 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141146255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature selection for high-dimensional neural network potentials with the adaptive group lasso 利用自适应群套索为高维神经网络电位进行特征选择
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-16 DOI: 10.1088/2632-2153/ad450e
Johannes Sandberg, Thomas Voigtmann, Emilie Devijver and Noel Jakse
Neural network potentials are a powerful tool for atomistic simulations, allowing to accurately reproduce ab initio potential energy surfaces with computational performance approaching classical force fields. A central component of such potentials is the transformation of atomic positions into a set of atomic features in a most efficient and informative way. In this work, a feature selection method is introduced for high dimensional neural network potentials, based on the adaptive group lasso (AGL) approach. It is shown that the use of an embedded method, taking into account the interplay between features and their action in the estimator, is necessary to optimize the number of features. The method’s efficiency is tested on three different monoatomic systems, including Lennard–Jones as a simple test case, Aluminium as a system characterized by predominantly radial interactions, and Boron as representative of a system with strongly directional components in the interactions. The AGL is compared with unsupervised filter methods and found to perform consistently better in reducing the number of features needed to reproduce the reference simulation data at a similar level of accuracy as the starting feature set. In particular, our results show the importance of taking into account model predictions in feature selection for interatomic potentials.
神经网络势能是原子模拟的强大工具,可以精确再现原子势能面,计算性能接近经典力场。这类势能的一个核心组成部分是以最有效、信息量最大的方式将原子位置转换成一组原子特征。在这项工作中,基于自适应群套索(AGL)方法,为高维神经网络势能引入了一种特征选择方法。研究表明,考虑到特征之间的相互作用及其在估计器中的作用,有必要使用嵌入式方法来优化特征数量。该方法的效率在三个不同的单原子系统上进行了测试,包括作为简单测试案例的伦纳德-琼斯系统、主要以径向相互作用为特征的铝系统,以及代表相互作用具有强烈方向性成分的硼系统。我们将 AGL 与无监督滤波方法进行了比较,结果发现 AGL 在减少重现参考模拟数据所需的特征数量方面一直表现较好,其准确度与起始特征集的准确度相当。特别是,我们的结果表明了在原子间电位的特征选择中考虑模型预测的重要性。
{"title":"Feature selection for high-dimensional neural network potentials with the adaptive group lasso","authors":"Johannes Sandberg, Thomas Voigtmann, Emilie Devijver and Noel Jakse","doi":"10.1088/2632-2153/ad450e","DOIUrl":"https://doi.org/10.1088/2632-2153/ad450e","url":null,"abstract":"Neural network potentials are a powerful tool for atomistic simulations, allowing to accurately reproduce ab initio potential energy surfaces with computational performance approaching classical force fields. A central component of such potentials is the transformation of atomic positions into a set of atomic features in a most efficient and informative way. In this work, a feature selection method is introduced for high dimensional neural network potentials, based on the adaptive group lasso (AGL) approach. It is shown that the use of an embedded method, taking into account the interplay between features and their action in the estimator, is necessary to optimize the number of features. The method’s efficiency is tested on three different monoatomic systems, including Lennard–Jones as a simple test case, Aluminium as a system characterized by predominantly radial interactions, and Boron as representative of a system with strongly directional components in the interactions. The AGL is compared with unsupervised filter methods and found to perform consistently better in reducing the number of features needed to reproduce the reference simulation data at a similar level of accuracy as the starting feature set. In particular, our results show the importance of taking into account model predictions in feature selection for interatomic potentials.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"51 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141060611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multifidelity approach to continual learning for physical systems 物理系统持续学习的多保真方法
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-15 DOI: 10.1088/2632-2153/ad45b2
Amanda Howard, Yucheng Fu and Panos Stinis
We introduce a novel continual learning method based on multifidelity deep neural networks. This method learns the correlation between the output of previously trained models and the desired output of the model on the current training dataset, limiting catastrophic forgetting. On its own the multifidelity continual learning method shows robust results that limit forgetting across several datasets. Additionally, we show that the multifidelity method can be combined with existing continual learning methods, including replay and memory aware synapses, to further limit catastrophic forgetting. The proposed continual learning method is especially suited for physical problems where the data satisfy the same physical laws on each domain, or for physics-informed neural networks, because in these cases we expect there to be a strong correlation between the output of the previous model and the model on the current training domain.
我们介绍了一种基于多保真深度神经网络的新型持续学习方法。这种方法可以学习先前训练模型的输出与当前训练数据集上模型期望输出之间的相关性,从而限制灾难性遗忘。多保真度持续学习方法本身就能在多个数据集上显示出限制遗忘的稳健结果。此外,我们还展示了多保真度方法可以与现有的持续学习方法相结合,包括重放和记忆感知突触,以进一步限制灾难性遗忘。所提出的持续学习方法尤其适用于数据在每个域上都满足相同物理定律的物理问题,或者适用于物理信息神经网络,因为在这些情况下,我们希望上一个模型的输出与当前训练域上的模型之间存在很强的相关性。
{"title":"A multifidelity approach to continual learning for physical systems","authors":"Amanda Howard, Yucheng Fu and Panos Stinis","doi":"10.1088/2632-2153/ad45b2","DOIUrl":"https://doi.org/10.1088/2632-2153/ad45b2","url":null,"abstract":"We introduce a novel continual learning method based on multifidelity deep neural networks. This method learns the correlation between the output of previously trained models and the desired output of the model on the current training dataset, limiting catastrophic forgetting. On its own the multifidelity continual learning method shows robust results that limit forgetting across several datasets. Additionally, we show that the multifidelity method can be combined with existing continual learning methods, including replay and memory aware synapses, to further limit catastrophic forgetting. The proposed continual learning method is especially suited for physical problems where the data satisfy the same physical laws on each domain, or for physics-informed neural networks, because in these cases we expect there to be a strong correlation between the output of the previous model and the model on the current training domain.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"67 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141060597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive machine learning-based investigation for the index-value prediction of 2G HTS coated conductor tapes 基于机器学习的 2G HTS 涂层导体带指数值预测综合研究
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-12 DOI: 10.1088/2632-2153/ad45b1
Shahin Alipour Bonab, Giacomo Russo, Antonio Morandi and Mohammad Yazdani-Asrami
Index-value, or so-called n-value prediction is of paramount importance for understanding the superconductors’ behaviour specially when modeling of superconductors is needed. This parameter is dependent on several physical quantities including temperature, the magnetic field’s density and orientation, and affects the behaviour of high-temperature superconducting devices made out of coated conductors in terms of losses and quench propagation. In this paper, a comprehensive analysis of many machine learning (ML) methods for estimating the n-value has been carried out. The results demonstrated that cascade forward neural network (CFNN) excels in this scope. Despite needing considerably higher training time when compared to the other attempted models, it performs at the highest accuracy, with 0.48 root mean squared error (RMSE) and 99.72% Pearson coefficient for goodness of fit (R-squared). In contrast, the rigid regression method had the worst predictions with 4.92 RMSE and 37.29% R-squared. Also, random forest, boosting methods, and simple feed forward neural network can be considered as a middle accuracy model with faster training time than CFNN. The findings of this study not only advance modeling of superconductors but also pave the way for applications and further research on ML plug-and-play codes for superconducting studies including modeling of superconducting devices.
索引值或所谓的 n 值预测对于理解超导体的行为至关重要,特别是在需要对超导体进行建模时。该参数取决于多个物理量,包括温度、磁场密度和方向,并在损耗和淬火传播方面影响由涂层导体制成的高温超导设备的行为。本文全面分析了用于估算 n 值的多种机器学习(ML)方法。结果表明,级联前向神经网络(CFNN)在这一领域表现出色。尽管与其他尝试过的模型相比,它需要的训练时间要长得多,但它的准确度却最高,均方根误差(RMSE)为 0.48,皮尔逊拟合系数(R-squared)为 99.72%。相比之下,刚性回归法的预测结果最差,均方根误差为 4.92,R 方为 37.29%。此外,随机森林、提升法和简单前馈神经网络可被视为中等精度模型,其训练时间比 CFNN 更快。本研究的发现不仅推进了超导建模,还为超导研究(包括超导设备建模)中的 ML 即插即用代码的应用和进一步研究铺平了道路。
{"title":"A comprehensive machine learning-based investigation for the index-value prediction of 2G HTS coated conductor tapes","authors":"Shahin Alipour Bonab, Giacomo Russo, Antonio Morandi and Mohammad Yazdani-Asrami","doi":"10.1088/2632-2153/ad45b1","DOIUrl":"https://doi.org/10.1088/2632-2153/ad45b1","url":null,"abstract":"Index-value, or so-called n-value prediction is of paramount importance for understanding the superconductors’ behaviour specially when modeling of superconductors is needed. This parameter is dependent on several physical quantities including temperature, the magnetic field’s density and orientation, and affects the behaviour of high-temperature superconducting devices made out of coated conductors in terms of losses and quench propagation. In this paper, a comprehensive analysis of many machine learning (ML) methods for estimating the n-value has been carried out. The results demonstrated that cascade forward neural network (CFNN) excels in this scope. Despite needing considerably higher training time when compared to the other attempted models, it performs at the highest accuracy, with 0.48 root mean squared error (RMSE) and 99.72% Pearson coefficient for goodness of fit (R-squared). In contrast, the rigid regression method had the worst predictions with 4.92 RMSE and 37.29% R-squared. Also, random forest, boosting methods, and simple feed forward neural network can be considered as a middle accuracy model with faster training time than CFNN. The findings of this study not only advance modeling of superconductors but also pave the way for applications and further research on ML plug-and-play codes for superconducting studies including modeling of superconducting devices.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"5 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140931911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning a general model of single phase flow in complex 3D porous media 学习复杂三维多孔介质中单相流的一般模型
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-10 DOI: 10.1088/2632-2153/ad45af
Javier E Santos, Agnese Marcato, Qinjun Kang, Mohamed Mehana, Daniel O’Malley, Hari Viswanathan, Nicholas Lubbers
Modeling effective transport properties of 3D porous media, such as permeability, at multiple scales is challenging as a result of the combined complexity of the pore structures and fluid physics—in particular, confinement effects which vary across the nanoscale to the microscale. While numerical simulation is possible, the computational cost is prohibitive for realistic domains, which are large and complex. Although machine learning (ML) models have been proposed to circumvent simulation, none so far has simultaneously accounted for heterogeneous 3D structures, fluid confinement effects, and multiple simulation resolutions. By utilizing numerous computer science techniques to improve the scalability of training, we have for the first time developed a general flow model that accounts for the pore-structure and corresponding physical phenomena at scales from Angstrom to the micrometer. Using synthetic computational domains for training, our ML model exhibits strong performance (R2 = 0.9) when tested on extremely diverse real domains at multiple scales.
由于孔隙结构和流体物理--尤其是从纳米尺度到微米尺度各不相同的约束效应--的综合复杂性,对三维多孔介质的有效传输特性(如渗透性)进行多尺度建模极具挑战性。虽然可以进行数值模拟,但对于庞大而复杂的现实领域来说,计算成本过高。虽然已经提出了机器学习(ML)模型来规避模拟,但迄今为止还没有一个模型能同时考虑异质三维结构、流体约束效应和多种模拟分辨率。通过利用大量计算机科学技术来提高训练的可扩展性,我们首次开发了一种通用流动模型,该模型考虑了从埃到微米尺度上的孔隙结构和相应的物理现象。我们的 ML 模型使用合成计算域进行训练,在多种尺度的极其多样化的真实域上进行测试时,表现出很强的性能(R2 = 0.9)。
{"title":"Learning a general model of single phase flow in complex 3D porous media","authors":"Javier E Santos, Agnese Marcato, Qinjun Kang, Mohamed Mehana, Daniel O’Malley, Hari Viswanathan, Nicholas Lubbers","doi":"10.1088/2632-2153/ad45af","DOIUrl":"https://doi.org/10.1088/2632-2153/ad45af","url":null,"abstract":"Modeling effective transport properties of 3D porous media, such as permeability, at multiple scales is challenging as a result of the combined complexity of the pore structures and fluid physics—in particular, confinement effects which vary across the nanoscale to the microscale. While numerical simulation is possible, the computational cost is prohibitive for realistic domains, which are large and complex. Although machine learning (ML) models have been proposed to circumvent simulation, none so far has simultaneously accounted for heterogeneous 3D structures, fluid confinement effects, and multiple simulation resolutions. By utilizing numerous computer science techniques to improve the scalability of training, we have for the first time developed a general flow model that accounts for the pore-structure and corresponding physical phenomena at scales from Angstrom to the micrometer. Using synthetic computational domains for training, our ML model exhibits strong performance (<italic toggle=\"yes\">R</italic>\u0000<sup>2</sup> = 0.9) when tested on extremely diverse real domains at multiple scales.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"187 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140931932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Closed-loop Koopman operator approximation 闭环库普曼算子近似值
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-10 DOI: 10.1088/2632-2153/ad45b0
Steven Dahdah, James Richard Forbes
This paper proposes a method to identify a Koopman model of a feedback-controlled system given a known controller. The Koopman operator allows a nonlinear system to be rewritten as an infinite-dimensional linear system by viewing it in terms of an infinite set of lifting functions. A finite-dimensional approximation of the Koopman operator can be identified from data by choosing a finite subset of lifting functions and solving a regression problem in the lifted space. Existing methods are designed to identify open-loop systems. However, it is impractical or impossible to run experiments on some systems, such as unstable systems, in an open-loop fashion. The proposed method leverages the linearity of the Koopman operator, along with knowledge of the controller and the structure of the closed-loop (CL) system, to simultaneously identify the CL and plant systems. The advantages of the proposed CL Koopman operator approximation method are demonstrated in simulation using a Duffing oscillator and experimentally using a rotary inverted pendulum system. An open-source software implementation of the proposed method is publicly available, along with the experimental dataset generated for this paper.
本文提出了一种在已知控制器的情况下识别反馈控制系统库普曼模型的方法。通过库普曼算子,可将非线性系统视为一个无穷大的提升函数集,从而将其重写为一个无穷维的线性系统。通过选择有限的提升函数子集,并解决提升空间中的回归问题,就能从数据中识别出 Koopman 算子的有限维近似值。现有方法旨在识别开环系统。然而,对某些系统(如不稳定系统)进行开环实验是不切实际或不可能的。所提出的方法利用 Koopman 算子的线性以及控制器知识和闭环 (CL) 系统结构,可同时识别闭环系统和工厂系统。通过使用 Duffing 振荡器进行仿真和使用旋转倒立摆系统进行实验,证明了所提出的 CL Koopman 算子近似方法的优势。本文公开了所提方法的开源软件实现,以及为本文生成的实验数据集。
{"title":"Closed-loop Koopman operator approximation","authors":"Steven Dahdah, James Richard Forbes","doi":"10.1088/2632-2153/ad45b0","DOIUrl":"https://doi.org/10.1088/2632-2153/ad45b0","url":null,"abstract":"This paper proposes a method to identify a Koopman model of a feedback-controlled system given a known controller. The Koopman operator allows a nonlinear system to be rewritten as an infinite-dimensional linear system by viewing it in terms of an infinite set of lifting functions. A finite-dimensional approximation of the Koopman operator can be identified from data by choosing a finite subset of lifting functions and solving a regression problem in the lifted space. Existing methods are designed to identify open-loop systems. However, it is impractical or impossible to run experiments on some systems, such as unstable systems, in an open-loop fashion. The proposed method leverages the linearity of the Koopman operator, along with knowledge of the controller and the structure of the closed-loop (CL) system, to simultaneously identify the CL and plant systems. The advantages of the proposed CL Koopman operator approximation method are demonstrated in simulation using a Duffing oscillator and experimentally using a rotary inverted pendulum system. An open-source software implementation of the proposed method is publicly available, along with the experimental dataset generated for this paper.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"44 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140932006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Machine Learning Science and Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1