首页 > 最新文献

Machine Learning Science and Technology最新文献

英文 中文
Chaotic attractor reconstruction using small reservoirs—the influence of topology 利用小型水库重构混沌吸引子--拓扑结构的影响
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-27 DOI: 10.1088/2632-2153/ad6ee8
Lina Jaurigue
Forecasting timeseries based upon measured data is needed in a wide range of applications and has been the subject of extensive research. A particularly challenging task is the forecasting of timeseries generated by chaotic dynamics. In recent years reservoir computing has been shown to be an effective method of forecasting chaotic dynamics and reconstructing chaotic attractors from data. In this work strides are made toward smaller and lower complexity reservoirs with the goal of improved hardware implementability and more reliable production of adequate surrogate models. We show that a reservoir of uncoupled nodes more reliably produces long term timeseries predictions than more complex reservoir topologies. We then link the improved attractor reconstruction of the uncoupled reservoir with smaller spectral radii of the resulting surrogate systems. These results indicate that, the node degree plays an important role in determining whether the desired dynamics will be stable in the autonomous surrogate system which is attained via closed-loop operation of the trained reservoir. In terms of hardware implementability, uncoupled nodes would allow for greater freedom in the hardware architecture because no complex coupling setups are needed and because, for uncoupled nodes, the system response is equivalent for space and time multiplexing.
在广泛的应用中,需要根据测量数据对时序进行预测,这已成为广泛研究的主题。一项特别具有挑战性的任务是预测由混沌动力学产生的时序。近年来,储层计算已被证明是预测混沌动力学和从数据中重建混沌吸引子的有效方法。在这项工作中,我们正朝着更小、更低复杂度的蓄水池迈进,目标是提高硬件的可实施性,并更可靠地生成适当的代用模型。我们表明,与更复杂的储层拓扑结构相比,无耦合节点储层能更可靠地生成长期时间序列预测。然后,我们将无耦合水库的改进吸引子重构与由此产生的代用系统较小的频谱半径联系起来。这些结果表明,节点度在决定自主代理系统中预期动态是否稳定方面起着重要作用,而自主代理系统是通过训练水库的闭环运行实现的。就硬件的可实施性而言,无耦合节点将使硬件结构具有更大的自由度,因为不需要复杂的耦合设置,而且对于无耦合节点,系统响应在空间和时间多路复用方面是等效的。
{"title":"Chaotic attractor reconstruction using small reservoirs—the influence of topology","authors":"Lina Jaurigue","doi":"10.1088/2632-2153/ad6ee8","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6ee8","url":null,"abstract":"Forecasting timeseries based upon measured data is needed in a wide range of applications and has been the subject of extensive research. A particularly challenging task is the forecasting of timeseries generated by chaotic dynamics. In recent years reservoir computing has been shown to be an effective method of forecasting chaotic dynamics and reconstructing chaotic attractors from data. In this work strides are made toward smaller and lower complexity reservoirs with the goal of improved hardware implementability and more reliable production of adequate surrogate models. We show that a reservoir of uncoupled nodes more reliably produces long term timeseries predictions than more complex reservoir topologies. We then link the improved attractor reconstruction of the uncoupled reservoir with smaller spectral radii of the resulting surrogate systems. These results indicate that, the node degree plays an important role in determining whether the desired dynamics will be stable in the autonomous surrogate system which is attained via closed-loop operation of the trained reservoir. In terms of hardware implementability, uncoupled nodes would allow for greater freedom in the hardware architecture because no complex coupling setups are needed and because, for uncoupled nodes, the system response is equivalent for space and time multiplexing.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"13 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
JefiAtten: an attention-based neural network model for solving Maxwell’s equations with charge and current sources JefiAtten:基于注意力的神经网络模型,用于求解带电荷源和电流源的麦克斯韦方程
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-23 DOI: 10.1088/2632-2153/ad6ee9
Ming-Yan Sun, Peng Xu, Jun-Jie Zhang, Tai-Jiao Du, Jian-Guo Wang
We present JefiAtten, a novel neural network model employing the attention mechanism to solve Maxwell’s equations efficiently. JefiAtten uses self-attention and cross-attention modules to understand the interplay between charge density, current density, and electromagnetic fields. Our results indicate that JefiAtten can generalize well to a range of scenarios, maintaining accuracy across various spatial distribution and handling amplitude variations. The model showcases an improvement in computation speed after training, compared to traditional integral methods. The adaptability of the model suggests potential for broader applications in computational physics, with further refinements to enhance its predictive capabilities and computational efficiency. Our work is a testament to the efficacy of integrating attention mechanisms with numerical simulations, marking a step forward in the quest for data-driven solutions to physical phenomena.
我们介绍的 JefiAtten 是一种新型神经网络模型,它利用注意力机制高效地求解麦克斯韦方程。JefiAtten 使用自我注意和交叉注意模块来理解电荷密度、电流密度和电磁场之间的相互作用。我们的研究结果表明,JefiAtten 可以很好地适用于各种情况,在各种空间分布情况下都能保持精度,并能处理振幅变化。与传统的积分方法相比,该模型在训练后的计算速度有所提高。该模型的适应性表明,它有潜力在计算物理学中得到更广泛的应用,并通过进一步改进来提高其预测能力和计算效率。我们的工作证明了将注意力机制与数值模拟相结合的有效性,标志着在寻求数据驱动的物理现象解决方案方面向前迈进了一步。
{"title":"JefiAtten: an attention-based neural network model for solving Maxwell’s equations with charge and current sources","authors":"Ming-Yan Sun, Peng Xu, Jun-Jie Zhang, Tai-Jiao Du, Jian-Guo Wang","doi":"10.1088/2632-2153/ad6ee9","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6ee9","url":null,"abstract":"We present JefiAtten, a novel neural network model employing the attention mechanism to solve Maxwell’s equations efficiently. JefiAtten uses self-attention and cross-attention modules to understand the interplay between charge density, current density, and electromagnetic fields. Our results indicate that JefiAtten can generalize well to a range of scenarios, maintaining accuracy across various spatial distribution and handling amplitude variations. The model showcases an improvement in computation speed after training, compared to traditional integral methods. The adaptability of the model suggests potential for broader applications in computational physics, with further refinements to enhance its predictive capabilities and computational efficiency. Our work is a testament to the efficacy of integrating attention mechanisms with numerical simulations, marking a step forward in the quest for data-driven solutions to physical phenomena.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"30 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven sparse modeling of oscillations in plasma space propulsion 等离子体空间推进器振荡的数据驱动稀疏建模
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-23 DOI: 10.1088/2632-2153/ad6d29
Borja Bayón-Buján, Mario Merino
An algorithm to obtain data-driven models of oscillatory phenomena in plasma space propulsion systems is presented, based on sparse regression (SINDy) and Pareto front analysis. The algorithm can incorporate physical constraints, use data bootstrapping for additional robustness, and fine-tuning to different metrics. Standard, weak and integral SINDy formulations are discussed and compared. The scheme is benchmarked for the case of breathing-mode oscillations in Hall effect thrusters, using particle-in-cell/fluid simulation data. Models of varying complexity are obtained for the average plasma properties, and shown to have a clear physical interpretability and agreement with existing 0D models in the literature. Lastly, the algorithm applied is also shown to enable the identification of physical subdomains with qualitatively different plasma dynamics, providing valuable information for more advanced modeling approaches.
基于稀疏回归(SINDy)和帕累托前沿分析,介绍了一种获得等离子空间推进系统振荡现象数据驱动模型的算法。该算法可以结合物理约束条件,使用数据引导以获得额外的鲁棒性,并根据不同的指标进行微调。对标准、弱和积分 SINDy 公式进行了讨论和比较。针对霍尔效应推进器中的呼吸模式振荡情况,利用舱内粒子/流体模拟数据对该方案进行了基准测试。针对等离子体的平均特性获得了不同复杂度的模型,结果表明这些模型与文献中现有的 0D 模型具有明确的物理可解释性和一致性。最后,所应用的算法还证明能够识别具有质的不同等离子体动力学的物理子域,为更先进的建模方法提供有价值的信息。
{"title":"Data-driven sparse modeling of oscillations in plasma space propulsion","authors":"Borja Bayón-Buján, Mario Merino","doi":"10.1088/2632-2153/ad6d29","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6d29","url":null,"abstract":"An algorithm to obtain data-driven models of oscillatory phenomena in plasma space propulsion systems is presented, based on sparse regression (SINDy) and Pareto front analysis. The algorithm can incorporate physical constraints, use data bootstrapping for additional robustness, and fine-tuning to different metrics. Standard, weak and integral SINDy formulations are discussed and compared. The scheme is benchmarked for the case of breathing-mode oscillations in Hall effect thrusters, using particle-in-cell/fluid simulation data. Models of varying complexity are obtained for the average plasma properties, and shown to have a clear physical interpretability and agreement with existing 0D models in the literature. Lastly, the algorithm applied is also shown to enable the identification of physical subdomains with qualitatively different plasma dynamics, providing valuable information for more advanced modeling approaches.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"11 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142225270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Active causal learning for decoding chemical complexities with targeted interventions 通过主动因果学习解码复杂化学物质,进行有针对性的干预
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-23 DOI: 10.1088/2632-2153/ad6feb
Zachary R Fox, Ayana Ghosh
Predicting and enhancing inherent properties based on molecular structures is paramount to design tasks in medicine, materials science, and environmental management. Most of the current machine learning and deep learning approaches have become standard for predictions, but they face challenges when applied across different datasets due to reliance on correlations between molecular representation and target properties. These approaches typically depend on large datasets to capture the diversity within the chemical space, facilitating a more accurate approximation, interpolation, or extrapolation of the chemical behavior of molecules. In our research, we introduce an active learning approach that discerns underlying cause-effect relationships through strategic sampling with the use of a graph loss function. This method identifies the smallest subset of the dataset capable of encoding the most information representative of a much larger chemical space. The identified causal relations are then leveraged to conduct systematic interventions, optimizing the design task within a chemical space that the models have not encountered previously. While our implementation focused on the QM9 quantum-chemical dataset for a specific design task—finding molecules with a large dipole moment—our active causal learning approach, driven by intelligent sampling and interventions, holds potential for broader applications in molecular, materials design and discovery.
根据分子结构预测和增强固有特性对于医学、材料科学和环境管理领域的设计任务至关重要。目前大多数机器学习和深度学习方法已成为预测的标准,但由于依赖分子表征和目标特性之间的相关性,它们在应用于不同数据集时面临挑战。这些方法通常依赖于大型数据集来捕捉化学空间内的多样性,从而有助于更准确地近似、内插或外推分子的化学行为。在我们的研究中,我们引入了一种主动学习方法,通过使用图损失函数进行策略性采样来辨别潜在的因果关系。这种方法能识别出数据集的最小子集,该子集能够编码代表更大化学空间的最多信息。然后,利用确定的因果关系进行系统干预,在模型以前未曾接触过的化学空间内优化设计任务。虽然我们的实施侧重于 QM9 量子化学数据集的特定设计任务--寻找具有大偶极矩的分子--但我们的主动因果学习方法在智能采样和干预的驱动下,有望在分子、材料设计和发现领域得到更广泛的应用。
{"title":"Active causal learning for decoding chemical complexities with targeted interventions","authors":"Zachary R Fox, Ayana Ghosh","doi":"10.1088/2632-2153/ad6feb","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6feb","url":null,"abstract":"Predicting and enhancing inherent properties based on molecular structures is paramount to design tasks in medicine, materials science, and environmental management. Most of the current machine learning and deep learning approaches have become standard for predictions, but they face challenges when applied across different datasets due to reliance on correlations between molecular representation and target properties. These approaches typically depend on large datasets to capture the diversity within the chemical space, facilitating a more accurate approximation, interpolation, or extrapolation of the chemical behavior of molecules. In our research, we introduce an active learning approach that discerns underlying cause-effect relationships through strategic sampling with the use of a graph loss function. This method identifies the smallest subset of the dataset capable of encoding the most information representative of a much larger chemical space. The identified causal relations are then leveraged to conduct systematic interventions, optimizing the design task within a chemical space that the models have not encountered previously. While our implementation focused on the QM9 quantum-chemical dataset for a specific design task—finding molecules with a large dipole moment—our active causal learning approach, driven by intelligent sampling and interventions, holds potential for broader applications in molecular, materials design and discovery.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"68 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emergence of chemotactic strategies with multi-agent reinforcement learning 多代理强化学习催化策略的出现
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-21 DOI: 10.1088/2632-2153/ad5f73
Samuel Tovey, Christoph Lohrmann, Christian Holm
Reinforcement learning (RL) is a flexible and efficient method for programming micro-robots in complex environments. Here we investigate whether RL can provide insights into biological systems when trained to perform chemotaxis. Namely, whether we can learn about how intelligent agents process given information in order to swim towards a target. We run simulations covering a range of agent shapes, sizes, and swim speeds to determine if the physical constraints on biological swimmers, namely Brownian motion, lead to regions where reinforcement learners’ training fails. We find that the RL agents can perform chemotaxis as soon as it is physically possible and, in some cases, even before the active swimming overpowers the stochastic environment. We study the efficiency of the emergent policy and identify convergence in agent size and swim speeds. Finally, we study the strategy adopted by the RL algorithm to explain how the agents perform their tasks. To this end, we identify three emerging dominant strategies and several rare approaches taken. These strategies, whilst producing almost identical trajectories in simulation, are distinct and give insight into the possible mechanisms behind which biological agents explore their environment and respond to changing conditions.
强化学习(RL)是在复杂环境中对微型机器人进行编程的一种灵活高效的方法。在此,我们将研究当强化学习被训练用于执行趋化时,它是否能为生物系统提供洞察力。也就是说,我们能否了解智能代理如何处理给定信息以游向目标。我们运行了涵盖一系列代理形状、大小和游速的模拟,以确定生物游泳者的物理限制(即布朗运动)是否会导致强化学习器的训练失败。我们发现,只要物理条件允许,RL 代理就能执行趋化,在某些情况下,甚至在主动游动压倒随机环境之前就能执行趋化。我们研究了新兴策略的效率,并确定了代理规模和游动速度的收敛性。最后,我们研究了 RL 算法采用的策略,以解释代理如何执行任务。为此,我们确定了三种新出现的主导策略和几种罕见的方法。这些策略虽然在模拟中产生了几乎相同的轨迹,但却各具特色,让我们深入了解了生物制剂探索环境和应对不断变化的条件的可能机制。
{"title":"Emergence of chemotactic strategies with multi-agent reinforcement learning","authors":"Samuel Tovey, Christoph Lohrmann, Christian Holm","doi":"10.1088/2632-2153/ad5f73","DOIUrl":"https://doi.org/10.1088/2632-2153/ad5f73","url":null,"abstract":"Reinforcement learning (RL) is a flexible and efficient method for programming micro-robots in complex environments. Here we investigate whether RL can provide insights into biological systems when trained to perform chemotaxis. Namely, whether we can learn about how intelligent agents process given information in order to swim towards a target. We run simulations covering a range of agent shapes, sizes, and swim speeds to determine if the physical constraints on biological swimmers, namely Brownian motion, lead to regions where reinforcement learners’ training fails. We find that the RL agents can perform chemotaxis as soon as it is physically possible and, in some cases, even before the active swimming overpowers the stochastic environment. We study the efficiency of the emergent policy and identify convergence in agent size and swim speeds. Finally, we study the strategy adopted by the RL algorithm to explain how the agents perform their tasks. To this end, we identify three emerging dominant strategies and several rare approaches taken. These strategies, whilst producing almost identical trajectories in simulation, are distinct and give insight into the possible mechanisms behind which biological agents explore their environment and respond to changing conditions.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"97 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantum support vector data description for anomaly detection 用于异常检测的量子支持向量数据描述
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-21 DOI: 10.1088/2632-2153/ad6be8
Hyeondo Oh, Daniel K Park
Anomaly detection is a critical problem in data analysis and pattern recognition, finding applications in various domains. We introduce quantum support vector data description (QSVDD), an unsupervised learning algorithm designed for anomaly detection. QSVDD utilizes a shallow-depth quantum circuit to learn a minimum-volume hypersphere that tightly encloses normal data, tailored for the constraints of noisy intermediate-scale quantum (NISQ) computing. Simulation results on the MNIST and Fashion MNIST image datasets, as well as credit card fraud detection, demonstrate that QSVDD outperforms both quantum autoencoder and deep learning-based approaches under similar training conditions. Notably, QSVDD requires an extremely small number of model parameters, which increases logarithmically with the number of input qubits. This enables efficient learning with a simple training landscape, presenting a compact quantum machine learning model with strong performance for anomaly detection.
异常检测是数据分析和模式识别中的一个关键问题,在各个领域都有应用。我们介绍了量子支持向量数据描述(QSVDD),这是一种专为异常检测设计的无监督学习算法。QSVDD 利用浅深度量子电路来学习一个最小体积的超球,该超球紧紧包裹着正常数据,专为噪声中等规模量子计算(NISQ)的限制而量身定制。MNIST 和时尚 MNIST 图像数据集以及信用卡欺诈检测的仿真结果表明,在类似的训练条件下,QSVDD 的性能优于量子自动编码器和基于深度学习的方法。值得注意的是,QSVDD 只需要极少量的模型参数,这些参数随输入量子比特数量的增加而呈对数增长。这样就能通过简单的训练环境实现高效学习,从而为异常检测提供了一个性能强大的紧凑型量子机器学习模型。
{"title":"Quantum support vector data description for anomaly detection","authors":"Hyeondo Oh, Daniel K Park","doi":"10.1088/2632-2153/ad6be8","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6be8","url":null,"abstract":"Anomaly detection is a critical problem in data analysis and pattern recognition, finding applications in various domains. We introduce quantum support vector data description (QSVDD), an unsupervised learning algorithm designed for anomaly detection. QSVDD utilizes a shallow-depth quantum circuit to learn a minimum-volume hypersphere that tightly encloses normal data, tailored for the constraints of noisy intermediate-scale quantum (NISQ) computing. Simulation results on the MNIST and Fashion MNIST image datasets, as well as credit card fraud detection, demonstrate that QSVDD outperforms both quantum autoencoder and deep learning-based approaches under similar training conditions. Notably, QSVDD requires an extremely small number of model parameters, which increases logarithmically with the number of input qubits. This enables efficient learning with a simple training landscape, presenting a compact quantum machine learning model with strong performance for anomaly detection.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"65 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Normalizing flows as an enhanced sampling method for atomistic supercooled liquids 作为原子论过冷液体强化取样方法的归一化流动
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-21 DOI: 10.1088/2632-2153/ad6ca0
Gerhard Jung, Giulio Biroli, Ludovic Berthier
Normalizing flows can transform a simple prior probability distribution into a more complex target distribution. Here, we evaluate the ability and efficiency of generative machine learning methods to sample the Boltzmann distribution of an atomistic model for glass-forming liquids. This is a notoriously difficult task, as it amounts to ergodically exploring the complex free energy landscape of a disordered and frustrated many-body system. We optimize a normalizing flow model to successfully transform high-temperature configurations of a dense liquid into low-temperature ones, near the glass transition. We perform a detailed comparative analysis with established enhanced sampling techniques developed in the physics literature to assess and rank the performance of normalizing flows against state-of-the-art algorithms. We demonstrate that machine learning methods are very promising, showing a large speedup over conventional molecular dynamics. Normalizing flows show performances comparable to parallel tempering and population annealing, while still falling far behind the swap Monte Carlo algorithm. Our study highlights the potential of generative machine learning models in scientific computing for complex systems, but also points to some of its current limitations and the need for further improvement.
归一化流量可以将简单的先验概率分布转化为更复杂的目标分布。在这里,我们评估了生成式机器学习方法对玻璃形成液体的原子模型的玻尔兹曼分布进行采样的能力和效率。这是一项众所周知的艰巨任务,因为它相当于对无序和受挫多体系统的复杂自由能景观进行遍历式探索。我们优化了归一化流动模型,成功地将致密液体的高温构型转化为接近玻璃化转变的低温构型。我们与物理学文献中开发的成熟增强采样技术进行了详细的比较分析,以评估归一化流动的性能并与最先进的算法进行排名。我们证明,机器学习方法很有前途,与传统分子动力学相比,速度大幅提升。归一化流的性能可与并行回火和群体退火相媲美,但仍远远落后于交换蒙特卡洛算法。我们的研究凸显了生成式机器学习模型在复杂系统科学计算中的潜力,但也指出了其目前的一些局限性和进一步改进的必要性。
{"title":"Normalizing flows as an enhanced sampling method for atomistic supercooled liquids","authors":"Gerhard Jung, Giulio Biroli, Ludovic Berthier","doi":"10.1088/2632-2153/ad6ca0","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6ca0","url":null,"abstract":"Normalizing flows can transform a simple prior probability distribution into a more complex target distribution. Here, we evaluate the ability and efficiency of generative machine learning methods to sample the Boltzmann distribution of an atomistic model for glass-forming liquids. This is a notoriously difficult task, as it amounts to ergodically exploring the complex free energy landscape of a disordered and frustrated many-body system. We optimize a normalizing flow model to successfully transform high-temperature configurations of a dense liquid into low-temperature ones, near the glass transition. We perform a detailed comparative analysis with established enhanced sampling techniques developed in the physics literature to assess and rank the performance of normalizing flows against state-of-the-art algorithms. We demonstrate that machine learning methods are very promising, showing a large speedup over conventional molecular dynamics. Normalizing flows show performances comparable to parallel tempering and population annealing, while still falling far behind the swap Monte Carlo algorithm. Our study highlights the potential of generative machine learning models in scientific computing for complex systems, but also points to some of its current limitations and the need for further improvement.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"12 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coincidence anomaly detection for unsupervised locating of edge localized modes in the DIII-D tokamak dataset 在 DIII-D 托卡马克数据集中对边缘定位模式进行无监督定位的重合异常检测
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-20 DOI: 10.1088/2632-2153/ad6be7
Finn H O’Shea, Semin Joung, David R Smith, Daniel Ratner, Ryan Coffee
Using supervised learning to train a machine learning model to predict an on-coming edge localized mode (ELM) requires a large number of labeled samples. Creating an appropriate data set from the very large database of discharges at a long-running tokamak, such as DIII-D, would be a very time-consuming process for a human. Considering this need and difficulty, we use coincidence anomaly detection, an unsupervised learning technique, to train an ELM-identifier to identify and label ELMs in the DIII-D discharge database. This ELM-identifier shows, simultaneously, a precision of 0.68 and a recall of 0.63 (AUC is 0.73) on identifying ELMs in example time series pulled from thousands of discharges spanning five years. In a test set of 50 discharges, the algorithm finds over 26 thousand ELM candidates, more than 5 times the existing catalog of ELMs labeled by humans.
使用监督学习来训练机器学习模型,以预测即将发生的边缘局部模式(ELM),需要大量的标记样本。从 DIII-D 等长期运行的托卡马克放电的庞大数据库中创建一个适当的数据集,对人类来说是一个非常耗时的过程。考虑到这一需求和困难,我们使用了巧合异常检测(一种无监督学习技术)来训练 ELM 识别器,以识别和标记 DIII-D 放电数据库中的 ELM。该 ELM 识别器同时显示,在从跨越五年的数千个出院数据中提取的示例时间序列中识别 ELM 的精确度为 0.68,召回率为 0.63(AUC 为 0.73)。在一个包含 50 个出院数据的测试集中,该算法发现了超过 2.6 万个 ELM 候选,是现有人工标注 ELM 目录的 5 倍多。
{"title":"Coincidence anomaly detection for unsupervised locating of edge localized modes in the DIII-D tokamak dataset","authors":"Finn H O’Shea, Semin Joung, David R Smith, Daniel Ratner, Ryan Coffee","doi":"10.1088/2632-2153/ad6be7","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6be7","url":null,"abstract":"Using supervised learning to train a machine learning model to predict an on-coming edge localized mode (ELM) requires a large number of labeled samples. Creating an appropriate data set from the very large database of discharges at a long-running tokamak, such as DIII-D, would be a very time-consuming process for a human. Considering this need and difficulty, we use coincidence anomaly detection, an unsupervised learning technique, to train an ELM-identifier to identify and label ELMs in the DIII-D discharge database. This ELM-identifier shows, simultaneously, a precision of 0.68 and a recall of 0.63 (AUC is 0.73) on identifying ELMs in example time series pulled from thousands of discharges spanning five years. In a test set of 50 discharges, the algorithm finds over 26 thousand ELM candidates, more than 5 times the existing catalog of ELMs labeled by humans.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"27 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spectral-bias and kernel-task alignment in physically informed neural networks 物理信息神经网络中的频谱偏置和内核任务对齐
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-20 DOI: 10.1088/2632-2153/ad652d
Inbar Seroussi, Asaf Miron, Zohar Ringel
Physically informed neural networks (PINNs) are a promising emerging method for solving differential equations. As in many other deep learning approaches, the choice of PINN design and training protocol requires careful craftsmanship. Here, we suggest a comprehensive theoretical framework that sheds light on this important problem. Leveraging an equivalence between infinitely over-parameterized neural networks and Gaussian process regression, we derive an integro-differential equation that governs PINN prediction in the large data-set limit—the neurally-informed equation. This equation augments the original one by a kernel term reflecting architecture choices. It allows quantifying implicit bias induced by the network via a spectral decomposition of the source term in the original differential equation.
物理信息神经网络(PINN)是解决微分方程的一种前景广阔的新兴方法。与许多其他深度学习方法一样,PINN 设计和训练协议的选择也需要精雕细琢。在此,我们提出了一个全面的理论框架,以揭示这一重要问题。利用无限过参数化神经网络和高斯过程回归之间的等价关系,我们推导出了一个在大数据集限制下支配 PINN 预测的积分微分方程--神经信息方程。该方程通过一个反映架构选择的内核项来增强原始方程。通过对原始微分方程中的源项进行频谱分解,它可以量化网络引起的隐含偏差。
{"title":"Spectral-bias and kernel-task alignment in physically informed neural networks","authors":"Inbar Seroussi, Asaf Miron, Zohar Ringel","doi":"10.1088/2632-2153/ad652d","DOIUrl":"https://doi.org/10.1088/2632-2153/ad652d","url":null,"abstract":"Physically informed neural networks (PINNs) are a promising emerging method for solving differential equations. As in many other deep learning approaches, the choice of PINN design and training protocol requires careful craftsmanship. Here, we suggest a comprehensive theoretical framework that sheds light on this important problem. Leveraging an equivalence between infinitely over-parameterized neural networks and Gaussian process regression, we derive an integro-differential equation that governs PINN prediction in the large data-set limit—the neurally-informed equation. This equation augments the original one by a kernel term reflecting architecture choices. It allows quantifying implicit bias induced by the network via a spectral decomposition of the source term in the original differential equation.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"398 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovering symbolic laws directly from trajectories with hamiltonian graph neural networks 利用哈密顿图神经网络直接从轨迹中发现符号定律
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-20 DOI: 10.1088/2632-2153/ad6be6
Suresh Bishnoi, Ravinder Bhattoo, Jayadeva3jayadeva@ee.iitd.ac.in, Sayan Ranu, N M Anoop Krishnan
The time evolution of physical systems is described by differential equations, which depend on abstract quantities like energy and force. Traditionally, these quantities are derived as functionals based on observables such as positions and velocities. Discovering these governing symbolic laws is the key to comprehending the interactions in nature. Here, we present a Hamiltonian graph neural network (Hgnn), a physics-enforced Gnn that learns the dynamics of systems directly from their trajectory. We demonstrate the performance of Hgnn on nsprings, npendulums, gravitational systems, and binary Lennard Jones systems; Hgnn learns the dynamics in excellent agreement with the ground truth from small amounts of data. We also evaluate the ability of Hgnn to generalize to larger system sizes, and to a hybrid spring-pendulum system that is a combination of two original systems (spring and pendulum) on which the models are trained independently. Finally, employing symbolic regression on the learned Hgnn, we infer the underlying equations relating to the energy functionals, even for complex systems such as the binary Lennard-Jones liquid. Our framework facilitates the interpretable discovery of interaction laws directly from physical system trajectories. Furthermore, this approach can be extended to other systems with topology-dependent dynamics, such as cells, polydisperse gels, or deformable bodies.
物理系统的时间演化由微分方程描述,微分方程取决于能量和力等抽象量。传统上,这些量是根据位置和速度等观测值作为函数推导出来的。发现这些支配符号定律是理解自然界中相互作用的关键。在这里,我们提出了哈密顿图神经网络(Hgnn),这是一种物理强化 Gnn,可直接从系统轨迹学习其动力学。我们展示了 Hgnn 在 n-弹簧、n-钟摆、引力系统和二元伦纳德-琼斯系统上的表现;Hgnn 从少量数据中学习到的动力学与基本事实非常吻合。我们还评估了 Hgnn 对更大系统规模的泛化能力,以及对混合弹簧摆系统的泛化能力,混合弹簧摆系统是两个原始系统(弹簧和摆)的组合,而模型是在这两个原始系统上独立训练的。最后,通过对学习到的 Hgnn 进行符号回归,我们推断出了与能量函数相关的基本方程,即使对于二元伦纳德-琼斯液体等复杂系统也是如此。我们的框架有助于直接从物理系统轨迹中发现可解释的相互作用规律。此外,这种方法还可扩展到其他具有拓扑依赖性动力学的系统,如细胞、多分散凝胶或可变形体。
{"title":"Discovering symbolic laws directly from trajectories with hamiltonian graph neural networks","authors":"Suresh Bishnoi, Ravinder Bhattoo, Jayadeva3jayadeva@ee.iitd.ac.in, Sayan Ranu, N M Anoop Krishnan","doi":"10.1088/2632-2153/ad6be6","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6be6","url":null,"abstract":"The time evolution of physical systems is described by differential equations, which depend on abstract quantities like energy and force. Traditionally, these quantities are derived as functionals based on observables such as positions and velocities. Discovering these governing symbolic laws is the key to comprehending the interactions in nature. Here, we present a Hamiltonian graph neural network (<sc>Hgnn</sc>), a physics-enforced <sc>Gnn</sc> that learns the dynamics of systems directly from their trajectory. We demonstrate the performance of <sc>Hgnn</sc> on <inline-formula>\u0000<tex-math><?CDATA $n-$?></tex-math><mml:math overflow=\"scroll\"><mml:mrow><mml:mi>n</mml:mi><mml:mo>−</mml:mo></mml:mrow></mml:math><inline-graphic xlink:href=\"mlstad6be6ieqn1.gif\"></inline-graphic></inline-formula>springs, <inline-formula>\u0000<tex-math><?CDATA $n-$?></tex-math><mml:math overflow=\"scroll\"><mml:mrow><mml:mi>n</mml:mi><mml:mo>−</mml:mo></mml:mrow></mml:math><inline-graphic xlink:href=\"mlstad6be6ieqn2.gif\"></inline-graphic></inline-formula>pendulums, gravitational systems, and binary Lennard Jones systems; <sc>Hgnn</sc> learns the dynamics in excellent agreement with the ground truth from small amounts of data. We also evaluate the ability of <sc>Hgnn</sc> to generalize to larger system sizes, and to a hybrid spring-pendulum system that is a combination of two original systems (spring and pendulum) on which the models are trained independently. Finally, employing symbolic regression on the learned <sc>Hgnn</sc>, we infer the underlying equations relating to the energy functionals, even for complex systems such as the binary Lennard-Jones liquid. Our framework facilitates the interpretable discovery of interaction laws directly from physical system trajectories. Furthermore, this approach can be extended to other systems with topology-dependent dynamics, such as cells, polydisperse gels, or deformable bodies.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"210 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Machine Learning Science and Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1