首页 > 最新文献

Machine Learning Science and Technology最新文献

英文 中文
Data-driven sparse modeling of oscillations in plasma space propulsion 等离子体空间推进器振荡的数据驱动稀疏建模
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-23 DOI: 10.1088/2632-2153/ad6d29
Borja Bayón-Buján, Mario Merino
An algorithm to obtain data-driven models of oscillatory phenomena in plasma space propulsion systems is presented, based on sparse regression (SINDy) and Pareto front analysis. The algorithm can incorporate physical constraints, use data bootstrapping for additional robustness, and fine-tuning to different metrics. Standard, weak and integral SINDy formulations are discussed and compared. The scheme is benchmarked for the case of breathing-mode oscillations in Hall effect thrusters, using particle-in-cell/fluid simulation data. Models of varying complexity are obtained for the average plasma properties, and shown to have a clear physical interpretability and agreement with existing 0D models in the literature. Lastly, the algorithm applied is also shown to enable the identification of physical subdomains with qualitatively different plasma dynamics, providing valuable information for more advanced modeling approaches.
基于稀疏回归(SINDy)和帕累托前沿分析,介绍了一种获得等离子空间推进系统振荡现象数据驱动模型的算法。该算法可以结合物理约束条件,使用数据引导以获得额外的鲁棒性,并根据不同的指标进行微调。对标准、弱和积分 SINDy 公式进行了讨论和比较。针对霍尔效应推进器中的呼吸模式振荡情况,利用舱内粒子/流体模拟数据对该方案进行了基准测试。针对等离子体的平均特性获得了不同复杂度的模型,结果表明这些模型与文献中现有的 0D 模型具有明确的物理可解释性和一致性。最后,所应用的算法还证明能够识别具有质的不同等离子体动力学的物理子域,为更先进的建模方法提供有价值的信息。
{"title":"Data-driven sparse modeling of oscillations in plasma space propulsion","authors":"Borja Bayón-Buján, Mario Merino","doi":"10.1088/2632-2153/ad6d29","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6d29","url":null,"abstract":"An algorithm to obtain data-driven models of oscillatory phenomena in plasma space propulsion systems is presented, based on sparse regression (SINDy) and Pareto front analysis. The algorithm can incorporate physical constraints, use data bootstrapping for additional robustness, and fine-tuning to different metrics. Standard, weak and integral SINDy formulations are discussed and compared. The scheme is benchmarked for the case of breathing-mode oscillations in Hall effect thrusters, using particle-in-cell/fluid simulation data. Models of varying complexity are obtained for the average plasma properties, and shown to have a clear physical interpretability and agreement with existing 0D models in the literature. Lastly, the algorithm applied is also shown to enable the identification of physical subdomains with qualitatively different plasma dynamics, providing valuable information for more advanced modeling approaches.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"11 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142225270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Active causal learning for decoding chemical complexities with targeted interventions 通过主动因果学习解码复杂化学物质,进行有针对性的干预
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-23 DOI: 10.1088/2632-2153/ad6feb
Zachary R Fox, Ayana Ghosh
Predicting and enhancing inherent properties based on molecular structures is paramount to design tasks in medicine, materials science, and environmental management. Most of the current machine learning and deep learning approaches have become standard for predictions, but they face challenges when applied across different datasets due to reliance on correlations between molecular representation and target properties. These approaches typically depend on large datasets to capture the diversity within the chemical space, facilitating a more accurate approximation, interpolation, or extrapolation of the chemical behavior of molecules. In our research, we introduce an active learning approach that discerns underlying cause-effect relationships through strategic sampling with the use of a graph loss function. This method identifies the smallest subset of the dataset capable of encoding the most information representative of a much larger chemical space. The identified causal relations are then leveraged to conduct systematic interventions, optimizing the design task within a chemical space that the models have not encountered previously. While our implementation focused on the QM9 quantum-chemical dataset for a specific design task—finding molecules with a large dipole moment—our active causal learning approach, driven by intelligent sampling and interventions, holds potential for broader applications in molecular, materials design and discovery.
根据分子结构预测和增强固有特性对于医学、材料科学和环境管理领域的设计任务至关重要。目前大多数机器学习和深度学习方法已成为预测的标准,但由于依赖分子表征和目标特性之间的相关性,它们在应用于不同数据集时面临挑战。这些方法通常依赖于大型数据集来捕捉化学空间内的多样性,从而有助于更准确地近似、内插或外推分子的化学行为。在我们的研究中,我们引入了一种主动学习方法,通过使用图损失函数进行策略性采样来辨别潜在的因果关系。这种方法能识别出数据集的最小子集,该子集能够编码代表更大化学空间的最多信息。然后,利用确定的因果关系进行系统干预,在模型以前未曾接触过的化学空间内优化设计任务。虽然我们的实施侧重于 QM9 量子化学数据集的特定设计任务--寻找具有大偶极矩的分子--但我们的主动因果学习方法在智能采样和干预的驱动下,有望在分子、材料设计和发现领域得到更广泛的应用。
{"title":"Active causal learning for decoding chemical complexities with targeted interventions","authors":"Zachary R Fox, Ayana Ghosh","doi":"10.1088/2632-2153/ad6feb","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6feb","url":null,"abstract":"Predicting and enhancing inherent properties based on molecular structures is paramount to design tasks in medicine, materials science, and environmental management. Most of the current machine learning and deep learning approaches have become standard for predictions, but they face challenges when applied across different datasets due to reliance on correlations between molecular representation and target properties. These approaches typically depend on large datasets to capture the diversity within the chemical space, facilitating a more accurate approximation, interpolation, or extrapolation of the chemical behavior of molecules. In our research, we introduce an active learning approach that discerns underlying cause-effect relationships through strategic sampling with the use of a graph loss function. This method identifies the smallest subset of the dataset capable of encoding the most information representative of a much larger chemical space. The identified causal relations are then leveraged to conduct systematic interventions, optimizing the design task within a chemical space that the models have not encountered previously. While our implementation focused on the QM9 quantum-chemical dataset for a specific design task—finding molecules with a large dipole moment—our active causal learning approach, driven by intelligent sampling and interventions, holds potential for broader applications in molecular, materials design and discovery.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"68 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emergence of chemotactic strategies with multi-agent reinforcement learning 多代理强化学习催化策略的出现
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-21 DOI: 10.1088/2632-2153/ad5f73
Samuel Tovey, Christoph Lohrmann, Christian Holm
Reinforcement learning (RL) is a flexible and efficient method for programming micro-robots in complex environments. Here we investigate whether RL can provide insights into biological systems when trained to perform chemotaxis. Namely, whether we can learn about how intelligent agents process given information in order to swim towards a target. We run simulations covering a range of agent shapes, sizes, and swim speeds to determine if the physical constraints on biological swimmers, namely Brownian motion, lead to regions where reinforcement learners’ training fails. We find that the RL agents can perform chemotaxis as soon as it is physically possible and, in some cases, even before the active swimming overpowers the stochastic environment. We study the efficiency of the emergent policy and identify convergence in agent size and swim speeds. Finally, we study the strategy adopted by the RL algorithm to explain how the agents perform their tasks. To this end, we identify three emerging dominant strategies and several rare approaches taken. These strategies, whilst producing almost identical trajectories in simulation, are distinct and give insight into the possible mechanisms behind which biological agents explore their environment and respond to changing conditions.
强化学习(RL)是在复杂环境中对微型机器人进行编程的一种灵活高效的方法。在此,我们将研究当强化学习被训练用于执行趋化时,它是否能为生物系统提供洞察力。也就是说,我们能否了解智能代理如何处理给定信息以游向目标。我们运行了涵盖一系列代理形状、大小和游速的模拟,以确定生物游泳者的物理限制(即布朗运动)是否会导致强化学习器的训练失败。我们发现,只要物理条件允许,RL 代理就能执行趋化,在某些情况下,甚至在主动游动压倒随机环境之前就能执行趋化。我们研究了新兴策略的效率,并确定了代理规模和游动速度的收敛性。最后,我们研究了 RL 算法采用的策略,以解释代理如何执行任务。为此,我们确定了三种新出现的主导策略和几种罕见的方法。这些策略虽然在模拟中产生了几乎相同的轨迹,但却各具特色,让我们深入了解了生物制剂探索环境和应对不断变化的条件的可能机制。
{"title":"Emergence of chemotactic strategies with multi-agent reinforcement learning","authors":"Samuel Tovey, Christoph Lohrmann, Christian Holm","doi":"10.1088/2632-2153/ad5f73","DOIUrl":"https://doi.org/10.1088/2632-2153/ad5f73","url":null,"abstract":"Reinforcement learning (RL) is a flexible and efficient method for programming micro-robots in complex environments. Here we investigate whether RL can provide insights into biological systems when trained to perform chemotaxis. Namely, whether we can learn about how intelligent agents process given information in order to swim towards a target. We run simulations covering a range of agent shapes, sizes, and swim speeds to determine if the physical constraints on biological swimmers, namely Brownian motion, lead to regions where reinforcement learners’ training fails. We find that the RL agents can perform chemotaxis as soon as it is physically possible and, in some cases, even before the active swimming overpowers the stochastic environment. We study the efficiency of the emergent policy and identify convergence in agent size and swim speeds. Finally, we study the strategy adopted by the RL algorithm to explain how the agents perform their tasks. To this end, we identify three emerging dominant strategies and several rare approaches taken. These strategies, whilst producing almost identical trajectories in simulation, are distinct and give insight into the possible mechanisms behind which biological agents explore their environment and respond to changing conditions.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"97 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantum support vector data description for anomaly detection 用于异常检测的量子支持向量数据描述
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-21 DOI: 10.1088/2632-2153/ad6be8
Hyeondo Oh, Daniel K Park
Anomaly detection is a critical problem in data analysis and pattern recognition, finding applications in various domains. We introduce quantum support vector data description (QSVDD), an unsupervised learning algorithm designed for anomaly detection. QSVDD utilizes a shallow-depth quantum circuit to learn a minimum-volume hypersphere that tightly encloses normal data, tailored for the constraints of noisy intermediate-scale quantum (NISQ) computing. Simulation results on the MNIST and Fashion MNIST image datasets, as well as credit card fraud detection, demonstrate that QSVDD outperforms both quantum autoencoder and deep learning-based approaches under similar training conditions. Notably, QSVDD requires an extremely small number of model parameters, which increases logarithmically with the number of input qubits. This enables efficient learning with a simple training landscape, presenting a compact quantum machine learning model with strong performance for anomaly detection.
异常检测是数据分析和模式识别中的一个关键问题,在各个领域都有应用。我们介绍了量子支持向量数据描述(QSVDD),这是一种专为异常检测设计的无监督学习算法。QSVDD 利用浅深度量子电路来学习一个最小体积的超球,该超球紧紧包裹着正常数据,专为噪声中等规模量子计算(NISQ)的限制而量身定制。MNIST 和时尚 MNIST 图像数据集以及信用卡欺诈检测的仿真结果表明,在类似的训练条件下,QSVDD 的性能优于量子自动编码器和基于深度学习的方法。值得注意的是,QSVDD 只需要极少量的模型参数,这些参数随输入量子比特数量的增加而呈对数增长。这样就能通过简单的训练环境实现高效学习,从而为异常检测提供了一个性能强大的紧凑型量子机器学习模型。
{"title":"Quantum support vector data description for anomaly detection","authors":"Hyeondo Oh, Daniel K Park","doi":"10.1088/2632-2153/ad6be8","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6be8","url":null,"abstract":"Anomaly detection is a critical problem in data analysis and pattern recognition, finding applications in various domains. We introduce quantum support vector data description (QSVDD), an unsupervised learning algorithm designed for anomaly detection. QSVDD utilizes a shallow-depth quantum circuit to learn a minimum-volume hypersphere that tightly encloses normal data, tailored for the constraints of noisy intermediate-scale quantum (NISQ) computing. Simulation results on the MNIST and Fashion MNIST image datasets, as well as credit card fraud detection, demonstrate that QSVDD outperforms both quantum autoencoder and deep learning-based approaches under similar training conditions. Notably, QSVDD requires an extremely small number of model parameters, which increases logarithmically with the number of input qubits. This enables efficient learning with a simple training landscape, presenting a compact quantum machine learning model with strong performance for anomaly detection.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"65 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Normalizing flows as an enhanced sampling method for atomistic supercooled liquids 作为原子论过冷液体强化取样方法的归一化流动
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-21 DOI: 10.1088/2632-2153/ad6ca0
Gerhard Jung, Giulio Biroli, Ludovic Berthier
Normalizing flows can transform a simple prior probability distribution into a more complex target distribution. Here, we evaluate the ability and efficiency of generative machine learning methods to sample the Boltzmann distribution of an atomistic model for glass-forming liquids. This is a notoriously difficult task, as it amounts to ergodically exploring the complex free energy landscape of a disordered and frustrated many-body system. We optimize a normalizing flow model to successfully transform high-temperature configurations of a dense liquid into low-temperature ones, near the glass transition. We perform a detailed comparative analysis with established enhanced sampling techniques developed in the physics literature to assess and rank the performance of normalizing flows against state-of-the-art algorithms. We demonstrate that machine learning methods are very promising, showing a large speedup over conventional molecular dynamics. Normalizing flows show performances comparable to parallel tempering and population annealing, while still falling far behind the swap Monte Carlo algorithm. Our study highlights the potential of generative machine learning models in scientific computing for complex systems, but also points to some of its current limitations and the need for further improvement.
归一化流量可以将简单的先验概率分布转化为更复杂的目标分布。在这里,我们评估了生成式机器学习方法对玻璃形成液体的原子模型的玻尔兹曼分布进行采样的能力和效率。这是一项众所周知的艰巨任务,因为它相当于对无序和受挫多体系统的复杂自由能景观进行遍历式探索。我们优化了归一化流动模型,成功地将致密液体的高温构型转化为接近玻璃化转变的低温构型。我们与物理学文献中开发的成熟增强采样技术进行了详细的比较分析,以评估归一化流动的性能并与最先进的算法进行排名。我们证明,机器学习方法很有前途,与传统分子动力学相比,速度大幅提升。归一化流的性能可与并行回火和群体退火相媲美,但仍远远落后于交换蒙特卡洛算法。我们的研究凸显了生成式机器学习模型在复杂系统科学计算中的潜力,但也指出了其目前的一些局限性和进一步改进的必要性。
{"title":"Normalizing flows as an enhanced sampling method for atomistic supercooled liquids","authors":"Gerhard Jung, Giulio Biroli, Ludovic Berthier","doi":"10.1088/2632-2153/ad6ca0","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6ca0","url":null,"abstract":"Normalizing flows can transform a simple prior probability distribution into a more complex target distribution. Here, we evaluate the ability and efficiency of generative machine learning methods to sample the Boltzmann distribution of an atomistic model for glass-forming liquids. This is a notoriously difficult task, as it amounts to ergodically exploring the complex free energy landscape of a disordered and frustrated many-body system. We optimize a normalizing flow model to successfully transform high-temperature configurations of a dense liquid into low-temperature ones, near the glass transition. We perform a detailed comparative analysis with established enhanced sampling techniques developed in the physics literature to assess and rank the performance of normalizing flows against state-of-the-art algorithms. We demonstrate that machine learning methods are very promising, showing a large speedup over conventional molecular dynamics. Normalizing flows show performances comparable to parallel tempering and population annealing, while still falling far behind the swap Monte Carlo algorithm. Our study highlights the potential of generative machine learning models in scientific computing for complex systems, but also points to some of its current limitations and the need for further improvement.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"12 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coincidence anomaly detection for unsupervised locating of edge localized modes in the DIII-D tokamak dataset 在 DIII-D 托卡马克数据集中对边缘定位模式进行无监督定位的重合异常检测
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-20 DOI: 10.1088/2632-2153/ad6be7
Finn H O’Shea, Semin Joung, David R Smith, Daniel Ratner, Ryan Coffee
Using supervised learning to train a machine learning model to predict an on-coming edge localized mode (ELM) requires a large number of labeled samples. Creating an appropriate data set from the very large database of discharges at a long-running tokamak, such as DIII-D, would be a very time-consuming process for a human. Considering this need and difficulty, we use coincidence anomaly detection, an unsupervised learning technique, to train an ELM-identifier to identify and label ELMs in the DIII-D discharge database. This ELM-identifier shows, simultaneously, a precision of 0.68 and a recall of 0.63 (AUC is 0.73) on identifying ELMs in example time series pulled from thousands of discharges spanning five years. In a test set of 50 discharges, the algorithm finds over 26 thousand ELM candidates, more than 5 times the existing catalog of ELMs labeled by humans.
使用监督学习来训练机器学习模型,以预测即将发生的边缘局部模式(ELM),需要大量的标记样本。从 DIII-D 等长期运行的托卡马克放电的庞大数据库中创建一个适当的数据集,对人类来说是一个非常耗时的过程。考虑到这一需求和困难,我们使用了巧合异常检测(一种无监督学习技术)来训练 ELM 识别器,以识别和标记 DIII-D 放电数据库中的 ELM。该 ELM 识别器同时显示,在从跨越五年的数千个出院数据中提取的示例时间序列中识别 ELM 的精确度为 0.68,召回率为 0.63(AUC 为 0.73)。在一个包含 50 个出院数据的测试集中,该算法发现了超过 2.6 万个 ELM 候选,是现有人工标注 ELM 目录的 5 倍多。
{"title":"Coincidence anomaly detection for unsupervised locating of edge localized modes in the DIII-D tokamak dataset","authors":"Finn H O’Shea, Semin Joung, David R Smith, Daniel Ratner, Ryan Coffee","doi":"10.1088/2632-2153/ad6be7","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6be7","url":null,"abstract":"Using supervised learning to train a machine learning model to predict an on-coming edge localized mode (ELM) requires a large number of labeled samples. Creating an appropriate data set from the very large database of discharges at a long-running tokamak, such as DIII-D, would be a very time-consuming process for a human. Considering this need and difficulty, we use coincidence anomaly detection, an unsupervised learning technique, to train an ELM-identifier to identify and label ELMs in the DIII-D discharge database. This ELM-identifier shows, simultaneously, a precision of 0.68 and a recall of 0.63 (AUC is 0.73) on identifying ELMs in example time series pulled from thousands of discharges spanning five years. In a test set of 50 discharges, the algorithm finds over 26 thousand ELM candidates, more than 5 times the existing catalog of ELMs labeled by humans.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"27 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spectral-bias and kernel-task alignment in physically informed neural networks 物理信息神经网络中的频谱偏置和内核任务对齐
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-20 DOI: 10.1088/2632-2153/ad652d
Inbar Seroussi, Asaf Miron, Zohar Ringel
Physically informed neural networks (PINNs) are a promising emerging method for solving differential equations. As in many other deep learning approaches, the choice of PINN design and training protocol requires careful craftsmanship. Here, we suggest a comprehensive theoretical framework that sheds light on this important problem. Leveraging an equivalence between infinitely over-parameterized neural networks and Gaussian process regression, we derive an integro-differential equation that governs PINN prediction in the large data-set limit—the neurally-informed equation. This equation augments the original one by a kernel term reflecting architecture choices. It allows quantifying implicit bias induced by the network via a spectral decomposition of the source term in the original differential equation.
物理信息神经网络(PINN)是解决微分方程的一种前景广阔的新兴方法。与许多其他深度学习方法一样,PINN 设计和训练协议的选择也需要精雕细琢。在此,我们提出了一个全面的理论框架,以揭示这一重要问题。利用无限过参数化神经网络和高斯过程回归之间的等价关系,我们推导出了一个在大数据集限制下支配 PINN 预测的积分微分方程--神经信息方程。该方程通过一个反映架构选择的内核项来增强原始方程。通过对原始微分方程中的源项进行频谱分解,它可以量化网络引起的隐含偏差。
{"title":"Spectral-bias and kernel-task alignment in physically informed neural networks","authors":"Inbar Seroussi, Asaf Miron, Zohar Ringel","doi":"10.1088/2632-2153/ad652d","DOIUrl":"https://doi.org/10.1088/2632-2153/ad652d","url":null,"abstract":"Physically informed neural networks (PINNs) are a promising emerging method for solving differential equations. As in many other deep learning approaches, the choice of PINN design and training protocol requires careful craftsmanship. Here, we suggest a comprehensive theoretical framework that sheds light on this important problem. Leveraging an equivalence between infinitely over-parameterized neural networks and Gaussian process regression, we derive an integro-differential equation that governs PINN prediction in the large data-set limit—the neurally-informed equation. This equation augments the original one by a kernel term reflecting architecture choices. It allows quantifying implicit bias induced by the network via a spectral decomposition of the source term in the original differential equation.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"398 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovering symbolic laws directly from trajectories with hamiltonian graph neural networks 利用哈密顿图神经网络直接从轨迹中发现符号定律
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-20 DOI: 10.1088/2632-2153/ad6be6
Suresh Bishnoi, Ravinder Bhattoo, Jayadeva3jayadeva@ee.iitd.ac.in, Sayan Ranu, N M Anoop Krishnan
The time evolution of physical systems is described by differential equations, which depend on abstract quantities like energy and force. Traditionally, these quantities are derived as functionals based on observables such as positions and velocities. Discovering these governing symbolic laws is the key to comprehending the interactions in nature. Here, we present a Hamiltonian graph neural network (Hgnn), a physics-enforced Gnn that learns the dynamics of systems directly from their trajectory. We demonstrate the performance of Hgnn on nsprings, npendulums, gravitational systems, and binary Lennard Jones systems; Hgnn learns the dynamics in excellent agreement with the ground truth from small amounts of data. We also evaluate the ability of Hgnn to generalize to larger system sizes, and to a hybrid spring-pendulum system that is a combination of two original systems (spring and pendulum) on which the models are trained independently. Finally, employing symbolic regression on the learned Hgnn, we infer the underlying equations relating to the energy functionals, even for complex systems such as the binary Lennard-Jones liquid. Our framework facilitates the interpretable discovery of interaction laws directly from physical system trajectories. Furthermore, this approach can be extended to other systems with topology-dependent dynamics, such as cells, polydisperse gels, or deformable bodies.
物理系统的时间演化由微分方程描述,微分方程取决于能量和力等抽象量。传统上,这些量是根据位置和速度等观测值作为函数推导出来的。发现这些支配符号定律是理解自然界中相互作用的关键。在这里,我们提出了哈密顿图神经网络(Hgnn),这是一种物理强化 Gnn,可直接从系统轨迹学习其动力学。我们展示了 Hgnn 在 n-弹簧、n-钟摆、引力系统和二元伦纳德-琼斯系统上的表现;Hgnn 从少量数据中学习到的动力学与基本事实非常吻合。我们还评估了 Hgnn 对更大系统规模的泛化能力,以及对混合弹簧摆系统的泛化能力,混合弹簧摆系统是两个原始系统(弹簧和摆)的组合,而模型是在这两个原始系统上独立训练的。最后,通过对学习到的 Hgnn 进行符号回归,我们推断出了与能量函数相关的基本方程,即使对于二元伦纳德-琼斯液体等复杂系统也是如此。我们的框架有助于直接从物理系统轨迹中发现可解释的相互作用规律。此外,这种方法还可扩展到其他具有拓扑依赖性动力学的系统,如细胞、多分散凝胶或可变形体。
{"title":"Discovering symbolic laws directly from trajectories with hamiltonian graph neural networks","authors":"Suresh Bishnoi, Ravinder Bhattoo, Jayadeva3jayadeva@ee.iitd.ac.in, Sayan Ranu, N M Anoop Krishnan","doi":"10.1088/2632-2153/ad6be6","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6be6","url":null,"abstract":"The time evolution of physical systems is described by differential equations, which depend on abstract quantities like energy and force. Traditionally, these quantities are derived as functionals based on observables such as positions and velocities. Discovering these governing symbolic laws is the key to comprehending the interactions in nature. Here, we present a Hamiltonian graph neural network (<sc>Hgnn</sc>), a physics-enforced <sc>Gnn</sc> that learns the dynamics of systems directly from their trajectory. We demonstrate the performance of <sc>Hgnn</sc> on <inline-formula>\u0000<tex-math><?CDATA $n-$?></tex-math><mml:math overflow=\"scroll\"><mml:mrow><mml:mi>n</mml:mi><mml:mo>−</mml:mo></mml:mrow></mml:math><inline-graphic xlink:href=\"mlstad6be6ieqn1.gif\"></inline-graphic></inline-formula>springs, <inline-formula>\u0000<tex-math><?CDATA $n-$?></tex-math><mml:math overflow=\"scroll\"><mml:mrow><mml:mi>n</mml:mi><mml:mo>−</mml:mo></mml:mrow></mml:math><inline-graphic xlink:href=\"mlstad6be6ieqn2.gif\"></inline-graphic></inline-formula>pendulums, gravitational systems, and binary Lennard Jones systems; <sc>Hgnn</sc> learns the dynamics in excellent agreement with the ground truth from small amounts of data. We also evaluate the ability of <sc>Hgnn</sc> to generalize to larger system sizes, and to a hybrid spring-pendulum system that is a combination of two original systems (spring and pendulum) on which the models are trained independently. Finally, employing symbolic regression on the learned <sc>Hgnn</sc>, we infer the underlying equations relating to the energy functionals, even for complex systems such as the binary Lennard-Jones liquid. Our framework facilitates the interpretable discovery of interaction laws directly from physical system trajectories. Furthermore, this approach can be extended to other systems with topology-dependent dynamics, such as cells, polydisperse gels, or deformable bodies.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"210 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automating the discovery of partial differential equations in dynamical systems 自动发现动力系统中的偏微分方程
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-14 DOI: 10.1088/2632-2153/ad682f
Weizhen Li, Rui Carvalho
Identifying partial differential equations (PDEs) from data is crucial for understanding the governing mechanisms of natural phenomena, yet it remains a challenging task. We present an extension to the ARGOS framework, ARGOS-RAL, which leverages sparse regression with the recurrent adaptive lasso to identify PDEs from limited prior knowledge automatically. Our method automates calculating partial derivatives, constructing a candidate library, and estimating a sparse model. We rigorously evaluate the performance of ARGOS-RAL in identifying canonical PDEs under various noise levels and sample sizes, demonstrating its robustness in handling noisy and non-uniformly distributed data. We also test the algorithm’s performance on datasets consisting solely of random noise to simulate scenarios with severely compromised data quality. Our results show that ARGOS-RAL effectively and reliably identifies the underlying PDEs from data, outperforming the sequential threshold ridge regression method in most cases. We highlight the potential of combining statistical methods, machine learning, and dynamical systems theory to automatically discover governing equations from collected data, streamlining the scientific modeling process.
从数据中识别偏微分方程(PDE)对于理解自然现象的支配机制至关重要,但这仍然是一项具有挑战性的任务。我们介绍了 ARGOS 框架的扩展 ARGOS-RAL,它利用稀疏回归和递归自适应套索从有限的先验知识中自动识别偏微分方程。我们的方法可以自动计算偏导数、构建候选库和估计稀疏模型。我们严格评估了 ARGOS-RAL 在各种噪声水平和样本大小下识别规范 PDE 的性能,证明了它在处理噪声和非均匀分布数据时的鲁棒性。我们还测试了该算法在完全由随机噪声组成的数据集上的性能,以模拟数据质量严重受损的情况。我们的结果表明,ARGOS-RAL 能有效、可靠地从数据中识别出底层 PDE,在大多数情况下都优于顺序阈值脊回归方法。我们强调了将统计方法、机器学习和动力系统理论相结合,从收集的数据中自动发现治理方程,简化科学建模过程的潜力。
{"title":"Automating the discovery of partial differential equations in dynamical systems","authors":"Weizhen Li, Rui Carvalho","doi":"10.1088/2632-2153/ad682f","DOIUrl":"https://doi.org/10.1088/2632-2153/ad682f","url":null,"abstract":"Identifying partial differential equations (PDEs) from data is crucial for understanding the governing mechanisms of natural phenomena, yet it remains a challenging task. We present an extension to the ARGOS framework, ARGOS-RAL, which leverages sparse regression with the recurrent adaptive lasso to identify PDEs from limited prior knowledge automatically. Our method automates calculating partial derivatives, constructing a candidate library, and estimating a sparse model. We rigorously evaluate the performance of ARGOS-RAL in identifying canonical PDEs under various noise levels and sample sizes, demonstrating its robustness in handling noisy and non-uniformly distributed data. We also test the algorithm’s performance on datasets consisting solely of random noise to simulate scenarios with severely compromised data quality. Our results show that ARGOS-RAL effectively and reliably identifies the underlying PDEs from data, outperforming the sequential threshold ridge regression method in most cases. We highlight the potential of combining statistical methods, machine learning, and dynamical systems theory to automatically discover governing equations from collected data, streamlining the scientific modeling process.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"14 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Smart pixel sensors: towards on-sensor filtering of pixel clusters with deep learning 智能像素传感器:利用深度学习对像素群进行传感器上过滤
IF 6.8 2区 物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-14 DOI: 10.1088/2632-2153/ad6a00
Jieun Yoo, Jennet Dickinson, Morris Swartz, Giuseppe Di Guglielmo, Alice Bean, Douglas Berry, Manuel Blanco Valentin, Karri DiPetrillo, Farah Fahim, Lindsey Gray, James Hirschauer, Shruti R Kulkarni, Ron Lipton, Petar Maksimovic, Corrinne Mills, Mark S Neubauer, Benjamin Parpillon, Gauri Pradhan, Chinar Syal, Nhan Tran, Dahai Wen, Aaron Young
Highly granular pixel detectors allow for increasingly precise measurements of charged particle tracks. Next-generation detectors require that pixel sizes will be further reduced, leading to unprecedented data rates exceeding those foreseen at the High- Luminosity Large Hadron Collider. Signal processing that handles data incoming at a rate of O(40 MHz) and intelligently reduces the data within the pixelated region of the detector at rate will enhance physics performance at high luminosity and enable physics analyses that are not currently possible. Using the shape of charge clusters deposited in an array of small pixels, the physical properties of the traversing particle can be extracted with locally customized neural networks. In this first demonstration, we present a neural network that can be embedded into the on-sensor readout and filter out hits from low momentum tracks, reducing the detector’s data volume by 57.1%–75.7%. The network is designed and simulated as a custom readout integrated circuit with 28 nm CMOS technology and is expected to operate at less than 300 μW with an area of less than 0.2 mm2. The temporal development of charge clusters is investigated to demonstrate possible future performance gains, and there is also a discussion of future algorithmic and technological improvements that could enhance efficiency, data reduction, and power per area.
高精细像素探测器可以对带电粒子轨道进行越来越精确的测量。下一代探测器要求进一步缩小像素尺寸,这将导致前所未有的数据传输速率,超过高亮度大型强子对撞机的预期数据传输速率。信号处理如果能以 O(40 MHz) 的速率处理输入的数据,并在探测器的像素化区域内智能地按速率缩小数据,将提高高亮度下的物理学性能,并实现目前无法实现的物理学分析。利用沉积在小像素阵列中的电荷团的形状,可以通过本地定制的神经网络提取穿越粒子的物理特性。在首次演示中,我们介绍了一种神经网络,它可以嵌入到传感器读出中,过滤掉低动量轨道的命中率,从而将探测器的数据量减少 57.1%-75.7%。该网络是以 28 纳米 CMOS 技术设计和模拟的定制读出集成电路,预计运行功耗小于 300 μW,面积小于 0.2 mm2。对电荷簇的时间发展进行了研究,以展示未来可能的性能提升,同时还讨论了未来可提高效率、减少数据和单位面积功耗的算法和技术改进。
{"title":"Smart pixel sensors: towards on-sensor filtering of pixel clusters with deep learning","authors":"Jieun Yoo, Jennet Dickinson, Morris Swartz, Giuseppe Di Guglielmo, Alice Bean, Douglas Berry, Manuel Blanco Valentin, Karri DiPetrillo, Farah Fahim, Lindsey Gray, James Hirschauer, Shruti R Kulkarni, Ron Lipton, Petar Maksimovic, Corrinne Mills, Mark S Neubauer, Benjamin Parpillon, Gauri Pradhan, Chinar Syal, Nhan Tran, Dahai Wen, Aaron Young","doi":"10.1088/2632-2153/ad6a00","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6a00","url":null,"abstract":"Highly granular pixel detectors allow for increasingly precise measurements of charged particle tracks. Next-generation detectors require that pixel sizes will be further reduced, leading to unprecedented data rates exceeding those foreseen at the High- Luminosity Large Hadron Collider. Signal processing that handles data incoming at a rate of <inline-formula>\u0000<tex-math><?CDATA $mathcal{O}$?></tex-math><mml:math overflow=\"scroll\"><mml:mrow><mml:mrow><mml:mi>O</mml:mi></mml:mrow></mml:mrow></mml:math><inline-graphic xlink:href=\"mlstad6a00ieqn1.gif\"></inline-graphic></inline-formula>(40 MHz) and intelligently reduces the data within the pixelated region of the detector <italic toggle=\"yes\">at rate</italic> will enhance physics performance at high luminosity and enable physics analyses that are not currently possible. Using the shape of charge clusters deposited in an array of small pixels, the physical properties of the traversing particle can be extracted with locally customized neural networks. In this first demonstration, we present a neural network that can be embedded into the on-sensor readout and filter out hits from low momentum tracks, reducing the detector’s data volume by 57.1%–75.7%. The network is designed and simulated as a custom readout integrated circuit with 28 nm CMOS technology and is expected to operate at less than 300 <inline-formula>\u0000<tex-math><?CDATA $mu W$?></tex-math><mml:math overflow=\"scroll\"><mml:mrow><mml:mi>μ</mml:mi><mml:mi>W</mml:mi></mml:mrow></mml:math><inline-graphic xlink:href=\"mlstad6a00ieqn2.gif\"></inline-graphic></inline-formula> with an area of less than 0.2 mm<sup>2</sup>. The temporal development of charge clusters is investigated to demonstrate possible future performance gains, and there is also a discussion of future algorithmic and technological improvements that could enhance efficiency, data reduction, and power per area.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"1 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Machine Learning Science and Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1