Machine Learning Science and Technology最新文献

Quality assurance for online adaptive radiotherapy: a secondary dose verification model with geometry-encoded U-Net. 在线自适应放射治疗的质量保证：采用几何编码 U-Net 的二次剂量验证模型。

IF 6.3 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2024-12-01 Epub Date: 2024-10-11 DOI: 10.1088/2632-2153/ad829e

Shunyu Yan, Austen Maniscalco, Biling Wang, Dan Nguyen, Steve Jiang, Chenyang Shen

In online adaptive radiotherapy (ART), quick computation-based secondary dose verification is crucial for ensuring the quality of ART plans while the patient is positioned on the treatment couch. However, traditional dose verification algorithms are generally time-consuming, reducing the efficiency of ART workflow. This study aims to develop an ultra-fast deep-learning (DL) based secondary dose verification algorithm to accurately estimate dose distributions using computed tomography (CT) and fluence maps (FMs). We integrated FMs into the CT image domain by explicitly resolving the geometry of treatment delivery. For each gantry angle, an FM was constructed based on the optimized multi-leaf collimator apertures and corresponding monitoring units. To effectively encode treatment beam configuration, the constructed FMs were back-projected to $30$ cm away from the isocenter with respect to the exact geometry of the treatment machines. Then, a 3D U-Net was utilized to take the integrated CT and FM volume as input to estimate dose. Training and validation were performed on $381$ prostate cancer cases, with an additional $40$ testing cases for independent evaluation of model performance. The proposed model can estimate dose in ∼ $15$ ms for each patient. The average γ passing rate ( $3 % / 2 mm$ , $10 %$ threshold) for the estimated dose was 99.9% ± 0.15% on testing patients. The mean dose differences for the planning target volume and organs at risk were $0.07 % \pm 0.34 %$ and $0.48 % \pm 0.72 %$ , respectively. We have developed a geometry-resolved DL framework for accurate dose estimation and demonstrated its potential in real-time online ART doses verification.

在在线自适应放射治疗（ART）中，当病人被安置在治疗床上时，基于快速计算的二次剂量验证对于确保 ART 计划的质量至关重要。然而，传统的剂量验证算法一般都很耗时，降低了 ART 工作流程的效率。本研究旨在开发一种基于深度学习（DL）的超快速二次剂量验证算法，利用计算机断层成像（CT）和通量图（FMs）准确估计剂量分布。我们通过明确解析治疗投放的几何形状，将通量图整合到 CT 图像域中。对于每个龙门架角度，我们都根据优化的多叶准直器孔径和相应的监测单元构建了一个 FM。为有效编码治疗光束配置，根据治疗机的精确几何形状，将构建的调频反向投影到距离等中心 30 厘米的位置。然后，利用三维 U-Net 将集成 CT 和调频体积作为输入来估算剂量。对 381 个前列腺癌病例进行了训练和验证，另外还对 40 个测试病例进行了独立的模型性能评估。建议的模型能在 15 毫秒内估算出每位患者的剂量。在测试患者中，估计剂量的平均γ通过率（3 % / 2 mm，10 %阈值）为 99.9% ± 0.15%。规划靶体积和危险器官的平均剂量差异分别为 0.07 % ± 0.34 % 和 0.48 % ± 0.72 %。我们开发出了一种用于精确剂量估算的几何分辨 DL 框架，并证明了其在实时在线 ART 剂量验证中的潜力。

{"title":"Quality assurance for online adaptive radiotherapy: a secondary dose verification model with geometry-encoded U-Net.","authors":"Shunyu Yan, Austen Maniscalco, Biling Wang, Dan Nguyen, Steve Jiang, Chenyang Shen","doi":"10.1088/2632-2153/ad829e","DOIUrl":"10.1088/2632-2153/ad829e","url":null,"abstract":"In online adaptive radiotherapy (ART), quick computation-based secondary dose verification is crucial for ensuring the quality of ART plans while the patient is positioned on the treatment couch. However, traditional dose verification algorithms are generally time-consuming, reducing the efficiency of ART workflow. This study aims to develop an ultra-fast deep-learning (DL) based secondary dose verification algorithm to accurately estimate dose distributions using computed tomography (CT) and fluence maps (FMs). We integrated FMs into the CT image domain by explicitly resolving the geometry of treatment delivery. For each gantry angle, an FM was constructed based on the optimized multi-leaf collimator apertures and corresponding monitoring units. To effectively encode treatment beam configuration, the constructed FMs were back-projected to <math><mrow><mn>30</mn></mrow> </math> cm away from the isocenter with respect to the exact geometry of the treatment machines. Then, a 3D U-Net was utilized to take the integrated CT and FM volume as input to estimate dose. Training and validation were performed on <math><mrow><mn>381</mn></mrow> </math> prostate cancer cases, with an additional <math><mrow><mn>40</mn></mrow> </math> testing cases for independent evaluation of model performance. The proposed model can estimate dose in ∼ <math><mrow><mn>15</mn></mrow> </math> ms for each patient. The average γ passing rate ( <math><mrow><mn>3</mn> <mi>%</mi> <mrow><mo>/</mo></mrow> <mn>2</mn> <mstyle></mstyle> <mrow><mtext>mm</mtext></mrow> </mrow> </math> , <math><mrow><mn>10</mn> <mi>%</mi></mrow> </math> threshold) for the estimated dose was 99.9% ± 0.15% on testing patients. The mean dose differences for the planning target volume and organs at risk were <math><mrow><mn>0.07</mn> <mi>%</mi> <mo>±</mo> <mn>0.34</mn> <mi>%</mi></mrow> </math> and <math><mrow><mn>0.48</mn> <mi>%</mi> <mo>±</mo> <mn>0.72</mn> <mi>%</mi></mrow> </math> , respectively. We have developed a geometry-resolved DL framework for accurate dose estimation and demonstrated its potential in real-time online ART doses verification.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"5 4","pages":"045013"},"PeriodicalIF":6.3,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11467776/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Equivariant tensor network potentials 等变张量网络势

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2024-09-18 DOI: 10.1088/2632-2153/ad79b5

M Hodapp and A Shapeev

Machine-learning interatomic potentials (MLIPs) have made a significant contribution to the recent progress in the fields of computational materials and chemistry due to the MLIPs’ ability of accurately approximating energy landscapes of quantum-mechanical models while being orders of magnitude more computationally efficient. However, the computational cost and number of parameters of many state-of-the-art MLIPs increases exponentially with the number of atomic features. Tensor (non-neural) networks, based on low-rank representations of high-dimensional tensors, have been a way to reduce the number of parameters in approximating multidimensional functions, however, it is often not easy to encode the model symmetries into them. In this work we develop a formalism for rank-efficient equivariant tensor networks (ETNs), i.e. tensor networks that remain invariant under actions of SO(3) upon contraction. All the key algorithms of tensor networks like orthogonalization of cores and DMRG-based algorithms carry over to our equivariant case. Moreover, we show that many elements of modern neural network architectures like message passing, pulling, or attention mechanisms, can in some form be implemented into the ETNs. Based on ETNs, we develop a new class of polynomial-based MLIPs that demonstrate superior performance over existing MLIPs for multicomponent systems.

机器学习原子间势（MLIPs）能够精确逼近量子力学模型的能量景观，同时计算效率高达数个数量级，因此为计算材料和化学领域的最新进展做出了重大贡献。然而，许多最先进的 MLIPs 的计算成本和参数数量会随着原子特征数量的增加而呈指数增长。基于高维张量的低秩表示的张量（非神经）网络，一直是减少近似多维函数参数数量的一种方法，然而，将模型对称性编码到张量（非神经）网络中往往并不容易。在这项工作中，我们开发了一种秩效等变张量网络（ETN）的形式主义，即在收缩时保持 SO(3) 作用不变的张量网络。张量网络的所有关键算法，如核的正交化和基于 DMRG 的算法，都适用于我们的等变情况。此外，我们还展示了现代神经网络架构的许多元素，如消息传递、牵引或注意力机制，都可以以某种形式在 ETN 中实现。在 ETN 的基础上，我们开发了一类新的基于多项式的 MLIP，与现有的多分量系统 MLIP 相比，表现出更优越的性能。

{"title":"Equivariant tensor network potentials","authors":"M Hodapp and A Shapeev","doi":"10.1088/2632-2153/ad79b5","DOIUrl":"https://doi.org/10.1088/2632-2153/ad79b5","url":null,"abstract":"Machine-learning interatomic potentials (MLIPs) have made a significant contribution to the recent progress in the fields of computational materials and chemistry due to the MLIPs’ ability of accurately approximating energy landscapes of quantum-mechanical models while being orders of magnitude more computationally efficient. However, the computational cost and number of parameters of many state-of-the-art MLIPs increases exponentially with the number of atomic features. Tensor (non-neural) networks, based on low-rank representations of high-dimensional tensors, have been a way to reduce the number of parameters in approximating multidimensional functions, however, it is often not easy to encode the model symmetries into them. In this work we develop a formalism for rank-efficient equivariant tensor networks (ETNs), i.e. tensor networks that remain invariant under actions of SO(3) upon contraction. All the key algorithms of tensor networks like orthogonalization of cores and DMRG-based algorithms carry over to our equivariant case. Moreover, we show that many elements of modern neural network architectures like message passing, pulling, or attention mechanisms, can in some form be implemented into the ETNs. Based on ETNs, we develop a new class of polynomial-based MLIPs that demonstrate superior performance over existing MLIPs for multicomponent systems.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"4 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimizing ZX-diagrams with deep reinforcement learning 利用深度强化学习优化 ZX 图

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2024-09-18 DOI: 10.1088/2632-2153/ad76f7

Maximilian Nägele and Florian Marquardt

ZX-diagrams are a powerful graphical language for the description of quantum processes with applications in fundamental quantum mechanics, quantum circuit optimization, tensor network simulation, and many more. The utility of ZX-diagrams relies on a set of local transformation rules that can be applied to them without changing the underlying quantum process they describe. These rules can be exploited to optimize the structure of ZX-diagrams for a range of applications. However, finding an optimal sequence of transformation rules is generally an open problem. In this work, we bring together ZX-diagrams with reinforcement learning, a machine learning technique designed to discover an optimal sequence of actions in a decision-making problem and show that a trained reinforcement learning agent can significantly outperform other optimization techniques like a greedy strategy, simulated annealing, and state-of-the-art hand-crafted algorithms. The use of graph neural networks to encode the policy of the agent enables generalization to diagrams much bigger than seen during the training phase.

ZX-iagrams 是一种用于描述量子过程的强大图形语言，可应用于基础量子力学、量子电路优化、张量网络模拟等领域。ZX-iagrams 的实用性依赖于一组局部变换规则，这些规则可以应用于 ZX-iagrams 而不改变其所描述的基础量子过程。可以利用这些规则来优化 ZX-Diagram 的结构，以满足一系列应用的需要。然而，寻找最佳变换规则序列通常是一个未决问题。在这项工作中，我们将 ZX-Diagrams 与强化学习（一种旨在发现决策问题中最优行动序列的机器学习技术）结合在一起，并证明训练有素的强化学习代理可以显著超越其他优化技术，如贪婪策略、模拟退火和最先进的手工算法。使用图神经网络对代理的策略进行编码，可使其泛化到比训练阶段大得多的图表中。

引用次数: 0

DiffLense: a conditional diffusion model for super-resolution of gravitational lensing data DiffLense：引力透镜数据超分辨率的条件扩散模型

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2024-09-18 DOI: 10.1088/2632-2153/ad76f8

Pranath Reddy, Michael W Toomey, Hanna Parul and Sergei Gleyzer

Gravitational lensing data is frequently collected at low resolution due to instrumental limitations and observing conditions. Machine learning-based super-resolution techniques offer a method to enhance the resolution of these images, enabling more precise measurements of lensing effects and a better understanding of the matter distribution in the lensing system. This enhancement can significantly improve our knowledge of the distribution of mass within the lensing galaxy and its environment, as well as the properties of the background source being lensed. Traditional super-resolution techniques typically learn a mapping function from lower-resolution to higher-resolution samples. However, these methods are often constrained by their dependence on optimizing a fixed distance function, which can result in the loss of intricate details crucial for astrophysical analysis. In this work, we introduce DiffLense, a novel super-resolution pipeline based on a conditional diffusion model specifically designed to enhance the resolution of gravitational lensing images obtained from the Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP). Our approach adopts a generative model, leveraging the detailed structural information present in Hubble space telescope (HST) counterparts. The diffusion model, trained to generate HST data, is conditioned on HSC data pre-processed with denoising techniques and thresholding to significantly reduce noise and background interference. This process leads to a more distinct and less overlapping conditional distribution during the model’s training phase. We demonstrate that DiffLense outperforms existing state-of-the-art single-image super-resolution techniques, particularly in retaining the fine details necessary for astrophysical analyses.

由于仪器和观测条件的限制，引力透镜数据经常是以低分辨率收集的。基于机器学习的超分辨率技术提供了一种提高这些图像分辨率的方法，可以更精确地测量透镜效应，更好地了解透镜系统中的物质分布。这种增强可以极大地提高我们对透镜星系及其环境中质量分布的了解，以及对被透镜的背景源特性的了解。传统的超分辨率技术通常是学习从低分辨率样本到高分辨率样本的映射函数。然而，这些方法往往受制于它们对固定距离函数的优化依赖，这可能导致对天体物理分析至关重要的复杂细节的丢失。在这项工作中，我们介绍了 DiffLense，这是一种基于条件扩散模型的新型超分辨率管道，专门用于提高从超级超ime-Cam Subaru 战略计划（HSC-SSP）获得的引力透镜图像的分辨率。我们的方法采用了一个生成模型，充分利用了哈勃空间望远镜（HST）对应图像中的详细结构信息。为生成 HST 数据而训练的扩散模型，以经过去噪技术和阈值化预处理的 HSC 数据为条件，以显著减少噪声和背景干扰。在模型的训练阶段，这一过程会使条件分布更清晰、重叠更少。我们证明，DiffLense优于现有的最先进的单图像超分辨率技术，尤其是在保留天体物理分析所需的精细细节方面。

{"title":"DiffLense: a conditional diffusion model for super-resolution of gravitational lensing data","authors":"Pranath Reddy, Michael W Toomey, Hanna Parul and Sergei Gleyzer","doi":"10.1088/2632-2153/ad76f8","DOIUrl":"https://doi.org/10.1088/2632-2153/ad76f8","url":null,"abstract":"Gravitational lensing data is frequently collected at low resolution due to instrumental limitations and observing conditions. Machine learning-based super-resolution techniques offer a method to enhance the resolution of these images, enabling more precise measurements of lensing effects and a better understanding of the matter distribution in the lensing system. This enhancement can significantly improve our knowledge of the distribution of mass within the lensing galaxy and its environment, as well as the properties of the background source being lensed. Traditional super-resolution techniques typically learn a mapping function from lower-resolution to higher-resolution samples. However, these methods are often constrained by their dependence on optimizing a fixed distance function, which can result in the loss of intricate details crucial for astrophysical analysis. In this work, we introduce DiffLense, a novel super-resolution pipeline based on a conditional diffusion model specifically designed to enhance the resolution of gravitational lensing images obtained from the Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP). Our approach adopts a generative model, leveraging the detailed structural information present in Hubble space telescope (HST) counterparts. The diffusion model, trained to generate HST data, is conditioned on HSC data pre-processed with denoising techniques and thresholding to significantly reduce noise and background interference. This process leads to a more distinct and less overlapping conditional distribution during the model’s training phase. We demonstrate that DiffLense outperforms existing state-of-the-art single-image super-resolution techniques, particularly in retaining the fine details necessary for astrophysical analyses.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"70 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Masked particle modeling on sets: towards self-supervised high energy physics foundation models 集合上的掩蔽粒子建模：走向自监督高能物理基础模型

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2024-09-16 DOI: 10.1088/2632-2153/ad64a8

Tobias Golling, Lukas Heinrich, Michael Kagan, Samuel Klein, Matthew Leigh, Margarita Osadchy and John Andrew Raine

We propose masked particle modeling (MPM) as a self-supervised method for learning generic, transferable, and reusable representations on unordered sets of inputs for use in high energy physics (HEP) scientific data. This work provides a novel scheme to perform masked modeling based pre-training to learn permutation invariant functions on sets. More generally, this work provides a step towards building large foundation models for HEP that can be generically pre-trained with self-supervised learning and later fine-tuned for a variety of down-stream tasks. In MPM, particles in a set are masked and the training objective is to recover their identity, as defined by a discretized token representation of a pre-trained vector quantized variational autoencoder. We study the efficacy of the method in samples of high energy jets at collider physics experiments, including studies on the impact of discretization, permutation invariance, and ordering. We also study the fine-tuning capability of the model, showing that it can be adapted to tasks such as supervised and weakly supervised jet classification, and that the model can transfer efficiently with small fine-tuning data sets to new classes and new data domains.

我们提出了掩蔽粒子建模（MPM）作为一种自监督方法，用于学习无序输入集上的通用、可转移和可重复使用的表示，以用于高能物理（HEP）科学数据。这项工作提供了一种新颖的方案，用于执行基于掩码建模的预训练，以学习集合上的包络不变函数。更广泛地说，这项工作为建立大型高能物理基础模型迈出了一步，这些模型可以通过自监督学习进行通用预训练，然后针对各种下游任务进行微调。在 MPM 中，一个集合中的粒子被遮蔽，训练目标是恢复它们的身份，身份由预先训练的向量量化变分自动编码器的离散标记表示法定义。我们研究了该方法在对撞机物理实验的高能射流样本中的功效，包括研究离散化、包络不变性和排序的影响。我们还研究了该模型的微调能力，表明它可以适应监督和弱监督射流分类等任务，而且该模型可以通过小规模微调数据集高效地转移到新的类别和新的数据域。

{"title":"Masked particle modeling on sets: towards self-supervised high energy physics foundation models","authors":"Tobias Golling, Lukas Heinrich, Michael Kagan, Samuel Klein, Matthew Leigh, Margarita Osadchy and John Andrew Raine","doi":"10.1088/2632-2153/ad64a8","DOIUrl":"https://doi.org/10.1088/2632-2153/ad64a8","url":null,"abstract":"We propose masked particle modeling (MPM) as a self-supervised method for learning generic, transferable, and reusable representations on unordered sets of inputs for use in high energy physics (HEP) scientific data. This work provides a novel scheme to perform masked modeling based pre-training to learn permutation invariant functions on sets. More generally, this work provides a step towards building large foundation models for HEP that can be generically pre-trained with self-supervised learning and later fine-tuned for a variety of down-stream tasks. In MPM, particles in a set are masked and the training objective is to recover their identity, as defined by a discretized token representation of a pre-trained vector quantized variational autoencoder. We study the efficacy of the method in samples of high energy jets at collider physics experiments, including studies on the impact of discretization, permutation invariance, and ordering. We also study the fine-tuning capability of the model, showing that it can be adapted to tasks such as supervised and weakly supervised jet classification, and that the model can transfer efficiently with small fine-tuning data sets to new classes and new data domains.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"75 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Transforming the bootstrap: using transformers to compute scattering amplitudes in planar N =... 转换自举法：使用转换器计算平面 N =...

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2024-09-15 DOI: 10.1088/2632-2153/ad743e

Tianji Cai, Garrett W Merz, François Charton, Niklas Nolte, Matthias Wilhelm, Kyle Cranmer and Lance J Dixon

We pursue the use of deep learning methods to improve state-of-the-art computations in theoretical high-energy physics. Planar Super Yang–Mills theory is a close cousin to the theory that describes Higgs boson production at the Large Hadron Collider; its scattering amplitudes are large mathematical expressions containing integer coefficients. In this paper, we apply transformers to predict these coefficients. The problem can be formulated in a language-like representation amenable to standard cross-entropy training objectives. We design two related experiments and show that the model achieves high accuracy ( on both tasks. Our work shows that transformers can be applied successfully to problems in theoretical physics that require exact solutions.

我们致力于利用深度学习方法来改进理论高能物理的最新计算。平面超杨-米尔斯理论是在大型强子对撞机上描述希格斯玻色子产生的理论的近亲；其散射振幅是包含整数系数的大型数学表达式。在本文中，我们应用变换器来预测这些系数。这个问题可以用一种类似语言的表示法来表述，适合标准的交叉熵训练目标。我们设计了两个相关实验，结果表明该模型在两个任务中都达到了很高的准确率。我们的工作表明，变换器可以成功地应用于理论物理中需要精确解的问题。

引用次数: 0

Learning on the correctness class for domain inverse problems of gravimetry 关于重力测量领域反问题正确性类的学习

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2024-09-11 DOI: 10.1088/2632-2153/ad72cc

Yihang Chen and Wenbin Li

We consider end-to-end learning approaches for inverse problems of gravimetry. Due to ill-posedness of the inverse gravimetry, the reliability of learning approaches is questionable. To deal with this problem, we propose the strategy of learning on the correctness class. The well-posedness theorems are employed when designing the neural-network architecture and constructing the training set. Given the density-contrast function as a priori information, the domain of mass can be uniquely determined under certain constrains, and the domain inverse problem is a correctness class of the inverse gravimetry. Under this correctness class, we design the neural network for learning by mimicking the level-set formulation for the inverse gravimetry. Numerical examples illustrate that the method is able to recover mass models with non-constant density contrast.

我们考虑了重力测量逆问题的端到端学习方法。由于反重力测量的非假设性，学习方法的可靠性值得怀疑。为了解决这个问题，我们提出了在正确性类上学习的策略。在设计神经网络架构和构建训练集时，我们采用了问题定理。给定密度对比函数作为先验信息，在一定的约束条件下，质量域可以唯一确定，质量域逆问题是反重力测量学的一个正确性类别。在这一正确性类别下，我们模仿反重力测量的水平集公式设计了用于学习的神经网络。数值实例表明，该方法能够恢复非恒定密度对比的质量模型。

引用次数: 0

A combined modeling method for complex multi-fidelity data fusion 复杂多保真数据融合的组合建模方法

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2024-09-10 DOI: 10.1088/2632-2153/ad718f

Lei Tang, Feng Liu, Anping Wu, Yubo Li, Wanqiu Jiang, Qingfeng Wang and Jun Huang

Currently, mainstream methods for multi-fidelity data fusion have achieved great success in many fields, but they generally suffer from poor scalability. Therefore, this paper proposes a combination modeling method for complex multi-fidelity data fusion, devoted to solving the modeling problems with three types of multi-fidelity data fusion, and explores a general solution for any n types of multi-fidelity data fusion. Different from the traditional direct modeling method—Multi-Fidelity Deep Neural Network (MFDNN)—the method is an indirect modeling method. The experimental results on three representative benchmark functions and the prediction tasks of SG6043 airfoil aerodynamic performance show that combination modeling has the following advantages: (1) It can quickly establish the mapping relationship between high, medium, and low fidelity data. (2) It can effectively solve the data imbalance problem in multi-fidelity modeling. (3) Compared with MFDNN, it has stronger noise resistance and higher prediction accuracy. Additionally, this paper discusses the scalability problem of the method when n = 4 and n = 5, providing a reference for further research on the combined modeling method.

目前，多保真数据融合的主流方法在许多领域取得了巨大成功，但普遍存在可扩展性差的问题。因此，本文提出了一种复杂多保真度数据融合的组合建模方法，致力于解决三种多保真度数据融合的建模问题，并探索了适用于任意n种多保真度数据融合的通用解决方案。与传统的直接建模方法--多保真深度神经网络（MFDNN）不同，该方法是一种间接建模方法。在三个具有代表性的基准函数和 SG6043 机翼气动性能预测任务上的实验结果表明，组合建模具有以下优点：（1）可以快速建立高、中、低保真数据之间的映射关系。(2）能有效解决多保真度建模中的数据不平衡问题。(3) 与 MFDNN 相比，它具有更强的抗噪声能力和更高的预测精度。此外，本文还讨论了该方法在 n = 4 和 n = 5 时的可扩展性问题，为进一步研究组合建模方法提供了参考。

{"title":"A combined modeling method for complex multi-fidelity data fusion","authors":"Lei Tang, Feng Liu, Anping Wu, Yubo Li, Wanqiu Jiang, Qingfeng Wang and Jun Huang","doi":"10.1088/2632-2153/ad718f","DOIUrl":"https://doi.org/10.1088/2632-2153/ad718f","url":null,"abstract":"Currently, mainstream methods for multi-fidelity data fusion have achieved great success in many fields, but they generally suffer from poor scalability. Therefore, this paper proposes a combination modeling method for complex multi-fidelity data fusion, devoted to solving the modeling problems with three types of multi-fidelity data fusion, and explores a general solution for any n types of multi-fidelity data fusion. Different from the traditional direct modeling method—Multi-Fidelity Deep Neural Network (MFDNN)—the method is an indirect modeling method. The experimental results on three representative benchmark functions and the prediction tasks of SG6043 airfoil aerodynamic performance show that combination modeling has the following advantages: (1) It can quickly establish the mapping relationship between high, medium, and low fidelity data. (2) It can effectively solve the data imbalance problem in multi-fidelity modeling. (3) Compared with MFDNN, it has stronger noise resistance and higher prediction accuracy. Additionally, this paper discusses the scalability problem of the method when n = 4 and n = 5, providing a reference for further research on the combined modeling method.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"56 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards a comprehensive visualisation of structure in large scale data sets 实现大规模数据集结构的全面可视化

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2024-09-09 DOI: 10.1088/2632-2153/ad6fea

Joan Garriga and Frederic Bartumeus

Dimensionality reduction methods are fundamental to the exploration and visualisation of large data sets. Basic requirements for unsupervised data exploration are flexibility and scalability. However, current methods have computational limitations that restrict our ability to explore data structures to the lower range of scales. We focus on t-SNE and propose a chunk-and-mix protocol that enables the parallel implementation of this algorithm, as well as a self-adaptive parametric scheme that facilitates its parametric configuration. As a proof of concept, we present the pt-SNE algorithm, a parallel version of Barnes-Hat-SNE (an implementation of t-SNE). In pt-SNE, a single free parameter for the size of the neighbourhood, namely the perplexity, modulates the visualisation of the data structure at different scales, from local to global. Thanks to parallelisation, the runtime of the algorithm remains almost independent of the perplexity, which extends the range of scales to be analysed. The pt-SNE converges to a good global embedding comparable to current solutions, although it adds little noise at the local scale. This noise illustrates an unavoidable trade-off between computational speed and accuracy. We expect the same approach to be applicable to faster embedding algorithms than Barnes-Hat-SNE, such as Fast-Fourier Interpolation-based t-SNE or Uniform Manifold Approximation and Projection, thus extending the state of the art and allowing a more comprehensive visualisation and analysis of data structures.

降维方法是探索和可视化大型数据集的基础。无监督数据探索的基本要求是灵活性和可扩展性。然而，目前的方法存在计算上的局限性，将我们探索数据结构的能力限制在较低的尺度范围内。我们将重点放在 t-SNE 上，并提出了一种分块混合协议（chunk-and-mix protocol）来并行执行该算法，以及一种自适应参数方案（self-adaptive parametric scheme）来简化参数配置。作为概念验证，我们提出了pt-SNE 算法，它是 Barnes-Hat-SNE（t-SNE 的实现）的并行版本。在 pt-SNE 算法中，邻域大小的一个自由参数，即困惑度，可以调节从局部到全局等不同尺度的数据结构的可视化。得益于并行化，该算法的运行时间几乎不受复杂度的影响，从而扩大了可分析的尺度范围。尽管 pt-SNE 算法在局部尺度上几乎没有增加噪音，但它收敛到了与当前解决方案相当的良好全局嵌入。这种噪音说明了计算速度和精确度之间不可避免的权衡。我们希望同样的方法也能适用于比 Barnes-Hat-SNE 更快的嵌入算法，如基于快速傅立叶插值的 t-SNE 或均匀曲面逼近和投影，从而拓展技术领域，实现更全面的数据结构可视化和分析。

{"title":"Towards a comprehensive visualisation of structure in large scale data sets","authors":"Joan Garriga and Frederic Bartumeus","doi":"10.1088/2632-2153/ad6fea","DOIUrl":"https://doi.org/10.1088/2632-2153/ad6fea","url":null,"abstract":"Dimensionality reduction methods are fundamental to the exploration and visualisation of large data sets. Basic requirements for unsupervised data exploration are flexibility and scalability. However, current methods have computational limitations that restrict our ability to explore data structures to the lower range of scales. We focus on t-SNE and propose a chunk-and-mix protocol that enables the parallel implementation of this algorithm, as well as a self-adaptive parametric scheme that facilitates its parametric configuration. As a proof of concept, we present the pt-SNE algorithm, a parallel version of Barnes-Hat-SNE (an implementation of t-SNE). In pt-SNE, a single free parameter for the size of the neighbourhood, namely the perplexity, modulates the visualisation of the data structure at different scales, from local to global. Thanks to parallelisation, the runtime of the algorithm remains almost independent of the perplexity, which extends the range of scales to be analysed. The pt-SNE converges to a good global embedding comparable to current solutions, although it adds little noise at the local scale. This noise illustrates an unavoidable trade-off between computational speed and accuracy. We expect the same approach to be applicable to faster embedding algorithms than Barnes-Hat-SNE, such as Fast-Fourier Interpolation-based t-SNE or Uniform Manifold Approximation and Projection, thus extending the state of the art and allowing a more comprehensive visualisation and analysis of data structures.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"30 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Designing quantum multi-category classifier from the perspective of brain processing information 从大脑处理信息的角度设计量子多类别分类器

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2024-09-06 DOI: 10.1088/2632-2153/ad7570

Xiaodong Ding, Jinchen Xu, Zhihui Song, Yifan Hou, Zheng Shan

In the field of machine learning, the multi-category classification problem plays a crucial role. Solving the problem has a profound impact on driving the innovation and development of machine learning techniques and addressing complex problems in the real world. In recent years, researchers have begun to focus on utilizing quantum computing to solve the multi-category classification problem. Some studies have shown that the process of processing information in the brain may be related to quantum phenomena, with different brain regions having neurons with different structures. Inspired by this, we design a quantum multi-category classifier model from this perspective for the first time. The model employs a heterogeneous population of quantum neural networks (QNNs) to simulate the cooperative work of multiple different brain regions. When processing information, these heterogeneous clusters of QNNs allow for simultaneous execution on different quantum computers, thus simulating the brain’s ability to utilize multiple brain regions working in concert to maintain the robustness of the model. By setting the number of heterogeneous QNN clusters and parameterizing the number of stacks of unit layers in the quantum circuit, the model demonstrates excellent scalability in dealing with different types of data and different numbers of classes in the classification problem. Based on the attention mechanism of the brain, we integrate the processing results of heterogeneous QNN clusters to achieve high accuracy in classification. Finally, we conducted classification simulation experiments on different datasets. The results show that our method exhibits strong robustness and scalability. Among them, on different subsets of the MNIST dataset, its classification accuracy improves by up to about 5% compared to other quantum multiclassification algorithms. This result becomes the state-of-the-art simulation result for quantum classification models and exceeds the performance of classical classifiers with a considerable number of trainable parameters on some subsets of the MNIST dataset.

在机器学习领域，多类别分类问题起着至关重要的作用。解决这一问题对推动机器学习技术的创新和发展以及解决现实世界中的复杂问题有着深远的影响。近年来，研究人员开始关注利用量子计算解决多类别分类问题。一些研究表明，大脑处理信息的过程可能与量子现象有关，不同脑区的神经元具有不同的结构。受此启发，我们首次从这个角度设计了一个量子多类别分类器模型。该模型采用量子神经网络（QNN）的异质群来模拟多个不同脑区的协同工作。在处理信息时，这些异构的量子神经网络集群可以在不同的量子计算机上同时执行，从而模拟大脑利用多个脑区协同工作的能力，以保持模型的鲁棒性。通过设置异构 QNN 群集的数量和量子电路中单元层堆叠数的参数，该模型在处理分类问题中不同类型的数据和不同数量的类别时表现出了出色的可扩展性。基于大脑的注意力机制，我们整合了异构 QNN 簇的处理结果，实现了高精度的分类。最后，我们在不同的数据集上进行了分类模拟实验。结果表明，我们的方法具有很强的鲁棒性和可扩展性。其中，在 MNIST 数据集的不同子集上，与其他量子多分类算法相比，其分类准确率提高了约 5%。这一结果成为量子分类模型最先进的模拟结果，并在 MNIST 数据集的某些子集中超过了具有相当数量可训练参数的经典分类器的性能。

{"title":"Designing quantum multi-category classifier from the perspective of brain processing information","authors":"Xiaodong Ding, Jinchen Xu, Zhihui Song, Yifan Hou, Zheng Shan","doi":"10.1088/2632-2153/ad7570","DOIUrl":"https://doi.org/10.1088/2632-2153/ad7570","url":null,"abstract":"In the field of machine learning, the multi-category classification problem plays a crucial role. Solving the problem has a profound impact on driving the innovation and development of machine learning techniques and addressing complex problems in the real world. In recent years, researchers have begun to focus on utilizing quantum computing to solve the multi-category classification problem. Some studies have shown that the process of processing information in the brain may be related to quantum phenomena, with different brain regions having neurons with different structures. Inspired by this, we design a quantum multi-category classifier model from this perspective for the first time. The model employs a heterogeneous population of quantum neural networks (QNNs) to simulate the cooperative work of multiple different brain regions. When processing information, these heterogeneous clusters of QNNs allow for simultaneous execution on different quantum computers, thus simulating the brain’s ability to utilize multiple brain regions working in concert to maintain the robustness of the model. By setting the number of heterogeneous QNN clusters and parameterizing the number of stacks of unit layers in the quantum circuit, the model demonstrates excellent scalability in dealing with different types of data and different numbers of classes in the classification problem. Based on the attention mechanism of the brain, we integrate the processing results of heterogeneous QNN clusters to achieve high accuracy in classification. Finally, we conducted classification simulation experiments on different datasets. The results show that our method exhibits strong robustness and scalability. Among them, on different subsets of the MNIST dataset, its classification accuracy improves by up to about 5% compared to other quantum multiclassification algorithms. This result becomes the state-of-the-art simulation result for quantum classification models and exceeds the performance of classical classifiers with a considerable number of trainable parameters on some subsets of the MNIST dataset.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"27 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0