Machine Learning Science and Technology最新文献_第4页

Intelligent processing of electromagnetic data using detrended and identification 利用趋势识别技术对电磁数据进行智能处理

2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-11-13 DOI: 10.1088/2632-2153/ad0c40

Xian Zhang, Diquan Li, Bei Liu, Yanfang Hu, Yao Mo

Abstract The application of the electromagnetic method has accelerated due to the demand for the development of mineral resource, however the strong electromagnetic interference seriously lowers the data quality, resolution and detect effect. To suppress the electromagnetic interference, this paper proposes an intelligent processing method based on detrended and identification, and applies for wide field electromagnetic method (WFEM) data. First, we combined the improved intrinsic time scale decomposition (IITD) and detrended fluctuation analysis (DFA) algorithm for removing the trend noise. Then, we extracted the time domain characteristics of the WFEM data after removing the trend noise. Next, the arithmetic optimization algorithm (AOA) was utilized to search for the optimal smoothing factor of the probabilistic neural network (PNN) algorithm, which realized to intelligently identify the noise data and WFEM effective data. Finally, The Fourier transform was performed to extract the spectrum amplitude of the effective frequency points from the reconstructed WFEM data, and the electric field curve was obtained. In these studies and applications, the fuzzy c-mean (FCM) and PNN algorithm are contrasted. The proposed method indicated that the trend noise can be adaptively extracted and eliminated, the abnormal waveform or noise interference can be intelligently identified, the reconstructed WFEM data can effectively recover the pseudo-random signal waveform, and the shape of electric field curves were more stable. Simulation experiments and measured applications has verified that the proposed method can provide technical support for deep underground exploration.

由于矿产资源开发的需要，电磁方法的应用加快，但强电磁干扰严重降低了数据质量、分辨率和探测效果。为了抑制电磁干扰，本文提出了一种基于去趋势和识别的智能处理方法，并应用于广域电磁法(WFEM)数据。首先，结合改进的内禀时间尺度分解(IITD)和去趋势波动分析(DFA)算法去除趋势噪声;然后，在去除趋势噪声后提取WFEM数据的时域特征。其次，利用算术优化算法(AOA)搜索概率神经网络(PNN)算法的最优平滑因子，实现对噪声数据和WFEM有效数据的智能识别;最后，对重构的WFEM数据进行傅里叶变换提取有效频率点的频谱幅值，得到电场曲线。在这些研究和应用中，比较了模糊c均值(FCM)和PNN算法。结果表明，该方法能够自适应提取和消除趋势噪声，智能识别异常波形或噪声干扰，重构后的WFEM数据能够有效恢复伪随机信号波形，电场曲线形状更加稳定。仿真实验和实测应用验证了该方法能够为深部地下勘探提供技术支持。

{"title":"Intelligent processing of electromagnetic data using detrended and identification","authors":"Xian Zhang, Diquan Li, Bei Liu, Yanfang Hu, Yao Mo","doi":"10.1088/2632-2153/ad0c40","DOIUrl":"https://doi.org/10.1088/2632-2153/ad0c40","url":null,"abstract":"Abstract The application of the electromagnetic method has accelerated due to the demand for the development of mineral resource, however the strong electromagnetic interference seriously lowers the data quality, resolution and detect effect. To suppress the electromagnetic interference, this paper proposes an intelligent processing method based on detrended and identification, and applies for wide field electromagnetic method (WFEM) data. First, we combined the improved intrinsic time scale decomposition (IITD) and detrended fluctuation analysis (DFA) algorithm for removing the trend noise. Then, we extracted the time domain characteristics of the WFEM data after removing the trend noise. Next, the arithmetic optimization algorithm (AOA) was utilized to search for the optimal smoothing factor of the probabilistic neural network (PNN) algorithm, which realized to intelligently identify the noise data and WFEM effective data. Finally, The Fourier transform was performed to extract the spectrum amplitude of the effective frequency points from the reconstructed WFEM data, and the electric field curve was obtained. In these studies and applications, the fuzzy c-mean (FCM) and PNN algorithm are contrasted. The proposed method indicated that the trend noise can be adaptively extracted and eliminated, the abnormal waveform or noise interference can be intelligently identified, the reconstructed WFEM data can effectively recover the pseudo-random signal waveform, and the shape of electric field curves were more stable. Simulation experiments and measured applications has verified that the proposed method can provide technical support for deep underground exploration.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"59 16","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136281910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Neural network Gaussian processes as efficient models of potential energy surfaces for polyatomic molecules 神经网络高斯过程作为多原子分子势能面的有效模型

2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-11-10 DOI: 10.1088/2632-2153/ad0652

Jun Dai, Roman V Krems

Abstract Kernel models of potential energy surfaces (PESs) for polyatomic molecules are often restricted by a specific choice of the kernel function. This can be avoided by optimizing the complexity of the kernel function. For regression problems with very expensive data, the functional form of the model kernels can be optimized in the Gaussian process (GP) setting through compositional function search guided by the Bayesian information criterion. However, the compositional kernel search is computationally demanding and relies on greedy strategies, which may yield sub-optimal kernels. An alternative strategy of increasing complexity of GP kernels treats a GP as a Bayesian neural network (NN) with a variable number of hidden layers, which yields NNGP models. Here, we present a direct comparison of GP models with composite kernels and NNGP models for applications aiming at the construction of global PES for polyatomic molecules. We show that NNGP models of PES can be trained much more efficiently and yield better generalization accuracy without relying on any specific form of the kernel function. We illustrate that NNGP models trained by distributions of energy points at low energies produce accurate predictions of PES at high energies. We also illustrate that NNGP models can extrapolate in the input variable space by building the free energy surface of the Heisenberg model trained in the paramagnetic phase and validated in the ferromagnetic phase. By construction, composite kernels yield more accurate models than kernels with a fixed functional form. Therefore, by illustrating that NNGP models outperform GP models with composite kernels, our work suggests that NNGP models should be a preferred choice of kernel models for PES.

摘要多原子分子势能面核模型常常受到核函数选择的限制。这可以通过优化核函数的复杂性来避免。对于数据非常昂贵的回归问题，可以通过贝叶斯信息准则指导下的组合函数搜索，在高斯过程(GP)设置下优化模型核的函数形式。然而，组合核搜索的计算量很大，并且依赖于贪婪策略，这可能会产生次优核。一种增加GP核复杂性的替代策略将GP视为具有可变隐藏层数的贝叶斯神经网络(NN)，从而产生NNGP模型。在这里，我们将GP模型与复合核模型和NNGP模型进行了直接比较，用于构建多原子分子的全局PES。我们证明了PES的NNGP模型可以更有效地训练并产生更好的泛化精度，而不依赖于任何特定形式的核函数。我们证明了由低能量点分布训练的NNGP模型可以准确地预测高能量的PES。我们还通过建立在顺磁相位训练并在铁磁相位验证的海森堡模型的自由能面，说明NNGP模型可以在输入变量空间中进行外推。通过构造，复合核比具有固定函数形式的核产生更精确的模型。因此，通过说明NNGP模型优于具有复合核的GP模型，我们的工作表明NNGP模型应该是PES核模型的首选。

{"title":"Neural network Gaussian processes as efficient models of potential energy surfaces for polyatomic molecules","authors":"Jun Dai, Roman V Krems","doi":"10.1088/2632-2153/ad0652","DOIUrl":"https://doi.org/10.1088/2632-2153/ad0652","url":null,"abstract":"Abstract Kernel models of potential energy surfaces (PESs) for polyatomic molecules are often restricted by a specific choice of the kernel function. This can be avoided by optimizing the complexity of the kernel function. For regression problems with very expensive data, the functional form of the model kernels can be optimized in the Gaussian process (GP) setting through compositional function search guided by the Bayesian information criterion. However, the compositional kernel search is computationally demanding and relies on greedy strategies, which may yield sub-optimal kernels. An alternative strategy of increasing complexity of GP kernels treats a GP as a Bayesian neural network (NN) with a variable number of hidden layers, which yields NNGP models. Here, we present a direct comparison of GP models with composite kernels and NNGP models for applications aiming at the construction of global PES for polyatomic molecules. We show that NNGP models of PES can be trained much more efficiently and yield better generalization accuracy without relying on any specific form of the kernel function. We illustrate that NNGP models trained by distributions of energy points at low energies produce accurate predictions of PES at high energies. We also illustrate that NNGP models can extrapolate in the input variable space by building the free energy surface of the Heisenberg model trained in the paramagnetic phase and validated in the ferromagnetic phase. By construction, composite kernels yield more accurate models than kernels with a fixed functional form. Therefore, by illustrating that NNGP models outperform GP models with composite kernels, our work suggests that NNGP models should be a preferred choice of kernel models for PES.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"71 26","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135088047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

How deep convolutional neural networks lose spatial information with training 深度卷积神经网络如何在训练中丢失空间信息

2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-11-09 DOI: 10.1088/2632-2153/ad092c

Umberto Maria Tomasini, Leonardo Petrini, Francesco Cagnetta, Matthieu Wyart

Abstract A central question of machine learning is how deep nets manage to learn tasks in high dimensions. An appealing hypothesis is that they achieve this feat by building a representation of the data where information irrelevant to the task is lost. For image datasets, this view is supported by the observation that after (and not before) training, the neural representation becomes less and less sensitive to diffeomorphisms acting on images as the signal propagates through the network. This loss of sensitivity correlates with performance and surprisingly correlates with a gain of sensitivity to white noise acquired during training. Which are the mechanisms learned by convolutional neural networks (CNNs) responsible for the these phenomena? In particular, why is the sensitivity to noise heightened with training? Our approach consists of two steps. (1) Analyzing the layer-wise representations of trained CNNs, we disentangle the role of spatial pooling in contrast to channel pooling in decreasing their sensitivity to image diffeomorphisms while increasing their sensitivity to noise. (2) We introduce model scale-detection tasks, which qualitatively reproduce the phenomena reported in our empirical analysis. In these models we can assess quantitatively how spatial pooling affects these sensitivities. We find that the increased sensitivity to noise observed in deep ReLU networks is a mechanistic consequence of the perturbing noise piling up during spatial pooling, after being rectified by ReLU units. Using odd activation functions like tanh drastically reduces the CNNs’ sensitivity to noise.

机器学习的一个核心问题是深度网络如何在高维空间学习任务。一个吸引人的假设是，他们通过建立一个与任务无关的信息丢失的数据表示来实现这一壮举。对于图像数据集，这一观点得到了观察结果的支持，即在训练之后(而不是之前)，随着信号在网络中传播，神经表示对作用于图像的微分同态变得越来越不敏感。这种灵敏度的丧失与表现有关，令人惊讶的是，它与训练中获得的对白噪声的灵敏度的增加有关。卷积神经网络(cnn)学习的哪些机制导致了这些现象?特别是，为什么对噪音的敏感度会随着训练而提高?我们的方法包括两个步骤。(1)分析训练后的cnn的分层表示，我们区分了空间池化与通道池化在降低其对图像差分同态的敏感性而增加其对噪声的敏感性方面的作用。(2)引入模型尺度检测任务，定性再现我们实证分析中报告的现象。在这些模型中，我们可以定量地评估空间池化如何影响这些敏感性。我们发现，在深度ReLU网络中观察到的对噪声的敏感性增加是在空间池化过程中扰动噪声堆积后被ReLU单元纠正的机制结果。使用像tanh这样的奇数激活函数大大降低了cnn对噪声的敏感性。

{"title":"How deep convolutional neural networks lose spatial information with training","authors":"Umberto Maria Tomasini, Leonardo Petrini, Francesco Cagnetta, Matthieu Wyart","doi":"10.1088/2632-2153/ad092c","DOIUrl":"https://doi.org/10.1088/2632-2153/ad092c","url":null,"abstract":"Abstract A central question of machine learning is how deep nets manage to learn tasks in high dimensions. An appealing hypothesis is that they achieve this feat by building a representation of the data where information irrelevant to the task is lost. For image datasets, this view is supported by the observation that after (and not before) training, the neural representation becomes less and less sensitive to diffeomorphisms acting on images as the signal propagates through the network. This loss of sensitivity correlates with performance and surprisingly correlates with a gain of sensitivity to white noise acquired during training. Which are the mechanisms learned by convolutional neural networks (CNNs) responsible for the these phenomena? In particular, why is the sensitivity to noise heightened with training? Our approach consists of two steps. (1) Analyzing the layer-wise representations of trained CNNs, we disentangle the role of spatial pooling in contrast to channel pooling in decreasing their sensitivity to image diffeomorphisms while increasing their sensitivity to noise. (2) We introduce model scale-detection tasks, which qualitatively reproduce the phenomena reported in our empirical analysis. In these models we can assess quantitatively how spatial pooling affects these sensitivities. We find that the increased sensitivity to noise observed in deep ReLU networks is a mechanistic consequence of the perturbing noise piling up during spatial pooling, after being rectified by ReLU units. Using odd activation functions like tanh drastically reduces the CNNs’ sensitivity to noise.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":" 7","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135191273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Looking at the posterior: accuracy and uncertainty of neural-network predictions 后验分析:神经网络预测的准确性和不确定性

2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-11-08 DOI: 10.1088/2632-2153/ad0ab4

Hampus Linander, Oleksandr Balabanov, Henry Yang, Bernhard Mehlig

Abstract Bayesian inference can quantify uncertainty in the predictions of neural networks using posterior distributions for model parameters and network output. By looking at these posterior distributions, one can separate the origin of uncertainty into aleatoric and epistemic contributions. One goal of uncertainty quantification is to inform on prediction accuracy.Here we show that prediction accuracy depends on both epistemic and aleatoric uncertainty in an intricate fashion that cannot be understood in terms of marginalized uncertainty distributions alone. How the accuracy relates to epistemic and aleatoric uncertainties depends not only on the model architecture, but also on the properties of the dataset.We discuss the significance of these results for active learning and introduce a novel acquisition function that outperforms common uncertainty-based methods. To arrive at our results, we approximated the posteriors using deep ensembles, for fully-connected, convolutional and attention-based neural networks.

摘要贝叶斯推理可以利用模型参数和网络输出的后验分布来量化神经网络预测中的不确定性。通过观察这些后验分布，人们可以将不确定性的起源分为任意贡献和认知贡献。不确定性量化的一个目标是告知预测准确性。在这里，我们表明预测准确性以一种复杂的方式依赖于认知不确定性和任意不确定性，这种不确定性不能仅根据边缘不确定性分布来理解。准确性与认知不确定性和任意不确定性的关系不仅取决于模型架构，还取决于数据集的属性。我们讨论了这些结果对主动学习的意义，并引入了一种优于普通基于不确定性方法的新型获取函数。为了得到我们的结果，我们使用深度集成来近似后置，用于完全连接的、卷积的和基于注意力的神经网络。

引用次数: 0

Calibration of uncertainty in the active learning of machine learning force fields 机器学习力场主动学习中不确定度的标定

2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-11-08 DOI: 10.1088/2632-2153/ad0ab5

Adam Thomas-Mitchell, Glenn Ivan Hawe, Paul Popelier

Abstract FFLUX is a Machine Learning Force Field that uses the Maximum Expected Prediction Error (MEPE) active learning algorithm to improve the efficiency of model training. MEPE uses the predictive uncertainty of a Gaussian Process to balance exploration and exploitation when selecting the next training sample. However, the predictive uncertainty of a Gaussian Process is unlikely to be accurate or precise immediately after training. We hypothesize that calibrating the uncertainty quantification within MEPE will improve active learning performance. We develop and test two methods to improve uncertainty estimates: post-hoc calibration of predictive uncertainty using the CRUDE algorithm, and replacing the Gaussian Process with a Student-t Process. We investigate the impact of these methods on MEPE for single sample and batch sample active learning. Our findings suggest that post-hoc calibration does not improve the performance of active learning using the MEPE method. However, we do find that the Student-t Process can outperform active learning strategies and random sampling using a Gaussian Process if the training set is sufficiently large.

摘要:FFLUX是一种机器学习力场，它使用最大期望预测误差(MEPE)主动学习算法来提高模型训练的效率。在选择下一个训练样本时，MEPE使用高斯过程的预测不确定性来平衡探索和利用。然而，高斯过程的预测不确定性不太可能在训练后立即准确或精确。我们假设在MEPE内校准不确定性量化将改善主动学习绩效。我们开发并测试了两种改进不确定性估计的方法:使用CRUDE算法对预测不确定性进行事后校准，并用Student-t过程取代高斯过程。我们研究了这些方法对单样本和批量样本主动学习的MEPE的影响。我们的研究结果表明，事后校准并不能改善使用MEPE方法的主动学习的表现。然而，我们确实发现，如果训练集足够大，学生-t过程可以优于主动学习策略和使用高斯过程的随机抽样。

{"title":"Calibration of uncertainty in the active learning of machine learning force fields","authors":"Adam Thomas-Mitchell, Glenn Ivan Hawe, Paul Popelier","doi":"10.1088/2632-2153/ad0ab5","DOIUrl":"https://doi.org/10.1088/2632-2153/ad0ab5","url":null,"abstract":"Abstract FFLUX is a Machine Learning Force Field that uses the Maximum Expected Prediction Error (MEPE) active learning algorithm to improve the efficiency of model training. MEPE uses the predictive uncertainty of a Gaussian Process to balance exploration and exploitation when selecting the next training sample. However, the predictive uncertainty of a Gaussian Process is unlikely to be accurate or precise immediately after training. We hypothesize that calibrating the uncertainty quantification within MEPE will improve active learning performance. We develop and test two methods to improve uncertainty estimates: post-hoc calibration of predictive uncertainty using the CRUDE algorithm, and replacing the Gaussian Process with a Student-t Process. We investigate the impact of these methods on MEPE for single sample and batch sample active learning. Our findings suggest that post-hoc calibration does not improve the performance of active learning using the MEPE method. However, we do find that the Student-t Process can outperform active learning strategies and random sampling using a Gaussian Process if the training set is sufficiently large.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":" 31","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135340811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Machine-learning approach to setting optimal thresholds and its application in rolling bearing fault diagnosis 最优阈值设置的机器学习方法及其在滚动轴承故障诊断中的应用

2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-11-08 DOI: 10.1088/2632-2153/ad0ab3

Yaochi Tang, Kuohao Li

Abstract Bearings are one of the critical components of any mechanical equipment. They induce most equipment faults, and their health status directly impacts the overall performance of equipment. Therefore, effective bearing fault diagnosis is essential, as it helps maintain the equipment stability, increasing economic benefits through timely maintenance. Currently, most studies focus on extracting fault features, with limited attention to establishing fault thresholds. As a result, these thresholds are challenging to utilize in the automatic monitoring diagnosis of intelligent devices. This study employed the generalized fractal dimensions (GFDs) to effectively extract the feature of time-domain vibration signals of bearings. The optimal fault threshold model was developed using the receiver operating characteristic curve (ROC curve), which served as the baseline of exception judgment. The extracted fault threshold model was verified using two bearing operation experiments. The experimental results revealed different damaged positions and components observed in the two experiments. The same fault threshold model was obtained using the method proposed in this study, and it effectively diagnosed the abnormal states within the signals. This finding confirms the effectiveness of the diagnostic method proposed in this study.

轴承是任何机械设备的关键部件之一。它们是设备故障的主要原因，其健康状况直接影响设备的整体性能。因此，有效的轴承故障诊断是必不可少的，因为它有助于保持设备的稳定性，通过及时维护增加经济效益。目前的研究大多集中在故障特征的提取上，对故障阈值的建立关注较少。因此，这些阈值在智能设备的自动监测诊断中具有挑战性。采用广义分形维数(GFDs)有效提取轴承时域振动信号的特征。利用受试者工作特征曲线(ROC曲线)建立最优故障阈值模型，作为异常判断的基线。通过两次轴承运行实验对提取的故障阈值模型进行了验证。实验结果显示，两个实验中观察到不同的损伤部位和部位。采用本文提出的方法得到了相同的故障阈值模型，有效地诊断了信号中的异常状态。这一发现证实了本研究提出的诊断方法的有效性。

{"title":"A Machine-learning approach to setting optimal thresholds and its application in rolling bearing fault diagnosis","authors":"Yaochi Tang, Kuohao Li","doi":"10.1088/2632-2153/ad0ab3","DOIUrl":"https://doi.org/10.1088/2632-2153/ad0ab3","url":null,"abstract":"Abstract Bearings are one of the critical components of any mechanical equipment. They induce most equipment faults, and their health status directly impacts the overall performance of equipment. Therefore, effective bearing fault diagnosis is essential, as it helps maintain the equipment stability, increasing economic benefits through timely maintenance. Currently, most studies focus on extracting fault features, with limited attention to establishing fault thresholds. As a result, these thresholds are challenging to utilize in the automatic monitoring diagnosis of intelligent devices. This study employed the generalized fractal dimensions (GFDs) to effectively extract the feature of time-domain vibration signals of bearings. The optimal fault threshold model was developed using the receiver operating characteristic curve (ROC curve), which served as the baseline of exception judgment. The extracted fault threshold model was verified using two bearing operation experiments. The experimental results revealed different damaged positions and components observed in the two experiments. The same fault threshold model was obtained using the method proposed in this study, and it effectively diagnosed the abnormal states within the signals. This finding confirms the effectiveness of the diagnostic method proposed in this study.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":" 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135340967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Artificial neural networks exploiting point cloud data for fragmented solid objects classification 利用点云数据进行碎片实体分类的人工神经网络

2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-11-06 DOI: 10.1088/2632-2153/ad035e

Alessandro Baiocchi, Stefano Giagu, Christian Napoli, Marco SERRA, Pietro Nardelli, Martina Valleriani

Abstract This paper presents a novel approach for fragmented solid object classification exploiting neural networks based on point clouds. This work is the initial step of a project in collaboration with the Institution of ‘Ente Parco Archeologico del Colosseo’ in Rome, which aims to reconstruct ancient artifacts from their fragments. We built from scratch a synthetic dataset (DS) of fragments of different 3D objects including aging effects. We used this DS to train deep learning models for the task of classifying internal and external fragments. As model architectures, we adopted PointNet and dynamical graph convolutional neural network, which take as input a point cloud representing the spatial geometry of a fragment, and we optimized model performance by adding additional features sensitive to local geometry characteristics. We tested the approach by performing several experiments to check the robustness and generalization capabilities of the models. Finally, we test the models on a real case using a 3D scan of artifacts preserved in different museums, artificially fragmented, obtaining good performance.

摘要提出了一种基于点云的神经网络碎片实体分类新方法。这项工作是与罗马“Ente Parco Archeologico del Colosseo”研究所合作项目的第一步，该项目旨在从碎片中重建古代文物。我们从零开始建立了一个合成数据集(DS)，其中包括不同3D物体的碎片，包括老化效果。我们使用这个DS来训练深度学习模型来完成内部和外部碎片的分类任务。采用PointNet和动态图卷积神经网络作为模型架构，以表示碎片空间几何形状的点云为输入，通过增加对局部几何特征敏感的特征来优化模型性能。我们通过执行几个实验来测试该方法，以检查模型的鲁棒性和泛化能力。最后，我们在实际案例中对不同博物馆保存的文物进行了三维扫描，人工分割，获得了良好的效果。

{"title":"Artificial neural networks exploiting point cloud data for fragmented solid objects classification","authors":"Alessandro Baiocchi, Stefano Giagu, Christian Napoli, Marco SERRA, Pietro Nardelli, Martina Valleriani","doi":"10.1088/2632-2153/ad035e","DOIUrl":"https://doi.org/10.1088/2632-2153/ad035e","url":null,"abstract":"Abstract This paper presents a novel approach for fragmented solid object classification exploiting neural networks based on point clouds. This work is the initial step of a project in collaboration with the Institution of ‘Ente Parco Archeologico del Colosseo’ in Rome, which aims to reconstruct ancient artifacts from their fragments. We built from scratch a synthetic dataset (DS) of fragments of different 3D objects including aging effects. We used this DS to train deep learning models for the task of classifying internal and external fragments. As model architectures, we adopted PointNet and dynamical graph convolutional neural network, which take as input a point cloud representing the spatial geometry of a fragment, and we optimized model performance by adding additional features sensitive to local geometry characteristics. We tested the approach by performing several experiments to check the robustness and generalization capabilities of the models. Finally, we test the models on a real case using a 3D scan of artifacts preserved in different museums, artificially fragmented, obtaining good performance.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"17 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135585228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Graph machine learning framework for depicting wavefunction on interface 用于在界面上描绘波函数的图形机器学习框架

2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-11-02 DOI: 10.1088/2632-2153/ad0937

Sheng Chang, Ao Wu, Li Liu, Zifeng Wang, Shurong Pan, Jiangxue Huang, Qijun Huang, Jin He, Hao Wang

Abstract The wavefunction, as the basic hypothesis of quantum mechanics, describes the motion of particles and plays a pivotal role in determining physical properties at the atomic scale. However, its conventional acquisition method, such as density functional theory (DFT), requires a considerable amount of calculation, which brings numerous problems to wide application. Here, we propose an algorithmic framework based on graph neural network (GNN) to machine-learn the wavefunction of electrons. This framework primarily generates atomic features containing information about chemical environment and geometric structure and subsequently constructs a scalable distribution map. For the first time, the visualization of wavefunction of interface is realized by machine learning (ML) methods, bypassing complex calculation and obscure comprehension. In this way, we vividly illustrate quantum mechanics, which can inspire theoretical exploration. As an intriguing case to verify the ability of our method, a novel quantum confinement phenomenon on interfaces based on graphene nanoribbon (GNR) is uncovered. We believe that the versatility of this framework paves the way for swiftly linking quantum physics and atom-level structures.

波函数作为量子力学的基本假设，描述了粒子的运动，在决定原子尺度上的物理性质方面起着举足轻重的作用。然而，传统的获取方法，如密度泛函理论(DFT)，需要大量的计算，这给广泛应用带来了许多问题。在此，我们提出了一个基于图神经网络(GNN)的算法框架来机器学习电子波函数。该框架主要生成包含化学环境和几何结构信息的原子特征，然后构建可伸缩的分布图。首次利用机器学习方法实现了界面波函数的可视化，省去了复杂的计算和晦涩的理解。这样，我们生动地说明了量子力学，可以激发理论探索。作为验证我们方法能力的一个有趣案例，在基于石墨烯纳米带(GNR)的界面上发现了一种新的量子约束现象。我们相信，这个框架的多功能性为快速连接量子物理和原子级结构铺平了道路。

{"title":"Graph machine learning framework for depicting wavefunction on interface","authors":"Sheng Chang, Ao Wu, Li Liu, Zifeng Wang, Shurong Pan, Jiangxue Huang, Qijun Huang, Jin He, Hao Wang","doi":"10.1088/2632-2153/ad0937","DOIUrl":"https://doi.org/10.1088/2632-2153/ad0937","url":null,"abstract":"Abstract The wavefunction, as the basic hypothesis of quantum mechanics, describes the motion of particles and plays a pivotal role in determining physical properties at the atomic scale. However, its conventional acquisition method, such as density functional theory (DFT), requires a considerable amount of calculation, which brings numerous problems to wide application. Here, we propose an algorithmic framework based on graph neural network (GNN) to machine-learn the wavefunction of electrons. This framework primarily generates atomic features containing information about chemical environment and geometric structure and subsequently constructs a scalable distribution map. For the first time, the visualization of wavefunction of interface is realized by machine learning (ML) methods, bypassing complex calculation and obscure comprehension. In this way, we vividly illustrate quantum mechanics, which can inspire theoretical exploration. As an intriguing case to verify the ability of our method, a novel quantum confinement phenomenon on interfaces based on graphene nanoribbon (GNR) is uncovered. We believe that the versatility of this framework paves the way for swiftly linking quantum physics and atom-level structures.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"25 15","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135875561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fast Neural Network Inference on FPGAs for Triggering on Long-Lived Particles at Colliders 基于fpga的长寿命粒子触发快速神经网络推理

2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-10-31 DOI: 10.1088/2632-2153/ad087a

Andrea Coccaro, Francesco Armando Di Bello, Stefano Giagu, Lucrezia Rambelli, Nicola Stocchetti

Abstract Experimental particle physics demands a sophisticated trigger and acquisition system capable to eﬃciently retain the collisions of interest for further investigation. Heterogeneous computing with the employment of FPGA cards may emerge as a trending technology for the triggering strategy of the upcoming high-luminosity program of the Large Hadron Collider at CERN. In this context, we present two machine-learning algorithms for selecting events where neutral long-lived particles decay within the detector volume studying their accuracy and inference time when accelerated on commercially available Xilinx FPGA accelerator cards. The inference time is also confronted with a CPU- and GPU-based hardware setup. The proposed new algorithms are proven eﬃcient for the considered benchmark physics scenario and their accuracy is found to not degrade when accelerated on the FPGA cards. The results indicate that all tested architectures ﬁt within the latency requirements of a second-level trigger farm and that exploiting accelerator technologies for realtime processing of particle-physics collisions is a promising research ﬁeld that deserves additional investigations, in particular with machine-learning models with a large number of trainable parameters.

实验粒子物理学需要一个复杂的触发和采集系统，能够有效地保留感兴趣的碰撞，以便进一步研究。采用FPGA卡的异构计算可能会成为欧洲核子研究中心(CERN)即将启动的大型强子对撞机(Large Hadron Collider)高亮度项目触发策略的趋势技术。在这种情况下，我们提出了两种机器学习算法，用于选择中性长寿命粒子在探测器体积内衰变的事件，研究它们在商用Xilinx FPGA加速卡上加速时的准确性和推理时间。推理时间还面临着基于CPU和gpu的硬件设置。所提出的新算法在考虑的基准物理场景中被证明是有效的，并且在FPGA卡上加速时发现其准确性不会降低。结果表明，所有测试的架构都符合二级触发场的延迟要求，并且利用加速器技术实时处理粒子物理碰撞是一个有前途的研究领域，值得进一步研究，特别是具有大量可训练参数的机器学习模型。

{"title":"Fast Neural Network Inference on FPGAs for Triggering on Long-Lived Particles at Colliders","authors":"Andrea Coccaro, Francesco Armando Di Bello, Stefano Giagu, Lucrezia Rambelli, Nicola Stocchetti","doi":"10.1088/2632-2153/ad087a","DOIUrl":"https://doi.org/10.1088/2632-2153/ad087a","url":null,"abstract":"Abstract Experimental particle physics demands a sophisticated trigger and acquisition system capable to eﬃciently retain the collisions of interest for further investigation. Heterogeneous computing with the employment of FPGA cards may emerge as a trending technology for the triggering strategy of the upcoming high-luminosity program of the Large Hadron Collider at CERN. In this context, we present two machine-learning algorithms for selecting events where neutral long-lived particles decay within the detector volume studying their accuracy and inference time when accelerated on commercially available Xilinx FPGA accelerator cards. The inference time is also confronted with a CPU- and GPU-based hardware setup. The proposed new algorithms are proven eﬃcient for the considered benchmark physics scenario and their accuracy is found to not degrade when accelerated on the FPGA cards. The results indicate that all tested architectures ﬁt within the latency requirements of a second-level trigger farm and that exploiting accelerator technologies for realtime processing of particle-physics collisions is a promising research ﬁeld that deserves additional investigations, in particular with machine-learning models with a large number of trainable parameters.&#xD;","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135808988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Estimating truncation effects of quantum bosonic systems using sampling algorithms 用抽样算法估计量子玻色子系统的截断效应

2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-10-30 DOI: 10.1088/2632-2153/ad035c

Masanori Hanada, Junyu Liu, Enrico Rinaldi, Masaki Tezuka

Abstract To simulate bosons on a qubit- or qudit-based quantum computer, one has to regularize the theory by truncating infinite-dimensional local Hilbert spaces to finite dimensions. In the search for practical quantum applications, it is important to know how big the truncation errors can be. In general, it is not easy to estimate errors unless we have a good quantum computer. In this paper, we show that traditional sampling methods on classical devices, specifically Markov Chain Monte Carlo, can address this issue for a rather generic class of bosonic systems with a reasonable amount of computational resources available today. As a demonstration, we apply this idea to the scalar field theory on a two-dimensional lattice, with a size that goes beyond what is achievable using exact diagonalization methods. This method can be used to estimate the resources needed for realistic quantum simulations of bosonic theories, and also, to check the validity of the results of the corresponding quantum simulations.

为了在基于量子位或量子位的量子计算机上模拟玻色子，必须通过将无限维局部希尔伯特空间截断为有限维来正则化理论。在寻找实际的量子应用时，知道截断误差有多大是很重要的。一般来说，除非我们有一台好的量子计算机，否则估计误差并不容易。在本文中，我们表明传统的采样方法在经典设备上，特别是马尔可夫链蒙特卡罗，可以解决这个问题，对于一个相当一般的一类玻色子系统，具有合理数量的可用计算资源。作为演示，我们将这个想法应用于二维晶格上的标量场理论，其大小超出了使用精确对角化方法可以实现的范围。这种方法可以用来估计玻色子理论实际量子模拟所需的资源，也可以用来检验相应量子模拟结果的有效性。

引用次数: 0