Machine Learning Science and Technology最新文献

英文中文

Koopman-inspired approach for identification of exogenous anomalies in nonstationary time-series data Koopman启发的非平稳时间序列数据外生异常识别方法

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-06-01 DOI: 10.1088/2632-2153/acdd50

Alex Mallen, C. Keller, J. Kutz

In many scenarios, it is necessary to monitor a complex system via a time-series of observations and determine when anomalous exogenous events have occurred so that relevant actions can be taken. Determining whether current observations are abnormal is challenging. It requires learning an extrapolative probabilistic model of the dynamics from historical data, and using a limited number of current observations to make a classification. We leverage recent advances in long-term probabilistic forecasting, namely Deep Probabilistic Koopman, to build a general method for classifying anomalies in multi-dimensional time-series data. We also show how to utilize models with domain knowledge of the dynamics to reduce type I and type II error. We demonstrate our proposed method on the important real-world task of global atmospheric pollution monitoring, integrating it with NASA’s Global Earth Observing System Model. The system successfully detects localized anomalies in air quality due to events such as COVID-19 lockdowns and wildfires.

在许多情况下，有必要通过观测的时间序列来监测复杂系统，并确定异常外部事件何时发生，以便采取相关行动。确定当前观测是否异常是一项挑战。它需要从历史数据中学习动力学的外推概率模型，并使用有限数量的当前观测进行分类。我们利用长期概率预测的最新进展，即深度概率库普曼，建立了一种对多维时间序列数据中的异常进行分类的通用方法。我们还展示了如何利用具有动力学领域知识的模型来减少I型和II型误差。我们在全球大气污染监测这一重要的现实世界任务中展示了我们提出的方法，并将其与美国国家航空航天局的全球地球观测系统模型相结合。该系统成功检测到由于新冠肺炎封锁和野火等事件导致的局部空气质量异常。

引用次数: 0

MeGen - generation of gallium metal clusters using reinforcement learning MeGen-利用强化学习生成镓金属团簇

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-06-01 DOI: 10.1088/2632-2153/acdc03

Rohit Modee, Ashwini Verma, Kavita Joshi, Deva Priyakumar

The generation of low-energy 3D structures of metal clusters depends on the efficiency of the search algorithm and the accuracy of inter-atomic interaction description. In this work, we formulate the search algorithm as a reinforcement learning (RL) problem. Concisely, we propose a novel actor-critic architecture that generates low-lying isomers of metal clusters at a fraction of computational cost than conventional methods. Our RL-based search algorithm uses a previously developed DART model as a reward function to describe the inter-atomic interactions to validate predicted structures. Using the DART model as a reward function incentivizes the RL model to generate low-energy structures and helps generate valid structures. We demonstrate the advantages of our approach over conventional methods for scanning local minima on potential energy surface. Our approach not only generates isomer of gallium clusters at a minimal computational cost but also predicts isomer families that were not discovered through previous density-functional theory (DFT)-based approaches.

金属团簇的低能量3D结构的生成取决于搜索算法的效率和原子间相互作用描述的准确性。在这项工作中，我们将搜索算法公式化为强化学习（RL）问题。简单地说，我们提出了一种新的行动者-评论家体系结构，该体系结构以比传统方法低一小部分的计算成本生成金属团簇的低洼异构体。我们基于RL的搜索算法使用先前开发的DART模型作为奖励函数来描述原子间的相互作用，以验证预测的结构。使用DART模型作为奖励函数可以激励RL模型生成低能量结构，并有助于生成有效结构。我们展示了我们的方法相对于传统方法在势能面上扫描局部极小值的优势。我们的方法不仅以最小的计算成本生成镓簇的异构体，而且预测了以前基于密度泛函理论（DFT）的方法没有发现的异构体家族。

引用次数: 1

Deep learning model with L1 penalty for predicting breast cancer metastasis using gene expression data 利用基因表达数据预测癌症转移的L1惩罚深度学习模型

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-06-01 DOI: 10.1088/2632-2153/acd987

Jaeyoon Kim, Minhyeok Lee, Junhee Seok

Breast cancer has the highest incidence and death rate among women; moreover, its metastasis to other organs increases the mortality rate. Since several studies have reported gene expression and cancer prognosis to be related, the study of breast cancer metastasis using gene expression is crucial. To this end, a novel deep neural network architecture, deep learning-based cancer metastasis estimator (DeepCME), is proposed in this paper for predicting breast cancer metastasis. However, the problem of overfitting occurs frequently while training deep learning models using gene expression data because they contain a large number of genes and the sample size is rather small. To address overfitting, several regularization methods are implemented, such as L1 penalty, batch normalization, and dropout. To demonstrate the superior performance of our model, area under curve (AUC) scores are evaluated and then compared with five baseline models: logistic regression, support vector classifier (SVC), random forest, decision tree, and k-nearest neighbor. Considering results, DeepCME demonstrates the highest average AUC scores in most cross-validation cases, and the average AUC score of DeepCME is 0.754, which is approximately 12.9% higher than SVC, the second-best model. In addition, the 30 most significant genes related to breast cancer metastasis are identified based on DeepCME results and some are discussed in further detail considering the reports from some previous medical studies. Considering the high expense involved in measuring the expression of a single gene, the ability to develop the cost-effective and time-efficient tests using only a few key genes is valuable. Based on this study, we expect DeepCME to be utilized clinically for predicting breast cancer metastasis and be applied to other types of cancer as well after further research.

癌症在女性中发病率和死亡率最高；此外，它转移到其他器官会增加死亡率。由于一些研究报道了基因表达与癌症预后相关，因此利用基因表达研究癌症转移至关重要。为此，本文提出了一种新的深度神经网络结构——基于深度学习的癌症转移估计器（DeepCME），用于预测癌症转移。然而，在使用基因表达数据训练深度学习模型时，过拟合问题经常发生，因为它们包含大量基因，并且样本量相当小。为了解决过拟合问题，实现了几种正则化方法，如L1惩罚、批量归一化和丢弃。为了证明我们模型的优越性能，评估了曲线下面积（AUC）得分，然后将其与五个基线模型进行比较：逻辑回归、支持向量分类器（SVC）、随机森林、决策树和k近邻。从结果来看，DeepCME在大多数交叉验证案例中的平均AUC得分最高，DeepCME的平均AUC得分为0.754，比第二好模型SVC高出约12.9%。此外，根据DeepCME的结果，确定了与癌症转移相关的30个最重要的基因，并考虑到之前一些医学研究的报告，对其中一些基因进行了进一步详细的讨论。考虑到测量单个基因表达所涉及的高昂费用，仅使用少数关键基因开发成本效益高且时效性强的测试的能力是有价值的。基于这项研究，我们期望DeepCME在临床上用于预测癌症转移，并在进一步研究后应用于其他类型的癌症。

{"title":"Deep learning model with L1 penalty for predicting breast cancer metastasis using gene expression data","authors":"Jaeyoon Kim, Minhyeok Lee, Junhee Seok","doi":"10.1088/2632-2153/acd987","DOIUrl":"https://doi.org/10.1088/2632-2153/acd987","url":null,"abstract":"Breast cancer has the highest incidence and death rate among women; moreover, its metastasis to other organs increases the mortality rate. Since several studies have reported gene expression and cancer prognosis to be related, the study of breast cancer metastasis using gene expression is crucial. To this end, a novel deep neural network architecture, deep learning-based cancer metastasis estimator (DeepCME), is proposed in this paper for predicting breast cancer metastasis. However, the problem of overfitting occurs frequently while training deep learning models using gene expression data because they contain a large number of genes and the sample size is rather small. To address overfitting, several regularization methods are implemented, such as L1 penalty, batch normalization, and dropout. To demonstrate the superior performance of our model, area under curve (AUC) scores are evaluated and then compared with five baseline models: logistic regression, support vector classifier (SVC), random forest, decision tree, and k-nearest neighbor. Considering results, DeepCME demonstrates the highest average AUC scores in most cross-validation cases, and the average AUC score of DeepCME is 0.754, which is approximately 12.9% higher than SVC, the second-best model. In addition, the 30 most significant genes related to breast cancer metastasis are identified based on DeepCME results and some are discussed in further detail considering the reports from some previous medical studies. Considering the high expense involved in measuring the expression of a single gene, the ability to develop the cost-effective and time-efficient tests using only a few key genes is valuable. Based on this study, we expect DeepCME to be utilized clinically for predicting breast cancer metastasis and be applied to other types of cancer as well after further research.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":" ","pages":""},"PeriodicalIF":6.8,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44811425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Shape sensing of optical fiber Bragg gratings based on deep learning 基于深度学习的光纤布拉格光栅形状传感

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-05-30 DOI: 10.1088/2632-2153/acda10

Samaneh Manavi Roodsari, Antal Huck-Horváth, Sara Freund, A. Zam, G. Rauter, W. Schade, P. Cattin

Continuum robots in robot-assisted minimally invasive surgeries provide adequate access to target anatomies that are not directly reachable through small incisions. Achieving precise and reliable shape estimation of such snake-like manipulators necessitates an accurate navigation system, that requires no line-of-sight and is immune to electromagnetic noise. Fiber Bragg grating (FBG) shape sensing, particularly eccentric FBG (eFBG), is a promising and cost-effective solution for this task. However, in eFBG sensors, the spectral intensity of the Bragg wavelengths that carries the strain information can be affected by undesired bending-induced phenomena, making standard characterization techniques less suitable for these sensors. We showed in our previous work that a deep learning model has the potential to extract the strain information from the eFBG sensor’s spectrum and accurately predict its shape. In this paper, we conducted a more thorough investigation to find a suitable architectural design of the deep learning model to further increase shape prediction accuracy. We used the Hyperband algorithm to search for optimal hyperparameters in two steps. First, we limited the search space to layer settings of the network, from which, the best-performing configuration was selected. Then, we modified the search space for tuning the training and loss calculation hyperparameters. We also analyzed various data transformations on the network’s input and output variables, as data rescaling can directly influence the model’s performance. Additionally, we performed discriminative training using the Siamese network architecture that employs two convolutional neural networks (CNN) with identical parameters to learn similarity metrics between the spectra of similar target values. The best-performing network architecture among all evaluated configurations can predict the shape of a 30 cm long sensor with a median tip error of 3.11 mm in a curvature range of 1.4 m−1 to 35.3 m−1.

在机器人辅助微创手术中，连续体机器人提供了足够的机会进入目标解剖结构，而这些解剖结构不能通过小切口直接到达。要对这种蛇形机械臂进行精确可靠的形状估计，需要一个精确的导航系统，该系统不需要视线，也不受电磁噪声的影响。光纤布拉格光栅(FBG)的形状传感，特别是偏心光纤光栅(eFBG)，是一种很有前途和经济的解决方案。然而，在eFBG传感器中，携带应变信息的Bragg波长的光谱强度会受到不希望的弯曲诱导现象的影响，使得标准表征技术不太适合这些传感器。我们在之前的研究中表明，深度学习模型有可能从eFBG传感器的频谱中提取应变信息，并准确预测其形状。在本文中，我们进行了更深入的研究，以找到合适的深度学习模型的架构设计，以进一步提高形状预测的精度。我们使用超带算法分两步搜索最优超参数。首先，我们将搜索空间限制在网络的层设置中，从中选择性能最佳的配置。然后，我们修改搜索空间来调整训练和损失计算超参数。我们还分析了网络输入和输出变量上的各种数据转换，因为数据重新缩放会直接影响模型的性能。此外，我们使用Siamese网络架构进行判别训练，该架构采用两个具有相同参数的卷积神经网络(CNN)来学习相似目标值的光谱之间的相似性度量。在所有评估的配置中，性能最好的网络架构可以预测30厘米长的传感器的形状，在1.4 m−1到35.3 m−1的曲率范围内，尖端误差中值为3.11 mm。

{"title":"Shape sensing of optical fiber Bragg gratings based on deep learning","authors":"Samaneh Manavi Roodsari, Antal Huck-Horváth, Sara Freund, A. Zam, G. Rauter, W. Schade, P. Cattin","doi":"10.1088/2632-2153/acda10","DOIUrl":"https://doi.org/10.1088/2632-2153/acda10","url":null,"abstract":"Continuum robots in robot-assisted minimally invasive surgeries provide adequate access to target anatomies that are not directly reachable through small incisions. Achieving precise and reliable shape estimation of such snake-like manipulators necessitates an accurate navigation system, that requires no line-of-sight and is immune to electromagnetic noise. Fiber Bragg grating (FBG) shape sensing, particularly eccentric FBG (eFBG), is a promising and cost-effective solution for this task. However, in eFBG sensors, the spectral intensity of the Bragg wavelengths that carries the strain information can be affected by undesired bending-induced phenomena, making standard characterization techniques less suitable for these sensors. We showed in our previous work that a deep learning model has the potential to extract the strain information from the eFBG sensor’s spectrum and accurately predict its shape. In this paper, we conducted a more thorough investigation to find a suitable architectural design of the deep learning model to further increase shape prediction accuracy. We used the Hyperband algorithm to search for optimal hyperparameters in two steps. First, we limited the search space to layer settings of the network, from which, the best-performing configuration was selected. Then, we modified the search space for tuning the training and loss calculation hyperparameters. We also analyzed various data transformations on the network’s input and output variables, as data rescaling can directly influence the model’s performance. Additionally, we performed discriminative training using the Siamese network architecture that employs two convolutional neural networks (CNN) with identical parameters to learn similarity metrics between the spectra of similar target values. The best-performing network architecture among all evaluated configurations can predict the shape of a 30 cm long sensor with a median tip error of 3.11 mm in a curvature range of 1.4 m−1 to 35.3 m−1.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":" ","pages":""},"PeriodicalIF":6.8,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45790368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Estimating Gibbs free energies via isobaric-isothermal flows 用等压等温流估算吉布斯自由能

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-05-22 DOI: 10.1088/2632-2153/acefa8

Peter Wirnsberger, Borja Ibarz, G. Papamakarios

We present a machine-learning model based on normalizing flows that is trained to sample from the isobaric-isothermal ensemble. In our approach, we approximate the joint distribution of a fully-flexible triclinic simulation box and particle coordinates to achieve a desired internal pressure. This novel extension of flow-based sampling to the isobaric-isothermal ensemble yields direct estimates of Gibbs free energies. We test our NPT-flow on monatomic water in the cubic and hexagonal ice phases and find excellent agreement of Gibbs free energies and other observables compared with established baselines.

我们提出了一个基于归一化流的机器学习模型，该模型被训练为从等压等温系综中采样。在我们的方法中，我们近似于完全灵活的三斜模拟盒和粒子坐标的联合分布，以实现所需的内部压力。这种基于流的采样到等压等温系综的新扩展产生了吉布斯自由能的直接估计。我们在立方和六边形冰相中的单原子水上测试了我们的NPT流动，发现吉布斯自由能和其他可观测值与已建立的基线相比非常一致。

引用次数: 1

scGMM-VGAE: a Gaussian mixture model-based variational graph autoencoder algorithm for clustering single-cell RNA-seq data scGMM-VGAE:一种基于高斯混合模型的单细胞RNA-seq数据聚类变分图自编码器算法

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-05-22 DOI: 10.1088/2632-2153/acd7c3

Eric W Lin, Boyuan Liu, L. Lac, Daryl L. X. Fung, C. Leung, P. Hu

Cell type identification using single-cell RNA sequencing data is critical for understanding disease mechanisms and drug discovery. Cell clustering analysis has been widely studied in health research for rare tumor cell detection. In this study, we propose a Gaussian mixture model-based variational graph autoencoder on scRNA-seq data (scGMM-VGAE) that integrates a statistical clustering model to a deep learning algorithm to significantly improve the cell clustering performance. This model feeds a cell-cell graph adjacency matrix and a gene feature matrix into a graph variational autoencoder (VGAE) to generate latent data. These data are then used for cell clustering by the Gaussian mixture model (GMM) module. To optimize the algorithm, a designed loss function is derived by combining parameter estimates from the GMM and VGAE. We test the proposed method on four publicly available and three simulated datasets which contain many biological and technical zeros. The scGMM-VGAE outperforms four selected baseline methods on three evaluation metrics in cell clustering. By successfully incorporating GMM into deep learning VGAE on scRNA-seq data, the proposed method shows higher accuracy in cell clustering on scRNA-seq data. This improvement has a significant impact on detecting rare cell types in health research. All source codes used in this study can be found at https://github.com/ericlin1230/scGMM-VGAE.

使用单细胞RNA测序数据进行细胞类型鉴定对于理解疾病机制和药物发现至关重要。细胞聚类分析在罕见肿瘤细胞检测的健康研究中得到了广泛的研究。在本研究中，我们提出了一种基于高斯混合模型的scRNA-seq数据变分图自动编码器（scGMM-VGAE），该编码器将统计聚类模型与深度学习算法相结合，以显著提高细胞聚类性能。该模型将细胞-细胞图邻接矩阵和基因特征矩阵输入到图变分自动编码器（VGAE）中以生成潜在数据。然后，这些数据被高斯混合模型（GMM）模块用于细胞聚类。为了优化算法，结合GMM和VGAE的参数估计，导出了设计的损失函数。我们在四个公开可用的数据集和三个模拟数据集上测试了所提出的方法，这些数据集包含许多生物学和技术零点。scGMM-VGAE在细胞聚类的三个评估指标上优于四种选定的基线方法。通过成功地将GMM结合到scRNA-seq数据的深度学习VGAE中，所提出的方法在scRNA-seq数据的细胞聚类中显示出更高的准确性。这一改进对健康研究中检测稀有细胞类型具有重大影响。本研究中使用的所有源代码均可在https://github.com/ericlin1230/scGMM-VGAE.

{"title":"scGMM-VGAE: a Gaussian mixture model-based variational graph autoencoder algorithm for clustering single-cell RNA-seq data","authors":"Eric W Lin, Boyuan Liu, L. Lac, Daryl L. X. Fung, C. Leung, P. Hu","doi":"10.1088/2632-2153/acd7c3","DOIUrl":"https://doi.org/10.1088/2632-2153/acd7c3","url":null,"abstract":"Cell type identification using single-cell RNA sequencing data is critical for understanding disease mechanisms and drug discovery. Cell clustering analysis has been widely studied in health research for rare tumor cell detection. In this study, we propose a Gaussian mixture model-based variational graph autoencoder on scRNA-seq data (scGMM-VGAE) that integrates a statistical clustering model to a deep learning algorithm to significantly improve the cell clustering performance. This model feeds a cell-cell graph adjacency matrix and a gene feature matrix into a graph variational autoencoder (VGAE) to generate latent data. These data are then used for cell clustering by the Gaussian mixture model (GMM) module. To optimize the algorithm, a designed loss function is derived by combining parameter estimates from the GMM and VGAE. We test the proposed method on four publicly available and three simulated datasets which contain many biological and technical zeros. The scGMM-VGAE outperforms four selected baseline methods on three evaluation metrics in cell clustering. By successfully incorporating GMM into deep learning VGAE on scRNA-seq data, the proposed method shows higher accuracy in cell clustering on scRNA-seq data. This improvement has a significant impact on detecting rare cell types in health research. All source codes used in this study can be found at https://github.com/ericlin1230/scGMM-VGAE.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":" ","pages":""},"PeriodicalIF":6.8,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48549329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Interpretable machine learning model to predict survival days of malignant brain tumor patients 预测恶性脑肿瘤患者生存天数的可解释机器学习模型

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-05-15 DOI: 10.1088/2632-2153/acd5a9

Snehal Rajput, Rupal A. Kapdi, M. Raval, Mohendra Roy

An artificial intelligence (AI) model’s performance is strongly influenced by the input features. Therefore, it is vital to find the optimal feature set. It is more crucial for the survival prediction of the glioblastoma multiforme (GBM) type of brain tumor. In this study, we identify the best feature set for predicting the survival days (SD) of GBM patients that outrank the current state-of-the-art methodologies. The proposed approach is an end-to-end AI model. This model first segments tumors from healthy brain parts in patients’ MRI images, extracts features from the segmented results, performs feature selection, and makes predictions about patients’ survival days (SD) based on selected features. The extracted features are primarily shape-based, location-based, and radiomics-based features. Additionally, patient metadata is also included as a feature. The selection methods include recursive feature elimination, permutation importance (PI), and finding the correlation between the features. Finally, we examined features’ behavior at local (single sample) and global (all the samples) levels. In this study, we find that out of 1265 extracted features, only 29 dominant features play a crucial role in predicting patients’ SD. Among these 29 features, one is metadata (age of patient), three are location-based, and the rest are radiomics features. Furthermore, we find explanations of these features using post-hoc interpretability methods to validate the model’s robust prediction and understand its decision. Finally, we analyzed the behavioral impact of the top six features on survival prediction, and the findings drawn from the explanations were coherent with the medical domain. We find that after the age of 50 years, the likelihood of survival of a patient deteriorates, and survival after 80 years is scarce. Again, for location-based features, the SD is less if the tumor location is in the central or back part of the brain. All these trends derived from the developed AI model are in sync with medically proven facts. The results show an overall 33% improvement in the accuracy of SD prediction compared to the top-performing methods of the BraTS-2020 challenge.

人工智能（AI）模型的性能受到输入特征的强烈影响。因此，找到最优特征集是至关重要的。它对多形性胶质母细胞瘤（GBM）型脑肿瘤的生存预测更为重要。在这项研究中，我们确定了预测GBM患者生存天数（SD）的最佳特征集，该特征集超过了当前最先进的方法。所提出的方法是一个端到端的人工智能模型。该模型首先从患者MRI图像中的健康大脑部分分割肿瘤，从分割结果中提取特征，进行特征选择，并根据所选特征预测患者的生存天数（SD）。提取的特征主要是基于形状、基于位置和基于放射组学的特征。此外，还将患者元数据作为一项功能包括在内。选择方法包括递归特征消除、排列重要性（PI）和寻找特征之间的相关性。最后，我们在局部（单个样本）和全局（所有样本）级别检查了特征的行为。在这项研究中，我们发现在1265个提取的特征中，只有29个主导特征在预测患者SD方面起着至关重要的作用。在这29个特征中，一个是元数据（患者年龄），三个是基于位置的，其余是放射组学特征。此外，我们使用事后可解释性方法来验证模型的鲁棒预测并理解其决策，从而找到对这些特征的解释。最后，我们分析了前六个特征对生存预测的行为影响，从这些解释中得出的结果与医学领域一致。我们发现，在50岁后，患者的存活率会下降，80岁后的存活率很低。同样，对于基于位置的特征，如果肿瘤位置在大脑的中央或后部，则SD较小。所有这些从开发的人工智能模型中得出的趋势都与医学证明的事实一致。结果显示，与BraTS-2020挑战赛中表现最好的方法相比，SD预测的准确性总体提高了33%。

{"title":"Interpretable machine learning model to predict survival days of malignant brain tumor patients","authors":"Snehal Rajput, Rupal A. Kapdi, M. Raval, Mohendra Roy","doi":"10.1088/2632-2153/acd5a9","DOIUrl":"https://doi.org/10.1088/2632-2153/acd5a9","url":null,"abstract":"An artificial intelligence (AI) model’s performance is strongly influenced by the input features. Therefore, it is vital to find the optimal feature set. It is more crucial for the survival prediction of the glioblastoma multiforme (GBM) type of brain tumor. In this study, we identify the best feature set for predicting the survival days (SD) of GBM patients that outrank the current state-of-the-art methodologies. The proposed approach is an end-to-end AI model. This model first segments tumors from healthy brain parts in patients’ MRI images, extracts features from the segmented results, performs feature selection, and makes predictions about patients’ survival days (SD) based on selected features. The extracted features are primarily shape-based, location-based, and radiomics-based features. Additionally, patient metadata is also included as a feature. The selection methods include recursive feature elimination, permutation importance (PI), and finding the correlation between the features. Finally, we examined features’ behavior at local (single sample) and global (all the samples) levels. In this study, we find that out of 1265 extracted features, only 29 dominant features play a crucial role in predicting patients’ SD. Among these 29 features, one is metadata (age of patient), three are location-based, and the rest are radiomics features. Furthermore, we find explanations of these features using post-hoc interpretability methods to validate the model’s robust prediction and understand its decision. Finally, we analyzed the behavioral impact of the top six features on survival prediction, and the findings drawn from the explanations were coherent with the medical domain. We find that after the age of 50 years, the likelihood of survival of a patient deteriorates, and survival after 80 years is scarce. Again, for location-based features, the SD is less if the tumor location is in the central or back part of the brain. All these trends derived from the developed AI model are in sync with medically proven facts. The results show an overall 33% improvement in the accuracy of SD prediction compared to the top-performing methods of the BraTS-2020 challenge.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":" ","pages":""},"PeriodicalIF":6.8,"publicationDate":"2023-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42639698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Graph Neural Networks and 3-dimensional topology 图神经网络和三维拓扑

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-05-10 DOI: 10.1088/2632-2153/acf097

Song Jin Ri, P. Putrov

We test the efficiency of applying geometric deep learning to the problems in low-dimensional topology in a certain simple setting. Specifically, we consider the class of 3-manifolds described by plumbing graphs and use graph neural networks (GNN) for the problem of deciding whether a pair of graphs give homeomorphic 3-manifolds. We use supervised learning to train a GNN that provides the answer to such a question with high accuracy. Moreover, we consider reinforcement learning by a GNN to find a sequence of Neumann moves that relates the pair of graphs if the answer is positive. The setting can be understood as a toy model of the problem of deciding whether a pair of Kirby diagrams give diffeomorphic 3- or 4-manifolds.

在一个简单的环境下，我们测试了将几何深度学习应用于低维拓扑问题的效率。具体地说，我们考虑由管道图描述的一类3-流形，并使用图神经网络(GNN)来决定一对图是否为同胚3-流形。我们使用监督学习来训练GNN，该GNN以高精度提供此类问题的答案。此外，我们考虑通过GNN进行强化学习，以便在答案为正的情况下找到与图对相关的诺伊曼移动序列。这个设置可以理解为决定一对Kirby图是否给出微分同构的3-或4-流形的问题的一个玩具模型。

引用次数: 1

Towards a phenomenological understanding of neural networks: data 走向对神经网络的现象学理解：数据

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-05-01 DOI: 10.1088/2632-2153/acf099

S. Tovey, S. Krippendorf, K. Nikolaou, Daniel Fink

A theory of neural networks (NNs) built upon collective variables would provide scientists with the tools to better understand the learning process at every stage. In this work, we introduce two such variables, the entropy and the trace of the empirical neural tangent kernel (NTK) built on the training data passed to the model. We empirically analyze the NN performance in the context of these variables and find that there exists correlation between the starting entropy, the trace of the NTK, and the generalization of the model computed after training is complete. This framework is then applied to the problem of optimal data selection for the training of NNs. To this end, random network distillation (RND) is used as a means of selecting training data which is then compared with random selection of data. It is shown that not only does RND select data-sets capable of outperforming random selection, but that the collective variables associated with the RND data-sets are larger than those of the randomly selected sets. The results of this investigation provide a stable ground from which the selection of data for NN training can be driven by this phenomenological framework.

建立在集体变量基础上的神经网络理论将为科学家提供更好地理解每个阶段学习过程的工具。在这项工作中，我们引入了两个这样的变量，即基于传递给模型的训练数据建立的经验神经切线核（NTK）的熵和迹。我们在这些变量的背景下实证分析了神经网络的性能，发现起始熵、NTK的轨迹和训练完成后计算的模型的泛化之间存在相关性。然后将该框架应用于神经网络训练的最优数据选择问题。为此，使用随机网络蒸馏（RND）作为选择训练数据的手段，然后将训练数据与数据的随机选择进行比较。结果表明，RND不仅选择了能够优于随机选择的数据集，而且与RND数据集相关联的集合变量大于随机选择集的集合变量。这项研究的结果提供了一个稳定的基础，从中可以通过这种现象学框架来驱动神经网络训练数据的选择。

{"title":"Towards a phenomenological understanding of neural networks: data","authors":"S. Tovey, S. Krippendorf, K. Nikolaou, Daniel Fink","doi":"10.1088/2632-2153/acf099","DOIUrl":"https://doi.org/10.1088/2632-2153/acf099","url":null,"abstract":"A theory of neural networks (NNs) built upon collective variables would provide scientists with the tools to better understand the learning process at every stage. In this work, we introduce two such variables, the entropy and the trace of the empirical neural tangent kernel (NTK) built on the training data passed to the model. We empirically analyze the NN performance in the context of these variables and find that there exists correlation between the starting entropy, the trace of the NTK, and the generalization of the model computed after training is complete. This framework is then applied to the problem of optimal data selection for the training of NNs. To this end, random network distillation (RND) is used as a means of selecting training data which is then compared with random selection of data. It is shown that not only does RND select data-sets capable of outperforming random selection, but that the collective variables associated with the RND data-sets are larger than those of the randomly selected sets. The results of this investigation provide a stable ground from which the selection of data for NN training can be driven by this phenomenological framework.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":" ","pages":""},"PeriodicalIF":6.8,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42797889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Closed-loop control of a noisy qubit with reinforcement learning 带强化学习的噪声量子位闭环控制

IF 6.8 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology

Pub Date : 2023-04-25 DOI: 10.1088/2632-2153/acd048

Yongcheng Ding, Xi Chen, R. Magdalena-Benedito, J. Martín-Guerrero

The exotic nature of quantum mechanics differentiates machine learning applications in the quantum realm from classical ones. Stream learning is a powerful approach that can be applied to extract knowledge continuously from quantum systems in a wide range of tasks. In this paper, we propose a deep reinforcement learning method that uses streaming data from a continuously measured qubit in the presence of detuning, dephasing, and relaxation. The model receives streaming quantum information for learning and decision-making, providing instant feedback on the quantum system. We also explore the agent’s adaptability to other quantum noise patterns through transfer learning. Our protocol offers insights into closed-loop quantum control, potentially advancing the development of quantum technologies.

量子力学的奇异性将机器学习在量子领域的应用与经典应用区分开来。流学习是一种强大的方法，可以在各种任务中从量子系统中连续提取知识。在本文中，我们提出了一种深度强化学习方法，该方法在存在失谐、去相位和弛豫的情况下使用来自连续测量量子位的流数据。该模型接收用于学习和决策的流式量子信息，为量子系统提供即时反馈。我们还通过迁移学习探索了智能体对其他量子噪声模式的适应性。我们的协议为闭环量子控制提供了见解，有可能推动量子技术的发展。

引用次数: 1

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Machine Learning Science and Technology

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀