Machine Learning: Science and Technology最新文献_第8页

ATSFCNN: A Novel Attention-based Triple-Stream Fused CNN Model for Hyperspectral Image Classification ATSFCNN：用于高光谱图像分类的基于注意力的新型三流融合 CNN 模型

Machine Learning: Science and Technology

Pub Date : 2024-01-10 DOI: 10.1088/2632-2153/ad1d05

Jizhen Cai, Clotilde Boust, Alamin Mansouri

Recently, the Convolutional Neural Network (CNN) has gained increasing importance in hyperspectral image classification thanks to its superior performance. However, most of the previous research has mainly focused on 2D-CNN, and the limited applications of 3D-CNN have been attributed to its complexity, despite its potential to enhance information extraction between adjacent channels of the image. Moreover, 1D-CNN is typically restricted to the field of signal processing as it ignores the spatial information of hyperspectral images. In this paper, we propose a novel CNN model named ATSFCNN (Attention-based Triple-Stream Fused Convolutional Neural Network) that fuses the features of 1D-CNN, 2D-CNN, and 3D-CNN to consider all the relevant information of the hyperspectral dataset. Our contributions are twofold: First, we propose a strategy to extract and homogenize features from 1D, 2D, and 3D CNN. Secondly, we propose a way to efficiently fuse these features. This attention-based methodology adeptly integrates features from the triple streams, thereby transcending the former limitations of singular stream utilization. Consequently, it becomes capable of attaining elevated outcomes in the context of hyperspectral classification, marked by increased levels of both accuracy and stability. We compared the results of ATSFCNN with those of other deep learning models, including 1D-CNN, 2D-CNN, 2D-CNN+PCA, 3D-CNN, and 3D-CNN+PCA, and demonstrated its superior performance and robustness. Quantitative assessments, predicated on the metrics of Overall Accuracy (OA), Average Accuracy (AA), and Kappa Coefficient (κ) emphatically corroborate the preeminence of ATSFCNN. Notably, spanning three remote sensing datasets, ATSFCNN consistently achieves peak levels of Overall Accuracy, quantified at 98.38%, 97.09%, and 96.93% respectively. This prowess is further accentuated by concomitant Average Accuracy scores of 98.47%, 95.80%, and 95.80%, as well as Kappa Coefficient values amounting to 97.41%, 96.14%, and 95.21%.

最近，卷积神经网络（CNN）凭借其卓越的性能，在高光谱图像分类领域的重要性与日俱增。然而，之前的大多数研究主要集中在二维卷积神经网络（2D-CNN）上，三维卷积神经网络（3D-CNN）的应用有限，原因在于其复杂性，尽管它具有增强图像相邻通道间信息提取的潜力。此外，1D-CNN 通常仅限于信号处理领域，因为它忽略了高光谱图像的空间信息。在本文中，我们提出了一种名为 ATSFCNN（基于注意力的三流融合卷积神经网络）的新型 CNN 模型，该模型融合了一维 CNN、二维 CNN 和三维 CNN 的特征，以考虑高光谱数据集的所有相关信息。我们的贡献有两个方面：首先，我们提出了一种从一维、二维和三维 CNN 中提取和同质化特征的策略。其次，我们提出了一种有效融合这些特征的方法。这种基于注意力的方法能够巧妙地整合来自三重流的特征，从而超越了以往单一流利用的局限性。因此，它能够在高光谱分类中获得更高的结果，其特点是准确性和稳定性都得到了提高。我们将 ATSFCNN 的结果与其他深度学习模型（包括一维-CNN、二维-CNN、二维-CNN+PCA、三维-CNN 和三维-CNN+PCA）的结果进行了比较，并证明了其卓越的性能和鲁棒性。根据总体准确率（OA）、平均准确率（AA）和卡帕系数（κ）等指标进行的定量评估有力地证实了 ATSFCNN 的卓越性能。值得注意的是，在三个遥感数据集中，ATSFCNN 的总体准确率始终保持在最高水平，分别为 98.38%、97.09% 和 96.93%。平均准确率分别为 98.47%、95.80% 和 95.80%，Kappa 系数分别为 97.41%、96.14% 和 95.21%，进一步彰显了 ATSFCNN 的卓越性能。

{"title":"ATSFCNN: A Novel Attention-based Triple-Stream Fused CNN Model for Hyperspectral Image Classification","authors":"Jizhen Cai, Clotilde Boust, Alamin Mansouri","doi":"10.1088/2632-2153/ad1d05","DOIUrl":"https://doi.org/10.1088/2632-2153/ad1d05","url":null,"abstract":"\u0000 Recently, the Convolutional Neural Network (CNN) has gained increasing importance in hyperspectral image classification thanks to its superior performance. However, most of the previous research has mainly focused on 2D-CNN, and the limited applications of 3D-CNN have been attributed to its complexity, despite its potential to enhance information extraction between adjacent channels of the image. Moreover, 1D-CNN is typically restricted to the field of signal processing as it ignores the spatial information of hyperspectral images. In this paper, we propose a novel CNN model named ATSFCNN (Attention-based Triple-Stream Fused Convolutional Neural Network) that fuses the features of 1D-CNN, 2D-CNN, and 3D-CNN to consider all the relevant information of the hyperspectral dataset. Our contributions are twofold: First, we propose a strategy to extract and homogenize features from 1D, 2D, and 3D CNN. Secondly, we propose a way to efficiently fuse these features. This attention-based methodology adeptly integrates features from the triple streams, thereby transcending the former limitations of singular stream utilization. Consequently, it becomes capable of attaining elevated outcomes in the context of hyperspectral classification, marked by increased levels of both accuracy and stability. We compared the results of ATSFCNN with those of other deep learning models, including 1D-CNN, 2D-CNN, 2D-CNN+PCA, 3D-CNN, and 3D-CNN+PCA, and demonstrated its superior performance and robustness. Quantitative assessments, predicated on the metrics of Overall Accuracy (OA), Average Accuracy (AA), and Kappa Coefficient (κ) emphatically corroborate the preeminence of ATSFCNN. Notably, spanning three remote sensing datasets, ATSFCNN consistently achieves peak levels of Overall Accuracy, quantified at 98.38%, 97.09%, and 96.93% respectively. This prowess is further accentuated by concomitant Average Accuracy scores of 98.47%, 95.80%, and 95.80%, as well as Kappa Coefficient values amounting to 97.41%, 96.14%, and 95.21%.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":"83 14","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139440810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

δARD loss for low-contrast medical image segmentation 用于低对比度医学图像分割的 δARD 损失

Machine Learning: Science and Technology

Pub Date : 2024-01-10 DOI: 10.1088/2632-2153/ad1d06

Yu Zhao, Xiaoyan Shen, Jiadong Chen, Wei Qian, He Ma, Liang Sang

Purpose Medical image segmentation is essential to image-based disease analysis and has proven to be significantly helpful for doctors to make decisions. Due to the low-contrast of some medical images, the accurate segmentation of medical images has always been a challenging problem. The experiment found that UNet with current loss functions cannot capture subtle information in target contours or regions in low-contrast medical images, which are crucial for subsequent disease diagnosis. Methods We propose a robust loss by incorporating the difference in average radial derivative (ARD), length and region area to further help the network to achieve more accurate segmentation results. We evaluated the proposed loss function using UNet as the base segmentation network compared to five conventional loss functions on one private and four public medical image datasets. Results Experimental results illustrate that UNet with the proposed loss function can achieve the best segmentation performance, even better than the outstanding deep learning models with original loss functions. Furthermore, three representative datasets were chosen to validate the effectiveness of the proposed δARD loss function with seven different models. Conclusion The experiments revealed δARD loss's plug-and-play feature and its robustness over multiple models and datasets.

目的医学影像分割对于基于图像的疾病分析至关重要，事实证明，医学影像分割大大有助于医生做出决策。由于某些医学图像对比度较低，准确分割医学图像一直是一个具有挑战性的问题。实验发现，目前使用损失函数的 UNet 无法捕捉低对比度医学图像中目标轮廓或区域的细微信息，而这些信息对于后续的疾病诊断至关重要。方法我们提出了一种稳健的损失函数，它结合了平均径向导数（ARD）、长度和区域面积的差异，进一步帮助网络获得更精确的分割结果。我们使用 UNet 作为基础分割网络，在一个私人和四个公共医疗图像数据集上对所提出的损失函数与五个传统损失函数进行了评估。结果实验结果表明，使用所提损失函数的 UNet 可以获得最佳的分割性能，甚至优于使用原始损失函数的优秀深度学习模型。此外，还选择了三个具有代表性的数据集来验证所提出的 δARD 损失函数与七个不同模型的有效性。结论实验揭示了δARD 损失函数即插即用的特点及其在多种模型和数据集上的鲁棒性。

{"title":"δARD loss for low-contrast medical image segmentation","authors":"Yu Zhao, Xiaoyan Shen, Jiadong Chen, Wei Qian, He Ma, Liang Sang","doi":"10.1088/2632-2153/ad1d06","DOIUrl":"https://doi.org/10.1088/2632-2153/ad1d06","url":null,"abstract":"\u0000 Purpose Medical image segmentation is essential to image-based disease analysis and has proven to be significantly helpful for doctors to make decisions. Due to the low-contrast of some medical images, the accurate segmentation of medical images has always been a challenging problem. The experiment found that UNet with current loss functions cannot capture subtle information in target contours or regions in low-contrast medical images, which are crucial for subsequent disease diagnosis. Methods We propose a robust loss by incorporating the difference in average radial derivative (ARD), length and region area to further help the network to achieve more accurate segmentation results. We evaluated the proposed loss function using UNet as the base segmentation network compared to five conventional loss functions on one private and four public medical image datasets. Results Experimental results illustrate that UNet with the proposed loss function can achieve the best segmentation performance, even better than the outstanding deep learning models with original loss functions. Furthermore, three representative datasets were chosen to validate the effectiveness of the proposed δARD loss function with seven different models. Conclusion The experiments revealed δARD loss's plug-and-play feature and its robustness over multiple models and datasets.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":"4 12","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139439893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Redefining the North Atlantic Oscillation Index Generation using Autoencoder Neural Network 利用自动编码器神经网络重新定义北大西洋涛动指数的生成

Machine Learning: Science and Technology

Pub Date : 2024-01-08 DOI: 10.1088/2632-2153/ad1c32

Chibuike Chiedozie Ibebuchi

Understanding the spatial patterns of the North Atlantic Oscillation (NAO) is vital for climate science. For this reason, empirical orthogonal function (EOF) analysis is commonly applied to sea-level pressure (SLP) anomaly data in the North Atlantic region. This study evaluated the traditional EOF-based definition of the NAO index against the Autoencoder (AE) neural network-based definition, using the Hurrell NAO Index (Station-Based) as a reference. Specifically, EOF and AE were applied to monthly SLP anomaly data from ERA5 (1950-2022) to derive spatial modes of variability in the North Atlantic region. Both methods produced spatial patterns consistent with the traditional NAO definition, with dipole centers of action between the Icelandic Low and the Azores High. During boreal winter (December to March), when the NAO is most active, the AE-based method achieved a correlation of 0.96 with the reference NAO index, outperforming the EOF-based method's correlation of 0.90. The all-season Adjusted R-squared values were 50% for the AE-based index and 34% for the EOF-based index. Notably, the AE-based index revealed several other non-linear patterns of the NAO, with more than one encoded pattern correlating at least 0.90 with the reference NAO index during boreal winter. These results not only demonstrate the AE's superiority over traditional EOF in representing the station-based index but also uncover previously unexplored complexities in the NAO that are close to the reference temporal pattern. This suggests that AE offers a promising approach for defining climate modes of variability, potentially capturing intricacies that traditional linear methods like EOF might miss.

了解北大西洋涛动（NAO）的空间模式对气候科学至关重要。因此，经验正交函数（EOF）分析通常用于北大西洋地区的海平面气压（SLP）异常数据。本研究以 Hurrell NAO 指数（基于站点）为参考，评估了基于 EOF 的传统 NAO 指数定义与基于自动编码器（AE）神经网络的 NAO 指数定义。具体而言，将 EOF 和 AE 应用于 ERA5（1950-2022 年）的月度 SLP 异常数据，以得出北大西洋区域的空间变异模式。这两种方法得出的空间模式与传统的北大西洋环流定义一致，其偶极子作用中心位于冰岛低纬度和亚速尔群岛高纬度之间。在北大西洋环流最活跃的北方冬季（12 月至 3 月），基于 AE 的方法与参考北大西洋环流指数的相关性达到 0.96，优于基于 EOF 方法的 0.90。基于AE的指数的全年调整R平方值为50%，基于EOF的指数为34%。值得注意的是，基于AE的指数揭示了NAO的其他几种非线性模式，在北方冬季，不止一种编码模式与参考NAO指数的相关性至少达到0.90。这些结果不仅证明了 AE 在表示基于站点的指数方面优于传统的 EOF，而且还揭示了以前未曾探索过的与参考时间模式相近的 NAO 复杂性。这表明，AE 为定义气候变异模式提供了一种很有前途的方法，有可能捕捉到 EOF 等传统线性方法可能忽略的复杂性。

{"title":"Redefining the North Atlantic Oscillation Index Generation using Autoencoder Neural Network","authors":"Chibuike Chiedozie Ibebuchi","doi":"10.1088/2632-2153/ad1c32","DOIUrl":"https://doi.org/10.1088/2632-2153/ad1c32","url":null,"abstract":"\u0000 Understanding the spatial patterns of the North Atlantic Oscillation (NAO) is vital for climate science. For this reason, empirical orthogonal function (EOF) analysis is commonly applied to sea-level pressure (SLP) anomaly data in the North Atlantic region. This study evaluated the traditional EOF-based definition of the NAO index against the Autoencoder (AE) neural network-based definition, using the Hurrell NAO Index (Station-Based) as a reference. Specifically, EOF and AE were applied to monthly SLP anomaly data from ERA5 (1950-2022) to derive spatial modes of variability in the North Atlantic region. Both methods produced spatial patterns consistent with the traditional NAO definition, with dipole centers of action between the Icelandic Low and the Azores High. During boreal winter (December to March), when the NAO is most active, the AE-based method achieved a correlation of 0.96 with the reference NAO index, outperforming the EOF-based method's correlation of 0.90. The all-season Adjusted R-squared values were 50% for the AE-based index and 34% for the EOF-based index. Notably, the AE-based index revealed several other non-linear patterns of the NAO, with more than one encoded pattern correlating at least 0.90 with the reference NAO index during boreal winter. These results not only demonstrate the AE's superiority over traditional EOF in representing the station-based index but also uncover previously unexplored complexities in the NAO that are close to the reference temporal pattern. This suggests that AE offers a promising approach for defining climate modes of variability, potentially capturing intricacies that traditional linear methods like EOF might miss.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":"48 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139448174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

High-resolution imaging in acoustic microscopy using deep learning 利用深度学习实现声学显微镜的高分辨率成像

Machine Learning: Science and Technology

Pub Date : 2024-01-08 DOI: 10.1088/2632-2153/ad1c30

Pragyan Banerjee, Shivam Milind Akarte, Prakhar Kumar, Muhammad Shamsuzzaman, Ankit Butola, Krishna Agarwal, dilip kumar prasad, F. Melandsø, A. Habib

Acoustic microscopy is a cutting-edge label-free imaging technology that allows us to see the surface and interior structure of industrial and biological materials. The acoustic image is created by focusing high-frequency acoustic waves on the object and then detecting reflected signals. On the other hand, the quality of the acoustic image's resolution is influenced by the signal-to-noise ratio, the scanning step size, and the frequency of the transducer. Deep learning-based high-resolution imaging in acoustic microscopy is proposed in this paper. To illustrate 4 times resolution improvement in acoustic images, five distinct models are used: SRGAN, ESRGAN, IMDN, DBPN-RES-MR64-3, and SwinIR. The trained model's performance is assessed by calculating the PSNR (Peak Signal to Noise Ratio) and SSIM (Structural Similarity Index) between the network-predicted and ground truth images. To avoid the model from over-fitting, transfer learning was incorporated during the procedure. SwinIR had average SSIM and PSNR values of 0.95 and 35, respectively. The model was also evaluated using a biological sample from Reindeer Antler, yielding an SSIM score of 0.88 and a PSNR score of 32.93. Our framework is relevant to a wide range of industrial applications, including electronic production, material micro-structure analysis, and other biological applications in general.

声学显微镜是一种尖端的无标记成像技术，可让我们看到工业和生物材料的表面和内部结构。声学图像是通过将高频声波聚焦在物体上，然后检测反射信号而形成的。另一方面，声学图像的分辨率受信噪比、扫描步长和换能器频率的影响。本文提出了基于深度学习的声学显微镜高分辨率成像技术。为了说明声学图像分辨率提高了 4 倍，本文使用了五个不同的模型：SRGAN、ESRGAN、IMDN、DBPN-RES-MR64-3 和 SwinIR。通过计算网络预测图像与地面实况图像之间的 PSNR（峰值信噪比）和 SSIM（结构相似性指数）来评估训练模型的性能。为避免模型过度拟合，在此过程中加入了迁移学习。SwinIR 的平均 SSIM 值和 PSNR 值分别为 0.95 和 35。我们还使用驯鹿鹿茸生物样本对模型进行了评估，结果显示 SSIM 值为 0.88，PSNR 值为 32.93。我们的框架适用于广泛的工业应用，包括电子生产、材料微观结构分析和其他一般生物应用。

{"title":"High-resolution imaging in acoustic microscopy using deep learning","authors":"Pragyan Banerjee, Shivam Milind Akarte, Prakhar Kumar, Muhammad Shamsuzzaman, Ankit Butola, Krishna Agarwal, dilip kumar prasad, F. Melandsø, A. Habib","doi":"10.1088/2632-2153/ad1c30","DOIUrl":"https://doi.org/10.1088/2632-2153/ad1c30","url":null,"abstract":"\u0000 Acoustic microscopy is a cutting-edge label-free imaging technology that allows us to see the surface and interior structure of industrial and biological materials. The acoustic image is created by focusing high-frequency acoustic waves on the object and then detecting reflected signals. On the other hand, the quality of the acoustic image's resolution is influenced by the signal-to-noise ratio, the scanning step size, and the frequency of the transducer. Deep learning-based high-resolution imaging in acoustic microscopy is proposed in this paper. To illustrate 4 times resolution improvement in acoustic images, five distinct models are used: SRGAN, ESRGAN, IMDN, DBPN-RES-MR64-3, and SwinIR. The trained model's performance is assessed by calculating the PSNR (Peak Signal to Noise Ratio) and SSIM (Structural Similarity Index) between the network-predicted and ground truth images. To avoid the model from over-fitting, transfer learning was incorporated during the procedure. SwinIR had average SSIM and PSNR values of 0.95 and 35, respectively. The model was also evaluated using a biological sample from Reindeer Antler, yielding an SSIM score of 0.88 and a PSNR score of 32.93. Our framework is relevant to a wide range of industrial applications, including electronic production, material micro-structure analysis, and other biological applications in general.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":"59 20","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139447702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Application of Autoencoders Artificial Neural Network and Principal Component Analysis for Pattern Extraction and Spatial Regionalization of Global Temperature Data 自动编码器人工神经网络和主成分分析在全球气温数据模式提取和空间区域化中的应用

Machine Learning: Science and Technology

Pub Date : 2024-01-08 DOI: 10.1088/2632-2153/ad1c34

Chibuike Chiedozie Ibebuchi, O. Obarein, Itohan-Osa Abu

Spatial regionalization is instrumental in simplifying the spatial complexity of the climate system. To identify regions of significant climate variability, pattern extraction is often required prior to spatial regionalization with a clustering algorithm. In this study, the autoencoder artificial neural network (AE) was applied to extract the inherent patterns of global temperature data (from 1901 to 2021). Subsequently, Fuzzy C-means clustering was applied to the extracted patterns to classify the global temperature regions. Our analysis involved comparing AE-based and principal component analysis (PCA)-based clustering results to assess consistency. We determined the number of clusters by examining the average percentage decrease in Fuzzy Partition Coefficient and its 95% confidence interval, seeking a balance between obtaining a high Fuzzy Partition Coefficient and avoiding over-segmentation. This approach suggested that for a more general model, four clusters is reasonable. The Adjusted Rand Index between the AE-based and PCA-based clusters is 0.75, indicating that the AE-based and PCA-based clusters have considerable overlap. The observed difference between the AE-based clusters and PCA-based clusters is suggested to be associated with AE’s capability to learn and extract complex non-linear patterns, and this attribute, for example, enabled the clustering algorithm to accurately detect the Himalayas region as the “third pole” with similar temperature characteristics as the polar regions. Finally, when the analysis period is divided into two (1901-1960 and 1961-2021), the Adjusted Rand Index between the two clusters is 0.96 which suggests that historical climate change has not significantly affected the defined temperature regions over the two periods. In essence, this study indicates both AE's potential to enhance our understanding of climate variability and reveals the stability of the historical temperature regions.

空间区域化有助于简化气候系统的空间复杂性。要识别气候显著变异的区域，通常需要在使用聚类算法进行空间区域化之前提取模式。本研究采用自动编码器人工神经网络（AE）提取全球气温数据（从 1901 年到 2021 年）的固有模式。随后，对提取的模式进行模糊 C-means 聚类，对全球气温区域进行分类。我们的分析包括比较基于 AE 和基于主成分分析 (PCA) 的聚类结果，以评估一致性。我们通过研究模糊分区系数的平均下降百分比及其 95% 的置信区间来确定聚类的数量，在获得高模糊分区系数和避免过度分区之间寻求平衡。这种方法表明，对于一个更一般的模型，四个聚类是合理的。基于 AE 的聚类和基于 PCA 的聚类之间的调整兰德指数为 0.75，表明基于 AE 的聚类和基于 PCA 的聚类有相当大的重叠。观察到的基于 AE 的聚类与基于 PCA 的聚类之间的差异表明，这与 AE 学习和提取复杂非线性模式的能力有关，例如，这一属性使得聚类算法能够准确地检测到喜马拉雅地区作为 "第三极"，具有与极地相似的温度特征。最后，如果将分析时段一分为二（1901-1960 年和 1961-2021 年），两个聚类之间的调整兰德指数为 0.96，这表明历史气候变化在两个时段内对所定义的气温区域没有显著影响。从本质上讲，这项研究既表明了 AE 在增强我们对气候变异性的理解方面的潜力，也揭示了历史温度区域的稳定性。

{"title":"Application of Autoencoders Artificial Neural Network and Principal Component Analysis for Pattern Extraction and Spatial Regionalization of Global Temperature Data","authors":"Chibuike Chiedozie Ibebuchi, O. Obarein, Itohan-Osa Abu","doi":"10.1088/2632-2153/ad1c34","DOIUrl":"https://doi.org/10.1088/2632-2153/ad1c34","url":null,"abstract":"\u0000 Spatial regionalization is instrumental in simplifying the spatial complexity of the climate system. To identify regions of significant climate variability, pattern extraction is often required prior to spatial regionalization with a clustering algorithm. In this study, the autoencoder artificial neural network (AE) was applied to extract the inherent patterns of global temperature data (from 1901 to 2021). Subsequently, Fuzzy C-means clustering was applied to the extracted patterns to classify the global temperature regions. Our analysis involved comparing AE-based and principal component analysis (PCA)-based clustering results to assess consistency. We determined the number of clusters by examining the average percentage decrease in Fuzzy Partition Coefficient and its 95% confidence interval, seeking a balance between obtaining a high Fuzzy Partition Coefficient and avoiding over-segmentation. This approach suggested that for a more general model, four clusters is reasonable. The Adjusted Rand Index between the AE-based and PCA-based clusters is 0.75, indicating that the AE-based and PCA-based clusters have considerable overlap. The observed difference between the AE-based clusters and PCA-based clusters is suggested to be associated with AE’s capability to learn and extract complex non-linear patterns, and this attribute, for example, enabled the clustering algorithm to accurately detect the Himalayas region as the “third pole” with similar temperature characteristics as the polar regions. Finally, when the analysis period is divided into two (1901-1960 and 1961-2021), the Adjusted Rand Index between the two clusters is 0.96 which suggests that historical climate change has not significantly affected the defined temperature regions over the two periods. In essence, this study indicates both AE's potential to enhance our understanding of climate variability and reveals the stability of the historical temperature regions.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":"19 26","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139445266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Multi-Stage Machine Learning Algorithm for Estimating Personal Dose Equivalent using Thermoluminescent Dosimeter 使用热释光剂量计估算个人剂量当量的多阶段机器学习算法

Machine Learning: Science and Technology

Pub Date : 2024-01-08 DOI: 10.1088/2632-2153/ad1c31

M. Pathan, Suresh M Pradhan, T. P. Selvam, Balvinder Kaur Sapra

In the present age, marked by data-driven advancements in various fields, the importance of machine learning holds a prominent position. The ability of machine learning algorithms to resolve complex patterns and extract insights from large datasets has solidified its transformative potential in various scientific domains. This paper introduces an innovative application of machine learning techniques in the domain of radiation dosimetry. Specifically, it shows the applicability of machine learning in estimating the radiation dose received by occupational workers. This estimation is expressed in terms of personal dose equivalent, and it involves the utilization of thermoluminescence signals emitted by CaSO4:Dy–based personnel monitoring badges. To estimate personal dose equivalent, three-stage algorithm driven by machine learning models is proposed. This algorithm systematically identifies the photon energy ranges, calculates the average photon energy, and determines personal dose equivalent. By implementing this approach to the conventional three-element dosimeter, the study overcomes existing limitations and enhances accuracy in dose estimation. The algorithm demonstrates 97.8% classification accuracy in discerning photon energy ranges and achieves a coefficient of determination of 0.988 for estimating average photon energy. Importantly, it also reduces the coefficient of variation of relative deviations by up to 6% for estimated personal dose equivalent, compared to existing algorithms. The study improves accuracy and establishes a new methodology for evaluating radiation exposure to occupational workers using conventional thermoluminescent dosimeter badge.

当今时代，以数据驱动的各领域进步为标志，机器学习的重要性占据了突出位置。机器学习算法能够解析复杂的模式，并从大型数据集中提取深刻的见解，这巩固了其在各个科学领域的变革潜力。本文介绍了机器学习技术在辐射剂量测定领域的创新应用。具体来说，它展示了机器学习在估算职业工作者所受辐射剂量方面的适用性。这种估算以个人剂量当量表示，涉及利用基于 CaSO4:Dy 的人员监测徽章发出的热释光信号。为了估算个人剂量当量，我们提出了由机器学习模型驱动的三阶段算法。该算法系统地识别光子能量范围、计算平均光子能量并确定个人剂量当量。通过在传统的三元素剂量计上采用这种方法，该研究克服了现有的局限性，提高了剂量估算的准确性。该算法在辨别光子能量范围方面的分类准确率达到 97.8%，在估算平均光子能量方面的决定系数达到 0.988。重要的是，与现有算法相比，该算法还将估计个人剂量当量的相对偏差系数降低了 6%。这项研究提高了使用传统热释光剂量计徽章评估职业工人辐照的准确性，并建立了一种新的方法。

{"title":"A Multi-Stage Machine Learning Algorithm for Estimating Personal Dose Equivalent using Thermoluminescent Dosimeter","authors":"M. Pathan, Suresh M Pradhan, T. P. Selvam, Balvinder Kaur Sapra","doi":"10.1088/2632-2153/ad1c31","DOIUrl":"https://doi.org/10.1088/2632-2153/ad1c31","url":null,"abstract":"\u0000 In the present age, marked by data-driven advancements in various fields, the importance of machine learning holds a prominent position. The ability of machine learning algorithms to resolve complex patterns and extract insights from large datasets has solidified its transformative potential in various scientific domains. This paper introduces an innovative application of machine learning techniques in the domain of radiation dosimetry. Specifically, it shows the applicability of machine learning in estimating the radiation dose received by occupational workers. This estimation is expressed in terms of personal dose equivalent, and it involves the utilization of thermoluminescence signals emitted by CaSO4:Dy–based personnel monitoring badges. To estimate personal dose equivalent, three-stage algorithm driven by machine learning models is proposed. This algorithm systematically identifies the photon energy ranges, calculates the average photon energy, and determines personal dose equivalent. By implementing this approach to the conventional three-element dosimeter, the study overcomes existing limitations and enhances accuracy in dose estimation. The algorithm demonstrates 97.8% classification accuracy in discerning photon energy ranges and achieves a coefficient of determination of 0.988 for estimating average photon energy. Importantly, it also reduces the coefficient of variation of relative deviations by up to 6% for estimated personal dose equivalent, compared to existing algorithms. The study improves accuracy and establishes a new methodology for evaluating radiation exposure to occupational workers using conventional thermoluminescent dosimeter badge.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":"41 15","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139447575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimizing collective behavior of communicating active particles with machine learning 利用机器学习优化通信主动粒子的集体行为

Machine Learning: Science and Technology

Pub Date : 2024-01-08 DOI: 10.1088/2632-2153/ad1c33

Jens Grauer, F. J. Schwarzendahl, Hartmut Löwen, B. Liebchen

Bacteria and other self-propelling microorganisms produce and respond to signaling molecules to communicate with each other (quorum sensing) and to direct their collective behavior. Here, we explore agents (active particles) which communicate with each other to coordinate their collective dynamics to optimize nutrient consumption. Using reinforcement learning and neural networks, we identify three different strategies: a "clustering strategy", where the agents accumulate in regions of high nutrient concentration; a "spreading strategy", where particles stay away from each other to avoid competing for sparse resources; and an "adaptive strategy", where the agents adaptively decide to either follow or stay away from others. Our work exemplifies the idea that machine learning can be used to determine parameters that are evolutionarily optimized in biological systems but often occur as unknown parameters in mathematical models describing their dynamics.

细菌和其他自走微生物会产生信号分子并对其做出反应，从而相互交流（法定人数感应）并指导其集体行为。在这里，我们探讨了相互通信的代理（活性颗粒）如何协调它们的集体动态以优化营养消耗。利用强化学习和神经网络，我们确定了三种不同的策略：一种是 "聚类策略"，即代理聚集在营养物质浓度高的区域；一种是 "扩散策略"，即粒子之间相互远离，以避免争夺稀少的资源；还有一种是 "自适应策略"，即代理自适应地决定跟随或远离其他代理。我们的工作体现了这样一种理念，即机器学习可用于确定生物系统中进化优化的参数，但这些参数在描述生物系统动态的数学模型中往往是未知参数。

引用次数: 0

Mud-Net: Multi-domain deep unrolling network for simultaneous sparse-view and metal artifact reduction in computed tomography Mud-Net：用于在计算机断层扫描中同时减少稀疏视图和金属伪影的多域深度展开网络

Machine Learning: Science and Technology

Pub Date : 2024-01-05 DOI: 10.1088/2632-2153/ad1b8e

Baoshun Shi, Ke Jiang, Shaolei Zhang, Qiusheng Lian, Yanwei Qin, Yunsong Zhao

Sparse-view computed tomography (SVCT) is regarded as a promising technique to accelerate data acquisition and reduce radiation dose. However, in the presence of metallic implants, SVCT inevitably makes the reconstructed CT images suffer from severe metal artifacts and streaking artifacts due to the lack of sufficient projection data. Previous stand-alone SVCT and metal artifact reduction (MAR) methods to solve the problem of simultaneously sparse-view and metal artifact reduction (SVMAR) are plagued by insufficient correction accuracy. To overcome this limitation, we propose a multi-domain deep unrolling network, called Mud-Net, for SVMAR. Specifically, we establish a joint sinogram, image, artifact, and coding domains deep unrolling reconstruction model to recover high-quality CT images from the under-sampled sinograms corrupted by metallic implants. To train this multi-domain network effectively, we embed multi-domain knowledge into the network training process. Comprehensive experiments demonstrate that our method is superior to both existing MAR methods in the full-view MAR task and previous SVCT methods in the SVMAR task.

稀疏视图计算机断层扫描（SVCT）被认为是一种很有前途的技术，可加快数据采集速度并减少辐射剂量。然而，在存在金属植入物的情况下，由于缺乏足够的投影数据，SVCT 不可避免地会使重建的 CT 图像出现严重的金属伪影和条纹伪影。以往独立的 SVCT 和金属伪影还原（MAR）方法在解决同时还原稀疏视图和金属伪影（SVMAR）的问题时，受到校正精度不足的困扰。为了克服这一局限性，我们提出了一种用于 SVMAR 的多域深度展开网络，称为 Mud-Net。具体来说，我们建立了一个联合窦状图、图像、伪影和编码域的深度展开重建模型，以从被金属植入物破坏的低采样窦状图中恢复高质量的 CT 图像。为了有效地训练这个多域网络，我们在网络训练过程中嵌入了多域知识。综合实验证明，我们的方法在全视图 MAR 任务中优于现有的 MAR 方法，在 SVMAR 任务中优于之前的 SVCT 方法。

{"title":"Mud-Net: Multi-domain deep unrolling network for simultaneous sparse-view and metal artifact reduction in computed tomography","authors":"Baoshun Shi, Ke Jiang, Shaolei Zhang, Qiusheng Lian, Yanwei Qin, Yunsong Zhao","doi":"10.1088/2632-2153/ad1b8e","DOIUrl":"https://doi.org/10.1088/2632-2153/ad1b8e","url":null,"abstract":"\u0000 Sparse-view computed tomography (SVCT) is regarded as a promising technique to accelerate data acquisition and reduce radiation dose. However, in the presence of metallic implants, SVCT inevitably makes the reconstructed CT images suffer from severe metal artifacts and streaking artifacts due to the lack of sufficient projection data. Previous stand-alone SVCT and metal artifact reduction (MAR) methods to solve the problem of simultaneously sparse-view and metal artifact reduction (SVMAR) are plagued by insufficient correction accuracy. To overcome this limitation, we propose a multi-domain deep unrolling network, called Mud-Net, for SVMAR. Specifically, we establish a joint sinogram, image, artifact, and coding domains deep unrolling reconstruction model to recover high-quality CT images from the under-sampled sinograms corrupted by metallic implants. To train this multi-domain network effectively, we embed multi-domain knowledge into the network training process. Comprehensive experiments demonstrate that our method is superior to both existing MAR methods in the full-view MAR task and previous SVCT methods in the SVMAR task.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":"14 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139382823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sparse Optical Flow Outliers Elimination Method Based on Borda Stochastic Neighborhood Graph 基于博尔达随机邻域图的稀疏光流异常值消除方法

Machine Learning: Science and Technology

Pub Date : 2024-01-03 DOI: 10.1088/2632-2153/ad1a50

Yifan Wang, Yang Li, Jiaqi Wang, Haofeng Lv, Jinshi Guo

During the tracking of moving targets in dynamic scenes, efficiently handling outliers in the optical flow and maintaining robustness across various motion amplitudes represents a critical challenge. So far, studies have used thresholding and local consistency based approaches to deal with optical outliers. However, there is subjectivity through expert-defined thresholds or delineated regions, and therefore these methods do not perform consistently enough under different target motion amplitudes. Other studies have focused on complex statistical-mathematical modelling which, although theoretically valid, requires significant computational resources. Aiming at the above problems this paper proposes a new method to calculate the optical outliers by using stochastic neighborhood graph combined with the Borda counting method, which reduces the computation amount on the basis of objectively eliminating the outliers. Sparse optical flow values are used as the overall population and the outlier and inlier sparse optical flow values are used as samples. Analyze the dissimilarity between sparse optical flow data points, obtaining the dissimilarity matrix, introducing the Gaussian function to smooth and reduce the dimensionality of the dissimilarity matrix, and then normalizing the smoothing matrix to generate the binding matrix, where the probability sum of each node to other nodes in the matrix is equal to 1. Stochastic neighborhood graphs are then generated based on a binding matrix to obtain the outlier probabilities of data points in different neighborhood graphs, and outlier samples are obtained based on the probability. To avoid the subjectivity of the expert thresholds, the outlier probabilities are weighted and ranked to calculate the data point Borda scores to obtain accurate optical outliers. The experimental results show that the method in this paper is robust to different amplitude motions and real scenarios, and the accuracy, precision and recall of outliers elimination are better than the current mainstream algorithms.

在动态场景中跟踪移动目标时，有效处理光流中的异常值并在各种运动幅度下保持鲁棒性是一项严峻的挑战。迄今为止，已有研究使用基于阈值和局部一致性的方法来处理光学异常值。然而，专家定义的阈值或划定的区域存在主观性，因此这些方法在不同目标运动幅度下的表现不够一致。其他研究侧重于复杂的统计数学模型，虽然理论上有效，但需要大量的计算资源。针对上述问题，本文提出了一种计算光学离群值的新方法，即使用随机邻域图结合博尔达计数法，在客观消除离群值的基础上减少计算量。将稀疏光流值作为总体，离群值和异常值作为样本。分析稀疏光流数据点之间的不相似度，得到不相似度矩阵，引入高斯函数对不相似度矩阵进行平滑降维，然后对平滑矩阵进行归一化处理，生成绑定矩阵，矩阵中每个节点与其他节点的概率之和等于1，再根据绑定矩阵生成随机邻域图，得到不同邻域图中数据点的离群概率，根据概率得到离群样本。为避免专家阈值的主观性，对离群值概率进行加权和排序，计算出数据点的 Borda 分数，从而得到准确的光学离群值。实验结果表明，本文方法对不同振幅运动和实际场景具有良好的鲁棒性，离群值消除的准确度、精确度和召回率均优于目前主流算法。

{"title":"Sparse Optical Flow Outliers Elimination Method Based on Borda Stochastic Neighborhood Graph","authors":"Yifan Wang, Yang Li, Jiaqi Wang, Haofeng Lv, Jinshi Guo","doi":"10.1088/2632-2153/ad1a50","DOIUrl":"https://doi.org/10.1088/2632-2153/ad1a50","url":null,"abstract":"\u0000 During the tracking of moving targets in dynamic scenes, efficiently handling outliers in the optical flow and maintaining robustness across various motion amplitudes represents a critical challenge. So far, studies have used thresholding and local consistency based approaches to deal with optical outliers. However, there is subjectivity through expert-defined thresholds or delineated regions, and therefore these methods do not perform consistently enough under different target motion amplitudes. Other studies have focused on complex statistical-mathematical modelling which, although theoretically valid, requires significant computational resources. Aiming at the above problems this paper proposes a new method to calculate the optical outliers by using stochastic neighborhood graph combined with the Borda counting method, which reduces the computation amount on the basis of objectively eliminating the outliers. Sparse optical flow values are used as the overall population and the outlier and inlier sparse optical flow values are used as samples. Analyze the dissimilarity between sparse optical flow data points, obtaining the dissimilarity matrix, introducing the Gaussian function to smooth and reduce the dimensionality of the dissimilarity matrix, and then normalizing the smoothing matrix to generate the binding matrix, where the probability sum of each node to other nodes in the matrix is equal to 1. Stochastic neighborhood graphs are then generated based on a binding matrix to obtain the outlier probabilities of data points in different neighborhood graphs, and outlier samples are obtained based on the probability. To avoid the subjectivity of the expert thresholds, the outlier probabilities are weighted and ranked to calculate the data point Borda scores to obtain accurate optical outliers. The experimental results show that the method in this paper is robust to different amplitude motions and real scenarios, and the accuracy, precision and recall of outliers elimination are better than the current mainstream algorithms.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":"18 7","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139388556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0