Neural Computing and Applications最新文献_第6页

Radial basis function neural network training using variable projection and fuzzy means 利用变量投影和模糊手段进行径向基函数神经网络训练

Neural Computing and Applications

Pub Date : 2024-08-23 DOI: 10.1007/s00521-024-10274-3

Despina Karamichailidou, Georgios Gerolymatos, Panagiotis Patrinos, Haralambos Sarimveis, Alex Alexandridis

Radial basis function (RBF) neural network training presents a challenging optimization task, necessitating the utilization of advanced algorithms that can fully train the network so as to produce accurate and computationally efficient models. To achieve this goal, this work introduces a new framework where the original RBF training problem is divided into two simpler subproblems; the linear parameters, namely the network weights, are projected out of the problem using variable projection (VP), thus leaving a reduced functional, which depends only on nonlinear parameters, i.e., the RBF centers. The centers are updated using the Levenberg–Marquardt (LM) algorithm, while the optimal values of the synaptic weights are calculated in each iteration of the LM algorithm using linear regression. The proposed VP-LM scheme is coupled with the fuzzy means (FM) algorithm, which helps to select the number of RBF centers and enhances the overall search procedure, thus resulting to a framework that produces parsimonious models with enhanced accuracy in shorter training times. The proposed training scheme is evaluated on 12 both real-world and synthetic benchmark datasets and tested against various RBF training algorithms, as well as different neural network architectures. The experimental results underscore the effectiveness of the VP-FM algorithm in producing neural network models that outperform those generated by alternative methods in many aspects; to be more specific, the proposed approach achieves very competitive model accuracy, while resulting to smaller network sizes and thus lower complexity, which leads to shorter training times.

径向基函数（RBF）神经网络训练是一项极具挑战性的优化任务，需要利用先进的算法对网络进行全面训练，以生成精确且计算效率高的模型。为了实现这一目标，这项工作引入了一个新的框架，将原始的 RBF 训练问题分为两个更简单的子问题；使用变量投影（VP）将线性参数（即网络权重）投影到问题之外，从而留下一个仅取决于非线性参数（即 RBF 中心）的简化函数。使用 Levenberg-Marquardt (LM) 算法更新中心，而在 LM 算法的每次迭代中使用线性回归计算突触权重的最佳值。所提出的 VP-LM 方案与模糊手段（FM）算法相结合，有助于选择 RBF 中心的数量，并增强整体搜索程序，从而使该框架能在更短的训练时间内生成具有更高精度的简约模型。我们在 12 个真实世界和合成基准数据集上对所提出的训练方案进行了评估，并与各种 RBF 训练算法以及不同的神经网络架构进行了对比测试。实验结果凸显了 VP-FM 算法在生成神经网络模型方面的有效性，这些模型在很多方面都优于其他方法生成的模型；更具体地说，所提出的方法实现了极具竞争力的模型准确性，同时缩小了网络规模，从而降低了复杂性，缩短了训练时间。

{"title":"Radial basis function neural network training using variable projection and fuzzy means","authors":"Despina Karamichailidou, Georgios Gerolymatos, Panagiotis Patrinos, Haralambos Sarimveis, Alex Alexandridis","doi":"10.1007/s00521-024-10274-3","DOIUrl":"https://doi.org/10.1007/s00521-024-10274-3","url":null,"abstract":"Radial basis function (RBF) neural network training presents a challenging optimization task, necessitating the utilization of advanced algorithms that can fully train the network so as to produce accurate and computationally efficient models. To achieve this goal, this work introduces a new framework where the original RBF training problem is divided into two simpler subproblems; the linear parameters, namely the network weights, are projected out of the problem using variable projection (VP), thus leaving a reduced functional, which depends only on nonlinear parameters, i.e., the RBF centers. The centers are updated using the Levenberg–Marquardt (LM) algorithm, while the optimal values of the synaptic weights are calculated in each iteration of the LM algorithm using linear regression. The proposed VP-LM scheme is coupled with the fuzzy means (FM) algorithm, which helps to select the number of RBF centers and enhances the overall search procedure, thus resulting to a framework that produces parsimonious models with enhanced accuracy in shorter training times. The proposed training scheme is evaluated on 12 both real-world and synthetic benchmark datasets and tested against various RBF training algorithms, as well as different neural network architectures. The experimental results underscore the effectiveness of the VP-FM algorithm in producing neural network models that outperform those generated by alternative methods in many aspects; to be more specific, the proposed approach achieves very competitive model accuracy, while resulting to smaller network sizes and thus lower complexity, which leads to shorter training times.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"113 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142224560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Siamese neural network-based diagnosis of COVID-19 using chest X-rays 基于连体神经网络的 COVID-19 胸部 X 射线诊断方法

Neural Computing and Applications

Pub Date : 2024-08-23 DOI: 10.1007/s00521-024-10326-8

Engin Tas, Ayca Hatice Atli

Radiological findings play an essential and complementary role in diagnosing Covid-19, assessing its severity, and managing its patients. Artificial intelligence technology based on medical imaging, which has made exciting developments by being applied in many areas, has become an area of interest for the rapid and accurate detection of the disease in the fight against the Covid-19 pandemic. The main difficulty is the inability to obtain a large dataset size with quality and standard images that neural networks need to perform well. Aiming at this problem, this study proposes a Siamese neural network-based deep learning framework for accurate diagnostics of Covid-19 using chest X-ray (CXR) images. The pre-trained VGG16 architecture, based on the transfer learning approach, forms the backbone of the Siamese neural network. The outputs of the backbones are joined together by a merging layer, and then the output passes through a fully connected layer. Based on this structure, category-aware Siamese-based models are produced for each class. The predictions from the models are combined using a voting mechanism to reduce the possibility of misclassification and to make better decisions. The framework was evaluated using a publicly available dataset for the 4-class classification task for Covid-19 pneumonia, lung opacity, normal, and non-Covid-19 viral pneumonia images. The findings reveal the high discrimination ability of the framework, trained using only 10 images per class in less training time, achieving an average test accuracy of 92%. Our framework, which learns a single Siamese-based pairwise model for each class, effectively captures class-specific features. Additionally, it has the potential to deal with data scarcity and long training time problems in multi-class classification tasks.

放射学检查结果在诊断 Covid-19、评估其严重程度和管理患者方面发挥着重要的辅助作用。基于医学影像的人工智能技术已在许多领域得到应用，并取得了令人振奋的发展，在抗击 Covid-19 大流行的斗争中，该技术已成为快速准确检测疾病的一个关注领域。其主要困难在于无法获得神经网络需要的具有高质量和标准图像的大型数据集。针对这一问题，本研究提出了一种基于连体神经网络的深度学习框架，用于利用胸部 X 光（CXR）图像准确诊断 Covid-19。基于迁移学习方法的预训练 VGG16 架构构成了连体神经网络的骨干。骨干层的输出通过合并层连接在一起，然后输出通过全连接层。基于这种结构，可为每个类别生成基于连体神经网络的类别感知模型。模型的预测结果通过投票机制进行组合，以减少误分类的可能性并做出更好的决策。该框架利用公开数据集对 Covid-19 肺炎、肺不张、正常和非 Covid-19 病毒性肺炎图像的 4 类分类任务进行了评估。结果表明，该框架的分辨能力很强，每类只用 10 张图像进行训练，训练时间更短，平均测试准确率达到 92%。我们的框架为每个类别学习一个基于连体的配对模型，能有效捕捉特定类别的特征。此外，它还能解决多类分类任务中数据稀缺和训练时间长的问题。

{"title":"A Siamese neural network-based diagnosis of COVID-19 using chest X-rays","authors":"Engin Tas, Ayca Hatice Atli","doi":"10.1007/s00521-024-10326-8","DOIUrl":"https://doi.org/10.1007/s00521-024-10326-8","url":null,"abstract":"Radiological findings play an essential and complementary role in diagnosing Covid-19, assessing its severity, and managing its patients. Artificial intelligence technology based on medical imaging, which has made exciting developments by being applied in many areas, has become an area of interest for the rapid and accurate detection of the disease in the fight against the Covid-19 pandemic. The main difficulty is the inability to obtain a large dataset size with quality and standard images that neural networks need to perform well. Aiming at this problem, this study proposes a Siamese neural network-based deep learning framework for accurate diagnostics of Covid-19 using chest X-ray (CXR) images. The pre-trained VGG16 architecture, based on the transfer learning approach, forms the backbone of the Siamese neural network. The outputs of the backbones are joined together by a merging layer, and then the output passes through a fully connected layer. Based on this structure, category-aware Siamese-based models are produced for each class. The predictions from the models are combined using a voting mechanism to reduce the possibility of misclassification and to make better decisions. The framework was evaluated using a publicly available dataset for the 4-class classification task for Covid-19 pneumonia, lung opacity, normal, and non-Covid-19 viral pneumonia images. The findings reveal the high discrimination ability of the framework, trained using only 10 images per class in less training time, achieving an average test accuracy of 92%. Our framework, which learns a single Siamese-based pairwise model for each class, effectively captures class-specific features. Additionally, it has the potential to deal with data scarcity and long training time problems in multi-class classification tasks.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Predicting blood transfusions for coronary artery bypass graft patients using deep neural networks and synthetic data 利用深度神经网络和合成数据预测冠状动脉旁路移植患者的输血量

Neural Computing and Applications

Pub Date : 2024-08-23 DOI: 10.1007/s00521-024-10309-9

Hsiao-Tien Tsai, Jichong Wu, Puneet Gupta, Eric R. Heinz, Amir Jafari

Coronary Artery Bypass Graft (CABG) is a common cardiac surgery, but it continues to have many associated risks, including the need for blood transfusions. Previous research has shown that blood transfusion during CABG surgery is associated with an increased risk for infection and mortality. The current study aims to use modern techniques, such as deep neural networks and data synthesis, to develop models that can best predict the need for blood transfusion among CABG patients. Results show that neural networks with synthetic data generated by DataSynthesizer have the best performance. Implications of results and future directions are discussed.

冠状动脉旁路移植术（CABG）是一种常见的心脏手术，但它仍然存在许多相关风险，包括需要输血。以往的研究表明，冠状动脉旁路移植手术期间输血与感染和死亡率风险增加有关。目前的研究旨在利用深度神经网络和数据合成等现代技术，开发出最能预测 CABG 患者输血需求的模型。结果表明，使用 DataSynthesizer 生成的合成数据的神经网络性能最佳。本文讨论了结果的意义和未来的发展方向。

引用次数: 0

Development of a design tool for the horizontal stabilizer of a helicopter using artificial neural networks 利用人工神经网络开发直升机水平稳定器设计工具

Neural Computing and Applications

Pub Date : 2024-08-23 DOI: 10.1007/s00521-024-10204-3

Eren Duzcu, Bora Yıldırım

The design of a helicopter is an intricate and challenging process. Decisions made during the preliminary design phase can significantly impact subsequent design stages, making it crucial to base these decisions on a solid foundation. A range of methods, including hand calculations, finite element analyses, and experimental tests, can be employed to establish the conceptual design parameters. However, these methods often come with the drawbacks of being time-intensive and costly, especially when testing various structures during the early design phase. To address this issue, this study introduces an artificial neural network-based design tool to evaluate the static structural characteristics of a helicopter’s horizontal stabilizer. The tool was built in Python using the Keras library. The required database for the training of the artificial neural network model was established using finite element analyses of the horizontal stabilizer subjected to the aerodynamic load for diverse design variables. The model’s performance was evaluated, and the model’s outputs were compared to the results derived from the finite element analyses. Moreover, the Hammersley sampling methodology was employed to reduce the size of the database without compromising on accuracy. The study also assessed the impact of decreasing the amount of data fed into the network model.

直升机的设计是一个复杂而具有挑战性的过程。在初步设计阶段做出的决定会对后续设计阶段产生重大影响，因此将这些决定建立在坚实的基础上至关重要。可以采用包括手工计算、有限元分析和实验测试在内的一系列方法来确定概念设计参数。然而，这些方法往往存在耗时长、成本高的缺点，尤其是在早期设计阶段测试各种结构时。为解决这一问题，本研究引入了一种基于人工神经网络的设计工具，用于评估直升机水平安定面的静态结构特性。该工具使用 Keras 库在 Python 中构建。通过对水平稳定器在不同设计变量下的气动载荷进行有限元分析，建立了人工神经网络模型训练所需的数据库。对模型的性能进行了评估，并将模型的输出结果与有限元分析得出的结果进行了比较。此外，还采用了哈默斯利取样方法，在不影响精度的情况下减少了数据库的大小。研究还评估了减少输入网络模型的数据量的影响。

{"title":"Development of a design tool for the horizontal stabilizer of a helicopter using artificial neural networks","authors":"Eren Duzcu, Bora Yıldırım","doi":"10.1007/s00521-024-10204-3","DOIUrl":"https://doi.org/10.1007/s00521-024-10204-3","url":null,"abstract":"The design of a helicopter is an intricate and challenging process. Decisions made during the preliminary design phase can significantly impact subsequent design stages, making it crucial to base these decisions on a solid foundation. A range of methods, including hand calculations, finite element analyses, and experimental tests, can be employed to establish the conceptual design parameters. However, these methods often come with the drawbacks of being time-intensive and costly, especially when testing various structures during the early design phase. To address this issue, this study introduces an artificial neural network-based design tool to evaluate the static structural characteristics of a helicopter’s horizontal stabilizer. The tool was built in Python using the Keras library. The required database for the training of the artificial neural network model was established using finite element analyses of the horizontal stabilizer subjected to the aerodynamic load for diverse design variables. The model’s performance was evaluated, and the model’s outputs were compared to the results derived from the finite element analyses. Moreover, the Hammersley sampling methodology was employed to reduce the size of the database without compromising on accuracy. The study also assessed the impact of decreasing the amount of data fed into the network model.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"24 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A self-adaptive arithmetic optimization algorithm with hybrid search modes for 0–1 knapsack problem 针对 0-1 knapsack 问题的具有混合搜索模式的自适应算术优化算法

Neural Computing and Applications

Pub Date : 2024-08-23 DOI: 10.1007/s00521-024-10327-7

Mengdie Lu, Haiyan Lu, Xinyu Hou, Qingyuan Hu

Arithmetic optimization algorithm (AOA) is a recently proposed algorithm inspired by mathematical operations. It has been used to solve a variety of optimization problems due to its simplicity of parameters and ease of implementation. However, it has been found that AOA encounters challenges such as poor exploration and premature convergence. To solve these issues, this paper proposes a self-adaptive AOA with hybrid search modes, named AOAHSM. In this algorithm, two hybrid search modes, i.e., the parallel search mode and the serial search mode, are established by combining AOA and differential evolution (DE) in different ways to enhance the exploration and exploitation abilities, respectively. In the parallel search mode, AOA and DE independently implement on their respective subpopulations to maintain a high distribution of the population. In the serial search mode, DE is embedded into AOA to provide more diversified solutions and thereby help the population jump out of local optima. Then, a self-adaptive conversion strategy is employed to dynamically switch between the two modes so as to achieve a better balance between exploration and exploitation. Additionally, a Levy flight strategy is used to perturb and update the best solution obtained in each iteration to further prevent premature convergence. Lastly, a binary version of AOAHSM is proposed to tackle the 0–1 knapsack problem. The proposed algorithms are evaluated on CEC2019, CEC2020 test functions, two typical engineering design problems and 45 instances of the 0–1 knapsack problem and compared with a number of state-of-the-art meta-heuristic algorithms. The obtained results demonstrate that AOAHSM and its binary version not only significantly outperform the original AOA but also achieve superior performance to the comparison algorithms in most cases.

算术优化算法（AOA）是最近受数学运算启发而提出的一种算法。由于参数简单、易于实现，它已被用于解决各种优化问题。然而，人们发现算术优化算法面临着探索性差和过早收敛等挑战。为了解决这些问题，本文提出了一种具有混合搜索模式的自适应 AOA，命名为 AOAHSM。在该算法中，通过将 AOA 与差分进化（DE）以不同方式结合，建立了两种混合搜索模式，即并行搜索模式和串行搜索模式，以分别增强探索和利用能力。在并行搜索模式下，AOA 和 DE 分别在各自的子种群中独立运行，以保持种群的高度分布。在串行搜索模式中，DE 被嵌入到 AOA 中，以提供更多样化的解决方案，从而帮助种群跳出局部最优。然后，采用自适应转换策略在两种模式之间动态切换，以便在探索和开发之间取得更好的平衡。此外，Levy 飞行策略用于扰动和更新每次迭代中获得的最佳解决方案，以进一步防止过早收敛。最后，还提出了一种二进制版本的 AOAHSM 来解决 0-1 knapsack 问题。我们在 CEC2019、CEC2020 测试功能、两个典型工程设计问题和 45 个 0-1 knapsack 问题实例上对所提出的算法进行了评估，并与一些最先进的元启发式算法进行了比较。结果表明，AOAHSM 及其二进制版本不仅明显优于原始 AOA，而且在大多数情况下都比对比算法性能更优。

{"title":"A self-adaptive arithmetic optimization algorithm with hybrid search modes for 0–1 knapsack problem","authors":"Mengdie Lu, Haiyan Lu, Xinyu Hou, Qingyuan Hu","doi":"10.1007/s00521-024-10327-7","DOIUrl":"https://doi.org/10.1007/s00521-024-10327-7","url":null,"abstract":"Arithmetic optimization algorithm (AOA) is a recently proposed algorithm inspired by mathematical operations. It has been used to solve a variety of optimization problems due to its simplicity of parameters and ease of implementation. However, it has been found that AOA encounters challenges such as poor exploration and premature convergence. To solve these issues, this paper proposes a self-adaptive AOA with hybrid search modes, named AOAHSM. In this algorithm, two hybrid search modes, i.e., the parallel search mode and the serial search mode, are established by combining AOA and differential evolution (DE) in different ways to enhance the exploration and exploitation abilities, respectively. In the parallel search mode, AOA and DE independently implement on their respective subpopulations to maintain a high distribution of the population. In the serial search mode, DE is embedded into AOA to provide more diversified solutions and thereby help the population jump out of local optima. Then, a self-adaptive conversion strategy is employed to dynamically switch between the two modes so as to achieve a better balance between exploration and exploitation. Additionally, a Levy flight strategy is used to perturb and update the best solution obtained in each iteration to further prevent premature convergence. Lastly, a binary version of AOAHSM is proposed to tackle the 0–1 knapsack problem. The proposed algorithms are evaluated on CEC2019, CEC2020 test functions, two typical engineering design problems and 45 instances of the 0–1 knapsack problem and compared with a number of state-of-the-art meta-heuristic algorithms. The obtained results demonstrate that AOAHSM and its binary version not only significantly outperform the original AOA but also achieve superior performance to the comparison algorithms in most cases.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"160 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep ensemble learning for osteoporosis diagnosis from knee X-rays: a preliminary cohort study in Kashmir valley 根据膝关节 X 射线诊断骨质疏松症的深度集合学习：克什米尔山谷的初步队列研究

Neural Computing and Applications

Pub Date : 2024-08-22 DOI: 10.1007/s00521-024-10158-6

Insha Majeed Wani, Sakshi Arora

Osteoporosis (OP) is the most prevalent and common bone disease, especially knee osteoporosis. It significantly disables sufferers all over the world. Although laborious and prone to user variation, manual diagnosis, segmentation, and annotation of knee joints continue to be the preferred way to diagnose OP in clinical procedures. Therefore, many deep learning algorithms, particularly the convolutional neural network (CNN), have been created to increase clinical workflow efficiency to overcome the shortcomings of the widely used method as above. Medical imaging procedures can show hidden structures in a volumetric view, particularly those that generate three-dimensional (3D) pictures like MRI. We created a dataset of 240 pictures from patients who had knee X-rays and skeletal bone mineral density assessments at the same time. Four convolutional neural networks (CNN) models were used to analyse the X-ray images and deep neural networks for clinical covariances to determine the degree of osteoporosis. Additionally, we investigated ensemble models that included each CNN with a clinical covariance. For every network, scores for accuracy and error rate were computed. ResNet and Alexnet displayed the highest levels of accuracy when the CNN models were tested using knee X-rays with normal, low BMD, and osteoporosis. An ensemble of DNN with Alexnet, ResNet, and both ResNet and Alexnet are employed resulting in improved accuracy. The ensemble of best-performing CNN and DNN is proposed to diagnose osteoporosis more accurately. The proposed method has produced a highly accurate osteoporosis diagnosis.

骨质疏松症（OP）是最普遍、最常见的骨病，尤其是膝关节骨质疏松症。骨质疏松症严重影响着全世界的患者。膝关节的人工诊断、分割和标注虽然费力且易受用户差异的影响，但仍是临床程序中诊断骨质疏松症的首选方法。因此，许多深度学习算法，特别是卷积神经网络（CNN）应运而生，以提高临床工作流程的效率，克服上述广泛使用的方法的缺点。医学成像程序可以显示容积视图中的隐藏结构，尤其是像核磁共振成像这样生成三维（3D）图片的程序。我们创建了一个包含 240 张图片的数据集，这些图片来自同时接受膝关节 X 光检查和骨骼骨矿密度评估的患者。我们使用四个卷积神经网络（CNN）模型分析 X 光图像，并使用深度神经网络分析临床协方差，以确定骨质疏松症的程度。此外，我们还研究了包含每个卷积神经网络和临床协方差的集合模型。我们计算了每个网络的准确率和错误率得分。在使用正常、低 BMD 和骨质疏松症的膝关节 X 光片对 CNN 模型进行测试时，ResNet 和 Alexnet 的准确率最高。使用包含 Alexnet、ResNet 以及 ResNet 和 Alexnet 的 DNN 集合提高了准确率。建议使用表现最佳的 CNN 和 DNN 的集合来更准确地诊断骨质疏松症。所提出的方法对骨质疏松症做出了高度准确的诊断。

{"title":"Deep ensemble learning for osteoporosis diagnosis from knee X-rays: a preliminary cohort study in Kashmir valley","authors":"Insha Majeed Wani, Sakshi Arora","doi":"10.1007/s00521-024-10158-6","DOIUrl":"https://doi.org/10.1007/s00521-024-10158-6","url":null,"abstract":"Osteoporosis (OP) is the most prevalent and common bone disease, especially knee osteoporosis. It significantly disables sufferers all over the world. Although laborious and prone to user variation, manual diagnosis, segmentation, and annotation of knee joints continue to be the preferred way to diagnose OP in clinical procedures. Therefore, many deep learning algorithms, particularly the convolutional neural network (CNN), have been created to increase clinical workflow efficiency to overcome the shortcomings of the widely used method as above. Medical imaging procedures can show hidden structures in a volumetric view, particularly those that generate three-dimensional (3D) pictures like MRI. We created a dataset of 240 pictures from patients who had knee X-rays and skeletal bone mineral density assessments at the same time. Four convolutional neural networks (CNN) models were used to analyse the X-ray images and deep neural networks for clinical covariances to determine the degree of osteoporosis. Additionally, we investigated ensemble models that included each CNN with a clinical covariance. For every network, scores for accuracy and error rate were computed. ResNet and Alexnet displayed the highest levels of accuracy when the CNN models were tested using knee X-rays with normal, low BMD, and osteoporosis. An ensemble of DNN with Alexnet, ResNet, and both ResNet and Alexnet are employed resulting in improved accuracy. The ensemble of best-performing CNN and DNN is proposed to diagnose osteoporosis more accurately. The proposed method has produced a highly accurate osteoporosis diagnosis.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"18 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

AI-based detection and identification of low-level nuclear waste: a comparative analysis 基于人工智能的低水平核废料检测和识别：比较分析

Neural Computing and Applications

Pub Date : 2024-08-22 DOI: 10.1007/s00521-024-10238-7

Aris Duani Rojas, Leonel Lagos, Himanshu Upadhyay, Jayesh Soni, Nagarajan Prabakar

Ensuring environmental safety and regulatory compliance at Department of Energy (DOE) sites demands an efficient and reliable detection system for low-level nuclear waste (LLW). Unlike existing methods that rely on human effort, this paper explores the integration of computer vision algorithms to automate the identification of such waste across DOE facilities. We evaluate the effectiveness of multiple algorithms in classifying nuclear waste materials and their adaptability to newly emerging LLW. Our research introduces and implements five state-of-the-art computer vision models, each representing a different approach to the problem. Through rigorous experimentation and validation, we evaluate these algorithms based on performance, speed, and adaptability. The results reveal a noteworthy trade-off between detection performance and adaptability. YOLOv7 shows the best performance and requires the highest effort to detect new LLW. Conversely, OWL-ViT has lower performance than YOLOv7 and requires minimal effort to detect new LLW. The inference speed does not strongly correlate with performance or adaptability. These findings offer valuable insights into the strengths and limitations of current computer vision algorithms for LLW detection. Each developed model provides a specialized solution with distinct advantages and disadvantages, empowering DOE stakeholders to select the algorithm that aligns best with their specific needs.

要确保能源部（DOE）场址的环境安全和合规性，就需要一个高效可靠的低放射性核废料（LLW）检测系统。与依赖人力的现有方法不同，本文探讨了计算机视觉算法的集成，以自动识别 DOE 设施中的此类废物。我们评估了多种算法在核废料材料分类方面的有效性，以及它们对新出现的 LLW 的适应性。我们的研究引入并实施了五种最先进的计算机视觉模型，每种模型都代表了解决问题的不同方法。通过严格的实验和验证，我们根据性能、速度和适应性对这些算法进行了评估。结果显示，在检测性能和适应性之间存在值得注意的权衡。YOLOv7 的性能最好，但检测新 LLW 所需的工作量最大。相反，OWL-ViT 的性能比 YOLOv7 低，但检测新 LLW 所需的工作量却最小。推理速度与性能或适应性的关系不大。这些发现为了解当前计算机视觉算法在检测 LLW 方面的优势和局限性提供了宝贵的见解。每个开发的模型都提供了具有明显优缺点的专门解决方案，使 DOE 利益相关者能够选择最符合其特定需求的算法。

{"title":"AI-based detection and identification of low-level nuclear waste: a comparative analysis","authors":"Aris Duani Rojas, Leonel Lagos, Himanshu Upadhyay, Jayesh Soni, Nagarajan Prabakar","doi":"10.1007/s00521-024-10238-7","DOIUrl":"https://doi.org/10.1007/s00521-024-10238-7","url":null,"abstract":"Ensuring environmental safety and regulatory compliance at Department of Energy (DOE) sites demands an efficient and reliable detection system for low-level nuclear waste (LLW). Unlike existing methods that rely on human effort, this paper explores the integration of computer vision algorithms to automate the identification of such waste across DOE facilities. We evaluate the effectiveness of multiple algorithms in classifying nuclear waste materials and their adaptability to newly emerging LLW. Our research introduces and implements five state-of-the-art computer vision models, each representing a different approach to the problem. Through rigorous experimentation and validation, we evaluate these algorithms based on performance, speed, and adaptability. The results reveal a noteworthy trade-off between detection performance and adaptability. YOLOv7 shows the best performance and requires the highest effort to detect new LLW. Conversely, OWL-ViT has lower performance than YOLOv7 and requires minimal effort to detect new LLW. The inference speed does not strongly correlate with performance or adaptability. These findings offer valuable insights into the strengths and limitations of current computer vision algorithms for LLW detection. Each developed model provides a specialized solution with distinct advantages and disadvantages, empowering DOE stakeholders to select the algorithm that aligns best with their specific needs.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing robustness and control performance of voltage source inverters using Kalman filter adaptive observer and ANN-based model predictive controller 利用卡尔曼滤波器自适应观测器和基于 ANN 的模型预测控制器提高电压源变频器的鲁棒性和控制性能

Neural Computing and Applications

Pub Date : 2024-08-22 DOI: 10.1007/s00521-024-10243-w

Sammy Kinga, Tamer F. Megahed, Haruichi Kanaya, Diaa-Eldin A. Mansour

Power electronic converters play a crucial role in integrating distributed generation, renewable energy sources, microgrids, and HVDC transmission networks into the grid. The control technique used in the voltage source inverters (VSI) is essential for handling load variations, system nonlinearity, stability, and fast transient response. This study focuses on improving the robustness and control performance of VSIs by integrating a Kalman filter adaptive observer into a finite control set model predictive control (FCS-MPC), resulting in an improved FCS-MPC strategy (IMPC). The classical FCS-MPC can be affected by inaccuracies due to measurement noise and uncertainties in system models, leading to less accurate predictions and suboptimal control actions. By employing the Kalman filter adaptive observer, real-time estimates of unmeasured variables are provided, compensating for uncertainties, and enhancing control performance. To further enhance flexibility and adaptivity, an artificial neural network (ANN)-based controller is designed. The ANN controller is trained offline using IMPC as baseline thus eliminating the need for online predictions and optimization. The ANN controller directly generates inverter switching configuration states, resulting in high-quality sinusoidal output voltage with low distortions. Comparative analysis is conducted for the classical FCS-MPC, IMPC, support vector machine (SVM), convolutional neural network (CNN), and ANN-based controllers under diverse operating conditions and system parameters. Although it has reduced interpretability, the ANN controller exhibits superior harmonic reduction, outperforming both MPC-based controllers and SVM. Evaluation against CNN-based controls also validates the ANN’s robustness and effectiveness in handling uncertainties, emphasizing its adaptability, efficiency, and practical applicability in power electronic applications.

电力电子变流器在将分布式发电、可再生能源、微电网和高压直流输电网络并入电网方面发挥着至关重要的作用。电压源逆变器（VSI）中使用的控制技术对于处理负载变化、系统非线性、稳定性和快速瞬态响应至关重要。本研究的重点是通过将卡尔曼滤波器自适应观测器集成到有限控制集模型预测控制（FCS-MPC）中，改进 FCS-MPC 策略（IMPC），从而提高 VSI 的鲁棒性和控制性能。传统的 FCS-MPC 可能会受到测量噪声和系统模型不确定性造成的不准确性的影响，导致预测不准确和控制行动不理想。通过采用卡尔曼滤波自适应观测器，可以实时估计未测量的变量，补偿不确定性，提高控制性能。为了进一步提高灵活性和适应性，设计了基于人工神经网络（ANN）的控制器。人工神经网络控制器以 IMPC 为基准进行离线训练，因此无需在线预测和优化。ANN 控制器直接生成逆变器开关配置状态，从而产生低失真、高质量的正弦输出电压。在不同的运行条件和系统参数下，对经典的 FCS-MPC、IMPC、支持向量机（SVM）、卷积神经网络（CNN）和基于 ANN 的控制器进行了比较分析。虽然解释性较差，但 ANN 控制器在减少谐波方面表现出色，优于基于 MPC 的控制器和 SVM。与基于 CNN 的控制器进行的评估还验证了 ANN 在处理不确定性时的鲁棒性和有效性，强调了其在电力电子应用中的适应性、效率和实际应用性。

{"title":"Enhancing robustness and control performance of voltage source inverters using Kalman filter adaptive observer and ANN-based model predictive controller","authors":"Sammy Kinga, Tamer F. Megahed, Haruichi Kanaya, Diaa-Eldin A. Mansour","doi":"10.1007/s00521-024-10243-w","DOIUrl":"https://doi.org/10.1007/s00521-024-10243-w","url":null,"abstract":"Power electronic converters play a crucial role in integrating distributed generation, renewable energy sources, microgrids, and HVDC transmission networks into the grid. The control technique used in the voltage source inverters (VSI) is essential for handling load variations, system nonlinearity, stability, and fast transient response. This study focuses on improving the robustness and control performance of VSIs by integrating a Kalman filter adaptive observer into a finite control set model predictive control (FCS-MPC), resulting in an improved FCS-MPC strategy (IMPC). The classical FCS-MPC can be affected by inaccuracies due to measurement noise and uncertainties in system models, leading to less accurate predictions and suboptimal control actions. By employing the Kalman filter adaptive observer, real-time estimates of unmeasured variables are provided, compensating for uncertainties, and enhancing control performance. To further enhance flexibility and adaptivity, an artificial neural network (ANN)-based controller is designed. The ANN controller is trained offline using IMPC as baseline thus eliminating the need for online predictions and optimization. The ANN controller directly generates inverter switching configuration states, resulting in high-quality sinusoidal output voltage with low distortions. Comparative analysis is conducted for the classical FCS-MPC, IMPC, support vector machine (SVM), convolutional neural network (CNN), and ANN-based controllers under diverse operating conditions and system parameters. Although it has reduced interpretability, the ANN controller exhibits superior harmonic reduction, outperforming both MPC-based controllers and SVM. Evaluation against CNN-based controls also validates the ANN’s robustness and effectiveness in handling uncertainties, emphasizing its adaptability, efficiency, and practical applicability in power electronic applications.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Vision transformers in domain adaptation and domain generalization: a study of robustness 领域适应和领域泛化中的视觉转换器：稳健性研究

Neural Computing and Applications

Pub Date : 2024-08-22 DOI: 10.1007/s00521-024-10353-5

Shadi Alijani, Jamil Fayyad, Homayoun Najjaran

Deep learning models are often evaluated in scenarios where the data distribution is different from those used in the training and validation phases. The discrepancy presents a challenge for accurately predicting the performance of models once deployed on the target distribution. Domain adaptation and generalization are widely recognized as effective strategies for addressing such shifts, thereby ensuring reliable performance. The recent promising results in applying vision transformers in computer vision tasks, coupled with advancements in self-attention mechanisms, have demonstrated their significant potential for robustness and generalization in handling distribution shifts. Motivated by the increased interest from the research community, our paper investigates the deployment of vision transformers in domain adaptation and domain generalization scenarios. For domain adaptation methods, we categorize research into feature-level, instance-level, model-level adaptations, and hybrid approaches, along with other categorizations with respect to diverse strategies for enhancing domain adaptation. Similarly, for domain generalization, we categorize research into multi-domain learning, meta-learning, regularization techniques, and data augmentation strategies. We further classify diverse strategies in research, underscoring the various approaches researchers have taken to address distribution shifts by integrating vision transformers. The inclusion of comprehensive tables summarizing these categories is a distinct feature of our work, offering valuable insights for researchers. These findings highlight the versatility of vision transformers in managing distribution shifts, crucial for real-world applications, especially in critical safety and decision-making scenarios.

深度学习模型经常在数据分布与训练和验证阶段所用数据分布不同的场景中进行评估。这种差异对准确预测模型在目标分布上部署后的性能提出了挑战。领域适应和泛化被广泛认为是解决这种差异的有效策略，从而确保可靠的性能。最近，在计算机视觉任务中应用视觉变换器取得了可喜的成果，再加上自我注意机制的进步，都证明了视觉变换器在处理分布偏移方面具有巨大的鲁棒性和泛化潜力。在研究界日益浓厚的兴趣的推动下，我们的论文研究了视觉变换器在领域适应和领域泛化场景中的应用。对于领域适应方法，我们将研究分为特征级适应、实例级适应、模型级适应和混合方法，并根据增强领域适应的不同策略进行了其他分类。同样，对于领域泛化，我们将研究分为多领域学习、元学习、正则化技术和数据增强策略。我们进一步对研究中的各种策略进行了分类，强调了研究人员通过整合视觉转换器来解决分布偏移问题的各种方法。我们的工作有一个显著特点，就是包含了总结这些类别的综合表格，为研究人员提供了宝贵的见解。这些发现凸显了视觉转换器在管理分布偏移方面的多功能性，这对现实世界的应用至关重要，尤其是在关键的安全和决策场景中。

{"title":"Vision transformers in domain adaptation and domain generalization: a study of robustness","authors":"Shadi Alijani, Jamil Fayyad, Homayoun Najjaran","doi":"10.1007/s00521-024-10353-5","DOIUrl":"https://doi.org/10.1007/s00521-024-10353-5","url":null,"abstract":"Deep learning models are often evaluated in scenarios where the data distribution is different from those used in the training and validation phases. The discrepancy presents a challenge for accurately predicting the performance of models once deployed on the target distribution. Domain adaptation and generalization are widely recognized as effective strategies for addressing such shifts, thereby ensuring reliable performance. The recent promising results in applying vision transformers in computer vision tasks, coupled with advancements in self-attention mechanisms, have demonstrated their significant potential for robustness and generalization in handling distribution shifts. Motivated by the increased interest from the research community, our paper investigates the deployment of vision transformers in domain adaptation and domain generalization scenarios. For domain adaptation methods, we categorize research into feature-level, instance-level, model-level adaptations, and hybrid approaches, along with other categorizations with respect to diverse strategies for enhancing domain adaptation. Similarly, for domain generalization, we categorize research into multi-domain learning, meta-learning, regularization techniques, and data augmentation strategies. We further classify diverse strategies in research, underscoring the various approaches researchers have taken to address distribution shifts by integrating vision transformers. The inclusion of comprehensive tables summarizing these categories is a distinct feature of our work, offering valuable insights for researchers. These findings highlight the versatility of vision transformers in managing distribution shifts, crucial for real-world applications, especially in critical safety and decision-making scenarios.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"1197 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Gene pointNet for tumor classification 用于肿瘤分类的基因点网络

Neural Computing and Applications

Pub Date : 2024-08-22 DOI: 10.1007/s00521-024-10307-x

Hao Lu, Mostafa Rezapour, Haseebullah Baha, Muhammad Khalid Khan Niazi, Aarthi Narayanan, Metin Nafi Gurcan

The rising incidence of cancer underscores the imperative for innovative diagnostic and prognostic methodologies. This study delves into the potential of RNA-Seq gene expression data to enhance cancer classification accuracy. Introducing a pioneering approach, we model gene expression data as point clouds, capitalizing on the data's intrinsic properties to bolster classification performance. Utilizing PointNet, a typical technique for processing point cloud data, as our framework's cornerstone, we incorporate inductive biases pertinent to gene expression and pathways. This integration markedly elevates model efficacy, culminating in developing an end-to-end deep learning classifier with an accuracy rate surpassing 99%. Our findings not only illuminate the capabilities of AI-driven models in the realm of oncology but also highlight the criticality of acknowledging biological dataset nuances in model design. This research provides insights into application of deep learning in medical science, setting the stage for further innovation in cancer classification through sophisticated biological data analysis. The source code for our study is accessible at: https://github.com/cialab/GPNet.

癌症发病率的上升凸显了创新诊断和预后方法的必要性。本研究深入探讨了 RNA-Seq 基因表达数据在提高癌症分类准确性方面的潜力。我们采用了一种开创性的方法，将基因表达数据建模为点云，利用数据的内在属性来提高分类性能。利用处理点云数据的典型技术 PointNet 作为框架的基石，我们纳入了与基因表达和通路相关的归纳偏差。这种整合显著提高了模型的功效，最终开发出一种端到端的深度学习分类器，准确率超过 99%。我们的发现不仅阐明了人工智能驱动模型在肿瘤学领域的能力，还强调了在模型设计中承认生物数据集细微差别的重要性。这项研究为深度学习在医学科学中的应用提供了见解，为通过复杂的生物数据分析进一步创新癌症分类奠定了基础。我们研究的源代码请访问：https://github.com/cialab/GPNet。

{"title":"Gene pointNet for tumor classification","authors":"Hao Lu, Mostafa Rezapour, Haseebullah Baha, Muhammad Khalid Khan Niazi, Aarthi Narayanan, Metin Nafi Gurcan","doi":"10.1007/s00521-024-10307-x","DOIUrl":"https://doi.org/10.1007/s00521-024-10307-x","url":null,"abstract":"The rising incidence of cancer underscores the imperative for innovative diagnostic and prognostic methodologies. This study delves into the potential of RNA-Seq gene expression data to enhance cancer classification accuracy. Introducing a pioneering approach, we model gene expression data as point clouds, capitalizing on the data's intrinsic properties to bolster classification performance. Utilizing PointNet, a typical technique for processing point cloud data, as our framework's cornerstone, we incorporate inductive biases pertinent to gene expression and pathways. This integration markedly elevates model efficacy, culminating in developing an end-to-end deep learning classifier with an accuracy rate surpassing 99%. Our findings not only illuminate the capabilities of AI-driven models in the realm of oncology but also highlight the criticality of acknowledging biological dataset nuances in model design. This research provides insights into application of deep learning in medical science, setting the stage for further innovation in cancer classification through sophisticated biological data analysis. The source code for our study is accessible at: https://github.com/cialab/GPNet.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0