首页 > 最新文献

Neural Computing and Applications最新文献

英文 中文
Skin cancer detection with MobileNet-based transfer learning and MixNets for enhanced diagnosis 利用基于移动网络的迁移学习和混合网络检测皮肤癌,提高诊断水平
Pub Date : 2024-08-28 DOI: 10.1007/s00521-024-10227-w
Mohammed Zakariah, Muna Al-Razgan, Taha Alfakih

Skin cancer poses a significant health hazard, necessitating the utilization of advanced diagnostic methodologies to facilitate timely detection, owing to its escalating prevalence in recent years. This paper proposes a novel approach to tackle the issue by introducing a method for detecting skin cancer that uses MixNets to enhance diagnosis and leverages mobile network-based transfer learning. Skin cancer has diverse forms, each distinguishable by its structural attributes, morphological characteristics, texture, and coloration. The pressing demand for accurate and efficient diagnostic instruments has spurred the investigation of novel techniques. The present study utilizes the ISIC dataset, comprising a validation set of 660 images and a training set of 2637 images. Moreover, the research employs a combination of MixNets and mobile network-based transfer learning as its chosen approach. Transfer learning is a technique that leverages preexisting models to enhance the diagnostic capabilities of the proposed system. Integrating MobileNet and MixNets allows for utilizing their respective functionalities, resulting in a dual-model methodology that enhances the comprehensiveness of skin cancer diagnosis. The results demonstrate impressive performance metrics, with MobileNet and MixNets models, and the proposed approach achieves an outstanding accuracy rate of 99.58%. The above findings underscore the efficacy of the dual-model method in effectively discerning between benign and malignant skin lesions. Moreover, the present study aims to examine the potential integration of emerging technologies to enhance the accuracy and practicality of diagnostics within real-world healthcare settings.

皮肤癌严重危害人们的健康,由于近年来发病率不断上升,有必要利用先进的诊断方法来促进及时发现。本文提出了一种新颖的方法来解决这一问题,即采用混合网络来加强诊断,并利用基于移动网络的迁移学习来检测皮肤癌。皮肤癌的形式多种多样,每一种都可以通过其结构属性、形态特征、纹理和颜色加以区分。对准确、高效诊断工具的迫切需求刺激了对新型技术的研究。本研究利用 ISIC 数据集,其中包括 660 幅图像的验证集和 2637 幅图像的训练集。此外,研究还采用了混合网络和基于移动网络的迁移学习相结合的方法。迁移学习是一种利用已有模型来增强拟议系统诊断能力的技术。将 MobileNet 和 MixNets 整合在一起,可以利用它们各自的功能,从而形成一种双模型方法,提高皮肤癌诊断的全面性。结果显示,MobileNet 和 MixNets 模型的性能指标令人印象深刻,拟议方法的准确率高达 99.58%。上述结果表明,双模型方法能有效区分良性和恶性皮肤病变。此外,本研究旨在探讨新兴技术的整合潜力,以提高实际医疗环境中诊断的准确性和实用性。
{"title":"Skin cancer detection with MobileNet-based transfer learning and MixNets for enhanced diagnosis","authors":"Mohammed Zakariah, Muna Al-Razgan, Taha Alfakih","doi":"10.1007/s00521-024-10227-w","DOIUrl":"https://doi.org/10.1007/s00521-024-10227-w","url":null,"abstract":"<p>Skin cancer poses a significant health hazard, necessitating the utilization of advanced diagnostic methodologies to facilitate timely detection, owing to its escalating prevalence in recent years. This paper proposes a novel approach to tackle the issue by introducing a method for detecting skin cancer that uses MixNets to enhance diagnosis and leverages mobile network-based transfer learning. Skin cancer has diverse forms, each distinguishable by its structural attributes, morphological characteristics, texture, and coloration. The pressing demand for accurate and efficient diagnostic instruments has spurred the investigation of novel techniques. The present study utilizes the ISIC dataset, comprising a validation set of 660 images and a training set of 2637 images. Moreover, the research employs a combination of MixNets and mobile network-based transfer learning as its chosen approach. Transfer learning is a technique that leverages preexisting models to enhance the diagnostic capabilities of the proposed system. Integrating MobileNet and MixNets allows for utilizing their respective functionalities, resulting in a dual-model methodology that enhances the comprehensiveness of skin cancer diagnosis. The results demonstrate impressive performance metrics, with MobileNet and MixNets models, and the proposed approach achieves an outstanding accuracy rate of 99.58%. The above findings underscore the efficacy of the dual-model method in effectively discerning between benign and malignant skin lesions. Moreover, the present study aims to examine the potential integration of emerging technologies to enhance the accuracy and practicality of diagnostics within real-world healthcare settings.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"45 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Empowering global ethereum price prediction with EtherVoyant: a state-of-the-art time series forecasting model 利用 EtherVoyant 增强全球以太坊价格预测能力:最先进的时间序列预测模型
Pub Date : 2024-08-27 DOI: 10.1007/s00521-024-10169-3
Umar Islam, Babar Shah, Abdullah A. Al-Atawi, Gioia Arnone, Mohamed R. Abonazel, Ijaz Ali, Fernando Moreira

Ethereum has emerged as a major platform for decentralized apps and smart contracts with the heightened interest in cryptocurrencies in recent years. Investors and market participants in the cryptocurrency space will find it increasingly important to use reliable price prediction models as Ethereum's popularity grows. To better estimate Ethereum prices around the world, we propose "EtherVoyant," a novel hybrid forecasting model that combines the advantages of ARIMA and SARIMA methods. To improve its forecasting abilities, EtherVoyant uses Ethereum price history to train ARIMA and SARIMA components independently before fusing their predictions. With the help of feature engineering and data preparation, we further improve the model so that it can deal with real-world difficulties like missing values and seasonality in the data. We also investigate hyperparameter optimization for the model's best possible performance. We compare EtherVoyant's forecasts against those of the more conventional ARIMA and SARIMA models to determine its efficacy. By providing more precise and trustworthy price forecasts, our trial results suggest that EtherVoyant is superior to the individual models. The importance of this study resides in the fact that it will lead to the creation of a sophisticated time series forecasting model that will be useful to cryptocurrency investors, traders, and decision-makers. We hope that by making EtherVoyant available on a worldwide scale, we will help advance the field of cryptocurrency analytics and encourage wider adoption of blockchain-based assets.

近年来,随着人们对加密货币的兴趣日益浓厚,以太坊已成为去中心化应用程序和智能合约的主要平台。随着以太坊越来越受欢迎,加密货币领域的投资者和市场参与者会发现使用可靠的价格预测模型越来越重要。为了更好地估算全球以太坊价格,我们提出了 "EtherVoyant",一种结合了 ARIMA 和 SARIMA 方法优点的新型混合预测模型。为了提高预测能力,EtherVoyant 使用以太坊价格历史记录来独立训练 ARIMA 和 SARIMA 组件,然后再融合它们的预测结果。在特征工程和数据准备的帮助下,我们进一步改进了模型,使其能够处理数据中的缺失值和季节性等现实世界中的难题。我们还研究了超参数优化,以尽可能提高模型的性能。我们将 EtherVoyant 的预测与更传统的 ARIMA 和 SARIMA 模型进行比较,以确定其有效性。试验结果表明,EtherVoyant 能提供更精确、更可靠的价格预测,因此优于其他模型。这项研究的重要性在于,它将有助于创建一个复杂的时间序列预测模型,为加密货币投资者、交易商和决策者提供帮助。我们希望,通过在全球范围内提供 EtherVoyant,我们将帮助推动加密货币分析领域的发展,并鼓励更广泛地采用基于区块链的资产。
{"title":"Empowering global ethereum price prediction with EtherVoyant: a state-of-the-art time series forecasting model","authors":"Umar Islam, Babar Shah, Abdullah A. Al-Atawi, Gioia Arnone, Mohamed R. Abonazel, Ijaz Ali, Fernando Moreira","doi":"10.1007/s00521-024-10169-3","DOIUrl":"https://doi.org/10.1007/s00521-024-10169-3","url":null,"abstract":"<p>Ethereum has emerged as a major platform for decentralized apps and smart contracts with the heightened interest in cryptocurrencies in recent years. Investors and market participants in the cryptocurrency space will find it increasingly important to use reliable price prediction models as Ethereum's popularity grows. To better estimate Ethereum prices around the world, we propose \"EtherVoyant,\" a novel hybrid forecasting model that combines the advantages of ARIMA and SARIMA methods. To improve its forecasting abilities, EtherVoyant uses Ethereum price history to train ARIMA and SARIMA components independently before fusing their predictions. With the help of feature engineering and data preparation, we further improve the model so that it can deal with real-world difficulties like missing values and seasonality in the data. We also investigate hyperparameter optimization for the model's best possible performance. We compare EtherVoyant's forecasts against those of the more conventional ARIMA and SARIMA models to determine its efficacy. By providing more precise and trustworthy price forecasts, our trial results suggest that EtherVoyant is superior to the individual models. The importance of this study resides in the fact that it will lead to the creation of a sophisticated time series forecasting model that will be useful to cryptocurrency investors, traders, and decision-makers. We hope that by making EtherVoyant available on a worldwide scale, we will help advance the field of cryptocurrency analytics and encourage wider adoption of blockchain-based assets.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel hyper-heuristic algorithm: an application to automatic voltage regulator 新型超启发式算法:应用于自动电压调节器
Pub Date : 2024-08-27 DOI: 10.1007/s00521-024-10313-z
Yunus Hinislioglu, Ugur Guvenc

This paper presents a novel optimization algorithm called hyper-heuristic fitness-distance balance success-history-based adaptive differential evolution (HH-FDB-SHADE). The hyper-heuristic algorithms have two main structures: a hyper-selection framework and a low-level heuristic (LLH) pool. In the proposed algorithm, the FDB method is preferred as a high-level selection framework to evaluate the LLH pool algorithms. In addition, a total of 10 different strategies is derived from five mutation operators and two crossover methods for using them as the LLH pool. Balancing the exploration and exploitation capability of FDB is the main reason for being the selection framework of the proposed algorithm. The success of the HH-FDB-SHADE algorithm was tested on CEC-17 and CEC-20 benchmark test suits for different dimensional search spaces, and the obtained solutions from the HH-FDB-SHADE were compared to 10 different LLH pool algorithms. In addition, the HH-FDB-SHADE algorithm has been applied to optimize the control parameters of PID, PIDF, FOPID, and PIDD2 in the optimal automatic voltage regulator (AVR) design problem to reveal the improved algorithm's performance more clearly and prove its success in solving engineering problems. The results obtained from the AVR system are compared with five other effective meta-heuristic search algorithms such as the fitness-distance balance Lévy Flight distribution, differential evolution, Harris–Hawks optimization, Barnacles mating optimizer, and Moth–Flame optimization algorithms in the literature. The results of the statistical analyses indicate that HH-FDB-SHADE is the best-ranked algorithm for solving CEC-17 and CEC-20 benchmark problems and gives better results compared to the LLH pool algorithms. Besides, the proposed algorithm is more effective and robust than five other meta-heuristic algorithms in solving optimal AVR design problems.

本文提出了一种新颖的优化算法,称为基于成功历史的超启发式适应性差分进化(HH-FDB-SHADE)。超启发式算法有两个主要结构:超选择框架和低级启发式(LLH)池。在所提出的算法中,首选 FDB 方法作为评估 LLH 池算法的高级选择框架。此外,五种突变算子和两种交叉方法共衍生出 10 种不同的策略,用作 LLH 池。平衡 FDB 的探索和利用能力是提出算法选择框架的主要原因。在不同维度搜索空间的 CEC-17 和 CEC-20 基准测试服上测试了 HH-FDB-SHADE 算法的成功性,并将 HH-FDB-SHADE 算法获得的解与 10 种不同的 LLH 池算法进行了比较。此外,HH-FDB-SHADE 算法还被应用于优化自动电压调节器(AVR)设计问题中的 PID、PIDF、FOPID 和 PIDD2 控制参数,以更清晰地揭示改进算法的性能,证明其在解决工程问题方面的成功。将 AVR 系统得到的结果与其他五种有效的元启发式搜索算法进行了比较,如文献中的适度-距离平衡 Lévy Flight 分布、差分进化、Harris-Hawks 优化、Barnacles 交配优化和 Moth-Flame 优化算法。统计分析结果表明,在解决 CEC-17 和 CEC-20 基准问题时,HH-FDB-SHADE 是排名最好的算法,与 LLH 池算法相比结果更好。此外,与其他五种元启发式算法相比,所提出的算法在解决 AVR 最佳设计问题时更加有效和稳健。
{"title":"A novel hyper-heuristic algorithm: an application to automatic voltage regulator","authors":"Yunus Hinislioglu, Ugur Guvenc","doi":"10.1007/s00521-024-10313-z","DOIUrl":"https://doi.org/10.1007/s00521-024-10313-z","url":null,"abstract":"<p>This paper presents a novel optimization algorithm called hyper-heuristic fitness-distance balance success-history-based adaptive differential evolution (HH-FDB-SHADE). The hyper-heuristic algorithms have two main structures: a hyper-selection framework and a low-level heuristic (LLH) pool. In the proposed algorithm, the FDB method is preferred as a high-level selection framework to evaluate the LLH pool algorithms. In addition, a total of 10 different strategies is derived from five mutation operators and two crossover methods for using them as the LLH pool. Balancing the exploration and exploitation capability of FDB is the main reason for being the selection framework of the proposed algorithm. The success of the HH-FDB-SHADE algorithm was tested on CEC-17 and CEC-20 benchmark test suits for different dimensional search spaces, and the obtained solutions from the HH-FDB-SHADE were compared to 10 different LLH pool algorithms. In addition, the HH-FDB-SHADE algorithm has been applied to optimize the control parameters of PID, PIDF, FOPID, and PIDD<sup>2</sup> in the optimal automatic voltage regulator (AVR) design problem to reveal the improved algorithm's performance more clearly and prove its success in solving engineering problems. The results obtained from the AVR system are compared with five other effective meta-heuristic search algorithms such as the fitness-distance balance Lévy Flight distribution, differential evolution, Harris–Hawks optimization, Barnacles mating optimizer, and Moth–Flame optimization algorithms in the literature. The results of the statistical analyses indicate that HH-FDB-SHADE is the best-ranked algorithm for solving CEC-17 and CEC-20 benchmark problems and gives better results compared to the LLH pool algorithms. Besides, the proposed algorithm is more effective and robust than five other meta-heuristic algorithms in solving optimal AVR design problems.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"567 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142224558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating deep learning for accurate gastrointestinal cancer classification: a comprehensive analysis of MSI and MSS patterns using histopathology data 整合深度学习,准确进行胃肠道癌症分类:利用组织病理学数据全面分析 MSI 和 MSS 模式
Pub Date : 2024-08-26 DOI: 10.1007/s00521-024-10287-y
Abeer A. Wafa, Reham M. Essa, Amr A. Abohany, Hanan E. Abdelkader

Early detection of microsatellite instability (MSI) and microsatellite stability (MSS) is crucial in the fight against gastrointestinal (GI) cancer. MSI is a sign of genetic instability often associated with DNA repair mechanism deficiencies, which can cause (GI) cancers. On the other hand, MSS signifies genomic stability in microsatellite regions. Differentiating between these two states is pivotal in clinical decision-making as it provides prognostic and predictive information and treatment strategies. Rapid identification of MSI and MSS enables oncologists to tailor therapies more accurately, potentially saving patients from unnecessary treatments and guiding them toward regimens with the highest likelihood of success. Detecting these microsatellite status markers at an initial stage can improve patient outcomes and quality of life in GI cancer management. Our research paper introduces a cutting-edge method for detecting early GI cancer using deep learning (DL). Our goal is to identify the optimal model for GI cancer detection that surpasses previous works. Our proposed model comprises four stages: data acquisition, image processing, feature extraction, and classification. We use histopathology images from the Cancer Genome Atlas (TCGA) and Kaggle website with some modifications for data acquisition. In the image processing stage, we apply various operations such as color transformation, resizing, normalization, and labeling to prepare the input image for enrollment in our DL models. We present five different DL models, including convolutional neural networks (CNNs), a hybrid of CNNs-simple RNN (recurrent neural network), a hybrid of CNNs with long short-term memory (LSTM) (CNNs-LSTM), a hybrid of CNNs with gated recurrent unit (GRU) (CNNs-GRU), and a hybrid of CNNs-SimpleRNN-LSTM-GRU. Our empirical results demonstrate that CNNs-SimpleRNN-LSTM-GRU outperforms other models in accuracy, specificity, recall, precision, AUC, and F1, achieving an accuracy of 99.90%. Our proposed methodology offers significant improvements in GI cancer detection compared to recent techniques, highlighting the potential of DL-based approaches for histopathology data. We expect our findings to inspire future research in DL-based GI cancer detection.

早期检测微卫星不稳定性(MSI)和微卫星稳定性(MSS)对于抗击胃肠道癌症(GI)至关重要。MSI 是遗传不稳定性的标志,通常与 DNA 修复机制缺陷有关,可导致胃肠道癌症。另一方面,MSS 标志着微卫星区域的基因组稳定性。区分这两种状态在临床决策中至关重要,因为它提供了预后和预测信息以及治疗策略。快速识别 MSI 和 MSS 使肿瘤学家能够更准确地定制治疗方案,从而使患者免于不必要的治疗,并指导他们采用最有可能成功的治疗方案。在消化道癌症治疗的初始阶段检测这些微卫星状态标记物可以改善患者的预后和生活质量。我们的研究论文介绍了一种利用深度学习(DL)检测早期消化道癌症的前沿方法。我们的目标是找出消化道癌症检测的最佳模型,以超越之前的研究成果。我们提出的模型包括四个阶段:数据采集、图像处理、特征提取和分类。我们使用来自癌症基因组图谱(TCGA)和 Kaggle 网站的组织病理学图像,并对其进行了一些修改以获取数据。在图像处理阶段,我们应用各种操作,如颜色转换、大小调整、归一化和标记,以准备输入图像,供我们的 DL 模型使用。我们提出了五种不同的 DL 模型,包括卷积神经网络(CNN)、CNN-简单 RNN(递归神经网络)混合模型、CNN-长短时记忆(LSTM)混合模型(CNNs-LSTM)、CNN-门控递归单元(GRU)混合模型(CNNs-GRU)以及 CNN-SimpleRNN-LSTM-GRU 混合模型。实证结果表明,CNNs-SimpleRNN-LSTM-GRU 在准确度、特异性、召回率、精确度、AUC 和 F1 方面均优于其他模型,准确度达到 99.90%。与最近的技术相比,我们提出的方法在消化道癌症检测方面有显著改进,突出了基于 DL 的方法在组织病理学数据方面的潜力。我们希望我们的研究结果能对未来基于 DL 的消化道癌症检测研究有所启发。
{"title":"Integrating deep learning for accurate gastrointestinal cancer classification: a comprehensive analysis of MSI and MSS patterns using histopathology data","authors":"Abeer A. Wafa, Reham M. Essa, Amr A. Abohany, Hanan E. Abdelkader","doi":"10.1007/s00521-024-10287-y","DOIUrl":"https://doi.org/10.1007/s00521-024-10287-y","url":null,"abstract":"<p>Early detection of microsatellite instability (MSI) and microsatellite stability (MSS) is crucial in the fight against gastrointestinal (GI) cancer. MSI is a sign of genetic instability often associated with DNA repair mechanism deficiencies, which can cause (GI) cancers. On the other hand, MSS signifies genomic stability in microsatellite regions. Differentiating between these two states is pivotal in clinical decision-making as it provides prognostic and predictive information and treatment strategies. Rapid identification of MSI and MSS enables oncologists to tailor therapies more accurately, potentially saving patients from unnecessary treatments and guiding them toward regimens with the highest likelihood of success. Detecting these microsatellite status markers at an initial stage can improve patient outcomes and quality of life in GI cancer management. Our research paper introduces a cutting-edge method for detecting early GI cancer using deep learning (DL). Our goal is to identify the optimal model for GI cancer detection that surpasses previous works. Our proposed model comprises four stages: data acquisition, image processing, feature extraction, and classification. We use histopathology images from the Cancer Genome Atlas (TCGA) and Kaggle website with some modifications for data acquisition. In the image processing stage, we apply various operations such as color transformation, resizing, normalization, and labeling to prepare the input image for enrollment in our DL models. We present five different DL models, including convolutional neural networks (CNNs), a hybrid of CNNs-simple RNN (recurrent neural network), a hybrid of CNNs with long short-term memory (LSTM) (CNNs-LSTM), a hybrid of CNNs with gated recurrent unit (GRU) (CNNs-GRU), and a hybrid of CNNs-SimpleRNN-LSTM-GRU. Our empirical results demonstrate that CNNs-SimpleRNN-LSTM-GRU outperforms other models in accuracy, specificity, recall, precision, AUC, and F1, achieving an accuracy of 99.90%. Our proposed methodology offers significant improvements in GI cancer detection compared to recent techniques, highlighting the potential of DL-based approaches for histopathology data. We expect our findings to inspire future research in DL-based GI cancer detection.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"29 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SuspAct: novel suspicious activity prediction based on deep learning in the real-time environment SuspAct:实时环境中基于深度学习的新型可疑活动预测
Pub Date : 2024-08-26 DOI: 10.1007/s00521-024-10355-3
Sachin Kansal, Akshat Kumar Jain, Moyukh Biswas, Shaurya Bansal, Namay Mahindru, Priya Kansal

In today’s evolving landscape of video surveillance, our study introduces SuspAct, an innovative ensemble model designed to detect suspicious activities in real time swiftly. Leveraging advanced Long-term Recurrent Convolutional Networks (LRCN), SuspAct represents a significant advancement in intelligent surveillance technology. By combining insights from various LRCN models through the Majority Voting ensemble technique, SuspAct enhances its overall robustness, outperforming traditional surveillance methods. Through rigorous experimentation on large-scale datasets, we demonstrate SuspAct’s superiority in proactive crime prevention, showcasing its potential to revolutionize security protocols and contribute substantially to public safety. Our work addresses the challenges posed by the escalating volume of video data and lays a strong foundation for future advancements in intelligent video surveillance technology.

在视频监控不断发展的今天,我们的研究引入了一种创新的集合模型 SuspAct,旨在实时快速地检测可疑活动。利用先进的长期递归卷积网络(LRCN),SuspAct 代表了智能监控技术的重大进步。通过 Majority Voting 集合技术,SuspAct 将各种 LRCN 模型的洞察力结合在一起,增强了其整体鲁棒性,表现优于传统监控方法。通过在大规模数据集上进行严格的实验,我们证明了 SuspAct 在主动预防犯罪方面的优势,展示了其彻底改变安全协议并为公共安全做出重大贡献的潜力。我们的工作解决了视频数据量不断攀升带来的挑战,为未来智能视频监控技术的发展奠定了坚实的基础。
{"title":"SuspAct: novel suspicious activity prediction based on deep learning in the real-time environment","authors":"Sachin Kansal, Akshat Kumar Jain, Moyukh Biswas, Shaurya Bansal, Namay Mahindru, Priya Kansal","doi":"10.1007/s00521-024-10355-3","DOIUrl":"https://doi.org/10.1007/s00521-024-10355-3","url":null,"abstract":"<p>In today’s evolving landscape of video surveillance, our study introduces SuspAct, an innovative ensemble model designed to detect suspicious activities in real time swiftly. Leveraging advanced Long-term Recurrent Convolutional Networks (LRCN), SuspAct represents a significant advancement in intelligent surveillance technology. By combining insights from various LRCN models through the Majority Voting ensemble technique, SuspAct enhances its overall robustness, outperforming traditional surveillance methods. Through rigorous experimentation on large-scale datasets, we demonstrate SuspAct’s superiority in proactive crime prevention, showcasing its potential to revolutionize security protocols and contribute substantially to public safety. Our work addresses the challenges posed by the escalating volume of video data and lays a strong foundation for future advancements in intelligent video surveillance technology.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"28 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optihybrid: a modified firebug swarm optimization algorithm for optimal sizing of hybrid renewable power system Optihybrid:用于优化可再生能源混合发电系统规模的改进型火虫群优化算法
Pub Date : 2024-08-26 DOI: 10.1007/s00521-024-10196-0
Hoda Abd El-Sattar, Salah Kamel, Fatma A. Hashim, Sahar F. Sabbeh

In areas where conventional energy sources are unavailable, alternative energy technologies play a crucial role in generating electricity. These technologies offer various benefits, such as reliable energy supply, environmental sustainability, and employment opportunities in rural regions. This study focuses on the development of a novel optimization algorithm called the modified firebug swarm algorithm (mFSO). Its objective is to determine the optimal size of an integrated renewable power system for supplying electricity to a specific remote site in Dehiba town, located in the eastern province of Tataouine, Tunisia. The proposed configuration for the standalone hybrid system involves PV/biomass/battery, and three objective functions are considered: minimizing the total energy cost (COE), reducing the loss of power supply probability (LPSP), and managing excess energy (EXC). The effectiveness of the modified algorithm is evaluated using various tests, including the Wilcoxon test, boxplot analysis, and the ten benchmark functions of the CEC2020 benchmark. Comparative analysis between the mFSO and widely used algorithms like the original Firebug Swarm Optimization (FSO), Slime Mold Algorithm (SMA), and Seagull Optimization Algorithm (SOA) demonstrates that the proposed mFSO technique is efficient and effective in solving the design problem, surpassing other optimization algorithms.

在缺乏传统能源的地区,替代能源技术在发电方面发挥着至关重要的作用。这些技术可为农村地区带来各种好处,如可靠的能源供应、环境可持续性和就业机会。本研究的重点是开发一种名为 "改良火虫群算法(mFSO)"的新型优化算法。其目标是确定一个综合可再生能源发电系统的最佳规模,以便为突尼斯东部塔塔瓦内省 Dehiba 镇的一个特定偏远地点供电。建议的独立混合系统配置包括光伏/生物质/电池,并考虑了三个目标函数:最小化总能源成本(COE)、降低供电损失概率(LPSP)和管理过剩能源(EXC)。通过各种测试,包括 Wilcoxon 检验、方框图分析和 CEC2020 基准的十个基准函数,对改进算法的有效性进行了评估。mFSO 与广泛使用的算法(如原始 Firebug Swarm Optimization(FSO)、Slime Mold Algorithm(SMA)和 Seagull Optimization Algorithm(SOA))之间的比较分析表明,所提出的 mFSO 技术在解决设计问题方面效率高、效果好,超过了其他优化算法。
{"title":"Optihybrid: a modified firebug swarm optimization algorithm for optimal sizing of hybrid renewable power system","authors":"Hoda Abd El-Sattar, Salah Kamel, Fatma A. Hashim, Sahar F. Sabbeh","doi":"10.1007/s00521-024-10196-0","DOIUrl":"https://doi.org/10.1007/s00521-024-10196-0","url":null,"abstract":"<p>In areas where conventional energy sources are unavailable, alternative energy technologies play a crucial role in generating electricity. These technologies offer various benefits, such as reliable energy supply, environmental sustainability, and employment opportunities in rural regions. This study focuses on the development of a novel optimization algorithm called the modified firebug swarm algorithm (mFSO). Its objective is to determine the optimal size of an integrated renewable power system for supplying electricity to a specific remote site in Dehiba town, located in the eastern province of Tataouine, Tunisia. The proposed configuration for the standalone hybrid system involves PV/biomass/battery, and three objective functions are considered: minimizing the total energy cost (COE), reducing the loss of power supply probability (LPSP), and managing excess energy (EXC). The effectiveness of the modified algorithm is evaluated using various tests, including the Wilcoxon test, boxplot analysis, and the ten benchmark functions of the CEC2020 benchmark. Comparative analysis between the mFSO and widely used algorithms like the original Firebug Swarm Optimization (FSO), Slime Mold Algorithm (SMA), and Seagull Optimization Algorithm (SOA) demonstrates that the proposed mFSO technique is efficient and effective in solving the design problem, surpassing other optimization algorithms.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heart rate variability analysis in controls and epilepsy patients with or without receiving treatment: a clinical review and meta-analysis 对照组和接受或未接受治疗的癫痫患者的心率变异性分析:临床回顾和荟萃分析
Pub Date : 2024-08-25 DOI: 10.1007/s00521-024-10135-z
Muhammad Bilal Shahnawaz, Hassan Dawooda, Uzair Iqbal

The malfunctioning of cardiac autonomic control in epileptic patients develops ventricular tachyarrhythmia and causes sudden unexpected death in epilepsy patients (SUDEP). Various clinical studies investigated the effect of epilepsy on cardiac autonomic control by performing heart rate variability (HRV) analysis; however, results are unclear regarding whether sympathetic, parasympathetic, or both branches of the autonomic nervous system (ANS) are affected in epilepsy and also the impact of anticonvulsant treatment on the ANS. This study follows the systematic protocols to investigate epilepsy and its anticonvulsant treatment on cardiac autonomic control by using linear and nonlinear HRV analysis measures. The electronic databases of PubMed, Embase, and Cochrane Library were used for the collection of studies. Initially, 1475 articles were identified whereas after 2-staged exclusion criteria, 33 studies were selected for execution of the review process and meta-analysis. For meta-analysis, four comparisons were performed (epilepsy patients): (1) controls (healthy subject with no history of epilepsy) versus untreated patients; (2) treated (patients under treatment that have a seizure) versus untreated patients; (3) controls versus treated patients; and (4) refractory versus well-controlled (epilepsy patients that were seizure-free for last 1 year). For treated and untreated patients, there was no significant difference whereas well-controlled patients presented higher values as compared to refractory patients. Meta-analysis was performed for the time-domain, frequency-domain, and nonlinear parameters. Untreated patients in comparison with controls presented significantly lower HF (high-frequency) and LF (low-frequency) values. These LF (g = − 0.9; 95% CI − 1.48 to − 0.37) and HF (g = − 0.69; 95% confidence interval (CI) − 1.24 to − 0.16) values were affirming suppressed both, vagal and sympathetic activity, respectively. Additionally, LF and HF value was increased in most of the studies indicating suppressed vagal tone, while for some studies, their value decreased to indicate suppressed sympathetic activity. No significant difference was observed for the remaining comparisons. Results affirmed the hypothesis that suppressed sympathetic activity affects sympathovagal balance and leads to SUDEP, as the LF value was significantly lower for patients as compared to healthy subjects. The overall effect size and statistical results for LF and HF were significant, showing the research and clinical significance of our study.

癫痫患者的心脏自主神经控制功能失调会导致室性心动过速,并导致癫痫患者意外猝死(SUDEP)。多项临床研究通过心率变异性(HRV)分析调查了癫痫对心脏自律神经控制的影响;然而,关于癫痫患者自律神经系统(ANS)的交感神经、副交感神经或两个分支均受影响,以及抗惊厥治疗对自律神经系统的影响,研究结果尚不明确。本研究采用线性和非线性心率变异分析方法,按照系统规程研究癫痫及其抗惊厥治疗对心脏自律神经控制的影响。本研究使用 PubMed、Embase 和 Cochrane Library 等电子数据库收集研究资料。最初确定了 1475 篇文章,经过两阶段的排除标准后,选择了 33 项研究进行审查和荟萃分析。在荟萃分析中,进行了四项比较(癫痫患者):(1) 对照组(无癫痫病史的健康人)与未接受治疗的患者;(2) 接受治疗的患者(接受治疗但有癫痫发作的患者)与未接受治疗的患者;(3) 对照组与接受治疗的患者;(4) 难治性与控制良好的患者(在过去一年中没有癫痫发作的癫痫患者)。接受治疗和未接受治疗的患者之间没有明显差异,而控制良好的患者与难治性患者相比数值更高。对时域、频域和非线性参数进行了元分析。与对照组相比,未经治疗的患者的高频(HF)和低频(LF)值明显较低。这些低频(g = - 0.9;95% 置信区间(CI)- 1.48 至 - 0.37)和高频(g = - 0.69;95% 置信区间(CI)- 1.24 至 - 0.16)值分别证实迷走神经和交感神经活动受到抑制。此外,大多数研究的低频和高频值增加,表明迷走神经张力受到抑制,而一些研究的低频和高频值降低,表明交感神经活动受到抑制。其余比较未发现明显差异。结果证实了交感神经活动受抑制会影响交感迷走平衡并导致 SUDEP 的假设,因为与健康受试者相比,患者的 LF 值明显较低。低频和高频的总体效应大小和统计结果均具有显著性,这表明我们的研究具有研究和临床意义。
{"title":"Heart rate variability analysis in controls and epilepsy patients with or without receiving treatment: a clinical review and meta-analysis","authors":"Muhammad Bilal Shahnawaz, Hassan Dawooda, Uzair Iqbal","doi":"10.1007/s00521-024-10135-z","DOIUrl":"https://doi.org/10.1007/s00521-024-10135-z","url":null,"abstract":"<p>The malfunctioning of cardiac autonomic control in epileptic patients develops ventricular tachyarrhythmia and causes sudden unexpected death in epilepsy patients (SUDEP). Various clinical studies investigated the effect of epilepsy on cardiac autonomic control by performing heart rate variability (HRV) analysis; however, results are unclear regarding whether sympathetic, parasympathetic, or both branches of the autonomic nervous system (ANS) are affected in epilepsy and also the impact of anticonvulsant treatment on the ANS. This study follows the systematic protocols to investigate epilepsy and its anticonvulsant treatment on cardiac autonomic control by using linear and nonlinear HRV analysis measures. The electronic databases of PubMed, Embase, and Cochrane Library were used for the collection of studies. Initially, 1475 articles were identified whereas after 2-staged exclusion criteria, 33 studies were selected for execution of the review process and meta-analysis. For meta-analysis, four comparisons were performed (epilepsy patients): (1) controls (healthy subject with no history of epilepsy) versus untreated patients; (2) treated (patients under treatment that have a seizure) versus untreated patients; (3) controls versus treated patients; and (4) refractory versus well-controlled (epilepsy patients that were seizure-free for last 1 year). For treated and untreated patients, there was no significant difference whereas well-controlled patients presented higher values as compared to refractory patients. Meta-analysis was performed for the time-domain, frequency-domain, and nonlinear parameters. Untreated patients in comparison with controls presented significantly lower HF (high-frequency) and LF (low-frequency) values. These LF (<i>g</i> = − 0.9; 95% CI − 1.48 to − 0.37) and HF (<i>g</i> = − 0.69; 95% confidence interval (CI) − 1.24 to − 0.16) values were affirming suppressed both, vagal and sympathetic activity, respectively. Additionally, LF and HF value was increased in most of the studies indicating suppressed vagal tone, while for some studies, their value decreased to indicate suppressed sympathetic activity. No significant difference was observed for the remaining comparisons. Results affirmed the hypothesis that suppressed sympathetic activity affects sympathovagal balance and leads to SUDEP, as the LF value was significantly lower for patients as compared to healthy subjects. The overall effect size and statistical results for LF and HF were significant, showing the research and clinical significance of our study.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"2010 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A deep fusion model for stock market prediction with news headlines and time series data 利用新闻标题和时间序列数据进行股市预测的深度融合模型
Pub Date : 2024-08-24 DOI: 10.1007/s00521-024-10303-1
Pinyu Chen, Zois Boukouvalas, Roberto Corizzo

Time series forecasting models are essential decision support tools in real-world domains. Stock market is a remarkably complex domain, due to its quickly evolving temporal nature, as well as the multiple factors having an impact on stock prices. To date, a number of machine learning-based approaches have been proposed in the literature to tackle stock trend prediction. However, they typically tend to analyze a single data source or modality, or consider multiple modalities in isolation and rely on simple combination strategies, with a potential reduction in their modeling power. In this paper, we propose a multimodal deep fusion model to predict stock trends, leveraging daily stock prices, technical indicators, and sentiment in daily news headlines published by media outlets. The proposed architecture leverages a BERT-based model branch fine-tuned on financial news and a long short-term memory (LSTM) branch that captures relevant temporal patterns in multivariate data, including stock prices and technical indicators. Our experiments on 12 different stock datasets with prices and news headlines demonstrate that our proposed model is more effective than popular baseline approaches, both in terms of accuracy and trading performance in a portfolio analysis simulation, highlighting the positive impact of multimodal deep learning for stock trend prediction.

时间序列预测模型是现实世界中必不可少的决策支持工具。股票市场是一个非常复杂的领域,因为它具有快速变化的时间特性,而且有多种因素对股票价格产生影响。迄今为止,文献中已经提出了许多基于机器学习的方法来解决股票趋势预测问题。然而,这些方法通常倾向于分析单一数据源或模式,或孤立地考虑多种模式,并依赖于简单的组合策略,其建模能力可能会降低。在本文中,我们提出了一种多模态深度融合模型,利用每日股价、技术指标和媒体发布的每日新闻标题中的情绪来预测股票走势。所提出的架构利用了基于 BERT 的模型分支和长短期记忆(LSTM)分支,前者根据财经新闻进行微调,后者捕捉多元数据(包括股票价格和技术指标)中的相关时间模式。我们在包含价格和新闻标题的 12 个不同股票数据集上进行的实验表明,在投资组合分析模拟中,我们提出的模型在准确性和交易性能方面都比流行的基线方法更有效,这凸显了多模态深度学习对股票趋势预测的积极影响。
{"title":"A deep fusion model for stock market prediction with news headlines and time series data","authors":"Pinyu Chen, Zois Boukouvalas, Roberto Corizzo","doi":"10.1007/s00521-024-10303-1","DOIUrl":"https://doi.org/10.1007/s00521-024-10303-1","url":null,"abstract":"<p>Time series forecasting models are essential decision support tools in real-world domains. Stock market is a remarkably complex domain, due to its quickly evolving temporal nature, as well as the multiple factors having an impact on stock prices. To date, a number of machine learning-based approaches have been proposed in the literature to tackle stock trend prediction. However, they typically tend to analyze a single data source or modality, or consider multiple modalities in isolation and rely on simple combination strategies, with a potential reduction in their modeling power. In this paper, we propose a multimodal deep fusion model to predict stock trends, leveraging daily stock prices, technical indicators, and sentiment in daily news headlines published by media outlets. The proposed architecture leverages a BERT-based model branch fine-tuned on financial news and a long short-term memory (LSTM) branch that captures relevant temporal patterns in multivariate data, including stock prices and technical indicators. Our experiments on 12 different stock datasets with prices and news headlines demonstrate that our proposed model is more effective than popular baseline approaches, both in terms of accuracy and trading performance in a portfolio analysis simulation, highlighting the positive impact of multimodal deep learning for stock trend prediction.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"73 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142224559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning approaches to intrusion detection in unmanned aerial vehicles (UAVs) 无人驾驶飞行器(UAV)入侵检测的机器学习方法
Pub Date : 2024-08-23 DOI: 10.1007/s00521-024-10306-y
Raghad A. AL-Syouf, Raed M. Bani-Hani, Omar Y. AL-Jarrah

Unmanned Aerial Vehicles (UAVs) have been gaining popularity in various commercial, civilian, and military applications due to their efficiency and cost-effectiveness. However, the increasing demand for UAVs makes them vulnerable to various cyberattacks/intrusions that could have devastating consequences at an individual, organizational, and national level. To mitigate this, prompt detection of such threats is crucial in order to prevent potential damage and ensure safe and secure operations. In this work, we provide an overview of UAV systems’ architecture, security, and privacy requirements. We then analyze potential threats to UAVs, providing an evaluation of countermeasures for UAV-based attacks. We also present a comprehensive and timely exploration of state-of-the-art UAV Intrusion Detection Systems (IDSs), specifically focusing on Machine Learning (ML)-based approaches. We look at the increasing importance of using ML for detecting intrusions in UAVs, which have gained significant attention from both academia and industry. This study also takes a step forward by pointing out and classifying contemporary IDSs based on their detection methods, feature selection techniques, evaluation datasets, and performance metrics. By evaluating existing research, we aim to provide more insight into the issues and limitations of current UAV IDSs. Additionally, we identify research gaps and challenges while suggesting potential future research directions in this domain.

无人驾驶飞行器(UAV)因其高效性和成本效益,在各种商业、民用和军事应用中越来越受欢迎。然而,对无人飞行器日益增长的需求使其容易受到各种网络攻击/入侵,从而对个人、组织和国家层面造成破坏性后果。为缓解这一问题,及时发现此类威胁对防止潜在损害和确保安全运行至关重要。在这项工作中,我们将概述无人机系统的架构、安全性和隐私要求。然后,我们分析了无人机面临的潜在威胁,并对无人机攻击的应对措施进行了评估。我们还对最先进的无人机入侵检测系统(IDS)进行了全面而及时的探讨,尤其侧重于基于机器学习(ML)的方法。我们研究了使用 ML 检测无人机入侵的日益重要的意义,这已得到学术界和工业界的极大关注。本研究还根据检测方法、特征选择技术、评估数据集和性能指标对当代 IDS 进行了指出和分类,从而向前迈出了一步。通过评估现有研究,我们旨在为当前无人机 IDS 的问题和局限性提供更深入的见解。此外,我们还确定了研究差距和挑战,同时提出了该领域未来潜在的研究方向。
{"title":"Machine learning approaches to intrusion detection in unmanned aerial vehicles (UAVs)","authors":"Raghad A. AL-Syouf, Raed M. Bani-Hani, Omar Y. AL-Jarrah","doi":"10.1007/s00521-024-10306-y","DOIUrl":"https://doi.org/10.1007/s00521-024-10306-y","url":null,"abstract":"<p>Unmanned Aerial Vehicles (UAVs) have been gaining popularity in various commercial, civilian, and military applications due to their efficiency and cost-effectiveness. However, the increasing demand for UAVs makes them vulnerable to various cyberattacks/intrusions that could have devastating consequences at an individual, organizational, and national level. To mitigate this, prompt detection of such threats is crucial in order to prevent potential damage and ensure safe and secure operations. In this work, we provide an overview of UAV systems’ architecture, security, and privacy requirements. We then analyze potential threats to UAVs, providing an evaluation of countermeasures for UAV-based attacks. We also present a comprehensive and timely exploration of state-of-the-art UAV Intrusion Detection Systems (IDSs), specifically focusing on Machine Learning (ML)-based approaches. We look at the increasing importance of using ML for detecting intrusions in UAVs, which have gained significant attention from both academia and industry. This study also takes a step forward by pointing out and classifying contemporary IDSs based on their detection methods, feature selection techniques, evaluation datasets, and performance metrics. By evaluating existing research, we aim to provide more insight into the issues and limitations of current UAV IDSs. Additionally, we identify research gaps and challenges while suggesting potential future research directions in this domain.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"45 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Radial basis function neural network training using variable projection and fuzzy means 利用变量投影和模糊手段进行径向基函数神经网络训练
Pub Date : 2024-08-23 DOI: 10.1007/s00521-024-10274-3
Despina Karamichailidou, Georgios Gerolymatos, Panagiotis Patrinos, Haralambos Sarimveis, Alex Alexandridis

Radial basis function (RBF) neural network training presents a challenging optimization task, necessitating the utilization of advanced algorithms that can fully train the network so as to produce accurate and computationally efficient models. To achieve this goal, this work introduces a new framework where the original RBF training problem is divided into two simpler subproblems; the linear parameters, namely the network weights, are projected out of the problem using variable projection (VP), thus leaving a reduced functional, which depends only on nonlinear parameters, i.e., the RBF centers. The centers are updated using the Levenberg–Marquardt (LM) algorithm, while the optimal values of the synaptic weights are calculated in each iteration of the LM algorithm using linear regression. The proposed VP-LM scheme is coupled with the fuzzy means (FM) algorithm, which helps to select the number of RBF centers and enhances the overall search procedure, thus resulting to a framework that produces parsimonious models with enhanced accuracy in shorter training times. The proposed training scheme is evaluated on 12 both real-world and synthetic benchmark datasets and tested against various RBF training algorithms, as well as different neural network architectures. The experimental results underscore the effectiveness of the VP-FM algorithm in producing neural network models that outperform those generated by alternative methods in many aspects; to be more specific, the proposed approach achieves very competitive model accuracy, while resulting to smaller network sizes and thus lower complexity, which leads to shorter training times.

径向基函数(RBF)神经网络训练是一项极具挑战性的优化任务,需要利用先进的算法对网络进行全面训练,以生成精确且计算效率高的模型。为了实现这一目标,这项工作引入了一个新的框架,将原始的 RBF 训练问题分为两个更简单的子问题;使用变量投影(VP)将线性参数(即网络权重)投影到问题之外,从而留下一个仅取决于非线性参数(即 RBF 中心)的简化函数。使用 Levenberg-Marquardt (LM) 算法更新中心,而在 LM 算法的每次迭代中使用线性回归计算突触权重的最佳值。所提出的 VP-LM 方案与模糊手段(FM)算法相结合,有助于选择 RBF 中心的数量,并增强整体搜索程序,从而使该框架能在更短的训练时间内生成具有更高精度的简约模型。我们在 12 个真实世界和合成基准数据集上对所提出的训练方案进行了评估,并与各种 RBF 训练算法以及不同的神经网络架构进行了对比测试。实验结果凸显了 VP-FM 算法在生成神经网络模型方面的有效性,这些模型在很多方面都优于其他方法生成的模型;更具体地说,所提出的方法实现了极具竞争力的模型准确性,同时缩小了网络规模,从而降低了复杂性,缩短了训练时间。
{"title":"Radial basis function neural network training using variable projection and fuzzy means","authors":"Despina Karamichailidou, Georgios Gerolymatos, Panagiotis Patrinos, Haralambos Sarimveis, Alex Alexandridis","doi":"10.1007/s00521-024-10274-3","DOIUrl":"https://doi.org/10.1007/s00521-024-10274-3","url":null,"abstract":"<p>Radial basis function (RBF) neural network training presents a challenging optimization task, necessitating the utilization of advanced algorithms that can fully train the network so as to produce accurate and computationally efficient models. To achieve this goal, this work introduces a new framework where the original RBF training problem is divided into two simpler subproblems; the linear parameters, namely the network weights, are projected out of the problem using variable projection (VP), thus leaving a reduced functional, which depends only on nonlinear parameters, i.e., the RBF centers. The centers are updated using the Levenberg–Marquardt (LM) algorithm, while the optimal values of the synaptic weights are calculated in each iteration of the LM algorithm using linear regression. The proposed VP-LM scheme is coupled with the fuzzy means (FM) algorithm, which helps to select the number of RBF centers and enhances the overall search procedure, thus resulting to a framework that produces parsimonious models with enhanced accuracy in shorter training times. The proposed training scheme is evaluated on 12 both real-world and synthetic benchmark datasets and tested against various RBF training algorithms, as well as different neural network architectures. The experimental results underscore the effectiveness of the VP-FM algorithm in producing neural network models that outperform those generated by alternative methods in many aspects; to be more specific, the proposed approach achieves very competitive model accuracy, while resulting to smaller network sizes and thus lower complexity, which leads to shorter training times.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"113 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142224560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Neural Computing and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1