Neural Computing and Applications最新文献_第5页

Aspect-based sentiment analysis in Urdu language: resource creation and evaluation 乌尔都语基于方面的情感分析：资源创建与评估

Neural Computing and Applications

Pub Date : 2024-08-28 DOI: 10.1007/s00521-024-10145-x

Amna Altaf, Muhammad Waqas Anwar, Muhammad Hasan Jamal, Usama Ijaz Bajwa, Sadaf Rani

With the advancement in web interactions and increased use of Online Social Networks, sentiment analysis has gained popularity. Topics like sports, health, music, and technology are widely debated on in OSN, especially on twitter. People share their activities, views, and feelings toward different events in their native languages that can be analyzed using sentiment analysis to understand the sentiments of the people toward these events. For English language, studies on sentiment analysis are vastly available. However, very little work exists on sentiment analysis for resource-scarce language like Urdu. For this study, we perform aspect-based sentiment analysis on sports tweets in Urdu language by extracting the following information from a sentence, i.e., aspect terms, aspect term polarity, aspect category, and aspect category polarity, using machine learning and deep learning classifiers. This work is the first effort in aspect-based sentiment analysis for Urdu language using classical machine learning and deep learning approach. Additionally, we also identify implicit aspects from a sentence. Our proposed approach shows classical machine learning approach performed better on the tasks of aspect term polarity, aspect category, and aspect category polarity, while deep learning model outperformed classical machine learning classifiers for the task of aspect term/s.

随着网络互动的发展和在线社交网络使用量的增加，情感分析越来越受欢迎。体育、健康、音乐和技术等话题在在线社交网络，尤其是在 Twitter 上被广泛讨论。人们用自己的母语分享他们的活动、观点和对不同事件的感受，通过情感分析可以了解人们对这些事件的情感。对于英语语言，情感分析的研究成果非常丰富。然而，对于乌尔都语这种资源稀缺的语言，情感分析方面的研究却少之又少。在这项研究中，我们使用机器学习和深度学习分类器从句子中提取以下信息，即方面术语、方面术语极性、方面类别和方面类别极性，从而对乌尔都语的体育推文进行基于方面的情感分析。这是首次使用经典机器学习和深度学习方法对乌尔都语进行基于方面的情感分析。此外，我们还识别了句子中的隐含方面。我们提出的方法表明，经典机器学习方法在方面术语极性、方面类别和方面类别极性任务中表现更佳，而深度学习模型在方面术语任务中的表现优于经典机器学习分类器。

{"title":"Aspect-based sentiment analysis in Urdu language: resource creation and evaluation","authors":"Amna Altaf, Muhammad Waqas Anwar, Muhammad Hasan Jamal, Usama Ijaz Bajwa, Sadaf Rani","doi":"10.1007/s00521-024-10145-x","DOIUrl":"https://doi.org/10.1007/s00521-024-10145-x","url":null,"abstract":"With the advancement in web interactions and increased use of Online Social Networks, sentiment analysis has gained popularity. Topics like sports, health, music, and technology are widely debated on in OSN, especially on twitter. People share their activities, views, and feelings toward different events in their native languages that can be analyzed using sentiment analysis to understand the sentiments of the people toward these events. For English language, studies on sentiment analysis are vastly available. However, very little work exists on sentiment analysis for resource-scarce language like Urdu. For this study, we perform aspect-based sentiment analysis on sports tweets in Urdu language by extracting the following information from a sentence, i.e., aspect terms, aspect term polarity, aspect category, and aspect category polarity, using machine learning and deep learning classifiers. This work is the first effort in aspect-based sentiment analysis for Urdu language using classical machine learning and deep learning approach. Additionally, we also identify implicit aspects from a sentence. Our proposed approach shows classical machine learning approach performed better on the tasks of aspect term polarity, aspect category, and aspect category polarity, while deep learning model outperformed classical machine learning classifiers for the task of aspect term/s.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Skin cancer detection with MobileNet-based transfer learning and MixNets for enhanced diagnosis 利用基于移动网络的迁移学习和混合网络检测皮肤癌，提高诊断水平

Neural Computing and Applications

Pub Date : 2024-08-28 DOI: 10.1007/s00521-024-10227-w

Mohammed Zakariah, Muna Al-Razgan, Taha Alfakih

Skin cancer poses a significant health hazard, necessitating the utilization of advanced diagnostic methodologies to facilitate timely detection, owing to its escalating prevalence in recent years. This paper proposes a novel approach to tackle the issue by introducing a method for detecting skin cancer that uses MixNets to enhance diagnosis and leverages mobile network-based transfer learning. Skin cancer has diverse forms, each distinguishable by its structural attributes, morphological characteristics, texture, and coloration. The pressing demand for accurate and efficient diagnostic instruments has spurred the investigation of novel techniques. The present study utilizes the ISIC dataset, comprising a validation set of 660 images and a training set of 2637 images. Moreover, the research employs a combination of MixNets and mobile network-based transfer learning as its chosen approach. Transfer learning is a technique that leverages preexisting models to enhance the diagnostic capabilities of the proposed system. Integrating MobileNet and MixNets allows for utilizing their respective functionalities, resulting in a dual-model methodology that enhances the comprehensiveness of skin cancer diagnosis. The results demonstrate impressive performance metrics, with MobileNet and MixNets models, and the proposed approach achieves an outstanding accuracy rate of 99.58%. The above findings underscore the efficacy of the dual-model method in effectively discerning between benign and malignant skin lesions. Moreover, the present study aims to examine the potential integration of emerging technologies to enhance the accuracy and practicality of diagnostics within real-world healthcare settings.

皮肤癌严重危害人们的健康，由于近年来发病率不断上升，有必要利用先进的诊断方法来促进及时发现。本文提出了一种新颖的方法来解决这一问题，即采用混合网络来加强诊断，并利用基于移动网络的迁移学习来检测皮肤癌。皮肤癌的形式多种多样，每一种都可以通过其结构属性、形态特征、纹理和颜色加以区分。对准确、高效诊断工具的迫切需求刺激了对新型技术的研究。本研究利用 ISIC 数据集，其中包括 660 幅图像的验证集和 2637 幅图像的训练集。此外，研究还采用了混合网络和基于移动网络的迁移学习相结合的方法。迁移学习是一种利用已有模型来增强拟议系统诊断能力的技术。将 MobileNet 和 MixNets 整合在一起，可以利用它们各自的功能，从而形成一种双模型方法，提高皮肤癌诊断的全面性。结果显示，MobileNet 和 MixNets 模型的性能指标令人印象深刻，拟议方法的准确率高达 99.58%。上述结果表明，双模型方法能有效区分良性和恶性皮肤病变。此外，本研究旨在探讨新兴技术的整合潜力，以提高实际医疗环境中诊断的准确性和实用性。

{"title":"Skin cancer detection with MobileNet-based transfer learning and MixNets for enhanced diagnosis","authors":"Mohammed Zakariah, Muna Al-Razgan, Taha Alfakih","doi":"10.1007/s00521-024-10227-w","DOIUrl":"https://doi.org/10.1007/s00521-024-10227-w","url":null,"abstract":"Skin cancer poses a significant health hazard, necessitating the utilization of advanced diagnostic methodologies to facilitate timely detection, owing to its escalating prevalence in recent years. This paper proposes a novel approach to tackle the issue by introducing a method for detecting skin cancer that uses MixNets to enhance diagnosis and leverages mobile network-based transfer learning. Skin cancer has diverse forms, each distinguishable by its structural attributes, morphological characteristics, texture, and coloration. The pressing demand for accurate and efficient diagnostic instruments has spurred the investigation of novel techniques. The present study utilizes the ISIC dataset, comprising a validation set of 660 images and a training set of 2637 images. Moreover, the research employs a combination of MixNets and mobile network-based transfer learning as its chosen approach. Transfer learning is a technique that leverages preexisting models to enhance the diagnostic capabilities of the proposed system. Integrating MobileNet and MixNets allows for utilizing their respective functionalities, resulting in a dual-model methodology that enhances the comprehensiveness of skin cancer diagnosis. The results demonstrate impressive performance metrics, with MobileNet and MixNets models, and the proposed approach achieves an outstanding accuracy rate of 99.58%. The above findings underscore the efficacy of the dual-model method in effectively discerning between benign and malignant skin lesions. Moreover, the present study aims to examine the potential integration of emerging technologies to enhance the accuracy and practicality of diagnostics within real-world healthcare settings.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"45 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An efficient fake account identification in social media networks: Facebook and Instagram using NSGA-II algorithm 社交媒体网络中的高效虚假账户识别：使用 NSGA-II 算法识别 Facebook 和 Instagram 中的虚假账户

Neural Computing and Applications

Pub Date : 2024-08-28 DOI: 10.1007/s00521-024-10350-8

Amine Sallah, El Arbi Abdellaoui Alaoui, Abdelaaziz Hessane, Said Agoujil, Anand Nayyar

The widespread use of online social networks (OSNs) has made them prime targets for cyber attackers, who exploit these platforms for various malicious activities. As a result, a whole industry of black-market services has emerged, selling services based on the sale of fake accounts. Because of the massive rise of OSNs, the number of fraudulent accounts rapidly expands. Hence, this research focuses on detecting fraudulent profiles on Instagram and Facebook and aims to find an optimal subset of features that can effectively differentiate between real and fake accounts. The problem has been formulated as a multiobjective optimization task, aiming to maximize the classification accuracy while minimizing the number of selected features. NSGA-II (non-dominated sorting genetic algorithm II) is employed as the optimization algorithm to explore the trade-offs between these conflicting objectives. In the current study, a novel approach for feature selection using the NSGA-II optimization algorithm to detect fake accounts is proposed. The proposed methodology relies on input data comprising features characterizing the profiles under investigation. The selected features are utilized to train a machine learning model. The model’s performance is evaluated using various metrics, including precision, recall, F1-score, and receiver operating characteristic (ROC) curve. The final prediction model achieved accuracy values ranging from 90 to 99.88%. The results indicated that the model, utilizing features selected by the NSGA-II algorithm, delivered high prediction accuracy while using less than 31% of the total feature space. This efficient feature selection allowed for the precise differentiation between fake and real users, demonstrating the model’s effectiveness with a minimal number of input variables. Furthermore, the results of experiments demonstrate that the proposed approach achieves better performance as compared to other existing approaches. This research paper focuses on explainability, which refers to the ability to understand and interpret the decisions and outcomes of machine learning models.

在线社交网络（OSN）的广泛使用使其成为网络攻击者的主要目标，他们利用这些平台进行各种恶意活动。因此，整个黑市服务行业应运而生，以出售虚假账户为基础销售服务。由于 OSN 的大规模兴起，虚假账户的数量迅速膨胀。因此，本研究的重点是检测 Instagram 和 Facebook 上的虚假资料，旨在找到能有效区分真假账户的最佳特征子集。该问题被表述为一个多目标优化任务，旨在最大限度地提高分类准确率，同时最小化所选特征的数量。采用 NSGA-II（非支配排序遗传算法 II）作为优化算法，以探索这些相互冲突的目标之间的权衡。本研究提出了一种利用 NSGA-II 优化算法进行特征选择以检测假账户的新方法。所提出的方法依赖于输入数据，这些数据包含描述调查对象特征的特征。所选特征用于训练机器学习模型。该模型的性能使用各种指标进行评估，包括精确度、召回率、F1-分数和接收者操作特征曲线（ROC）。最终预测模型的准确率达到了 90% 到 99.88%。结果表明，该模型利用 NSGA-II 算法选择的特征，在使用不到总特征空间 31% 的情况下，实现了较高的预测准确率。这种高效的特征选择可以精确区分虚假用户和真实用户，证明了该模型在输入变量数量极少的情况下的有效性。此外，实验结果表明，与其他现有方法相比，所提出的方法取得了更好的性能。本文的研究重点是可解释性，即理解和解释机器学习模型的决策和结果的能力。

{"title":"An efficient fake account identification in social media networks: Facebook and Instagram using NSGA-II algorithm","authors":"Amine Sallah, El Arbi Abdellaoui Alaoui, Abdelaaziz Hessane, Said Agoujil, Anand Nayyar","doi":"10.1007/s00521-024-10350-8","DOIUrl":"https://doi.org/10.1007/s00521-024-10350-8","url":null,"abstract":"The widespread use of online social networks (OSNs) has made them prime targets for cyber attackers, who exploit these platforms for various malicious activities. As a result, a whole industry of black-market services has emerged, selling services based on the sale of fake accounts. Because of the massive rise of OSNs, the number of fraudulent accounts rapidly expands. Hence, this research focuses on detecting fraudulent profiles on Instagram and Facebook and aims to find an optimal subset of features that can effectively differentiate between real and fake accounts. The problem has been formulated as a multiobjective optimization task, aiming to maximize the classification accuracy while minimizing the number of selected features. NSGA-II (non-dominated sorting genetic algorithm II) is employed as the optimization algorithm to explore the trade-offs between these conflicting objectives. In the current study, a novel approach for feature selection using the NSGA-II optimization algorithm to detect fake accounts is proposed. The proposed methodology relies on input data comprising features characterizing the profiles under investigation. The selected features are utilized to train a machine learning model. The model’s performance is evaluated using various metrics, including precision, recall, F1-score, and receiver operating characteristic (ROC) curve. The final prediction model achieved accuracy values ranging from 90 to 99.88%. The results indicated that the model, utilizing features selected by the NSGA-II algorithm, delivered high prediction accuracy while using less than 31% of the total feature space. This efficient feature selection allowed for the precise differentiation between fake and real users, demonstrating the model’s effectiveness with a minimal number of input variables. Furthermore, the results of experiments demonstrate that the proposed approach achieves better performance as compared to other existing approaches. This research paper focuses on explainability, which refers to the ability to understand and interpret the decisions and outcomes of machine learning models.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"33 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Empowering global ethereum price prediction with EtherVoyant: a state-of-the-art time series forecasting model 利用 EtherVoyant 增强全球以太坊价格预测能力：最先进的时间序列预测模型

Neural Computing and Applications

Pub Date : 2024-08-27 DOI: 10.1007/s00521-024-10169-3

Umar Islam, Babar Shah, Abdullah A. Al-Atawi, Gioia Arnone, Mohamed R. Abonazel, Ijaz Ali, Fernando Moreira

Ethereum has emerged as a major platform for decentralized apps and smart contracts with the heightened interest in cryptocurrencies in recent years. Investors and market participants in the cryptocurrency space will find it increasingly important to use reliable price prediction models as Ethereum's popularity grows. To better estimate Ethereum prices around the world, we propose "EtherVoyant," a novel hybrid forecasting model that combines the advantages of ARIMA and SARIMA methods. To improve its forecasting abilities, EtherVoyant uses Ethereum price history to train ARIMA and SARIMA components independently before fusing their predictions. With the help of feature engineering and data preparation, we further improve the model so that it can deal with real-world difficulties like missing values and seasonality in the data. We also investigate hyperparameter optimization for the model's best possible performance. We compare EtherVoyant's forecasts against those of the more conventional ARIMA and SARIMA models to determine its efficacy. By providing more precise and trustworthy price forecasts, our trial results suggest that EtherVoyant is superior to the individual models. The importance of this study resides in the fact that it will lead to the creation of a sophisticated time series forecasting model that will be useful to cryptocurrency investors, traders, and decision-makers. We hope that by making EtherVoyant available on a worldwide scale, we will help advance the field of cryptocurrency analytics and encourage wider adoption of blockchain-based assets.

近年来，随着人们对加密货币的兴趣日益浓厚，以太坊已成为去中心化应用程序和智能合约的主要平台。随着以太坊越来越受欢迎，加密货币领域的投资者和市场参与者会发现使用可靠的价格预测模型越来越重要。为了更好地估算全球以太坊价格，我们提出了 "EtherVoyant"，一种结合了 ARIMA 和 SARIMA 方法优点的新型混合预测模型。为了提高预测能力，EtherVoyant 使用以太坊价格历史记录来独立训练 ARIMA 和 SARIMA 组件，然后再融合它们的预测结果。在特征工程和数据准备的帮助下，我们进一步改进了模型，使其能够处理数据中的缺失值和季节性等现实世界中的难题。我们还研究了超参数优化，以尽可能提高模型的性能。我们将 EtherVoyant 的预测与更传统的 ARIMA 和 SARIMA 模型进行比较，以确定其有效性。试验结果表明，EtherVoyant 能提供更精确、更可靠的价格预测，因此优于其他模型。这项研究的重要性在于，它将有助于创建一个复杂的时间序列预测模型，为加密货币投资者、交易商和决策者提供帮助。我们希望，通过在全球范围内提供 EtherVoyant，我们将帮助推动加密货币分析领域的发展，并鼓励更广泛地采用基于区块链的资产。

{"title":"Empowering global ethereum price prediction with EtherVoyant: a state-of-the-art time series forecasting model","authors":"Umar Islam, Babar Shah, Abdullah A. Al-Atawi, Gioia Arnone, Mohamed R. Abonazel, Ijaz Ali, Fernando Moreira","doi":"10.1007/s00521-024-10169-3","DOIUrl":"https://doi.org/10.1007/s00521-024-10169-3","url":null,"abstract":"Ethereum has emerged as a major platform for decentralized apps and smart contracts with the heightened interest in cryptocurrencies in recent years. Investors and market participants in the cryptocurrency space will find it increasingly important to use reliable price prediction models as Ethereum's popularity grows. To better estimate Ethereum prices around the world, we propose \"EtherVoyant,\" a novel hybrid forecasting model that combines the advantages of ARIMA and SARIMA methods. To improve its forecasting abilities, EtherVoyant uses Ethereum price history to train ARIMA and SARIMA components independently before fusing their predictions. With the help of feature engineering and data preparation, we further improve the model so that it can deal with real-world difficulties like missing values and seasonality in the data. We also investigate hyperparameter optimization for the model's best possible performance. We compare EtherVoyant's forecasts against those of the more conventional ARIMA and SARIMA models to determine its efficacy. By providing more precise and trustworthy price forecasts, our trial results suggest that EtherVoyant is superior to the individual models. The importance of this study resides in the fact that it will lead to the creation of a sophisticated time series forecasting model that will be useful to cryptocurrency investors, traders, and decision-makers. We hope that by making EtherVoyant available on a worldwide scale, we will help advance the field of cryptocurrency analytics and encourage wider adoption of blockchain-based assets.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A novel hyper-heuristic algorithm: an application to automatic voltage regulator 新型超启发式算法：应用于自动电压调节器

Neural Computing and Applications

Pub Date : 2024-08-27 DOI: 10.1007/s00521-024-10313-z

Yunus Hinislioglu, Ugur Guvenc

This paper presents a novel optimization algorithm called hyper-heuristic fitness-distance balance success-history-based adaptive differential evolution (HH-FDB-SHADE). The hyper-heuristic algorithms have two main structures: a hyper-selection framework and a low-level heuristic (LLH) pool. In the proposed algorithm, the FDB method is preferred as a high-level selection framework to evaluate the LLH pool algorithms. In addition, a total of 10 different strategies is derived from five mutation operators and two crossover methods for using them as the LLH pool. Balancing the exploration and exploitation capability of FDB is the main reason for being the selection framework of the proposed algorithm. The success of the HH-FDB-SHADE algorithm was tested on CEC-17 and CEC-20 benchmark test suits for different dimensional search spaces, and the obtained solutions from the HH-FDB-SHADE were compared to 10 different LLH pool algorithms. In addition, the HH-FDB-SHADE algorithm has been applied to optimize the control parameters of PID, PIDF, FOPID, and PIDD² in the optimal automatic voltage regulator (AVR) design problem to reveal the improved algorithm's performance more clearly and prove its success in solving engineering problems. The results obtained from the AVR system are compared with five other effective meta-heuristic search algorithms such as the fitness-distance balance Lévy Flight distribution, differential evolution, Harris–Hawks optimization, Barnacles mating optimizer, and Moth–Flame optimization algorithms in the literature. The results of the statistical analyses indicate that HH-FDB-SHADE is the best-ranked algorithm for solving CEC-17 and CEC-20 benchmark problems and gives better results compared to the LLH pool algorithms. Besides, the proposed algorithm is more effective and robust than five other meta-heuristic algorithms in solving optimal AVR design problems.

本文提出了一种新颖的优化算法，称为基于成功历史的超启发式适应性差分进化（HH-FDB-SHADE）。超启发式算法有两个主要结构：超选择框架和低级启发式（LLH）池。在所提出的算法中，首选 FDB 方法作为评估 LLH 池算法的高级选择框架。此外，五种突变算子和两种交叉方法共衍生出 10 种不同的策略，用作 LLH 池。平衡 FDB 的探索和利用能力是提出算法选择框架的主要原因。在不同维度搜索空间的 CEC-17 和 CEC-20 基准测试服上测试了 HH-FDB-SHADE 算法的成功性，并将 HH-FDB-SHADE 算法获得的解与 10 种不同的 LLH 池算法进行了比较。此外，HH-FDB-SHADE 算法还被应用于优化自动电压调节器（AVR）设计问题中的 PID、PIDF、FOPID 和 PIDD2 控制参数，以更清晰地揭示改进算法的性能，证明其在解决工程问题方面的成功。将 AVR 系统得到的结果与其他五种有效的元启发式搜索算法进行了比较，如文献中的适度-距离平衡 Lévy Flight 分布、差分进化、Harris-Hawks 优化、Barnacles 交配优化和 Moth-Flame 优化算法。统计分析结果表明，在解决 CEC-17 和 CEC-20 基准问题时，HH-FDB-SHADE 是排名最好的算法，与 LLH 池算法相比结果更好。此外，与其他五种元启发式算法相比，所提出的算法在解决 AVR 最佳设计问题时更加有效和稳健。

{"title":"A novel hyper-heuristic algorithm: an application to automatic voltage regulator","authors":"Yunus Hinislioglu, Ugur Guvenc","doi":"10.1007/s00521-024-10313-z","DOIUrl":"https://doi.org/10.1007/s00521-024-10313-z","url":null,"abstract":"This paper presents a novel optimization algorithm called hyper-heuristic fitness-distance balance success-history-based adaptive differential evolution (HH-FDB-SHADE). The hyper-heuristic algorithms have two main structures: a hyper-selection framework and a low-level heuristic (LLH) pool. In the proposed algorithm, the FDB method is preferred as a high-level selection framework to evaluate the LLH pool algorithms. In addition, a total of 10 different strategies is derived from five mutation operators and two crossover methods for using them as the LLH pool. Balancing the exploration and exploitation capability of FDB is the main reason for being the selection framework of the proposed algorithm. The success of the HH-FDB-SHADE algorithm was tested on CEC-17 and CEC-20 benchmark test suits for different dimensional search spaces, and the obtained solutions from the HH-FDB-SHADE were compared to 10 different LLH pool algorithms. In addition, the HH-FDB-SHADE algorithm has been applied to optimize the control parameters of PID, PIDF, FOPID, and PIDD2 in the optimal automatic voltage regulator (AVR) design problem to reveal the improved algorithm's performance more clearly and prove its success in solving engineering problems. The results obtained from the AVR system are compared with five other effective meta-heuristic search algorithms such as the fitness-distance balance Lévy Flight distribution, differential evolution, Harris–Hawks optimization, Barnacles mating optimizer, and Moth–Flame optimization algorithms in the literature. The results of the statistical analyses indicate that HH-FDB-SHADE is the best-ranked algorithm for solving CEC-17 and CEC-20 benchmark problems and gives better results compared to the LLH pool algorithms. Besides, the proposed algorithm is more effective and robust than five other meta-heuristic algorithms in solving optimal AVR design problems.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"567 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142224558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Integrating deep learning for accurate gastrointestinal cancer classification: a comprehensive analysis of MSI and MSS patterns using histopathology data 整合深度学习，准确进行胃肠道癌症分类：利用组织病理学数据全面分析 MSI 和 MSS 模式

Neural Computing and Applications

Pub Date : 2024-08-26 DOI: 10.1007/s00521-024-10287-y

Abeer A. Wafa, Reham M. Essa, Amr A. Abohany, Hanan E. Abdelkader

Early detection of microsatellite instability (MSI) and microsatellite stability (MSS) is crucial in the fight against gastrointestinal (GI) cancer. MSI is a sign of genetic instability often associated with DNA repair mechanism deficiencies, which can cause (GI) cancers. On the other hand, MSS signifies genomic stability in microsatellite regions. Differentiating between these two states is pivotal in clinical decision-making as it provides prognostic and predictive information and treatment strategies. Rapid identification of MSI and MSS enables oncologists to tailor therapies more accurately, potentially saving patients from unnecessary treatments and guiding them toward regimens with the highest likelihood of success. Detecting these microsatellite status markers at an initial stage can improve patient outcomes and quality of life in GI cancer management. Our research paper introduces a cutting-edge method for detecting early GI cancer using deep learning (DL). Our goal is to identify the optimal model for GI cancer detection that surpasses previous works. Our proposed model comprises four stages: data acquisition, image processing, feature extraction, and classification. We use histopathology images from the Cancer Genome Atlas (TCGA) and Kaggle website with some modifications for data acquisition. In the image processing stage, we apply various operations such as color transformation, resizing, normalization, and labeling to prepare the input image for enrollment in our DL models. We present five different DL models, including convolutional neural networks (CNNs), a hybrid of CNNs-simple RNN (recurrent neural network), a hybrid of CNNs with long short-term memory (LSTM) (CNNs-LSTM), a hybrid of CNNs with gated recurrent unit (GRU) (CNNs-GRU), and a hybrid of CNNs-SimpleRNN-LSTM-GRU. Our empirical results demonstrate that CNNs-SimpleRNN-LSTM-GRU outperforms other models in accuracy, specificity, recall, precision, AUC, and F1, achieving an accuracy of 99.90%. Our proposed methodology offers significant improvements in GI cancer detection compared to recent techniques, highlighting the potential of DL-based approaches for histopathology data. We expect our findings to inspire future research in DL-based GI cancer detection.

早期检测微卫星不稳定性（MSI）和微卫星稳定性（MSS）对于抗击胃肠道癌症（GI）至关重要。MSI 是遗传不稳定性的标志，通常与 DNA 修复机制缺陷有关，可导致胃肠道癌症。另一方面，MSS 标志着微卫星区域的基因组稳定性。区分这两种状态在临床决策中至关重要，因为它提供了预后和预测信息以及治疗策略。快速识别 MSI 和 MSS 使肿瘤学家能够更准确地定制治疗方案，从而使患者免于不必要的治疗，并指导他们采用最有可能成功的治疗方案。在消化道癌症治疗的初始阶段检测这些微卫星状态标记物可以改善患者的预后和生活质量。我们的研究论文介绍了一种利用深度学习（DL）检测早期消化道癌症的前沿方法。我们的目标是找出消化道癌症检测的最佳模型，以超越之前的研究成果。我们提出的模型包括四个阶段：数据采集、图像处理、特征提取和分类。我们使用来自癌症基因组图谱（TCGA）和 Kaggle 网站的组织病理学图像，并对其进行了一些修改以获取数据。在图像处理阶段，我们应用各种操作，如颜色转换、大小调整、归一化和标记，以准备输入图像，供我们的 DL 模型使用。我们提出了五种不同的 DL 模型，包括卷积神经网络（CNN）、CNN-简单 RNN（递归神经网络）混合模型、CNN-长短时记忆（LSTM）混合模型（CNNs-LSTM）、CNN-门控递归单元（GRU）混合模型（CNNs-GRU）以及 CNN-SimpleRNN-LSTM-GRU 混合模型。实证结果表明，CNNs-SimpleRNN-LSTM-GRU 在准确度、特异性、召回率、精确度、AUC 和 F1 方面均优于其他模型，准确度达到 99.90%。与最近的技术相比，我们提出的方法在消化道癌症检测方面有显著改进，突出了基于 DL 的方法在组织病理学数据方面的潜力。我们希望我们的研究结果能对未来基于 DL 的消化道癌症检测研究有所启发。

{"title":"Integrating deep learning for accurate gastrointestinal cancer classification: a comprehensive analysis of MSI and MSS patterns using histopathology data","authors":"Abeer A. Wafa, Reham M. Essa, Amr A. Abohany, Hanan E. Abdelkader","doi":"10.1007/s00521-024-10287-y","DOIUrl":"https://doi.org/10.1007/s00521-024-10287-y","url":null,"abstract":"Early detection of microsatellite instability (MSI) and microsatellite stability (MSS) is crucial in the fight against gastrointestinal (GI) cancer. MSI is a sign of genetic instability often associated with DNA repair mechanism deficiencies, which can cause (GI) cancers. On the other hand, MSS signifies genomic stability in microsatellite regions. Differentiating between these two states is pivotal in clinical decision-making as it provides prognostic and predictive information and treatment strategies. Rapid identification of MSI and MSS enables oncologists to tailor therapies more accurately, potentially saving patients from unnecessary treatments and guiding them toward regimens with the highest likelihood of success. Detecting these microsatellite status markers at an initial stage can improve patient outcomes and quality of life in GI cancer management. Our research paper introduces a cutting-edge method for detecting early GI cancer using deep learning (DL). Our goal is to identify the optimal model for GI cancer detection that surpasses previous works. Our proposed model comprises four stages: data acquisition, image processing, feature extraction, and classification. We use histopathology images from the Cancer Genome Atlas (TCGA) and Kaggle website with some modifications for data acquisition. In the image processing stage, we apply various operations such as color transformation, resizing, normalization, and labeling to prepare the input image for enrollment in our DL models. We present five different DL models, including convolutional neural networks (CNNs), a hybrid of CNNs-simple RNN (recurrent neural network), a hybrid of CNNs with long short-term memory (LSTM) (CNNs-LSTM), a hybrid of CNNs with gated recurrent unit (GRU) (CNNs-GRU), and a hybrid of CNNs-SimpleRNN-LSTM-GRU. Our empirical results demonstrate that CNNs-SimpleRNN-LSTM-GRU outperforms other models in accuracy, specificity, recall, precision, AUC, and F1, achieving an accuracy of 99.90%. Our proposed methodology offers significant improvements in GI cancer detection compared to recent techniques, highlighting the potential of DL-based approaches for histopathology data. We expect our findings to inspire future research in DL-based GI cancer detection.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"29 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SuspAct: novel suspicious activity prediction based on deep learning in the real-time environment SuspAct：实时环境中基于深度学习的新型可疑活动预测

Neural Computing and Applications

Pub Date : 2024-08-26 DOI: 10.1007/s00521-024-10355-3

Sachin Kansal, Akshat Kumar Jain, Moyukh Biswas, Shaurya Bansal, Namay Mahindru, Priya Kansal

In today’s evolving landscape of video surveillance, our study introduces SuspAct, an innovative ensemble model designed to detect suspicious activities in real time swiftly. Leveraging advanced Long-term Recurrent Convolutional Networks (LRCN), SuspAct represents a significant advancement in intelligent surveillance technology. By combining insights from various LRCN models through the Majority Voting ensemble technique, SuspAct enhances its overall robustness, outperforming traditional surveillance methods. Through rigorous experimentation on large-scale datasets, we demonstrate SuspAct’s superiority in proactive crime prevention, showcasing its potential to revolutionize security protocols and contribute substantially to public safety. Our work addresses the challenges posed by the escalating volume of video data and lays a strong foundation for future advancements in intelligent video surveillance technology.

在视频监控不断发展的今天，我们的研究引入了一种创新的集合模型 SuspAct，旨在实时快速地检测可疑活动。利用先进的长期递归卷积网络（LRCN），SuspAct 代表了智能监控技术的重大进步。通过 Majority Voting 集合技术，SuspAct 将各种 LRCN 模型的洞察力结合在一起，增强了其整体鲁棒性，表现优于传统监控方法。通过在大规模数据集上进行严格的实验，我们证明了 SuspAct 在主动预防犯罪方面的优势，展示了其彻底改变安全协议并为公共安全做出重大贡献的潜力。我们的工作解决了视频数据量不断攀升带来的挑战，为未来智能视频监控技术的发展奠定了坚实的基础。

引用次数: 0

Optihybrid: a modified firebug swarm optimization algorithm for optimal sizing of hybrid renewable power system Optihybrid：用于优化可再生能源混合发电系统规模的改进型火虫群优化算法

Neural Computing and Applications

Pub Date : 2024-08-26 DOI: 10.1007/s00521-024-10196-0

Hoda Abd El-Sattar, Salah Kamel, Fatma A. Hashim, Sahar F. Sabbeh

In areas where conventional energy sources are unavailable, alternative energy technologies play a crucial role in generating electricity. These technologies offer various benefits, such as reliable energy supply, environmental sustainability, and employment opportunities in rural regions. This study focuses on the development of a novel optimization algorithm called the modified firebug swarm algorithm (mFSO). Its objective is to determine the optimal size of an integrated renewable power system for supplying electricity to a specific remote site in Dehiba town, located in the eastern province of Tataouine, Tunisia. The proposed configuration for the standalone hybrid system involves PV/biomass/battery, and three objective functions are considered: minimizing the total energy cost (COE), reducing the loss of power supply probability (LPSP), and managing excess energy (EXC). The effectiveness of the modified algorithm is evaluated using various tests, including the Wilcoxon test, boxplot analysis, and the ten benchmark functions of the CEC2020 benchmark. Comparative analysis between the mFSO and widely used algorithms like the original Firebug Swarm Optimization (FSO), Slime Mold Algorithm (SMA), and Seagull Optimization Algorithm (SOA) demonstrates that the proposed mFSO technique is efficient and effective in solving the design problem, surpassing other optimization algorithms.

在缺乏传统能源的地区，替代能源技术在发电方面发挥着至关重要的作用。这些技术可为农村地区带来各种好处，如可靠的能源供应、环境可持续性和就业机会。本研究的重点是开发一种名为 "改良火虫群算法（mFSO）"的新型优化算法。其目标是确定一个综合可再生能源发电系统的最佳规模，以便为突尼斯东部塔塔瓦内省 Dehiba 镇的一个特定偏远地点供电。建议的独立混合系统配置包括光伏/生物质/电池，并考虑了三个目标函数：最小化总能源成本（COE）、降低供电损失概率（LPSP）和管理过剩能源（EXC）。通过各种测试，包括 Wilcoxon 检验、方框图分析和 CEC2020 基准的十个基准函数，对改进算法的有效性进行了评估。mFSO 与广泛使用的算法（如原始 Firebug Swarm Optimization（FSO）、Slime Mold Algorithm（SMA）和 Seagull Optimization Algorithm（SOA））之间的比较分析表明，所提出的 mFSO 技术在解决设计问题方面效率高、效果好，超过了其他优化算法。

{"title":"Optihybrid: a modified firebug swarm optimization algorithm for optimal sizing of hybrid renewable power system","authors":"Hoda Abd El-Sattar, Salah Kamel, Fatma A. Hashim, Sahar F. Sabbeh","doi":"10.1007/s00521-024-10196-0","DOIUrl":"https://doi.org/10.1007/s00521-024-10196-0","url":null,"abstract":"In areas where conventional energy sources are unavailable, alternative energy technologies play a crucial role in generating electricity. These technologies offer various benefits, such as reliable energy supply, environmental sustainability, and employment opportunities in rural regions. This study focuses on the development of a novel optimization algorithm called the modified firebug swarm algorithm (mFSO). Its objective is to determine the optimal size of an integrated renewable power system for supplying electricity to a specific remote site in Dehiba town, located in the eastern province of Tataouine, Tunisia. The proposed configuration for the standalone hybrid system involves PV/biomass/battery, and three objective functions are considered: minimizing the total energy cost (COE), reducing the loss of power supply probability (LPSP), and managing excess energy (EXC). The effectiveness of the modified algorithm is evaluated using various tests, including the Wilcoxon test, boxplot analysis, and the ten benchmark functions of the CEC2020 benchmark. Comparative analysis between the mFSO and widely used algorithms like the original Firebug Swarm Optimization (FSO), Slime Mold Algorithm (SMA), and Seagull Optimization Algorithm (SOA) demonstrates that the proposed mFSO technique is efficient and effective in solving the design problem, surpassing other optimization algorithms.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Heart rate variability analysis in controls and epilepsy patients with or without receiving treatment: a clinical review and meta-analysis 对照组和接受或未接受治疗的癫痫患者的心率变异性分析：临床回顾和荟萃分析

Neural Computing and Applications

Pub Date : 2024-08-25 DOI: 10.1007/s00521-024-10135-z

Muhammad Bilal Shahnawaz, Hassan Dawooda, Uzair Iqbal

The malfunctioning of cardiac autonomic control in epileptic patients develops ventricular tachyarrhythmia and causes sudden unexpected death in epilepsy patients (SUDEP). Various clinical studies investigated the effect of epilepsy on cardiac autonomic control by performing heart rate variability (HRV) analysis; however, results are unclear regarding whether sympathetic, parasympathetic, or both branches of the autonomic nervous system (ANS) are affected in epilepsy and also the impact of anticonvulsant treatment on the ANS. This study follows the systematic protocols to investigate epilepsy and its anticonvulsant treatment on cardiac autonomic control by using linear and nonlinear HRV analysis measures. The electronic databases of PubMed, Embase, and Cochrane Library were used for the collection of studies. Initially, 1475 articles were identified whereas after 2-staged exclusion criteria, 33 studies were selected for execution of the review process and meta-analysis. For meta-analysis, four comparisons were performed (epilepsy patients): (1) controls (healthy subject with no history of epilepsy) versus untreated patients; (2) treated (patients under treatment that have a seizure) versus untreated patients; (3) controls versus treated patients; and (4) refractory versus well-controlled (epilepsy patients that were seizure-free for last 1 year). For treated and untreated patients, there was no significant difference whereas well-controlled patients presented higher values as compared to refractory patients. Meta-analysis was performed for the time-domain, frequency-domain, and nonlinear parameters. Untreated patients in comparison with controls presented significantly lower HF (high-frequency) and LF (low-frequency) values. These LF (g = − 0.9; 95% CI − 1.48 to − 0.37) and HF (g = − 0.69; 95% confidence interval (CI) − 1.24 to − 0.16) values were affirming suppressed both, vagal and sympathetic activity, respectively. Additionally, LF and HF value was increased in most of the studies indicating suppressed vagal tone, while for some studies, their value decreased to indicate suppressed sympathetic activity. No significant difference was observed for the remaining comparisons. Results affirmed the hypothesis that suppressed sympathetic activity affects sympathovagal balance and leads to SUDEP, as the LF value was significantly lower for patients as compared to healthy subjects. The overall effect size and statistical results for LF and HF were significant, showing the research and clinical significance of our study.

癫痫患者的心脏自主神经控制功能失调会导致室性心动过速，并导致癫痫患者意外猝死（SUDEP）。多项临床研究通过心率变异性（HRV）分析调查了癫痫对心脏自律神经控制的影响；然而，关于癫痫患者自律神经系统（ANS）的交感神经、副交感神经或两个分支均受影响，以及抗惊厥治疗对自律神经系统的影响，研究结果尚不明确。本研究采用线性和非线性心率变异分析方法，按照系统规程研究癫痫及其抗惊厥治疗对心脏自律神经控制的影响。本研究使用 PubMed、Embase 和 Cochrane Library 等电子数据库收集研究资料。最初确定了 1475 篇文章，经过两阶段的排除标准后，选择了 33 项研究进行审查和荟萃分析。在荟萃分析中，进行了四项比较（癫痫患者）：(1) 对照组（无癫痫病史的健康人）与未接受治疗的患者；(2) 接受治疗的患者（接受治疗但有癫痫发作的患者）与未接受治疗的患者；(3) 对照组与接受治疗的患者；(4) 难治性与控制良好的患者（在过去一年中没有癫痫发作的癫痫患者）。接受治疗和未接受治疗的患者之间没有明显差异，而控制良好的患者与难治性患者相比数值更高。对时域、频域和非线性参数进行了元分析。与对照组相比，未经治疗的患者的高频（HF）和低频（LF）值明显较低。这些低频（g = - 0.9；95% 置信区间（CI）- 1.48 至 - 0.37）和高频（g = - 0.69；95% 置信区间（CI）- 1.24 至 - 0.16）值分别证实迷走神经和交感神经活动受到抑制。此外，大多数研究的低频和高频值增加，表明迷走神经张力受到抑制，而一些研究的低频和高频值降低，表明交感神经活动受到抑制。其余比较未发现明显差异。结果证实了交感神经活动受抑制会影响交感迷走平衡并导致 SUDEP 的假设，因为与健康受试者相比，患者的 LF 值明显较低。低频和高频的总体效应大小和统计结果均具有显著性，这表明我们的研究具有研究和临床意义。

{"title":"Heart rate variability analysis in controls and epilepsy patients with or without receiving treatment: a clinical review and meta-analysis","authors":"Muhammad Bilal Shahnawaz, Hassan Dawooda, Uzair Iqbal","doi":"10.1007/s00521-024-10135-z","DOIUrl":"https://doi.org/10.1007/s00521-024-10135-z","url":null,"abstract":"The malfunctioning of cardiac autonomic control in epileptic patients develops ventricular tachyarrhythmia and causes sudden unexpected death in epilepsy patients (SUDEP). Various clinical studies investigated the effect of epilepsy on cardiac autonomic control by performing heart rate variability (HRV) analysis; however, results are unclear regarding whether sympathetic, parasympathetic, or both branches of the autonomic nervous system (ANS) are affected in epilepsy and also the impact of anticonvulsant treatment on the ANS. This study follows the systematic protocols to investigate epilepsy and its anticonvulsant treatment on cardiac autonomic control by using linear and nonlinear HRV analysis measures. The electronic databases of PubMed, Embase, and Cochrane Library were used for the collection of studies. Initially, 1475 articles were identified whereas after 2-staged exclusion criteria, 33 studies were selected for execution of the review process and meta-analysis. For meta-analysis, four comparisons were performed (epilepsy patients): (1) controls (healthy subject with no history of epilepsy) versus untreated patients; (2) treated (patients under treatment that have a seizure) versus untreated patients; (3) controls versus treated patients; and (4) refractory versus well-controlled (epilepsy patients that were seizure-free for last 1 year). For treated and untreated patients, there was no significant difference whereas well-controlled patients presented higher values as compared to refractory patients. Meta-analysis was performed for the time-domain, frequency-domain, and nonlinear parameters. Untreated patients in comparison with controls presented significantly lower HF (high-frequency) and LF (low-frequency) values. These LF (g = − 0.9; 95% CI − 1.48 to − 0.37) and HF (g = − 0.69; 95% confidence interval (CI) − 1.24 to − 0.16) values were affirming suppressed both, vagal and sympathetic activity, respectively. Additionally, LF and HF value was increased in most of the studies indicating suppressed vagal tone, while for some studies, their value decreased to indicate suppressed sympathetic activity. No significant difference was observed for the remaining comparisons. Results affirmed the hypothesis that suppressed sympathetic activity affects sympathovagal balance and leads to SUDEP, as the LF value was significantly lower for patients as compared to healthy subjects. The overall effect size and statistical results for LF and HF were significant, showing the research and clinical significance of our study.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"2010 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A deep fusion model for stock market prediction with news headlines and time series data 利用新闻标题和时间序列数据进行股市预测的深度融合模型

Neural Computing and Applications

Pub Date : 2024-08-24 DOI: 10.1007/s00521-024-10303-1

Pinyu Chen, Zois Boukouvalas, Roberto Corizzo

Time series forecasting models are essential decision support tools in real-world domains. Stock market is a remarkably complex domain, due to its quickly evolving temporal nature, as well as the multiple factors having an impact on stock prices. To date, a number of machine learning-based approaches have been proposed in the literature to tackle stock trend prediction. However, they typically tend to analyze a single data source or modality, or consider multiple modalities in isolation and rely on simple combination strategies, with a potential reduction in their modeling power. In this paper, we propose a multimodal deep fusion model to predict stock trends, leveraging daily stock prices, technical indicators, and sentiment in daily news headlines published by media outlets. The proposed architecture leverages a BERT-based model branch fine-tuned on financial news and a long short-term memory (LSTM) branch that captures relevant temporal patterns in multivariate data, including stock prices and technical indicators. Our experiments on 12 different stock datasets with prices and news headlines demonstrate that our proposed model is more effective than popular baseline approaches, both in terms of accuracy and trading performance in a portfolio analysis simulation, highlighting the positive impact of multimodal deep learning for stock trend prediction.

时间序列预测模型是现实世界中必不可少的决策支持工具。股票市场是一个非常复杂的领域，因为它具有快速变化的时间特性，而且有多种因素对股票价格产生影响。迄今为止，文献中已经提出了许多基于机器学习的方法来解决股票趋势预测问题。然而，这些方法通常倾向于分析单一数据源或模式，或孤立地考虑多种模式，并依赖于简单的组合策略，其建模能力可能会降低。在本文中，我们提出了一种多模态深度融合模型，利用每日股价、技术指标和媒体发布的每日新闻标题中的情绪来预测股票走势。所提出的架构利用了基于 BERT 的模型分支和长短期记忆（LSTM）分支，前者根据财经新闻进行微调，后者捕捉多元数据（包括股票价格和技术指标）中的相关时间模式。我们在包含价格和新闻标题的 12 个不同股票数据集上进行的实验表明，在投资组合分析模拟中，我们提出的模型在准确性和交易性能方面都比流行的基线方法更有效，这凸显了多模态深度学习对股票趋势预测的积极影响。

{"title":"A deep fusion model for stock market prediction with news headlines and time series data","authors":"Pinyu Chen, Zois Boukouvalas, Roberto Corizzo","doi":"10.1007/s00521-024-10303-1","DOIUrl":"https://doi.org/10.1007/s00521-024-10303-1","url":null,"abstract":"Time series forecasting models are essential decision support tools in real-world domains. Stock market is a remarkably complex domain, due to its quickly evolving temporal nature, as well as the multiple factors having an impact on stock prices. To date, a number of machine learning-based approaches have been proposed in the literature to tackle stock trend prediction. However, they typically tend to analyze a single data source or modality, or consider multiple modalities in isolation and rely on simple combination strategies, with a potential reduction in their modeling power. In this paper, we propose a multimodal deep fusion model to predict stock trends, leveraging daily stock prices, technical indicators, and sentiment in daily news headlines published by media outlets. The proposed architecture leverages a BERT-based model branch fine-tuned on financial news and a long short-term memory (LSTM) branch that captures relevant temporal patterns in multivariate data, including stock prices and technical indicators. Our experiments on 12 different stock datasets with prices and news headlines demonstrate that our proposed model is more effective than popular baseline approaches, both in terms of accuracy and trading performance in a portfolio analysis simulation, highlighting the positive impact of multimodal deep learning for stock trend prediction.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"73 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142224559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0