首页 > 最新文献

Intelligent Systems with Applications最新文献

英文 中文
A corporate credit evaluation method considering strong feature privacy with non-private label: A vertical heterogeneous feature fusion approach 一种考虑非私有标签强特征隐私的企业信用评价方法:垂直异构特征融合方法
IF 4.3 Pub Date : 2025-11-13 DOI: 10.1016/j.iswa.2025.200603
Xifeng Ning , Chao Yang , Hailu Sun , Xinyuan Song , Zifan Hu , Yu Feng , Jiawei Li , Yifan Zhu
In modern monitoring and operational management, whether in industrial systems, financial risk control, or infrastructure maintenance, decision-making increasingly relies on integrating heterogeneous data from multiple sources. However, due to data privacy regulations, distributed storage, communication constraints, and sensor failures, it is often difficult to centralize modeling when dealing with high-dimensional, incomplete datasets held by different institutions. Federated learning offers a privacy-preserving joint modeling solution, yet still faces challenges such as high communication overhead, low robustness to participant dropout, and risks of gradient leakage. In certain incomplete-data scenarios, not all data is private—labels such as equipment inspection results, fault reports, or corporate blacklists and whitelists published by authoritative bodies may be public—while feature data remains private and partially missing. To address this, we propose an innovative collaborative modeling framework tailored for incomplete-data monitoring and operations, in which each participant independently trains a model on its private features and exchanges only prediction results rather than gradients. Inspired by collective expert scoring, each “expert” evaluates based on its own data, then shares scores that are integrated into a comprehensive assessment. This approach offers multiple advantages: independent model training for each party, improved efficiency by migrating only prediction results, enhanced security by avoiding gradient transmission, and higher robustness since the failure of one participant does not halt others’ training. We present three variants of this prediction-result fusion method and evaluate them on representative datasets, including enterprise credit risk assessment as a case study, comparing against vertical federated logistic regression. Experimental results validate the effectiveness of the proposed approach, which can be widely applied to diverse monitoring and operational scenarios under incomplete data conditions.
在现代监控和运营管理中,无论是工业系统、金融风险控制还是基础设施维护,决策越来越依赖于对多源异构数据的集成。然而,由于数据隐私法规、分布式存储、通信约束和传感器故障,在处理不同机构持有的高维、不完整数据集时,通常很难集中建模。联邦学习提供了一种保护隐私的联合建模解决方案,但仍然面临着诸如高通信开销、参与者退出的低鲁棒性以及梯度泄漏风险等挑战。在某些数据不完整的场景中,并非所有数据都是私有数据,例如权威机构发布的设备检查结果、故障报告或企业黑名单和白名单可能是公开的,而特征数据仍然是私有的,部分缺失。为了解决这个问题,我们提出了一个创新的协作建模框架,为不完整的数据监测和操作量身定制,其中每个参与者根据其私有特征独立训练模型,并且只交换预测结果而不是梯度。受集体专家评分的启发,每个“专家”根据自己的数据进行评估,然后分享分数,这些分数被整合到一个综合评估中。这种方法具有多种优势:对每一方进行独立的模型训练,通过只迁移预测结果提高效率,通过避免梯度传输增强安全性,并且由于一个参与者的失败不会停止其他参与者的训练,因此具有更高的鲁棒性。我们提出了这种预测-结果融合方法的三种变体,并在代表性数据集上对它们进行了评估,其中包括以企业信用风险评估为例的研究,并与垂直联邦逻辑回归进行了比较。实验结果验证了该方法的有效性,可广泛应用于不完全数据条件下的各种监测和操作场景。
{"title":"A corporate credit evaluation method considering strong feature privacy with non-private label: A vertical heterogeneous feature fusion approach","authors":"Xifeng Ning ,&nbsp;Chao Yang ,&nbsp;Hailu Sun ,&nbsp;Xinyuan Song ,&nbsp;Zifan Hu ,&nbsp;Yu Feng ,&nbsp;Jiawei Li ,&nbsp;Yifan Zhu","doi":"10.1016/j.iswa.2025.200603","DOIUrl":"10.1016/j.iswa.2025.200603","url":null,"abstract":"<div><div>In modern monitoring and operational management, whether in industrial systems, financial risk control, or infrastructure maintenance, decision-making increasingly relies on integrating heterogeneous data from multiple sources. However, due to data privacy regulations, distributed storage, communication constraints, and sensor failures, it is often difficult to centralize modeling when dealing with high-dimensional, incomplete datasets held by different institutions. Federated learning offers a privacy-preserving joint modeling solution, yet still faces challenges such as high communication overhead, low robustness to participant dropout, and risks of gradient leakage. In certain incomplete-data scenarios, not all data is private—labels such as equipment inspection results, fault reports, or corporate blacklists and whitelists published by authoritative bodies may be public—while feature data remains private and partially missing. To address this, we propose an innovative collaborative modeling framework tailored for incomplete-data monitoring and operations, in which each participant independently trains a model on its private features and exchanges only prediction results rather than gradients. Inspired by collective expert scoring, each “expert” evaluates based on its own data, then shares scores that are integrated into a comprehensive assessment. This approach offers multiple advantages: independent model training for each party, improved efficiency by migrating only prediction results, enhanced security by avoiding gradient transmission, and higher robustness since the failure of one participant does not halt others’ training. We present three variants of this prediction-result fusion method and evaluate them on representative datasets, including enterprise credit risk assessment as a case study, comparing against vertical federated logistic regression. Experimental results validate the effectiveness of the proposed approach, which can be widely applied to diverse monitoring and operational scenarios under incomplete data conditions.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200603"},"PeriodicalIF":4.3,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145520077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attention-based fuzzy neural networks for self-supervised data annotation 基于注意力的模糊神经网络自监督数据标注
IF 4.3 Pub Date : 2025-11-13 DOI: 10.1016/j.iswa.2025.200610
Md Rakibul Islam, Shahina Begum, Mobyen Uddin Ahmed, Shaibal Barua
Annotating vibration data from heavy-duty pumps in the mining industry is highly challenging because it demands domain knowledge, a complex inspection setup, and, in many cases, remains infeasible. A self-supervised data annotation (SSDA) framework is therefore proposed and evaluated on historical data of slurry-pump vibration signals. The framework began with the collection of heterogeneous information, followed by information fusion using an autoencoder. This was then followed by a datafication step for preprocessing and achieving a better representation of features through a feature embedding technique. As a result, redundant information was pushed into an eight-dimensional latent space, achieving a reconstruction loss of 0.0023. Furthermore, Initial data annotation was obtained by combining the Isolation Forest and Kneedle algorithms to locate a data-driven knee or threshold, and it was found to be 0.58 for predicting labels. Partial samples were labeled and considered accurate. Lastly, an attention-based fuzzy neural network (AFNN) is trained on those labels where membership functions convert each latent feature into graded truth values. At the same time, an attention layer highlights the most relevant rules. An iterative self-training loop was implemented to refine the training set and obtain labeled data with higher model confidence. Here, we also tested six baseline models and found AFNN quite impressive. After seven iterations 2780 of 2872 samples were labeled and the remaining 92 are considered uncertain, still need some review from an expert, and the AFNN model confidence was (96.8%). Statistical analysis confirmed that the model predictions were significantly associated with true labels (p<0.05) and not driven by chance.
对采矿行业重型泵的振动数据进行注释是一项极具挑战性的工作,因为它需要领域知识和复杂的检测设置,而且在许多情况下仍然是不可行的。为此,提出了一种自监督数据注释(SSDA)框架,并对浆料泵振动信号历史数据进行了评价。该框架从异构信息的收集开始,然后使用自编码器进行信息融合。接下来是数据预处理步骤,并通过特征嵌入技术实现更好的特征表示。结果,冗余信息被推入八维潜在空间,重构损失为0.0023。此外,结合隔离森林和膝关节算法获得初始数据注释,以定位数据驱动的膝关节或阈值,发现预测标签的概率为0.58。部分样品被标记并被认为是准确的。最后,在这些标签上训练基于注意力的模糊神经网络(AFNN),其中隶属函数将每个潜在特征转换为分级真值。与此同时,注意力层突出了最相关的规则。采用迭代自训练循环对训练集进行细化,得到具有较高模型置信度的标记数据。在这里,我们还测试了六个基线模型,发现AFNN非常令人印象深刻。经过7次迭代,2872个样本中的2780个被标记,剩下的92个被认为是不确定的,仍然需要专家的一些审查,AFNN模型置信度为(96.8%)。统计分析证实,模型预测与真实标签显著相关(p<0.05),并非偶然驱动。
{"title":"Attention-based fuzzy neural networks for self-supervised data annotation","authors":"Md Rakibul Islam,&nbsp;Shahina Begum,&nbsp;Mobyen Uddin Ahmed,&nbsp;Shaibal Barua","doi":"10.1016/j.iswa.2025.200610","DOIUrl":"10.1016/j.iswa.2025.200610","url":null,"abstract":"<div><div>Annotating vibration data from heavy-duty pumps in the mining industry is highly challenging because it demands domain knowledge, a complex inspection setup, and, in many cases, remains infeasible. A self-supervised data annotation (SSDA) framework is therefore proposed and evaluated on historical data of slurry-pump vibration signals. The framework began with the collection of heterogeneous information, followed by information fusion using an autoencoder. This was then followed by a datafication step for preprocessing and achieving a better representation of features through a feature embedding technique. As a result, redundant information was pushed into an eight-dimensional latent space, achieving a reconstruction loss of 0.0023. Furthermore, Initial data annotation was obtained by combining the Isolation Forest and Kneedle algorithms to locate a data-driven knee or threshold, and it was found to be 0.58 for predicting labels. Partial samples were labeled and considered accurate. Lastly, an attention-based fuzzy neural network (AFNN) is trained on those labels where membership functions convert each latent feature into graded truth values. At the same time, an attention layer highlights the most relevant rules. An iterative self-training loop was implemented to refine the training set and obtain labeled data with higher model confidence. Here, we also tested six baseline models and found AFNN quite impressive. After seven iterations 2780 of 2872 samples were labeled and the remaining 92 are considered uncertain, still need some review from an expert, and the AFNN model confidence was (96.8%). Statistical analysis confirmed that the model predictions were significantly associated with true labels (<span><math><mrow><mi>p</mi><mo>&lt;</mo><mn>0</mn><mo>.</mo><mn>05</mn></mrow></math></span>) and not driven by chance.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200610"},"PeriodicalIF":4.3,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145520066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing printing processes with MCTS 使用MCTS优化打印过程
IF 4.3 Pub Date : 2025-11-10 DOI: 10.1016/j.iswa.2025.200602
Kadri Kukk , Ants Torim , Erki Eessaar , Tarmo Kadak
The printing industry benefits from digitalizing workflows such as customer quoting. Intelligent printing process planning is essential to determine the near-optimal price for automated quoting. This paper addresses the automation of sheet imposition, a critical and computationally intensive step in optimizing the printing process that belongs to the general class of cutting and packing problems. We propose a simple recursive sheet imposition representation as the basis for our algorithms. The Brute Force algorithm for optimizing sheet imposition guarantees the cheapest solution but is computationally infeasible for complex tasks. As alternatives, we investigate heuristic algorithms, specifically Monte Carlo Tree Search (MCTS) and Simulated Annealing (SA). Our findings show that while Brute Force is prohibitively slow, MCTS strikes a robust balance between computational performance and solution quality, consistently finding solutions within a 5% margin of optimal price. Although SA can occasionally find superior solutions, MCTS provides a more reliable and efficient approach by consistently delivering results close to the optimal price.
印刷行业受益于数字化工作流程,如客户报价。智能印刷工艺规划对于确定近乎最优的自动报价价格至关重要。本文讨论了纸张拼版的自动化,这是优化印刷过程的一个关键和计算密集的步骤,属于一般的切割和包装问题。我们提出了一个简单的递归拼版表示作为我们算法的基础。蛮力算法用于优化板材拼装保证了最便宜的解决方案,但计算上不可行的复杂任务。作为替代方案,我们研究了启发式算法,特别是蒙特卡罗树搜索(MCTS)和模拟退火(SA)。我们的研究结果表明,虽然蛮力算法速度非常慢,但MCTS在计算性能和解决方案质量之间取得了良好的平衡,始终在最优价格的5%范围内找到解决方案。虽然SA偶尔可以找到更好的解决方案,但MCTS提供了一种更可靠、更有效的方法,它始终如一地提供接近最优价格的结果。
{"title":"Optimizing printing processes with MCTS","authors":"Kadri Kukk ,&nbsp;Ants Torim ,&nbsp;Erki Eessaar ,&nbsp;Tarmo Kadak","doi":"10.1016/j.iswa.2025.200602","DOIUrl":"10.1016/j.iswa.2025.200602","url":null,"abstract":"<div><div>The printing industry benefits from digitalizing workflows such as customer quoting. Intelligent printing process planning is essential to determine the near-optimal price for automated quoting. This paper addresses the automation of sheet imposition, a critical and computationally intensive step in optimizing the printing process that belongs to the general class of cutting and packing problems. We propose a simple recursive sheet imposition representation as the basis for our algorithms. The Brute Force algorithm for optimizing sheet imposition guarantees the cheapest solution but is computationally infeasible for complex tasks. As alternatives, we investigate heuristic algorithms, specifically Monte Carlo Tree Search (MCTS) and Simulated Annealing (SA). Our findings show that while Brute Force is prohibitively slow, MCTS strikes a robust balance between computational performance and solution quality, consistently finding solutions within a 5% margin of optimal price. Although SA can occasionally find superior solutions, MCTS provides a more reliable and efficient approach by consistently delivering results close to the optimal price.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200602"},"PeriodicalIF":4.3,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145520071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced Online Grooming detection employing Context Determination and Message-Level Analysis 采用上下文确定和消息级分析的增强在线修饰检测
IF 4.3 Pub Date : 2025-11-10 DOI: 10.1016/j.iswa.2025.200607
Jake Street, Isibor Kennedy Ihianle, Funminiyi Olajide, Ahmad Lotfi
Online Grooming (OG) is a prevalent threat facing predominately children online, with groomers using deceptive methods to prey on the vulnerability of children on social media/messaging platforms. These attacks can have severe psychological and physical impacts, including a tendency towards revictimization. Current technical measures are inadequate, especially with the advent of end-to-end encryption which hampers message monitoring. Existing solutions focus on the signature analysis of child abuse media, which does not effectively address real-time OG detection. This paper proposes that OG attacks are complex, requiring the identification of specific communication patterns between adults and children alongside identifying other insights (e.g. Sexual language) to make an accurate determination. It introduces a novel approach leveraging advanced models such as BERT and RoBERTa for Message-Level Analysis and a Context Determination approach for classifying actor interactions, between adults attempting to groom children and honeypot children actors. This approach included the introduction of Actor Significance Thresholds and Message Significance Thresholds to make these determinations. The proposed method aims to enhance accuracy and robustness in detecting OG by considering the dynamic and multi-faceted nature of these attacks. Cross-dataset experiments evaluate the robustness and versatility of our approach. This paper’s contributions include improved detection methodologies and the potential for application in various scenarios, addressing gaps in current literature and practices.
在线诱骗(OG)是儿童在线面临的普遍威胁,诱骗者利用社交媒体/消息平台上儿童的脆弱性进行欺骗。这些攻击可能造成严重的心理和身体影响,包括再次受害的倾向。目前的技术措施是不够的,特别是端到端加密的出现阻碍了消息监控。现有的解决方案侧重于对儿童虐待媒体的特征分析,这并不能有效地解决实时OG检测问题。本文提出OG攻击是复杂的,需要识别成人和儿童之间的特定通信模式以及识别其他见解(例如性语言)以做出准确的判断。它引入了一种新颖的方法,利用BERT和RoBERTa等高级模型进行消息级分析,并采用上下文确定方法对演员之间的交互进行分类,成人试图培养儿童和蜜罐儿童演员之间的交互。该方法包括引入参与者显著性阈值和消息显著性阈值来做出这些决定。该方法考虑了网络攻击的动态性和多面性,提高了网络攻击检测的准确性和鲁棒性。跨数据集实验评估了我们方法的鲁棒性和通用性。本文的贡献包括改进的检测方法和在各种情况下应用的潜力,解决了当前文献和实践中的差距。
{"title":"Enhanced Online Grooming detection employing Context Determination and Message-Level Analysis","authors":"Jake Street,&nbsp;Isibor Kennedy Ihianle,&nbsp;Funminiyi Olajide,&nbsp;Ahmad Lotfi","doi":"10.1016/j.iswa.2025.200607","DOIUrl":"10.1016/j.iswa.2025.200607","url":null,"abstract":"<div><div>Online Grooming (OG) is a prevalent threat facing predominately children online, with groomers using deceptive methods to prey on the vulnerability of children on social media/messaging platforms. These attacks can have severe psychological and physical impacts, including a tendency towards revictimization. Current technical measures are inadequate, especially with the advent of end-to-end encryption which hampers message monitoring. Existing solutions focus on the signature analysis of child abuse media, which does not effectively address real-time OG detection. This paper proposes that OG attacks are complex, requiring the identification of specific communication patterns between adults and children alongside identifying other insights (e.g. Sexual language) to make an accurate determination. It introduces a novel approach leveraging advanced models such as BERT and RoBERTa for Message-Level Analysis and a Context Determination approach for classifying actor interactions, between adults attempting to groom children and honeypot children actors. This approach included the introduction of Actor Significance Thresholds and Message Significance Thresholds to make these determinations. The proposed method aims to enhance accuracy and robustness in detecting OG by considering the dynamic and multi-faceted nature of these attacks. Cross-dataset experiments evaluate the robustness and versatility of our approach. This paper’s contributions include improved detection methodologies and the potential for application in various scenarios, addressing gaps in current literature and practices.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200607"},"PeriodicalIF":4.3,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145520070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic SLAM: A comprehensive survey of methods and applications 语义SLAM:方法和应用的综合调查
IF 4.3 Pub Date : 2025-11-10 DOI: 10.1016/j.iswa.2025.200591
Houssein Kanso , Abhilasha Singh , Etaf El Zarif , Nooruldeen Almohammed , Jinane Mounsef , Noel Maalouf , Bilal Arain
This paper surveys the different approaches in semantic Simultaneous Localization and Mapping (SLAM), exploring how the incorporation of semantic information has enhanced performance in both indoor and outdoor settings, while highlighting key advancements in the field. It also identifies existing gaps and proposes potential directions for future improvements to address these issues. We provide a detailed review of the fundamentals of semantic SLAM, illustrating how incorporating semantic data enhances scene understanding and mapping accuracy. The paper presents semantic SLAM methods and core techniques that contribute to improved robustness and precision in mapping. A comprehensive overview of commonly used datasets for evaluating semantic SLAM systems is provided, along with a discussion of performance metrics used to assess their efficiency and accuracy. To demonstrate the reliability of semantic SLAM methodologies, we reproduce selected results from existing studies offering insights into the reproducibility of these approaches. The paper also addresses key challenges such as real-time processing, dynamic scene adaptation, and scalability while highlighting future research directions. Unlike prior surveys, this paper uniquely combines (i) a systematic taxonomy of semantic SLAM approaches across different sensing modalities and environments, (ii) a comparative review of datasets and evaluation metrics, and (iii) a reproducibility study of selected methods. To our knowledge, this is the first survey that integrates methods, datasets, evaluation practices, and application insights into a single comprehensive review, thereby offering a unified reference for researchers and practitioners. In conclusion, this review underscores the vital role of semantic SLAM in driving advancements in autonomous systems and intelligent navigation by analyzing recent developments, validating findings, and highlighting future research directions.
本文综述了语义同步定位和映射(SLAM)的不同方法,探讨了语义信息的结合如何在室内和室外环境中提高性能,同时强调了该领域的关键进展。它还确定了现有的差距,并提出了解决这些问题的未来改进的潜在方向。我们对语义SLAM的基本原理进行了详细的回顾,说明了结合语义数据如何增强场景理解和映射精度。本文提出了语义SLAM方法和核心技术,有助于提高映射的鲁棒性和精度。本文全面概述了用于评估语义SLAM系统的常用数据集,并讨论了用于评估其效率和准确性的性能指标。为了证明语义SLAM方法的可靠性,我们重现了从现有研究中选出的结果,为这些方法的可重复性提供了见解。本文还讨论了实时处理、动态场景适应和可扩展性等关键挑战,并指出了未来的研究方向。与之前的调查不同,本文独特地结合了(i)跨不同传感模式和环境的语义SLAM方法的系统分类,(ii)数据集和评估指标的比较回顾,以及(iii)所选方法的可重复性研究。据我们所知,这是第一次将方法、数据集、评估实践和应用见解整合到一个综合综述中的调查,从而为研究人员和从业者提供了统一的参考。总之,本文通过分析最近的发展、验证研究结果和强调未来的研究方向,强调了语义SLAM在推动自主系统和智能导航进步中的重要作用。
{"title":"Semantic SLAM: A comprehensive survey of methods and applications","authors":"Houssein Kanso ,&nbsp;Abhilasha Singh ,&nbsp;Etaf El Zarif ,&nbsp;Nooruldeen Almohammed ,&nbsp;Jinane Mounsef ,&nbsp;Noel Maalouf ,&nbsp;Bilal Arain","doi":"10.1016/j.iswa.2025.200591","DOIUrl":"10.1016/j.iswa.2025.200591","url":null,"abstract":"<div><div>This paper surveys the different approaches in semantic Simultaneous Localization and Mapping (SLAM), exploring how the incorporation of semantic information has enhanced performance in both indoor and outdoor settings, while highlighting key advancements in the field. It also identifies existing gaps and proposes potential directions for future improvements to address these issues. We provide a detailed review of the fundamentals of semantic SLAM, illustrating how incorporating semantic data enhances scene understanding and mapping accuracy. The paper presents semantic SLAM methods and core techniques that contribute to improved robustness and precision in mapping. A comprehensive overview of commonly used datasets for evaluating semantic SLAM systems is provided, along with a discussion of performance metrics used to assess their efficiency and accuracy. To demonstrate the reliability of semantic SLAM methodologies, we reproduce selected results from existing studies offering insights into the reproducibility of these approaches. The paper also addresses key challenges such as real-time processing, dynamic scene adaptation, and scalability while highlighting future research directions. Unlike prior surveys, this paper uniquely combines (i) a systematic taxonomy of semantic SLAM approaches across different sensing modalities and environments, (ii) a comparative review of datasets and evaluation metrics, and (iii) a reproducibility study of selected methods. To our knowledge, this is the first survey that integrates methods, datasets, evaluation practices, and application insights into a single comprehensive review, thereby offering a unified reference for researchers and practitioners. In conclusion, this review underscores the vital role of semantic SLAM in driving advancements in autonomous systems and intelligent navigation by analyzing recent developments, validating findings, and highlighting future research directions.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200591"},"PeriodicalIF":4.3,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145571741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced radiology report: Leveraging image enhancement and multi-label transfer learning with attention-based text generation 增强放射学报告:利用图像增强和多标签迁移学习与基于注意力的文本生成
IF 4.3 Pub Date : 2025-11-08 DOI: 10.1016/j.iswa.2025.200605
Hilya Tsaniya , Chastine Fatichah , Nanik Suciati , Takashi Obi , Joong-sun Lee
Current research in radiology report generation tend to overlook the utilization of abnormalities depicted in medical images. This study introduces a novel radiology report generator that integrates a multi-label learning approach for predicting abnormality tags and employs transformer models for generating reports. Additionally, the research explores contrast-based image enhancement to mitigate noise in medical images, evaluating its impact on model performance. The multi-label learning is trained on a dataset with 180 abnormality labels and the features used as initial weights for MIMICCXR, as a visual feature extractor.Imbalance handling and ensemble methods are employed to optimize multi-label model performance for abnormality tag prediction. Multi-head attention, in conjunction with GPT-2, facilitates context building for medical report generation, utilizing BERT embeddings for text feature extraction. Evaluation metrics demonstrate that the proposed model achieves superior performance in both multi-label prediction accuracy 77 % and text generation, showing an increase in similarity 28 % in average compared to the baseline model. These findings suggest that leveraging transfer learning with an ensemble classifier, combined with a transformer for context building and decoding, effectively utilizes visual and text features. Furthermore, the incorporation of image enhancement techniques significantly impacts model performance.
目前在放射学报告生成方面的研究往往忽视了对医学图像中所描述的异常的利用。本研究介绍了一种新的放射学报告生成器,它集成了多标签学习方法来预测异常标签,并使用变压器模型来生成报告。此外,研究探讨了基于对比度的图像增强来减轻医学图像中的噪声,评估其对模型性能的影响。多标签学习在具有180个异常标签的数据集上进行训练,这些特征用作MIMICCXR的初始权重,作为视觉特征提取器。采用不平衡处理和集成方法优化多标签模型的性能,用于异常标签预测。多头注意力与GPT-2结合,促进了医学报告生成的上下文构建,利用BERT嵌入进行文本特征提取。评估指标表明,所提出的模型在多标签预测准确率77%和文本生成方面都取得了优异的性能,与基线模型相比,相似度平均提高了28%。这些发现表明,利用集成分类器的迁移学习,结合上下文构建和解码的转换器,可以有效地利用视觉和文本特征。此外,图像增强技术的结合显著影响了模型的性能。
{"title":"Enhanced radiology report: Leveraging image enhancement and multi-label transfer learning with attention-based text generation","authors":"Hilya Tsaniya ,&nbsp;Chastine Fatichah ,&nbsp;Nanik Suciati ,&nbsp;Takashi Obi ,&nbsp;Joong-sun Lee","doi":"10.1016/j.iswa.2025.200605","DOIUrl":"10.1016/j.iswa.2025.200605","url":null,"abstract":"<div><div>Current research in radiology report generation tend to overlook the utilization of abnormalities depicted in medical images. This study introduces a novel radiology report generator that integrates a multi-label learning approach for predicting abnormality tags and employs transformer models for generating reports. Additionally, the research explores contrast-based image enhancement to mitigate noise in medical images, evaluating its impact on model performance. The multi-label learning is trained on a dataset with 180 abnormality labels and the features used as initial weights for MIMIC<img>CXR, as a visual feature extractor.Imbalance handling and ensemble methods are employed to optimize multi-label model performance for abnormality tag prediction. Multi-head attention, in conjunction with GPT-2, facilitates context building for medical report generation, utilizing BERT embeddings for text feature extraction. Evaluation metrics demonstrate that the proposed model achieves superior performance in both multi-label prediction accuracy 77 % and text generation, showing an increase in similarity 28 % in average compared to the baseline model. These findings suggest that leveraging transfer learning with an ensemble classifier, combined with a transformer for context building and decoding, effectively utilizes visual and text features. Furthermore, the incorporation of image enhancement techniques significantly impacts model performance.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200605"},"PeriodicalIF":4.3,"publicationDate":"2025-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145520078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated learning using quality-based aggregation method for brain tumour segmentation on multimodality medical images 基于质量的聚合方法的联邦学习在多模态医学图像上的脑肿瘤分割
IF 4.3 Pub Date : 2025-11-08 DOI: 10.1016/j.iswa.2025.200601
Rim El Badaoui , Ester Bonmati , Vasileios Argyriou , Barbara Villarini
Deep learning for medical imaging has shown great potential in improving patient outcomes due to its high accuracy in disease diagnosis. However, a major challenge preventing the widespread adoption of such models in clinical settings is data accessibility, which conflicts with the General Data Protection Regulation (GDPR) in a traditional centralised training environment. Hence, to address this issue, Federated Learning (FL) was introduced as a decentralised alternative that enables collaborative model training among data owners without sharing any private data. Despite its significance in healthcare, limited research has explored FL for medical imaging, particularly in multimodal brain tumour segmentation, due to challenges such as data heterogeneity.
In this study, we present Federated E-CATBraTS, an advanced federated deep learning model derived from the existing E-CATBraTS framework. This model is designed to segment brain tumours from multimodal magnetic resonance imaging (MRI) while preserving data privacy. Our framework introduces a novel aggregation method, DaQAvg, which optimally combines model weights based on data size and quality, demonstrating resilience against corrupted medical images.
We evaluated the performance of Federated E-CATBraTS using two publicly available datasets: UPenn-GBM and UCSF-PDGM, including a degraded version of the latter to assess the efficacy of our aggregation method. The results indicate a 6% overall improvement over traditional centralised approaches. Furthermore, we conducted a comprehensive comparison against state-of-the-art FL aggregation algorithms, including FedAVG, FedProx and FedNova. While FedNova demonstrated the highest overall DSC, DaQAvg demonstrated superior robustness to noisy conditions, showcasing its specific advantage in maintaining performance with variable data quality, a critical aspect in medical imaging.
医学成像的深度学习由于其在疾病诊断中的高准确性,在改善患者预后方面显示出巨大的潜力。然而,阻碍此类模型在临床环境中广泛采用的主要挑战是数据可访问性,这与传统集中式培训环境中的通用数据保护条例(GDPR)相冲突。因此,为了解决这个问题,联邦学习(FL)作为一种分散的替代方案被引入,它可以在数据所有者之间进行协作模型训练,而无需共享任何私有数据。尽管它在医疗保健方面具有重要意义,但由于数据异质性等挑战,有限的研究探索了FL用于医学成像,特别是在多模态脑肿瘤分割方面。在本研究中,我们提出了联邦E-CATBraTS,这是一种源自现有E-CATBraTS框架的高级联邦深度学习模型。该模型旨在从多模态磁共振成像(MRI)中分割脑肿瘤,同时保护数据隐私。我们的框架引入了一种新的聚合方法DaQAvg,该方法基于数据大小和质量优化地组合了模型权重,展示了对损坏医学图像的弹性。我们使用两个公开可用的数据集来评估联邦e - catbrat的性能:UPenn-GBM和UCSF-PDGM,包括后者的降级版本来评估我们的聚合方法的有效性。结果表明,与传统的集中式方法相比,总体改善了6%。此外,我们还与最先进的FL聚合算法(包括FedAVG、FedProx和FedNova)进行了全面比较。FedNova表现出最高的总体DSC, DaQAvg表现出对噪声条件的卓越鲁棒性,展示了其在保持可变数据质量方面的特定优势,这是医学成像的一个关键方面。
{"title":"Federated learning using quality-based aggregation method for brain tumour segmentation on multimodality medical images","authors":"Rim El Badaoui ,&nbsp;Ester Bonmati ,&nbsp;Vasileios Argyriou ,&nbsp;Barbara Villarini","doi":"10.1016/j.iswa.2025.200601","DOIUrl":"10.1016/j.iswa.2025.200601","url":null,"abstract":"<div><div>Deep learning for medical imaging has shown great potential in improving patient outcomes due to its high accuracy in disease diagnosis. However, a major challenge preventing the widespread adoption of such models in clinical settings is data accessibility, which conflicts with the General Data Protection Regulation (GDPR) in a traditional centralised training environment. Hence, to address this issue, Federated Learning (FL) was introduced as a decentralised alternative that enables collaborative model training among data owners without sharing any private data. Despite its significance in healthcare, limited research has explored FL for medical imaging, particularly in multimodal brain tumour segmentation, due to challenges such as data heterogeneity.</div><div>In this study, we present Federated E-CATBraTS, an advanced federated deep learning model derived from the existing E-CATBraTS framework. This model is designed to segment brain tumours from multimodal magnetic resonance imaging (MRI) while preserving data privacy. Our framework introduces a novel aggregation method, DaQAvg, which optimally combines model weights based on data size and quality, demonstrating resilience against corrupted medical images.</div><div>We evaluated the performance of Federated E-CATBraTS using two publicly available datasets: UPenn-GBM and UCSF-PDGM, including a degraded version of the latter to assess the efficacy of our aggregation method. The results indicate a 6% overall improvement over traditional centralised approaches. Furthermore, we conducted a comprehensive comparison against state-of-the-art FL aggregation algorithms, including FedAVG, FedProx and FedNova. While FedNova demonstrated the highest overall DSC, DaQAvg demonstrated superior robustness to noisy conditions, showcasing its specific advantage in maintaining performance with variable data quality, a critical aspect in medical imaging.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200601"},"PeriodicalIF":4.3,"publicationDate":"2025-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145520079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Beyond algorithms: Artificial intelligence driven talent identification with human insight 超越算法:人工智能驱动的人才识别与人类的洞察力
IF 4.3 Pub Date : 2025-11-07 DOI: 10.1016/j.iswa.2025.200604
Tiago Jacob Fernandes França , José Henrique Pereira São Mamede , João Manuel Pereira Barroso , Vítor Manuel Pereira Duarte dos Santos
The rapid evolution of Artificial Intelligence (AI) is reshaping Human Resource Management (HRM), with growing interest in its role in talent identification. While AI has demonstrated effectiveness in analysing structured data, its limitations in assessing qualitative attributes such as creativity, adaptability, and emotional intelligence remain underexplored. This study addresses these gaps through an exploratory mixed-methods design, combining a global survey (n = 240) with semi-structured interviews of HR professionals. Quantitative analysis highlights patterns of association between key competencies, while qualitative findings provide contextual insights into perceptions of fairness, bias, and cultural resistance. The results suggest that AI can complement, but not replace, human judgement, supporting a Hybrid Evaluative Model that integrates algorithmic efficiency with human interpretation. The study contributes rare empirical evidence to a nascent field, highlights the ethical imperatives of bias mitigation and transparency, and underscores the importance of cultural context (collectivist versus individualist orientations) in shaping the acceptance and effectiveness of AI-enabled HR practices. These findings offer practical guidance for organisations and advance theory-building at the intersection of AI and HRM.
人工智能(AI)的快速发展正在重塑人力资源管理(HRM),人们对其在人才识别中的作用越来越感兴趣。虽然人工智能在分析结构化数据方面已经证明了有效性,但它在评估创造力、适应性和情商等定性属性方面的局限性仍未得到充分探索。本研究通过探索性混合方法设计,将全球调查(n = 240)与人力资源专业人员的半结构化访谈相结合,解决了这些差距。定量分析强调了关键能力之间的关联模式,而定性研究结果提供了对公平、偏见和文化阻力感知的背景见解。结果表明,人工智能可以补充而不是取代人类的判断,支持将算法效率与人类解释相结合的混合评估模型。该研究为这一新兴领域提供了罕见的经验证据,强调了减少偏见和透明度的伦理必要性,并强调了文化背景(集体主义与个人主义取向)在塑造人工智能人力资源实践的接受度和有效性方面的重要性。这些发现为组织提供了实践指导,并推进了人工智能和人力资源管理交叉领域的理论建设。
{"title":"Beyond algorithms: Artificial intelligence driven talent identification with human insight","authors":"Tiago Jacob Fernandes França ,&nbsp;José Henrique Pereira São Mamede ,&nbsp;João Manuel Pereira Barroso ,&nbsp;Vítor Manuel Pereira Duarte dos Santos","doi":"10.1016/j.iswa.2025.200604","DOIUrl":"10.1016/j.iswa.2025.200604","url":null,"abstract":"<div><div>The rapid evolution of Artificial Intelligence (AI) is reshaping Human Resource Management (HRM), with growing interest in its role in talent identification. While AI has demonstrated effectiveness in analysing structured data, its limitations in assessing qualitative attributes such as creativity, adaptability, and emotional intelligence remain underexplored. This study addresses these gaps through an exploratory mixed-methods design, combining a global survey (<em>n</em> = 240) with semi-structured interviews of HR professionals. Quantitative analysis highlights patterns of association between key competencies, while qualitative findings provide contextual insights into perceptions of fairness, bias, and cultural resistance. The results suggest that AI can complement, but not replace, human judgement, supporting a Hybrid Evaluative Model that integrates algorithmic efficiency with human interpretation. The study contributes rare empirical evidence to a nascent field, highlights the ethical imperatives of bias mitigation and transparency, and underscores the importance of cultural context (collectivist versus individualist orientations) in shaping the acceptance and effectiveness of AI-enabled HR practices. These findings offer practical guidance for organisations and advance theory-building at the intersection of AI and HRM.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200604"},"PeriodicalIF":4.3,"publicationDate":"2025-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145571742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alert correlation for intelligent threat detection and response 警报关联智能威胁检测和响应
IF 4.3 Pub Date : 2025-11-07 DOI: 10.1016/j.iswa.2025.200606
Bronagh Lanigan , Zeinab Rezaeifar , Federico Cruciani , Michael Milliken , Jordan Vincent , Samuel Moore , Muhammad Aaqib , Alan Mills , Pushpinder K. Chouhan , Alfie Beard , Chris D. Nugent , Luke Chen , Alex Healing
With the increasing diversity of IoT devices, keeping IT systems secure is becoming increasingly difficult. Attackers exploit vulnerabilities within the system in order to access sensitive information, typically reaching their objective through several steps. Current Intrusion Detection Systems (IDSs) focus on low-level alerts, and tend to produce a high rate of false positives. This type of information alone is insufficient for the detection of sophisticated attack scenarios such Advanced Persistent Threats (APTs). Consequently, correlation techniques have recently been introduced to correlate alerts and reconstruct attack scenarios, however, various attack scenarios exist, with diverse characteristics. Also, different steps of the APTs scenarios may have their own characteristics. Therefore, finding a proper method that covers all cases remains a challenge. Moreover, after detecting APTs, how the system should respond to these attacks to avoid sabotage to the system remains a challenge. Thus, in this paper, first for detection of the attacks, we classify different cases, and then, a method based on different characteristics of attack patterns is proposed to detect APT scenarios. The proposed method consists of two main phases: APT detection and the intelligent hybrid response framework. In APT detection phase, similar alerts are aggregated and attack graphs are generated based on a similarity matrix. These graphs, combined with third party API data enable alert correlation and APT scenario detection. Entity graphs are then created to visualise host behaviour, and alert graphs are analysed to detect APT scenarios. In the response phase, attack graphs produced from the correlation inform the hybrid response framework, integrating knowledge and data-driven components that facilitate automated or recommended mitigation. The approach was evaluated on the ZeekData24 dataset. Obtained precision and recall on the malicious traffic was observed to be 96.65% and 87.04% respectively. The results show that our approach can effectively filter false positive alerts with a reduction of the data going from 10,063 alerts daily to 586 meta-alerts, pruned to 48 attack graphs and finally reduced to 20 suspicious attack graphs.
随着物联网设备的日益多样化,保持IT系统的安全变得越来越困难。攻击者利用系统中的漏洞来访问敏感信息,通常通过几个步骤来达到他们的目标。当前的入侵检测系统(ids)侧重于低级警报,容易产生高误报率。这种类型的信息本身不足以检测复杂的攻击场景,例如高级持续威胁(apt)。因此,最近引入了相关技术来关联警报和重建攻击场景,然而,存在各种攻击场景,具有不同的特征。此外,apt场景的不同步骤可能有自己的特点。因此,找到一种适用于所有情况的合适方法仍然是一项挑战。此外,在检测到apt之后,系统应该如何响应这些攻击以避免对系统的破坏仍然是一个挑战。因此,本文首先对攻击进行检测,对不同的案例进行分类,然后提出一种基于攻击模式不同特征的APT场景检测方法。该方法包括两个主要阶段:APT检测和智能混合响应框架。在APT检测阶段,基于相似矩阵聚合相似警报并生成攻击图。这些图表与第三方API数据相结合,可以实现警报关联和APT场景检测。然后创建实体图来可视化主机行为,并分析警报图以检测APT场景。在响应阶段,根据相关性生成的攻击图为混合响应框架提供信息,整合知识和数据驱动组件,促进自动化或推荐的缓解措施。该方法在ZeekData24数据集上进行了评估。对恶意流量的检测准确率和召回率分别为96.65%和87.04%。结果表明,我们的方法可以有效地过滤假阳性警报,将数据从每天10,063个警报减少到586个元警报,修剪到48个攻击图,最终减少到20个可疑攻击图。
{"title":"Alert correlation for intelligent threat detection and response","authors":"Bronagh Lanigan ,&nbsp;Zeinab Rezaeifar ,&nbsp;Federico Cruciani ,&nbsp;Michael Milliken ,&nbsp;Jordan Vincent ,&nbsp;Samuel Moore ,&nbsp;Muhammad Aaqib ,&nbsp;Alan Mills ,&nbsp;Pushpinder K. Chouhan ,&nbsp;Alfie Beard ,&nbsp;Chris D. Nugent ,&nbsp;Luke Chen ,&nbsp;Alex Healing","doi":"10.1016/j.iswa.2025.200606","DOIUrl":"10.1016/j.iswa.2025.200606","url":null,"abstract":"<div><div>With the increasing diversity of IoT devices, keeping IT systems secure is becoming increasingly difficult. Attackers exploit vulnerabilities within the system in order to access sensitive information, typically reaching their objective through several steps. Current Intrusion Detection Systems (IDSs) focus on low-level alerts, and tend to produce a high rate of false positives. This type of information alone is insufficient for the detection of sophisticated attack scenarios such Advanced Persistent Threats (APTs). Consequently, correlation techniques have recently been introduced to correlate alerts and reconstruct attack scenarios, however, various attack scenarios exist, with diverse characteristics. Also, different steps of the APTs scenarios may have their own characteristics. Therefore, finding a proper method that covers all cases remains a challenge. Moreover, after detecting APTs, how the system should respond to these attacks to avoid sabotage to the system remains a challenge. Thus, in this paper, first for detection of the attacks, we classify different cases, and then, a method based on different characteristics of attack patterns is proposed to detect APT scenarios. The proposed method consists of two main phases: APT detection and the intelligent hybrid response framework. In APT detection phase, similar alerts are aggregated and attack graphs are generated based on a similarity matrix. These graphs, combined with third party API data enable alert correlation and APT scenario detection. Entity graphs are then created to visualise host behaviour, and alert graphs are analysed to detect APT scenarios. In the response phase, attack graphs produced from the correlation inform the hybrid response framework, integrating knowledge and data-driven components that facilitate automated or recommended mitigation. The approach was evaluated on the ZeekData24 dataset. Obtained precision and recall on the malicious traffic was observed to be 96.65% and 87.04% respectively. The results show that our approach can effectively filter false positive alerts with a reduction of the data going from 10,063 alerts daily to 586 meta-alerts, pruned to 48 attack graphs and finally reduced to 20 suspicious attack graphs.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200606"},"PeriodicalIF":4.3,"publicationDate":"2025-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145520065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sentiment analysis: From rule-based lexicons to large language models 情感分析:从基于规则的词汇到大型语言模型
IF 4.3 Pub Date : 2025-11-07 DOI: 10.1016/j.iswa.2025.200599
Maikel Leon
This study provides a comprehensive review of two decades of research in opinion mining and sentiment analysis, addressing the fragmentation of prior work across methodologies, application domains, and data sources. The evolution of the field is traced from pre-1990 rule-based systems to lexicon heuristics, statistical learning, machine learning, deep learning, and the current wave of transformer-driven, multimodal, and generative models. Applications are examined across marketing, finance, politics, and social media, with emphasis on how methodological innovations have improved accuracy and enabled broader adoption. Best practices – including transformer fine-tuning, prompt engineering, zero-shot and few-shot learning, multimodal fusion, and domain adaptation – are analyzed to distill evidence-based guidelines for researchers and practitioners. The synthesis shows how sentiment analysis has shaped critical areas, including brand management, investor decision-making, political discourse, and online user engagement. Findings highlight the effectiveness of transformer-based approaches, particularly when combined with domain adaptation and prompt engineering, in delivering state-of-the-art performance. Beyond methodological and applied insights, the study identifies promising directions for future research, including real-time customer journey analytics, explainability in generative AI, robustness across multiple languages, ethical implications, and sustainability considerations. By consolidating dispersed knowledge into a unified account, this review provides both historical grounding and a structured roadmap that advances theoretical understanding and informs managerial practice.
本研究对二十年来在意见挖掘和情感分析方面的研究进行了全面的回顾,解决了以前在方法、应用领域和数据源方面工作的碎片化问题。该领域的发展可以追溯到1990年以前基于规则的系统,到词汇启发式、统计学习、机器学习、深度学习,以及当前的变压器驱动、多模态和生成模型。应用程序将在营销、金融、政治和社交媒体领域进行审查,重点是方法创新如何提高准确性并使其得到更广泛的采用。本文分析了最佳实践——包括变压器微调、快速工程、零采样和少采样学习、多模态融合和领域适应——为研究人员和实践者提炼出基于证据的指导方针。这份综合报告显示了情感分析是如何影响关键领域的,包括品牌管理、投资者决策、政治话语和在线用户参与。研究结果强调了基于变压器的方法的有效性,特别是当与领域适应和快速工程相结合时,在提供最先进的性能方面。除了方法论和应用见解之外,该研究还确定了未来研究的有希望的方向,包括实时客户旅程分析、生成式人工智能的可解释性、跨多种语言的稳健性、伦理影响和可持续性考虑。通过将分散的知识整合成一个统一的账户,本综述提供了历史基础和结构化的路线图,以推进理论理解并为管理实践提供信息。
{"title":"Sentiment analysis: From rule-based lexicons to large language models","authors":"Maikel Leon","doi":"10.1016/j.iswa.2025.200599","DOIUrl":"10.1016/j.iswa.2025.200599","url":null,"abstract":"<div><div>This study provides a comprehensive review of two decades of research in opinion mining and sentiment analysis, addressing the fragmentation of prior work across methodologies, application domains, and data sources. The evolution of the field is traced from pre-1990 rule-based systems to lexicon heuristics, statistical learning, machine learning, deep learning, and the current wave of transformer-driven, multimodal, and generative models. Applications are examined across marketing, finance, politics, and social media, with emphasis on how methodological innovations have improved accuracy and enabled broader adoption. Best practices – including transformer fine-tuning, prompt engineering, zero-shot and few-shot learning, multimodal fusion, and domain adaptation – are analyzed to distill evidence-based guidelines for researchers and practitioners. The synthesis shows how sentiment analysis has shaped critical areas, including brand management, investor decision-making, political discourse, and online user engagement. Findings highlight the effectiveness of transformer-based approaches, particularly when combined with domain adaptation and prompt engineering, in delivering state-of-the-art performance. Beyond methodological and applied insights, the study identifies promising directions for future research, including real-time customer journey analytics, explainability in generative AI, robustness across multiple languages, ethical implications, and sustainability considerations. By consolidating dispersed knowledge into a unified account, this review provides both historical grounding and a structured roadmap that advances theoretical understanding and informs managerial practice.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"28 ","pages":"Article 200599"},"PeriodicalIF":4.3,"publicationDate":"2025-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145465827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Intelligent Systems with Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1