首页 > 最新文献

Computational Intelligence最新文献

英文 中文
U-VQVAE-CTLesionNet: A Generalized Deep Learning Framework for Multi-Organ Lesion Detection and Segmentation in Medical Imaging U-VQVAE-CTLesionNet:医学影像中多器官病变检测与分割的广义深度学习框架
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-05 DOI: 10.1111/coin.70168
Alok Kumar, N. Mahendran

Lesion detection and segmentation are essential yet complex tasks in medical image analysis due to the substantial variability in lesion shape, size, contrast, and anatomical location across different organs. Existing deep learning methods often lack adaptability, as they are typically designed for specific organs or imaging modalities, leading to limited generalization when applied to diverse datasets. To address this limitation, this study introduces a unified and generalizable framework capable of accurate multi-organ lesion detection, localization, and segmentation across heterogeneous medical imaging data. The proposed U-VQVAE-CTLesionNet integrates a U-Net–based encoder–decoder architecture for spatial feature extraction with a Vector Quantized Variational Autoencoder (VQVAE) module that discretizes latent features through a learnable codebook, enabling the network to capture intricate texture and intensity variations while preserving structural consistency. A Bounding Box Regression (BBR) component is incorporated for lesion localization, followed by a GrabCut-based refinement step that iteratively adjusts lesion boundaries using Gaussian Mixture Model estimation and graph-cut optimization. The framework is further supported by a comprehensive preprocessing pipeline involving intensity normalization, Hounsfield Unit windowing, and affine transformations to standardize image quality and enhance model robustness across modalities. Comprehensive experiments conducted on multiple publicly available and locally curated datasets encompassing lung and kidney lesions validated the accuracy and stability of the proposed approach. For lung CT detection, the model achieved 98.8% accuracy, 98.0% precision, 97.03% recall, and a 97.51% F1-score, while kidney CT detection attained 99.1% accuracy, 99.0% precision, 98.8% recall, and a 98.9% F1-score. Segmentation performance yielded Dice coefficients of 96.5% for lung and 97.8% for kidney, with corresponding IoU values of 93.2% and 95.1%, and Hausdorff Distances of 2.8 mm for lung and 2.3 mm for kidney, respectively. Ablation studies further confirmed that the inclusion of preprocessing, quantization, BBR, and GrabCut modules improved segmentation accuracy by approximately 2%–3% compared to configurations without these components. These results demonstrate that U-VQVAE-CTLesionNet provides a robust, organ-agnostic framework for precise lesion analysis and establishes a solid foundation for future expert-assisted clinical integration.

由于不同器官的病变形状、大小、对比度和解剖位置的巨大差异,病变检测和分割是医学图像分析中必不可少但又复杂的任务。现有的深度学习方法往往缺乏适应性,因为它们通常是为特定的器官或成像模式设计的,导致在应用于不同的数据集时泛化有限。为了解决这一限制,本研究引入了一个统一的、可推广的框架,能够在异构医学成像数据中准确地检测、定位和分割多器官病变。提出的U-VQVAE-CTLesionNet集成了基于u - net的编码器-解码器架构,用于空间特征提取和矢量量化变分自编码器(VQVAE)模块,该模块通过可学习的码本离散潜在特征,使网络能够捕获复杂的纹理和强度变化,同时保持结构一致性。结合边界盒回归(BBR)组件进行病灶定位,然后采用基于grabcut的细化步骤,使用高斯混合模型估计和图切优化迭代调整病灶边界。该框架还得到了全面的预处理管道的进一步支持,包括强度归一化、霍斯菲尔德单元窗口和仿射变换,以标准化图像质量并增强模型跨模态的鲁棒性。在包含肺和肾脏病变的多个公开可用和本地管理的数据集上进行的综合实验验证了所提出方法的准确性和稳定性。对于肺部CT检测,该模型准确率为98.8%,精密度为98.0%,召回率为97.03%,f1评分为97.51%;肾脏CT检测准确率为99.1%,精密度为99.0%,召回率为98.8%,f1评分为98.9%。肺和肾的Dice系数分别为96.5%和97.8%,IoU值分别为93.2%和95.1%,肺和肾的Hausdorff距离分别为2.8 mm和2.3 mm。消融研究进一步证实,与没有这些组件的配置相比,包含预处理、量化、BBR和GrabCut模块的分割精度提高了约2%-3%。这些结果表明,u - vqvee - ctlesionnet为精确的病变分析提供了一个强大的、器官不可知的框架,并为未来专家辅助的临床整合奠定了坚实的基础。
{"title":"U-VQVAE-CTLesionNet: A Generalized Deep Learning Framework for Multi-Organ Lesion Detection and Segmentation in Medical Imaging","authors":"Alok Kumar,&nbsp;N. Mahendran","doi":"10.1111/coin.70168","DOIUrl":"https://doi.org/10.1111/coin.70168","url":null,"abstract":"<div>\u0000 \u0000 <p>Lesion detection and segmentation are essential yet complex tasks in medical image analysis due to the substantial variability in lesion shape, size, contrast, and anatomical location across different organs. Existing deep learning methods often lack adaptability, as they are typically designed for specific organs or imaging modalities, leading to limited generalization when applied to diverse datasets. To address this limitation, this study introduces a unified and generalizable framework capable of accurate multi-organ lesion detection, localization, and segmentation across heterogeneous medical imaging data. The proposed U-VQVAE-CTLesionNet integrates a U-Net–based encoder–decoder architecture for spatial feature extraction with a Vector Quantized Variational Autoencoder (VQVAE) module that discretizes latent features through a learnable codebook, enabling the network to capture intricate texture and intensity variations while preserving structural consistency. A Bounding Box Regression (BBR) component is incorporated for lesion localization, followed by a GrabCut-based refinement step that iteratively adjusts lesion boundaries using Gaussian Mixture Model estimation and graph-cut optimization. The framework is further supported by a comprehensive preprocessing pipeline involving intensity normalization, Hounsfield Unit windowing, and affine transformations to standardize image quality and enhance model robustness across modalities. Comprehensive experiments conducted on multiple publicly available and locally curated datasets encompassing lung and kidney lesions validated the accuracy and stability of the proposed approach. For lung CT detection, the model achieved 98.8% accuracy, 98.0% precision, 97.03% recall, and a 97.51% F1-score, while kidney CT detection attained 99.1% accuracy, 99.0% precision, 98.8% recall, and a 98.9% F1-score. Segmentation performance yielded Dice coefficients of 96.5% for lung and 97.8% for kidney, with corresponding IoU values of 93.2% and 95.1%, and Hausdorff Distances of 2.8 mm for lung and 2.3 mm for kidney, respectively. Ablation studies further confirmed that the inclusion of preprocessing, quantization, BBR, and GrabCut modules improved segmentation accuracy by approximately 2%–3% compared to configurations without these components. These results demonstrate that U-VQVAE-CTLesionNet provides a robust, organ-agnostic framework for precise lesion analysis and establishes a solid foundation for future expert-assisted clinical integration.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"42 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145964003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-Smart Classroom for English Translation: An Adaptive HMM-Based Framework 基于自适应hmm的英语翻译人工智能课堂
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-05 DOI: 10.1111/coin.70170
Xibo Chen, Haize Hu

This study develops an AI-enhanced smart classroom framework to overcome the limitations of traditional English translation instruction, particularly in addressing the theory-practice gap. We designed a three-phase intelligent teaching system incorporating Hidden Markov Models (HMM) for: (1) adaptive pre-class preparation, (2) immersive virtual translation scenarios, and (3) automated post-class assessment. A 6-week controlled experiment (N = 100) compared this approach with traditional instruction using quantitative metrics including engagement levels, translation accuracy, proficiency rates, and satisfaction surveys. The experimental group showed statistically significant improvements (p < 0.05): 12.7% higher engagement (d = 1.21), 8.3% better culture-specific translation accuracy, 6.9% faster proficiency attainment, and 89.2% satisfaction rate (vs. 82.1% control). HMM analysis effectively tracked learning progression and identified competency gaps. The study demonstrates HMM's effectiveness for modeling translation competence development and validates AI-enhanced instruction as a viable solution for translation education. The implemented framework offers a replicable model for intelligent language learning systems.

本研究开发了一个人工智能增强的智能课堂框架,以克服传统英语翻译教学的局限性,特别是在解决理论与实践的差距方面。我们设计了一个包含隐马尔可夫模型(HMM)的三阶段智能教学系统:(1)自适应课前准备,(2)沉浸式虚拟翻译场景,(3)课后自动评估。一项为期6周的对照实验(N = 100)将这种方法与传统教学方法进行了定量指标比较,包括参与度、翻译准确性、熟练率和满意度调查。实验组表现出统计学上显著的改善(p < 0.05):参与度提高了12.7% (d = 1.21),特定文化的翻译准确率提高了8.3%,熟练程度提高了6.9%,满意度提高了89.2%(对照组为82.1%)。HMM分析有效地跟踪了学习进度并确定了能力差距。该研究证明了HMM对翻译能力发展建模的有效性,并验证了人工智能增强教学是翻译教育的可行解决方案。实现的框架为智能语言学习系统提供了一个可复制的模型。
{"title":"AI-Smart Classroom for English Translation: An Adaptive HMM-Based Framework","authors":"Xibo Chen,&nbsp;Haize Hu","doi":"10.1111/coin.70170","DOIUrl":"https://doi.org/10.1111/coin.70170","url":null,"abstract":"<div>\u0000 \u0000 <p>This study develops an AI-enhanced smart classroom framework to overcome the limitations of traditional English translation instruction, particularly in addressing the theory-practice gap. We designed a three-phase intelligent teaching system incorporating Hidden Markov Models (HMM) for: (1) adaptive pre-class preparation, (2) immersive virtual translation scenarios, and (3) automated post-class assessment. A 6-week controlled experiment (<i>N</i> = 100) compared this approach with traditional instruction using quantitative metrics including engagement levels, translation accuracy, proficiency rates, and satisfaction surveys. The experimental group showed statistically significant improvements (<i>p</i> &lt; 0.05): 12.7% higher engagement (<i>d</i> = 1.21), 8.3% better culture-specific translation accuracy, 6.9% faster proficiency attainment, and 89.2% satisfaction rate (vs. 82.1% control). HMM analysis effectively tracked learning progression and identified competency gaps. The study demonstrates HMM's effectiveness for modeling translation competence development and validates AI-enhanced instruction as a viable solution for translation education. The implemented framework offers a replicable model for intelligent language learning systems.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"42 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145964004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Stiffness Control for Series Elastic Actuators in Robotic Systems Using Dynamic Systems 基于动态系统的机器人系统串联弹性作动器自适应刚度控制
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-26 DOI: 10.1111/coin.70163
Kunlin Guo, Zhenyu Lu, Zhiwei Tan

In traditional control methods, Series Elastic Actuator (SEA) joint manipulators are limited by their hardware and can only perform tasks with low stiffness requirements. To address this issue, we propose a stiffness adjustment control strategy for SEA manipulators based on Dynamic Systems (DS) with an adaptive stiffness control strategy, which could achieve higher manipulator end stiffness comprehensively considering the overall orientation of the manipulator and control gains. By incorporating a rotation matrix with the Dynamic Movement Primitives (DMPs) method, we enhanced the generalization capability of DS in complex tasks. Compared to traditional DS-based control strategies, the control strategy proposed in this paper has achieved better control effects in the experiments on the SEA manipulator, and adaptively adjusts the interaction stiffness under different postures to achieve the task goals. Furthermore, experiments on the mobile manipulator have also verified the universality of the control strategy proposed in this paper.

在传统的控制方法中,串联弹性作动器(SEA)关节机械臂受到硬件的限制,只能执行对刚度要求较低的任务。针对这一问题,提出了一种基于动力学系统(DS)的SEA机械臂刚度调整控制策略,该策略采用自适应刚度控制策略,综合考虑机械臂的整体姿态和控制增益,实现更高的末端刚度。通过将旋转矩阵与动态运动原语(DMPs)方法相结合,增强了DS在复杂任务中的泛化能力。与传统的基于dcs的控制策略相比,本文提出的控制策略在SEA机械臂的实验中取得了更好的控制效果,并自适应调整不同姿态下的交互刚度来实现任务目标。此外,在移动机械手上的实验也验证了本文提出的控制策略的通用性。
{"title":"Adaptive Stiffness Control for Series Elastic Actuators in Robotic Systems Using Dynamic Systems","authors":"Kunlin Guo,&nbsp;Zhenyu Lu,&nbsp;Zhiwei Tan","doi":"10.1111/coin.70163","DOIUrl":"https://doi.org/10.1111/coin.70163","url":null,"abstract":"<div>\u0000 \u0000 <p>In traditional control methods, Series Elastic Actuator (SEA) joint manipulators are limited by their hardware and can only perform tasks with low stiffness requirements. To address this issue, we propose a stiffness adjustment control strategy for SEA manipulators based on Dynamic Systems (DS) with an adaptive stiffness control strategy, which could achieve higher manipulator end stiffness comprehensively considering the overall orientation of the manipulator and control gains. By incorporating a rotation matrix with the Dynamic Movement Primitives (DMPs) method, we enhanced the generalization capability of DS in complex tasks. Compared to traditional DS-based control strategies, the control strategy proposed in this paper has achieved better control effects in the experiments on the SEA manipulator, and adaptively adjusts the interaction stiffness under different postures to achieve the task goals. Furthermore, experiments on the mobile manipulator have also verified the universality of the control strategy proposed in this paper.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"42 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145887529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Personalized Persuasive Recommender System: A Framework and a Machine Learning-Based Implementation 个性化说服性推荐系统:框架和基于机器学习的实现
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-23 DOI: 10.1111/coin.70161
Alslaity Alaa, Thomas Tran

Since the emergence of Recommender Systems (RS), most of the research has focused on improving the accuracy of a recommender system. However, the literature has demonstrated increasing evidence that it is vital for a recommender system to focus not only on the accuracy of the provided recommendations but also on other factors that influence the acceptance of recommendations and the extent to which these recommendations are convincing or persuasive. Consequently, there becomes a need for new research paradigms to help improve the capabilities of recommender systems, which goes beyond recommendation accuracy. One of the recently emerged research directions that consider this need fosters the idea of adopting human-related theories from the social sciences domain, such as persuasiveness of social communication. In this context, however, a challenging, non-trivial, and not fully explored issue that arises is: how to integrate human-related theories into a recommender system to increase user's acceptance? This paper aims to address this issue by providing a reference architecture framework to adapt and integrate persuasion features as a substantial characteristic of recommender systems. The proposed framework, named Personalized Persuasive RS ( PerPer ), adopts concepts from the social sciences literature, namely personality traits and persuasion principles. This paper also introduces a machine learning-based implementation of PerPer. In particular, it adapts the Learning Automata concepts to support learning capabilities. PerPer is evaluated using a user study where we implemented a prototype of a movie RS. The user study involved three parts, namely, the Conventional Recommender System (CRS) and two variants of PerPer that we called the General Reinforcement Approach (PerPer-GRA) and the Boosted Reinforcement Approach (PerPer-BRA). The analysis of the results obtained from 44 participants shows that PerPer was able to enhance users' acceptance of the recommendations in comparison to CRS. The results also show that the PerPer-BRA outperforms the PerPer-GRA in terms of accelerating the convergence of the best persuasion method while maintaining improvement in users' acceptance.

自推荐系统(RS)出现以来,大多数研究都集中在提高推荐系统的准确性上。然而,越来越多的证据表明,推荐系统不仅要关注所提供推荐的准确性,还要关注影响推荐接受程度的其他因素,以及这些推荐的说服力或说服力的程度,这一点至关重要。因此,需要新的研究范式来帮助提高推荐系统的能力,而不仅仅是推荐的准确性。考虑到这种需求,最近出现的一个研究方向促进了采用社会科学领域中与人类相关的理论的想法,例如社会沟通的说服力。然而,在这种背景下,一个具有挑战性的、重要的、未被充分探索的问题出现了:如何将与人类相关的理论整合到推荐系统中,以提高用户的接受度?本文旨在通过提供一个参考架构框架来解决这个问题,以适应和整合说服特征作为推荐系统的一个重要特征。提出的框架,名为个性化说服RS (PerPer),采用了社会科学文献中的概念,即人格特质和说服原则。本文还介绍了基于机器学习的PerPer实现。特别是,它采用了学习自动机的概念来支持学习能力。PerPer是通过用户研究来评估的,其中我们实现了一个电影RS的原型。用户研究包括三个部分,即传统推荐系统(CRS)和PerPer的两个变体,我们称之为一般强化方法(PerPer- gra)和增强强化方法(PerPer- bra)。对44个参与者的分析结果表明,与CRS相比,PerPer能够提高用户对建议的接受度。结果还表明,在加速最佳说服方法的收敛同时保持用户接受度的提高方面,PerPer-BRA优于PerPer-GRA。
{"title":"Personalized Persuasive Recommender System: A Framework and a Machine Learning-Based Implementation","authors":"Alslaity Alaa,&nbsp;Thomas Tran","doi":"10.1111/coin.70161","DOIUrl":"https://doi.org/10.1111/coin.70161","url":null,"abstract":"<p>Since the emergence of Recommender Systems (RS), most of the research has focused on improving the accuracy of a recommender system. However, the literature has demonstrated increasing evidence that it is vital for a recommender system to focus not only on the accuracy of the provided recommendations but also on other factors that influence the acceptance of recommendations and the extent to which these recommendations are convincing or persuasive. Consequently, there becomes a need for new research paradigms to help improve the capabilities of recommender systems, which goes beyond recommendation accuracy. One of the recently emerged research directions that consider this need fosters the idea of adopting human-related theories from the social sciences domain, such as persuasiveness of social communication. In this context, however, a challenging, non-trivial, and not fully explored issue that arises is: <i>how to integrate human-related theories into a recommender system to increase user's acceptance</i>? This paper aims to address this issue by providing a reference architecture framework to adapt and integrate persuasion features as a substantial characteristic of recommender systems. The proposed framework, named <i>\u0000 <b>Per</b>sonalized <b>Per</b>suasive</i> RS (<b>\u0000 <i>PerPer</i>\u0000 </b>), adopts concepts from the social sciences literature, namely personality traits and persuasion principles. This paper also introduces a machine learning-based implementation of <i>PerPer</i>. In particular, it adapts the Learning Automata concepts to support learning capabilities. <i>PerPer</i> is evaluated using a user study where we implemented a prototype of a movie RS. The user study involved three parts, namely, the Conventional Recommender System (CRS) and two variants of PerPer that we called the General Reinforcement Approach (PerPer-GRA) and the Boosted Reinforcement Approach (PerPer-BRA). The analysis of the results obtained from 44 participants shows that <i>PerPer</i> was able to enhance users' acceptance of the recommendations in comparison to CRS. The results also show that the PerPer-BRA outperforms the PerPer-GRA in terms of accelerating the convergence of the best persuasion method while maintaining improvement in users' acceptance.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"42 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/coin.70161","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145848298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unlocking the Potential of Context: A Contextual Neural Collaborative Filtering Framework for Rating Prediction 解锁上下文的潜力:一个用于评级预测的上下文神经协同过滤框架
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-19 DOI: 10.1111/coin.70162
Rajesh Garapati, Manomita Chakraborty

The exponential growth of online multimedia content across various platforms has created an urgent need for robust assistive technologies to manage the overwhelming volume of information. Consequently, considerable efforts have been dedicated to developing sophisticated multimedia recommendation systems, with Neural Collaborative Filtering (NCF) emerging as a prevalent methodology. However, the conventional NCF method exhibits significant limitations, particularly in integrating contextual data and effectively handling sparse and imbalanced datasets. To address these limitations, this research introduces the Contextual Neural Collaborative Filtering (C-NCF) method, which enhances the NCF framework by incorporating contextual data to enrich the learning process of user-item interactions. The primary objective of this method is to improve rating prediction accuracy, a crucial factor in generating more effective recommendations. A key innovation of the C-NCF method lies in its interaction mechanism, where user ratings of items are evaluated under diverse contextual conditions, assigning varying importance to each contextual factor. Extensive testing on three real-world datasets demonstrated that the C-NCF method outperforms existing advanced techniques. Empirical findings demonstrate that the C-NCF method achieved an average error reduction of 36.43% in Mean Absolute Error and 36.60% in Root Mean Squared Error compared to traditional collaborative filtering, matrix factorization, and context-aware models, significantly enhancing recommendation quality. These insights open promising avenues for further exploration in the field of context-aware recommender systems.

跨各种平台的在线多媒体内容呈指数级增长,迫切需要强大的辅助技术来管理大量的信息。因此,大量的努力致力于开发复杂的多媒体推荐系统,神经协同过滤(NCF)成为一种流行的方法。然而,传统的NCF方法存在明显的局限性,特别是在整合上下文数据和有效处理稀疏和不平衡数据集方面。为了解决这些限制,本研究引入了上下文神经协同过滤(C-NCF)方法,该方法通过结合上下文数据来增强NCF框架,以丰富用户-项目交互的学习过程。该方法的主要目标是提高评级预测的准确性,这是生成更有效推荐的关键因素。C-NCF方法的一个关键创新在于它的交互机制,在不同的语境条件下评估用户对项目的评分,为每个语境因素分配不同的重要性。在三个真实数据集上的广泛测试表明,C-NCF方法优于现有的先进技术。实证结果表明,与传统的协同过滤、矩阵分解和上下文感知模型相比,C-NCF方法在平均绝对误差(Mean Absolute error)和均方根误差(Root Mean Squared error)上平均降低了36.43%和36.60%,显著提高了推荐质量。这些见解为上下文感知推荐系统领域的进一步探索开辟了有希望的途径。
{"title":"Unlocking the Potential of Context: A Contextual Neural Collaborative Filtering Framework for Rating Prediction","authors":"Rajesh Garapati,&nbsp;Manomita Chakraborty","doi":"10.1111/coin.70162","DOIUrl":"https://doi.org/10.1111/coin.70162","url":null,"abstract":"<div>\u0000 \u0000 <p>The exponential growth of online multimedia content across various platforms has created an urgent need for robust assistive technologies to manage the overwhelming volume of information. Consequently, considerable efforts have been dedicated to developing sophisticated multimedia recommendation systems, with Neural Collaborative Filtering (NCF) emerging as a prevalent methodology. However, the conventional NCF method exhibits significant limitations, particularly in integrating contextual data and effectively handling sparse and imbalanced datasets. To address these limitations, this research introduces the Contextual Neural Collaborative Filtering (C-NCF) method, which enhances the NCF framework by incorporating contextual data to enrich the learning process of user-item interactions. The primary objective of this method is to improve rating prediction accuracy, a crucial factor in generating more effective recommendations. A key innovation of the C-NCF method lies in its interaction mechanism, where user ratings of items are evaluated under diverse contextual conditions, assigning varying importance to each contextual factor. Extensive testing on three real-world datasets demonstrated that the C-NCF method outperforms existing advanced techniques. Empirical findings demonstrate that the C-NCF method achieved an average error reduction of 36.43% in Mean Absolute Error and 36.60% in Root Mean Squared Error compared to traditional collaborative filtering, matrix factorization, and context-aware models, significantly enhancing recommendation quality. These insights open promising avenues for further exploration in the field of context-aware recommender systems.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145824831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Ensemble Machine Learning Classifier With Deep Learning Framework for COVID-19 Detection and Severity Classification 基于深度学习框架的集成机器学习分类器用于COVID-19检测和严重性分类
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-10 DOI: 10.1111/coin.70160
Sangram Sanjayrao Dandge, Pon Harshavardhanan

The influence of the COVID-19 pandemic on respiratory health is noteworthy, which tends to exacerbate continuous challenges in global healthcare systems. Chest X-ray (CXR) analysis is considered a significant tool for evaluating the impact of COVID-19 on the lungs. Anyhow, the prevailing analysis approaches frequently seem to be highly complex or cannot activate diagnosis and classify the disease precisely. To overcome these challenges, a novel multimodal deep learning framework that integrates advanced feature extraction, optimization, and classification approaches is proposed for enhanced detection of COVID-19 and severity assessment. The proposed framework introduces a novel hybrid spatial attention assisted extended version of the LSTM and GRU model (HAS-E-LSTM-GRU) that integrates long and short-term temporal feature extraction and attention approach. The prolonged framework maximizes the number of hidden units, whereas the learning and complex patterns obtained from CXR images represent a key novelty of the proposed model. The extracted features are enhanced further using Fourier, Wavelet, Convolutional, Gabor, and Cosine transforms. It is pursued by a feature selection approach using Grey Wolf Optimizer (GWO) to determine the most significant features with higher interclass variance. For the detection of disease, an ensemble classifier integrating Naïve Bayes (NB), k Nearest Neighbors (KNN), Deep Forest (DF), and Support Vector Machine (SVM) is used to determine the COVID-19 presence or absence. Finally, the disease severity is evaluated using a self-attention-based 1D convolutional neural network (CNN) that classifies the diverse cases into mild, moderate, and severe. In association with the standard deep learning methodologies, the proposed architecture obtains a detection accuracy of 98.51% which illustrates the capability in the case of real-time clinical applications.

COVID-19大流行对呼吸系统健康的影响值得注意,这往往会加剧全球卫生保健系统的持续挑战。胸部x射线(CXR)分析被认为是评估COVID-19对肺部影响的重要工具。然而,目前流行的分析方法往往显得非常复杂,或不能激活诊断和准确分类疾病。为了克服这些挑战,提出了一种新的多模态深度学习框架,该框架集成了先进的特征提取、优化和分类方法,以增强COVID-19的检测和严重性评估。该框架引入了一种新的LSTM和GRU模型的混合空间注意辅助扩展版本(HAS-E-LSTM-GRU),该模型集成了长、短期时间特征提取和注意方法。延长的框架最大化了隐藏单元的数量,而从CXR图像中获得的学习和复杂模式代表了所提出模型的关键新颖性。提取的特征使用傅里叶、小波、卷积、Gabor和余弦变换进一步增强。采用灰狼优化器(GWO)的特征选择方法来确定具有较高类间方差的最显著特征。对于疾病检测,使用Naïve贝叶斯(NB), k近邻(KNN),深度森林(DF)和支持向量机(SVM)的集成分类器来确定COVID-19是否存在。最后,使用基于自我注意的1D卷积神经网络(CNN)评估疾病严重程度,该网络将各种病例分为轻度、中度和重度。与标准的深度学习方法相结合,所提出的架构获得了98.51%的检测精度,这说明了在实时临床应用情况下的能力。
{"title":"An Ensemble Machine Learning Classifier With Deep Learning Framework for COVID-19 Detection and Severity Classification","authors":"Sangram Sanjayrao Dandge,&nbsp;Pon Harshavardhanan","doi":"10.1111/coin.70160","DOIUrl":"https://doi.org/10.1111/coin.70160","url":null,"abstract":"<div>\u0000 \u0000 <p>The influence of the COVID-19 pandemic on respiratory health is noteworthy, which tends to exacerbate continuous challenges in global healthcare systems. Chest X-ray (CXR) analysis is considered a significant tool for evaluating the impact of COVID-19 on the lungs. Anyhow, the prevailing analysis approaches frequently seem to be highly complex or cannot activate diagnosis and classify the disease precisely. To overcome these challenges, a novel multimodal deep learning framework that integrates advanced feature extraction, optimization, and classification approaches is proposed for enhanced detection of COVID-19 and severity assessment. The proposed framework introduces a novel hybrid spatial attention assisted extended version of the LSTM and GRU model (HAS-E-LSTM-GRU) that integrates long and short-term temporal feature extraction and attention approach. The prolonged framework maximizes the number of hidden units, whereas the learning and complex patterns obtained from CXR images represent a key novelty of the proposed model. The extracted features are enhanced further using Fourier, Wavelet, Convolutional, Gabor, and Cosine transforms. It is pursued by a feature selection approach using Grey Wolf Optimizer (GWO) to determine the most significant features with higher interclass variance. For the detection of disease, an ensemble classifier integrating Naïve Bayes (NB), k Nearest Neighbors (KNN), Deep Forest (DF), and Support Vector Machine (SVM) is used to determine the COVID-19 presence or absence. Finally, the disease severity is evaluated using a self-attention-based 1D convolutional neural network (CNN) that classifies the diverse cases into mild, moderate, and severe. In association with the standard deep learning methodologies, the proposed architecture obtains a detection accuracy of 98.51% which illustrates the capability in the case of real-time clinical applications.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145750749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Linguistic Explanations of Deep Models in Detecting Autism and Schizophrenia 自闭症和精神分裂症深层模型的语言学解释
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-10 DOI: 10.1111/coin.70158
Aleksander Wawer, Małgorzata Krawczyk, Justyna Sarzyńska-Wawer, Bartosz Żuk

Automated detection of mental conditions from text is increasingly important due to the rise of digital communication and limited access to clinicians, but the methods often suffer from insufficient explainability. In this study, we propose a novel framework to utilize linguistic dimensions to explain the attribution scores (Integrated Gradients) of tokens in a deep neural network (HerBERT) trained to detect autism and schizophrenia from textual data. In our approach, the scores are mapped to syntactic and psycholinguistic variables, which are then used to train explainable models. The feature importance derived from these models is thereby linked to linguistic dimensions, enhancing interpretability. Our results emphasize the role of syntactic clues, such as grammatical gender and case, in language processing. We also demonstrate that the deep model's attribution scores, when mapped to linguistic features, correlate with selected clinical tests. We propose that our framework can enhance the analysis and interpretation of deep neural networks used for detecting psychiatric disorders. Our approach advances explainable artificial intelligence in mental health by providing researchers with deeper insights into the linguistic cues driving deep neural model predictions.

由于数字通信的兴起和临床医生的限制,从文本中自动检测精神状况变得越来越重要,但这些方法往往存在可解释性不足的问题。在这项研究中,我们提出了一个新的框架,利用语言维度来解释深度神经网络(HerBERT)中标记的归因分数(集成梯度),该网络被训练用于从文本数据中检测自闭症和精神分裂症。在我们的方法中,分数被映射到句法和心理语言变量,然后用于训练可解释的模型。因此,从这些模型中得出的特征重要性与语言维度相关联,从而增强了可解释性。我们的研究结果强调了句法线索在语言处理中的作用,如语法性别和格。我们还证明,深度模型的归因分数,当映射到语言特征时,与选定的临床测试相关。我们建议我们的框架可以增强用于检测精神疾病的深度神经网络的分析和解释。我们的方法通过为研究人员提供对驱动深度神经模型预测的语言线索的更深入了解,推动了心理健康领域可解释的人工智能。
{"title":"Linguistic Explanations of Deep Models in Detecting Autism and Schizophrenia","authors":"Aleksander Wawer,&nbsp;Małgorzata Krawczyk,&nbsp;Justyna Sarzyńska-Wawer,&nbsp;Bartosz Żuk","doi":"10.1111/coin.70158","DOIUrl":"https://doi.org/10.1111/coin.70158","url":null,"abstract":"<div>\u0000 \u0000 <p>Automated detection of mental conditions from text is increasingly important due to the rise of digital communication and limited access to clinicians, but the methods often suffer from insufficient explainability. In this study, we propose a novel framework to utilize linguistic dimensions to explain the attribution scores (Integrated Gradients) of tokens in a deep neural network (HerBERT) trained to detect autism and schizophrenia from textual data. In our approach, the scores are mapped to syntactic and psycholinguistic variables, which are then used to train explainable models. The feature importance derived from these models is thereby linked to linguistic dimensions, enhancing interpretability. Our results emphasize the role of syntactic clues, such as grammatical gender and case, in language processing. We also demonstrate that the deep model's attribution scores, when mapped to linguistic features, correlate with selected clinical tests. We propose that our framework can enhance the analysis and interpretation of deep neural networks used for detecting psychiatric disorders. Our approach advances explainable artificial intelligence in mental health by providing researchers with deeper insights into the linguistic cues driving deep neural model predictions.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145750831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Multimodal Deep Learning Framework for Conversational AI: Integrating Vision, Text, and Speech With Knowledge-Augmented Attention Mechanisms 对话式人工智能的新型多模态深度学习框架:将视觉、文本和语音与知识增强注意机制相结合
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-05 DOI: 10.1111/coin.70159
T. Vaikunta Pai, M. Manjula Mallya, Pramod Vishnu Naik, Virgil Popescu, Ramona Birau, Amir Karbassi Yazdi

Existing conversational AI models lack effective multimodal fusion, leading to poor contextual understanding and factual inconsistencies. This research proposes a novel multimodal deep learning framework that integrates vision (CNNs and Vision Transformers), text (Transformers like BERT and GPT-4), and speech (RNNs and Self-Supervised Learning) using an attention-based fusion mechanism. The model enhances sarcasm detection and sentiment analysis through contrastive learning, while graph neural networks (GNNs) ensure knowledge-augmented reasoning, reducing hallucinations in AI-generated responses. Reinforcement learning optimizes dialogue generation for more natural and contextually accurate interactions. Experimental evaluations on MELD, CMU-MOSEI, and AVSD datasets demonstrate a 9%–12% improvement in classification accuracy, 15% higher sarcasm detection efficiency, and a 23% reduction in factual inconsistencies compared to state-of-the-art models like GPT-4, CLIP, and Flamingo. Extensive benchmarking using precision-recall analysis and response fluency metrics validates the robustness of the proposed framework in multimodal sentiment classification and contextual alignment. This research lays the foundation for next-generation multimodal conversational AI, with potential applications in real-world domains such as virtual assistants, healthcare diagnostics, and financial intelligence systems.

现有的会话AI模型缺乏有效的多模态融合,导致上下文理解差和事实不一致。本研究提出了一种新的多模态深度学习框架,该框架使用基于注意力的融合机制集成了视觉(cnn和视觉转换器)、文本(BERT和GPT-4等转换器)和语音(rnn和自监督学习)。该模型通过对比学习增强了讽刺检测和情感分析,而图神经网络(gnn)确保了知识增强推理,减少了人工智能生成的反应中的幻觉。强化学习优化了对话生成,以实现更自然和上下文准确的交互。在MELD、CMU-MOSEI和AVSD数据集上的实验评估表明,与GPT-4、CLIP和Flamingo等最先进的模型相比,MELD、CMU-MOSEI和AVSD数据集的分类准确率提高了9%-12%,讽刺检测效率提高了15%,事实不一致性降低了23%。广泛的基准测试使用精确召回分析和响应流畅性指标验证了所提出的框架在多模态情感分类和上下文对齐中的鲁棒性。这项研究为下一代多模态会话人工智能奠定了基础,在虚拟助理、医疗诊断和金融智能系统等现实世界领域具有潜在的应用前景。
{"title":"A Novel Multimodal Deep Learning Framework for Conversational AI: Integrating Vision, Text, and Speech With Knowledge-Augmented Attention Mechanisms","authors":"T. Vaikunta Pai,&nbsp;M. Manjula Mallya,&nbsp;Pramod Vishnu Naik,&nbsp;Virgil Popescu,&nbsp;Ramona Birau,&nbsp;Amir Karbassi Yazdi","doi":"10.1111/coin.70159","DOIUrl":"https://doi.org/10.1111/coin.70159","url":null,"abstract":"<div>\u0000 \u0000 <p>Existing conversational AI models lack effective multimodal fusion, leading to poor contextual understanding and factual inconsistencies. This research proposes a novel multimodal deep learning framework that integrates vision (CNNs and Vision Transformers), text (Transformers like BERT and GPT-4), and speech (RNNs and Self-Supervised Learning) using an attention-based fusion mechanism. The model enhances sarcasm detection and sentiment analysis through contrastive learning, while graph neural networks (GNNs) ensure knowledge-augmented reasoning, reducing hallucinations in AI-generated responses. Reinforcement learning optimizes dialogue generation for more natural and contextually accurate interactions. Experimental evaluations on MELD, CMU-MOSEI, and AVSD datasets demonstrate a 9%–12% improvement in classification accuracy, 15% higher sarcasm detection efficiency, and a 23% reduction in factual inconsistencies compared to state-of-the-art models like GPT-4, CLIP, and Flamingo. Extensive benchmarking using precision-recall analysis and response fluency metrics validates the robustness of the proposed framework in multimodal sentiment classification and contextual alignment. This research lays the foundation for next-generation multimodal conversational AI, with potential applications in real-world domains such as virtual assistants, healthcare diagnostics, and financial intelligence systems.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145695152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combined Deep Learning Framework With Selective Features for Botnet Attack Detection in Internet of Things 结合深度学习框架和选择性特征的物联网僵尸网络攻击检测
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-04 DOI: 10.1111/coin.70157
V. Ashok Kumar, G. R. Kanagachidambaresan

Botnet attacks pose a substantial risk to Internet of Things (IoT) settings by using interconnected nodes for malicious intentions, including infiltrating networks or initiating Distributed Denial of Service (DDoS) attacks. To overwhelm systems and cause service interruptions and security breaches, these attacks take advantage of compromised IoT devices. Given the growing prevalence of IoT devices and their associated vulnerabilities, an effective botnet attack detection model is crucial for safeguarding network integrity. To address this critical issue, a new botnet attack detection approach based on the Improved Bi-LSTM-LinkNet model (IBLLNet) is proposed, which is specifically implemented for IoT settings. Preprocessing, feature extraction, feature selection, and detection are the four main stages of this methodology. To solve class imbalance, the preprocessing phase makes use of the Enhanced Synthetic Minority Over-sampling Technique (ESMOTE) method, which produces more representative synthetic data than the traditional SMOTE algorithm. During feature extraction, the proposed approach captures diverse network behaviors through raw, statistical, and entropy-based features. This hybrid approach utilizes Correlation-based Feature Selection (CFS), Recursive Feature Elimination (RFE), and Ridge (L2 regularization) methods, and incorporates an improved selection process where feature weights are assigned based on their predicted significance. In the detection phase, the IBLLNet model integrates an Improved Bidirectional Long Short-Term Memory (IBi-LSTM) and LinkNet models. Here, the IBi-LSTM model incorporates advanced layers like Bi-LSTM, Attentive Context Normalization (ACN), and Artificial Neural Network (ANN) layers to increase detection efficiency and accuracy. The effectiveness of this strategy is confirmed by thorough tests against a number of performance metrics and comparisons with current strategies, indicating its resilience in addressing security risks associated with botnet assaults in Internet of Things environments.

僵尸网络攻击通过利用互联节点进行恶意攻击,包括渗透网络或发起分布式拒绝服务(DDoS)攻击,对物联网(IoT)设置构成重大风险。为了压倒系统并导致服务中断和安全漏洞,这些攻击利用了受损的物联网设备。鉴于物联网设备及其相关漏洞的日益普及,有效的僵尸网络攻击检测模型对于保护网络完整性至关重要。为了解决这一关键问题,提出了一种新的基于改进的Bi-LSTM-LinkNet模型(IBLLNet)的僵尸网络攻击检测方法,该方法专门针对物联网设置实现。预处理、特征提取、特征选择和检测是该方法的四个主要阶段。为了解决类不平衡问题,预处理阶段使用了增强型合成少数派过采样技术(ESMOTE)方法,该方法产生的合成数据比传统的SMOTE算法更具代表性。在特征提取过程中,提出的方法通过原始的、统计的和基于熵的特征来捕获不同的网络行为。这种混合方法利用了基于关联的特征选择(CFS)、递归特征消除(RFE)和Ridge (L2正则化)方法,并结合了一种改进的选择过程,其中基于特征的预测重要性分配特征权重。在检测阶段,IBLLNet模型集成了IBi-LSTM (Improved Bidirectional Long - Short-Term Memory)和LinkNet模型。在这里,ibm - lstm模型结合了Bi-LSTM、细心上下文归一化(attention Context Normalization, ACN)和人工神经网络(Artificial Neural Network, ANN)等高级层,以提高检测效率和准确性。通过对多项性能指标的全面测试和与当前策略的比较,证实了该策略的有效性,表明其在解决物联网环境中与僵尸网络攻击相关的安全风险方面具有弹性。
{"title":"Combined Deep Learning Framework With Selective Features for Botnet Attack Detection in Internet of Things","authors":"V. Ashok Kumar,&nbsp;G. R. Kanagachidambaresan","doi":"10.1111/coin.70157","DOIUrl":"https://doi.org/10.1111/coin.70157","url":null,"abstract":"<div>\u0000 \u0000 <p>Botnet attacks pose a substantial risk to Internet of Things (IoT) settings by using interconnected nodes for malicious intentions, including infiltrating networks or initiating Distributed Denial of Service (DDoS) attacks. To overwhelm systems and cause service interruptions and security breaches, these attacks take advantage of compromised IoT devices. Given the growing prevalence of IoT devices and their associated vulnerabilities, an effective botnet attack detection model is crucial for safeguarding network integrity. To address this critical issue, a new botnet attack detection approach based on the Improved Bi-LSTM-LinkNet model (IBLLNet) is proposed, which is specifically implemented for IoT settings. Preprocessing, feature extraction, feature selection, and detection are the four main stages of this methodology. To solve class imbalance, the preprocessing phase makes use of the Enhanced Synthetic Minority Over-sampling Technique (ESMOTE) method, which produces more representative synthetic data than the traditional SMOTE algorithm. During feature extraction, the proposed approach captures diverse network behaviors through raw, statistical, and entropy-based features. This hybrid approach utilizes Correlation-based Feature Selection (CFS), Recursive Feature Elimination (RFE), and Ridge (L2 regularization) methods, and incorporates an improved selection process where feature weights are assigned based on their predicted significance. In the detection phase, the IBLLNet model integrates an Improved Bidirectional Long Short-Term Memory (IBi-LSTM) and LinkNet models. Here, the IBi-LSTM model incorporates advanced layers like Bi-LSTM, Attentive Context Normalization (ACN), and Artificial Neural Network (ANN) layers to increase detection efficiency and accuracy. The effectiveness of this strategy is confirmed by thorough tests against a number of performance metrics and comparisons with current strategies, indicating its resilience in addressing security risks associated with botnet assaults in Internet of Things environments.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145695140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing Deep Learning Models for Detecting Ambiguities in Software Requirements: Harnessing BERT, Random Search, and Bi-LSTM 优化深度学习模型以检测软件需求中的歧义:利用BERT、随机搜索和Bi-LSTM
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-02 DOI: 10.1111/coin.70156
Younes Abdeahad, Esmaeil Kheirkhah, Mahdi Kabootari, Yalda Kheirkhah, Reza Sheibani

Requirements engineering is one of the most crucial parts of the lifecycle of software engineering. Many programs fail annually due to deficiencies in requirements engineering. Requirements engineering documents are written in natural languages, which can lead to ambiguities. The presence of ambiguity in natural language causes misunderstandings. Accurate and timely identification of these requirements is vital for the development process. However, manual classification is time-consuming and necessitates automation. Today, with the rapid advancement of technology, machine learning and deep learning are being used to detect these ambiguities in requirement specification documents. The BERT word embedding technique and the Bi-LSTM algorithm were used in this research. We have used meta-heuristic algorithms to choose the best value of hyperparameters of our deep learning algorithm. The publicly available Fault-Prone SRS dataset was utilized to train the models. This dataset was also used to evaluate the performance of the proposed algorithm in terms of F1-score, accuracy, and other statistical metrics. The BERT-BiLSTM model outperformed other models in classifying and detecting ambiguities in requirement specification documents, achieving an F1-score and higher than 81% accuracy.

需求工程是软件工程生命周期中最关键的部分之一。由于需求工程方面的缺陷,许多项目每年都会失败。需求工程文档是用自然语言编写的,这可能导致歧义。自然语言中歧义的存在会引起误解。准确和及时地识别这些需求对开发过程至关重要。然而,手工分类是费时的,需要自动化。今天,随着技术的快速发展,机器学习和深度学习被用于检测需求规范文档中的这些模糊性。本研究采用BERT词嵌入技术和Bi-LSTM算法。我们使用元启发式算法来选择深度学习算法的超参数的最佳值。利用公开可用的Fault-Prone SRS数据集来训练模型。该数据集还用于根据f1分数、准确性和其他统计指标评估所提出算法的性能。BERT-BiLSTM模型在需求规范文档中的歧义分类和检测方面优于其他模型,达到了f1分,准确率高于81%。
{"title":"Optimizing Deep Learning Models for Detecting Ambiguities in Software Requirements: Harnessing BERT, Random Search, and Bi-LSTM","authors":"Younes Abdeahad,&nbsp;Esmaeil Kheirkhah,&nbsp;Mahdi Kabootari,&nbsp;Yalda Kheirkhah,&nbsp;Reza Sheibani","doi":"10.1111/coin.70156","DOIUrl":"https://doi.org/10.1111/coin.70156","url":null,"abstract":"<div>\u0000 \u0000 <p>Requirements engineering is one of the most crucial parts of the lifecycle of software engineering. Many programs fail annually due to deficiencies in requirements engineering. Requirements engineering documents are written in natural languages, which can lead to ambiguities. The presence of ambiguity in natural language causes misunderstandings. Accurate and timely identification of these requirements is vital for the development process. However, manual classification is time-consuming and necessitates automation. Today, with the rapid advancement of technology, machine learning and deep learning are being used to detect these ambiguities in requirement specification documents. The BERT word embedding technique and the Bi-LSTM algorithm were used in this research. We have used meta-heuristic algorithms to choose the best value of hyperparameters of our deep learning algorithm. The publicly available Fault-Prone SRS dataset was utilized to train the models. This dataset was also used to evaluate the performance of the proposed algorithm in terms of F1-score, accuracy, and other statistical metrics. The BERT-BiLSTM model outperformed other models in classifying and detecting ambiguities in requirement specification documents, achieving an F1-score and higher than 81% accuracy.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145695167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computational Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1