首页 > 最新文献

Computational Intelligence最新文献

英文 中文
Age-Related Macular Degeneration Detection Using OCT Image: Deep Hybrid Classifier With Improved PCA-Based Selective Feature Set 基于OCT图像的年龄相关性黄斑变性检测:基于改进pca的选择性特征集的深度混合分类器
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-10 DOI: 10.1111/coin.70142
Aravapalli Sri Chaitanya, R. Arunkumar, Lakshmana Phaneendra Maguluri

Age-related macular degeneration (AMD) is a progressive retinal disease that can lead to vision loss if not diagnosed early. Accurate and timely detection is essential for effective treatment. This study presents a comprehensive deep learning-based framework for the automated detection of AMD using optical coherence tomography (OCT) images. The proposed method integrates advanced preprocessing, segmentation, feature extraction, and classification techniques. An improved Wiener filter is applied to enhance image quality by reducing noise while preserving edge details. A U-Net architecture is then used for accurate segmentation of retinal regions. Subsequently, diverse features such as histogram of oriented gradients (HOG), gray-level co-occurrence matrix (GLCM), scale-invariant feature transform (SIFT), VGG16, statistical features, and improved median ternary pattern (MTP) are extracted from segmented images. Then, an enhanced principal component analysis (PCA) with feature scaling is used for the optimal selection of features to improve classification robustness. Finally, a hybrid model combining deep convolutional neural networks (DCNNs) and AlexNet is proposed for AMD classification. Moreover, the proposed model is evaluated against traditional methods and achieves superior performance, with an accuracy of 0.958 and F-measure of 0.964. Experimental results confirm the model's effectiveness and reliability, highlighting its potential for clinical application in early AMD detection and diagnosis.

年龄相关性黄斑变性(AMD)是一种进行性视网膜疾病,如果不及早诊断,可导致视力丧失。准确和及时的检测对于有效治疗至关重要。本研究提出了一个全面的基于深度学习的框架,用于使用光学相干断层扫描(OCT)图像自动检测AMD。该方法集成了先进的预处理、分割、特征提取和分类技术。改进的维纳滤波器通过降低噪声来提高图像质量,同时保留边缘细节。然后使用U-Net架构对视网膜区域进行精确分割。随后,从分割图像中提取定向梯度直方图(HOG)、灰度共生矩阵(GLCM)、尺度不变特征变换(SIFT)、VGG16、统计特征和改进中值三元模式(MTP)等多种特征。然后,采用特征缩放的增强主成分分析(PCA)对特征进行优化选择,提高分类的鲁棒性。最后,提出了一种结合深度卷积神经网络(DCNNs)和AlexNet的混合模型用于AMD分类。并与传统方法进行了对比,结果表明,该模型的准确率为0.958,F-measure值为0.964。实验结果证实了该模型的有效性和可靠性,突出了其在AMD早期检测和诊断中的临床应用潜力。
{"title":"Age-Related Macular Degeneration Detection Using OCT Image: Deep Hybrid Classifier With Improved PCA-Based Selective Feature Set","authors":"Aravapalli Sri Chaitanya,&nbsp;R. Arunkumar,&nbsp;Lakshmana Phaneendra Maguluri","doi":"10.1111/coin.70142","DOIUrl":"https://doi.org/10.1111/coin.70142","url":null,"abstract":"<div>\u0000 \u0000 <p>Age-related macular degeneration (AMD) is a progressive retinal disease that can lead to vision loss if not diagnosed early. Accurate and timely detection is essential for effective treatment. This study presents a comprehensive deep learning-based framework for the automated detection of AMD using optical coherence tomography (OCT) images. The proposed method integrates advanced preprocessing, segmentation, feature extraction, and classification techniques. An improved Wiener filter is applied to enhance image quality by reducing noise while preserving edge details. A U-Net architecture is then used for accurate segmentation of retinal regions. Subsequently, diverse features such as histogram of oriented gradients (HOG), gray-level co-occurrence matrix (GLCM), scale-invariant feature transform (SIFT), VGG16, statistical features, and improved median ternary pattern (MTP) are extracted from segmented images. Then, an enhanced principal component analysis (PCA) with feature scaling is used for the optimal selection of features to improve classification robustness. Finally, a hybrid model combining deep convolutional neural networks (DCNNs) and AlexNet is proposed for AMD classification. Moreover, the proposed model is evaluated against traditional methods and achieves superior performance, with an accuracy of 0.958 and <i>F</i>-measure of 0.964. Experimental results confirm the model's effectiveness and reliability, highlighting its potential for clinical application in early AMD detection and diagnosis.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145521510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Precision-Weighted Federated Learning 精确加权联邦学习
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-04 DOI: 10.1111/coin.70150
Jonatan Reyes, Lisa Di Jorio, Cecile Low-Kam, Marta Kersten-Oertel
<p>Federated learning (FL) using the federated averaging (FedAvg) algorithm has shown great advantages for large-scale applications that rely on collaborative learning, especially when the training data is either unbalanced or inaccessible due to privacy constraints. We hypothesize that FedAvg underestimates the full extent of heterogeneity of data when the aggregation is performed. We propose <i>Precision-Weighted Federated Learning (PW)</i> a novel algorithm that takes into account the second raw moment (uncentered variance) of the stochastic gradient when computing the weighted average of the parameters of independent models trained in a FL setting. With PW, we address the communication and statistical challenges for the training of distributed models with private data and provide an alternate averaging scheme that leverages the heterogeneity of the data when it has a large diversity of features in its composition. Our method was evaluated using three standard image classification datasets (MNIST, Fashion-MNIST, and CIFAR) under two different data partitioning strategies: independent and identically distributed (IID), and nonidentical and nonindependent (non-IID). These experiments were designed to measure the performance and efficiency of our method in resource-constrained environments, such as mobile and IoT devices. The experimental results demonstrate that we can obtain a good balance between computational efficiency and convergence rates with PW. Our performance evaluations show <span></span><math> <semantics> <mrow> <mn>9</mn> <mo>%</mo> </mrow> <annotation>$$ 9% $$</annotation> </semantics></math> better predictions with MNIST, <span></span><math> <semantics> <mrow> <mn>18</mn> <mo>%</mo> </mrow> <annotation>$$ 18% $$</annotation> </semantics></math> with Fashion-MNIST, and <span></span><math> <semantics> <mrow> <mn>5</mn> <mo>%</mo> </mrow> <annotation>$$ 5% $$</annotation> </semantics></math> with CIFAR-10 in the non-IID setting. Further reliability evaluations ratify the stability in our method by reaching a 99% reliability index with IID partitions and 96% with non-IID partitions. In addition, we obtained a <span></span><math> <semantics> <mrow> <mn>20</mn> <mo>×</mo> </mrow> <annotation>$$ 20times $$</annotation> </semantics></math> speedup on Fashion-MNIST with only 10 clients and up to <span></span><math> <semantics> <mrow> <mn>37</mn> <mo>×</mo>
使用联邦平均(fedag)算法的联邦学习(FL)对于依赖协作学习的大规模应用程序显示出巨大的优势,特别是当训练数据不平衡或由于隐私约束而无法访问时。我们假设fedag在进行聚合时低估了数据异质性的全部程度。我们提出了一种新的精确加权联邦学习(PW)算法,该算法在计算FL设置中训练的独立模型参数的加权平均值时考虑了随机梯度的第二个原始矩(非中心方差)。通过PW,我们解决了使用私有数据训练分布式模型的通信和统计挑战,并提供了一种替代的平均方案,当数据在其组成中具有大量多样性的特征时,该方案利用了数据的异质性。使用三个标准图像分类数据集(MNIST, Fashion-MNIST和CIFAR)在两种不同的数据分区策略下对我们的方法进行了评估:独立和同分布(IID)和非相同和非独立(non-IID)。这些实验旨在衡量我们的方法在资源受限环境(如移动和物联网设备)中的性能和效率。实验结果表明,采用PW可以很好地平衡计算效率和收敛速度。我们的绩效评估显示9 % $$ 9% $$ better predictions with MNIST, 18 % $$ 18% $$ with Fashion-MNIST, and 5 % $$ 5% $$ with CIFAR-10 in the non-IID setting. Further reliability evaluations ratify the stability in our method by reaching a 99% reliability index with IID partitions and 96% with non-IID partitions. In addition, we obtained a 20 × $$ 20times $$ speedup on Fashion-MNIST with only 10 clients and up to 37 × $$ 37times $$ with 100 clients participating in the aggregation concurrently per communication round. Overall, PW demonstrates improved stability and accuracy with increasing batch sizes, and it benefits significantly from lower learning rates and longer local training, compared to FedAvg and FedProx. The results indicate that PW is an effective and faster alternative approach for aggregating model updates derived from private data, especially in domains where data is highly heterogeneous.
{"title":"Precision-Weighted Federated Learning","authors":"Jonatan Reyes,&nbsp;Lisa Di Jorio,&nbsp;Cecile Low-Kam,&nbsp;Marta Kersten-Oertel","doi":"10.1111/coin.70150","DOIUrl":"https://doi.org/10.1111/coin.70150","url":null,"abstract":"&lt;p&gt;Federated learning (FL) using the federated averaging (FedAvg) algorithm has shown great advantages for large-scale applications that rely on collaborative learning, especially when the training data is either unbalanced or inaccessible due to privacy constraints. We hypothesize that FedAvg underestimates the full extent of heterogeneity of data when the aggregation is performed. We propose &lt;i&gt;Precision-Weighted Federated Learning (PW)&lt;/i&gt; a novel algorithm that takes into account the second raw moment (uncentered variance) of the stochastic gradient when computing the weighted average of the parameters of independent models trained in a FL setting. With PW, we address the communication and statistical challenges for the training of distributed models with private data and provide an alternate averaging scheme that leverages the heterogeneity of the data when it has a large diversity of features in its composition. Our method was evaluated using three standard image classification datasets (MNIST, Fashion-MNIST, and CIFAR) under two different data partitioning strategies: independent and identically distributed (IID), and nonidentical and nonindependent (non-IID). These experiments were designed to measure the performance and efficiency of our method in resource-constrained environments, such as mobile and IoT devices. The experimental results demonstrate that we can obtain a good balance between computational efficiency and convergence rates with PW. Our performance evaluations show &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mn&gt;9&lt;/mn&gt;\u0000 &lt;mo&gt;%&lt;/mo&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ 9% $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; better predictions with MNIST, &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mn&gt;18&lt;/mn&gt;\u0000 &lt;mo&gt;%&lt;/mo&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ 18% $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; with Fashion-MNIST, and &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mn&gt;5&lt;/mn&gt;\u0000 &lt;mo&gt;%&lt;/mo&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ 5% $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; with CIFAR-10 in the non-IID setting. Further reliability evaluations ratify the stability in our method by reaching a 99% reliability index with IID partitions and 96% with non-IID partitions. In addition, we obtained a &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mn&gt;20&lt;/mn&gt;\u0000 &lt;mo&gt;×&lt;/mo&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ 20times $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; speedup on Fashion-MNIST with only 10 clients and up to &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mn&gt;37&lt;/mn&gt;\u0000 &lt;mo&gt;×&lt;/mo&gt;\u0000 ","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/coin.70150","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145469538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DStaL: Multilevel Fusion Classifier Based Analysis of Multimodal Sentiments Using Deep Learning Models 基于多层融合分类器的深度学习多模态情感分析
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-03 DOI: 10.1111/coin.70154
Nasheet Tarik, Ashish Jadhav

Multimodal data can more vividly and strangely represent consumers' feelings and sentiments than single-modal content. Users are moving beyond traditional text-based content on social media and are now more commonly incorporating images and text to convey their experiences and articulate their opinions. Traditional text-based techniques have given way to multimodal sentiment analysis, which poses a more complex challenge. This work attempts to address the challenge of sentiment analysis in image-text posts by presenting a refined recognition method that efficiently leverages textual and visual content information. Initially, a single dataset can be created by combining the MVSA-Single and MVSA-Multiple datasets. It is gathered to improve data quality by pre-processing the data with a Gaussian bilateral filter (GBF) for picture features and lemmatization, stemming, and stop word removal for text characteristics. The normalized term-inverse document frequency (NorTID) model is used to extract text features. The Convolutional VGG-16 (ConV-16) model extracts image characteristics. The Dense Stacked Long Short-Term Memory Network (DStaL) model is used to independently analyze the collected multimodal information. The obtained features are fused together, and the sentiments are effectively classified using the Multilevel Fusion Classifier (MFuse) model. The study illustrates the enhanced efficacy of the suggested strategy by comparing the results to those of standard approaches using several performance measures. For the MVSA-Single and MVSA-Multiple datasets, the findings show a 97.3% accuracy, 94.8% precision, 94.8% recall, 97.3% specificity, 92.1% Matthews Correlation Coefficient (MCC), 36.3% Root Mean Squared Error (RMSE), and 0.07% Mean Absolute Error (MAE).

与单模态内容相比,多模态数据更能生动、奇特地表达消费者的感受和情绪。用户正在超越传统的基于文本的社交媒体内容,现在更普遍地结合图像和文本来传达他们的体验和表达他们的观点。传统的基于文本的情感分析技术已经让位于多模态情感分析,这给情感分析带来了更加复杂的挑战。这项工作试图通过提出一种有效利用文本和视觉内容信息的精细识别方法来解决图像-文本帖子中情感分析的挑战。最初,可以通过组合MVSA-Single和MVSA-Multiple数据集来创建单个数据集。利用高斯双边滤波器(GBF)对图像特征进行预处理,对文本特征进行词序化、词干提取和停止词去除,从而提高数据质量。使用归一化项逆文档频率(NorTID)模型提取文本特征。卷积VGG-16 (convl -16)模型提取图像特征。采用密集堆叠长短期记忆网络(DStaL)模型对采集到的多模态信息进行独立分析。将得到的特征融合在一起,使用多层融合分类器(Multilevel Fusion Classifier, MFuse)模型对情感进行有效分类。该研究通过将结果与使用几种绩效指标的标准方法进行比较,说明了所建议策略的增强功效。对于MVSA-Single和MVSA-Multiple数据集,结果显示准确率为97.3%,精密度为94.8%,召回率为94.8%,特异性为97.3%,马修斯相关系数(MCC)为92.1%,均方根误差(RMSE)为36.3%,平均绝对误差(MAE)为0.07%。
{"title":"DStaL: Multilevel Fusion Classifier Based Analysis of Multimodal Sentiments Using Deep Learning Models","authors":"Nasheet Tarik,&nbsp;Ashish Jadhav","doi":"10.1111/coin.70154","DOIUrl":"https://doi.org/10.1111/coin.70154","url":null,"abstract":"<div>\u0000 \u0000 <p>Multimodal data can more vividly and strangely represent consumers' feelings and sentiments than single-modal content. Users are moving beyond traditional text-based content on social media and are now more commonly incorporating images and text to convey their experiences and articulate their opinions. Traditional text-based techniques have given way to multimodal sentiment analysis, which poses a more complex challenge. This work attempts to address the challenge of sentiment analysis in image-text posts by presenting a refined recognition method that efficiently leverages textual and visual content information. Initially, a single dataset can be created by combining the MVSA-Single and MVSA-Multiple datasets. It is gathered to improve data quality by pre-processing the data with a Gaussian bilateral filter (GBF) for picture features and lemmatization, stemming, and stop word removal for text characteristics. The normalized term-inverse document frequency (NorTID) model is used to extract text features. The Convolutional VGG-16 (ConV-16) model extracts image characteristics. The Dense Stacked Long Short-Term Memory Network (DStaL) model is used to independently analyze the collected multimodal information. The obtained features are fused together, and the sentiments are effectively classified using the Multilevel Fusion Classifier (MFuse) model. The study illustrates the enhanced efficacy of the suggested strategy by comparing the results to those of standard approaches using several performance measures. For the MVSA-Single and MVSA-Multiple datasets, the findings show a 97.3% accuracy, 94.8% precision, 94.8% recall, 97.3% specificity, 92.1% Matthews Correlation Coefficient (MCC), 36.3% Root Mean Squared Error (RMSE), and 0.07% Mean Absolute Error (MAE).</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145469345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Survey on AI Ethics: A Socio-Technical Perspective 人工智能伦理调查:社会技术视角
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-02 DOI: 10.1111/coin.70149
Dave Mbiazi, Meghana Bhange, Maryam Babaei, Ivaxi Sheth, Patrik Kenfack, Samira Ebrahimi Kahou

The past decade has observed a significant advancement in AI, with deep learning-based models being deployed in diverse scenarios, including safety-critical applications. As these AI systems become deeply embedded in our societal infrastructure, the repercussions of their decisions and actions have significant consequences, making the ethical implications of AI deployment highly relevant and essential. The ethical concerns associated with AI are multifaceted, including challenging issues of fairness, privacy and data protection, responsibility and accountability, safety and robustness, transparency and explainability, and environmental impact. These principles together form the foundations of ethical AI considerations that concern every stakeholder in the AI system lifecycle. In light of the present ethical and future x-risk concerns, governments have shown increasing interest in establishing guidelines for the ethical deployment of AI. This work unifies the current and future ethical concerns of deploying AI into society. While we acknowledge and appreciate the technical surveys for each of the ethical principles concerned, in this paper, we aim to provide a comprehensive overview that not only addresses each principle from a technical point of view but also discusses them from a social perspective.

过去十年,人工智能取得了重大进展,基于深度学习的模型被部署在各种场景中,包括安全关键型应用程序。随着这些人工智能系统深深嵌入到我们的社会基础设施中,它们的决策和行动的影响会产生重大后果,这使得人工智能部署的伦理影响高度相关且至关重要。与人工智能相关的伦理问题是多方面的,包括公平性、隐私和数据保护、责任和问责制、安全性和稳健性、透明度和可解释性以及环境影响等具有挑战性的问题。这些原则共同构成了人工智能伦理考虑的基础,涉及人工智能系统生命周期中的每个利益相关者。鉴于目前的道德和未来的x风险问题,各国政府越来越有兴趣为人工智能的道德部署制定指导方针。这项工作统一了将人工智能部署到社会中的当前和未来的伦理问题。虽然我们承认并赞赏每个有关伦理原则的技术调查,但在本文中,我们的目标是提供一个全面的概述,不仅从技术角度解决每个原则,而且从社会角度讨论它们。
{"title":"Survey on AI Ethics: A Socio-Technical Perspective","authors":"Dave Mbiazi,&nbsp;Meghana Bhange,&nbsp;Maryam Babaei,&nbsp;Ivaxi Sheth,&nbsp;Patrik Kenfack,&nbsp;Samira Ebrahimi Kahou","doi":"10.1111/coin.70149","DOIUrl":"https://doi.org/10.1111/coin.70149","url":null,"abstract":"<p>The past decade has observed a significant advancement in AI, with deep learning-based models being deployed in diverse scenarios, including safety-critical applications. As these AI systems become deeply embedded in our societal infrastructure, the repercussions of their decisions and actions have significant consequences, making the ethical implications of AI deployment highly relevant and essential. The ethical concerns associated with AI are multifaceted, including challenging issues of fairness, privacy and data protection, responsibility and accountability, safety and robustness, transparency and explainability, and environmental impact. These principles together form the foundations of ethical AI considerations that concern every stakeholder in the AI system lifecycle. In light of the present ethical and future x-risk concerns, governments have shown increasing interest in establishing guidelines for the ethical deployment of AI. This work unifies the current and future ethical concerns of deploying AI into society. While we acknowledge and appreciate the technical surveys for each of the ethical principles concerned, in this paper, we aim to provide a comprehensive overview that not only addresses each principle from a technical point of view but also discusses them from a social perspective.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/coin.70149","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145469389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Manipulator Collaboration Based on Topology Transformation 基于拓扑变换的多机械臂协作
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-28 DOI: 10.1111/coin.70151
Liangxiong Chen, Wenxin Mu, Mei Liu

For solving the problem of dynamic task topology change in multi-manipulator cooperative system, a distributed control scheme based on topology transformation is proposed. In this scheme, a topological transformation mechanism is introduced so that the multi-manipulator cooperative system dynamically adapts to the multi-stage tasks with topological changes. To facilitate the solution of the problem, the distributed scheme is converted into an optimization problem. In addition, a gradient neural network solver with an activation function is designed, which greatly improves the convergence speed of the solver. Theoretical analysis is given, which confirms the convergence of the solver. Furthermore, the effectiveness of the proposed scheme is demonstrated.

针对多机械手协作系统中任务拓扑动态变化的问题,提出了一种基于拓扑变换的分布式控制方案。该方案引入了拓扑转换机制,使多机械臂协作系统能够动态适应拓扑变化的多阶段任务。为了便于求解,将分布式方案转化为优化问题。此外,设计了带激活函数的梯度神经网络求解器,大大提高了求解器的收敛速度。理论分析证实了该算法的收敛性。进一步验证了该方案的有效性。
{"title":"Multi-Manipulator Collaboration Based on Topology Transformation","authors":"Liangxiong Chen,&nbsp;Wenxin Mu,&nbsp;Mei Liu","doi":"10.1111/coin.70151","DOIUrl":"https://doi.org/10.1111/coin.70151","url":null,"abstract":"<div>\u0000 \u0000 <p>For solving the problem of dynamic task topology change in multi-manipulator cooperative system, a distributed control scheme based on topology transformation is proposed. In this scheme, a topological transformation mechanism is introduced so that the multi-manipulator cooperative system dynamically adapts to the multi-stage tasks with topological changes. To facilitate the solution of the problem, the distributed scheme is converted into an optimization problem. In addition, a gradient neural network solver with an activation function is designed, which greatly improves the convergence speed of the solver. Theoretical analysis is given, which confirms the convergence of the solver. Furthermore, the effectiveness of the proposed scheme is demonstrated.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Blockchain-Integrated Deep Learning Framework for Securing Smart Healthcare Communication Networks 一种用于保护智能医疗通信网络的新型区块链集成深度学习框架
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-27 DOI: 10.1111/coin.70132
Rohit Malik, Sakshi Dua, Preety Shoran, Kamal Upreti

With the rapid expansion of intelligent medical equipment and their interconnectedness through the Internet of Things (IoT), addressing safety issues in the communicating system has become increasingly critical. A learning mechanism is proposed for an intelligent healthcare-based communication system that uses blockchain for secure network communication and incorporates a data evaluation layer based on cloud which actively segregates and ranks transactions into three main categories: Good, Moderate, and Malware. Fog servers are utilized to route the communicating nodes via Rician and Rayleigh channels. The learning mechanism employs a deep neural network to instruct and classify categories, thereby improving the blockchain layer's decision-making process. This paper introduces several significant contributions, such as the development of a secure blockchain framework for user authentication and a protected digital ledger for communication. Additionally, it incorporates a cloud-driven data analysis layer combined with a neural network to improve training accuracy and category classification. The developed algorithm surpassed the existing works in terms of quality of service (QoS) parameters with low latency, bit error rate (BER), higher signal to inference plus noise ratio (SINR), packet delivery ratio (PDR), true detection rate (TDR), false detection rate (FDR), and throughput. Also, a thorough comparison of consensus mechanisms like practical Byzantine fault tolerance (pBFT), proof of work (PoW), Raft, and Paxos is done to ensure which consensus helps optimize the proposed system in terms of security and fault tolerance with low latency and energy-efficient operations. It also establishes a secure and efficient communication network for smart healthcare, aimed at enhancing the overall quality of life for individuals.

随着智能医疗设备的快速扩展及其通过物联网(IoT)的互联,解决通信系统中的安全问题变得越来越重要。提出了一种基于智能医疗保健通信系统的学习机制,该系统使用区块链进行安全网络通信,并结合基于云的数据评估层,主动将事务划分为良好、中等和恶意三大类。雾服务器被用来通过瑞利和瑞利信道路由通信节点。学习机制采用深度神经网络对类别进行指导和分类,从而改善区块链层的决策过程。本文介绍了几个重要的贡献,例如开发了用于用户身份验证的安全区块链框架和用于通信的受保护的数字分类账。此外,它还结合了云驱动的数据分析层和神经网络,以提高训练精度和类别分类。该算法在低时延、低误码率(BER)、高信噪比(SINR)、高包投递比(PDR)、真检测率(TDR)、假检测率(FDR)、吞吐量等服务质量(QoS)参数方面超越了现有的工作。此外,还对实际的拜占庭容错(pBFT)、工作量证明(PoW)、Raft和Paxos等共识机制进行了全面的比较,以确保哪种共识有助于在安全性和容错方面优化所提议的系统,同时实现低延迟和节能操作。它还为智能医疗建立了一个安全高效的通信网络,旨在提高个人的整体生活质量。
{"title":"A Novel Blockchain-Integrated Deep Learning Framework for Securing Smart Healthcare Communication Networks","authors":"Rohit Malik,&nbsp;Sakshi Dua,&nbsp;Preety Shoran,&nbsp;Kamal Upreti","doi":"10.1111/coin.70132","DOIUrl":"https://doi.org/10.1111/coin.70132","url":null,"abstract":"<div>\u0000 \u0000 <p>With the rapid expansion of intelligent medical equipment and their interconnectedness through the Internet of Things (IoT), addressing safety issues in the communicating system has become increasingly critical. A learning mechanism is proposed for an intelligent healthcare-based communication system that uses blockchain for secure network communication and incorporates a data evaluation layer based on cloud which actively segregates and ranks transactions into three main categories: Good, Moderate, and Malware. Fog servers are utilized to route the communicating nodes via Rician and Rayleigh channels. The learning mechanism employs a deep neural network to instruct and classify categories, thereby improving the blockchain layer's decision-making process. This paper introduces several significant contributions, such as the development of a secure blockchain framework for user authentication and a protected digital ledger for communication. Additionally, it incorporates a cloud-driven data analysis layer combined with a neural network to improve training accuracy and category classification. The developed algorithm surpassed the existing works in terms of quality of service (QoS) parameters with low latency, bit error rate (BER), higher signal to inference plus noise ratio (SINR), packet delivery ratio (PDR), true detection rate (TDR), false detection rate (FDR), and throughput. Also, a thorough comparison of consensus mechanisms like practical Byzantine fault tolerance (pBFT), proof of work (PoW), Raft, and Paxos is done to ensure which consensus helps optimize the proposed system in terms of security and fault tolerance with low latency and energy-efficient operations. It also establishes a secure and efficient communication network for smart healthcare, aimed at enhancing the overall quality of life for individuals.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MTT: Multi-Tier Transformer for Early Diagnosis of Autism Spectrum Disorder and Language Disorder in Infants 多层变压器早期诊断婴儿自闭症谱系障碍和语言障碍
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-23 DOI: 10.1111/coin.70131
Jingyu Liu, Honghua Wang, Jialing Wang, Han Wu, Yifei Xu, Tao Zhang, Guangfeng Zhou

Current diagnosis of autism spectrum disorder (ASD) and developmental language/reading disorder (DLD/DD) relies predominantly on subjective behavioral assessments, this underscoring the urgent need for objective biomarkers to enable early intervention. This study proposes a Multi-Tier Transformer (MTT) model for early identification of ASD and DLD/DD using resting-state EEG baseline power values. To address severe class imbalance, we augmented the dataset using the SMOTE method. The MTT architecture integrates a Feature-Embedding layer, a Feature-Attention mechanism that dynamically weights multi-spectral inputs, and a dual-attention encoding block comprising both self-attention and cross-attention to enhance contextual representation learning from limited samples. Transfer learning was further employed to improve robustness by pre-training on augmented data and fine-tuning on original samples. Evaluated on clinical infant EEG data, the proposed MTT achieved an accuracy of 0.91 (95% CI: 0.89–0.93), recall of 0.89, and AUC of 0.97, significantly outperforming the state-of-the-art FT-transformer (p = 0.00091). The results indicate that MTT provides a robust and interpretable deep learning tool for auxiliary diagnosis of neurodevelopmental disorders in infancy.

目前自闭症谱系障碍(ASD)和发展性语言/阅读障碍(DLD/DD)的诊断主要依赖于主观行为评估,这强调了迫切需要客观的生物标志物来实现早期干预。本研究提出了一种多层变压器(Multi-Tier Transformer, MTT)模型,利用静息状态脑电图基线功率值来早期识别ASD和DLD/DD。为了解决严重的类不平衡问题,我们使用SMOTE方法增强了数据集。MTT架构集成了一个特征嵌入层、一个动态加权多光谱输入的特征注意机制和一个包含自注意和交叉注意的双注意编码块,以增强有限样本的上下文表示学习。通过对增强数据的预训练和对原始样本的微调,进一步利用迁移学习来提高鲁棒性。对临床婴儿脑电图数据进行评估,MTT的准确率为0.91 (95% CI: 0.89 - 0.93),召回率为0.89,AUC为0.97,显著优于最先进的FT-transformer (p = 0.00091)。结果表明,MTT为婴幼儿神经发育障碍的辅助诊断提供了一个强大的、可解释的深度学习工具。
{"title":"MTT: Multi-Tier Transformer for Early Diagnosis of Autism Spectrum Disorder and Language Disorder in Infants","authors":"Jingyu Liu,&nbsp;Honghua Wang,&nbsp;Jialing Wang,&nbsp;Han Wu,&nbsp;Yifei Xu,&nbsp;Tao Zhang,&nbsp;Guangfeng Zhou","doi":"10.1111/coin.70131","DOIUrl":"https://doi.org/10.1111/coin.70131","url":null,"abstract":"<div>\u0000 \u0000 <p>Current diagnosis of autism spectrum disorder (ASD) and developmental language/reading disorder (DLD/DD) relies predominantly on subjective behavioral assessments, this underscoring the urgent need for objective biomarkers to enable early intervention. This study proposes a Multi-Tier Transformer (MTT) model for early identification of ASD and DLD/DD using resting-state EEG baseline power values. To address severe class imbalance, we augmented the dataset using the SMOTE method. The MTT architecture integrates a Feature-Embedding layer, a Feature-Attention mechanism that dynamically weights multi-spectral inputs, and a dual-attention encoding block comprising both self-attention and cross-attention to enhance contextual representation learning from limited samples. Transfer learning was further employed to improve robustness by pre-training on augmented data and fine-tuning on original samples. Evaluated on clinical infant EEG data, the proposed MTT achieved an accuracy of 0.91 (95% CI: 0.89–0.93), recall of 0.89, and AUC of 0.97, significantly outperforming the state-of-the-art FT-transformer (<i>p</i> = 0.00091). The results indicate that MTT provides a robust and interpretable deep learning tool for auxiliary diagnosis of neurodevelopmental disorders in infancy.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145367041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Merging Subgroup Information to Supplement Personal Information for Personalized Federated Learning Through Similar Client Grouping 合并子组信息补充个人信息,通过相似客户分组实现个性化联合学习
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-22 DOI: 10.1111/coin.70135
Xuan Cai, Wenan Zhou

Personalized federated learning represents a pivotal strategy for addressing the challenges posed by statistical heterogeneity in federated learning. Clients optimize their models by leveraging information from other clients through a global model. However, data heterogeneity constrains the generalization capacity of the global model, thereby degrading the feature representation capability of local client models, especially in clients with limited data. In response to this challenge, we propose the Federal Merging Subgroup Information (FedMSI) method to augment personalized information in personalized federated learning. On the server side, FedMSI employs model clustering to identify subgroups of clients with similar personalized data distributions. It then aggregates cluster center models within each subgroup and transmits them to clients for use in the subsequent round of assisted training. On the client side, FedMSI introduces a local optimization objective that incorporates the cluster center model, enabling the extraction of informative knowledge to enhance local training. Experiments demonstrate the effectiveness of FedMSI across different datasets, data heterogeneity levels, and data sizes. Ablation experiments further confirm the effectiveness of the design of the local optimization objective. Compared to state-of-the-art methods, FedMSI achieves a 13.16% improvement in scalability performance accuracy.

个性化联邦学习是解决联邦学习中统计异质性带来的挑战的关键策略。客户通过全局模型利用来自其他客户的信息来优化他们的模型。然而,数据异构性限制了全局模型的泛化能力,从而降低了局部客户端模型的特征表示能力,特别是在数据有限的客户端中。针对这一挑战,我们提出了联邦合并子组信息(FedMSI)方法来增强个性化联邦学习中的个性化信息。在服务器端,FedMSI使用模型聚类来识别具有类似个性化数据分布的客户端子组。然后,它将每个子组中的集群中心模型聚合起来,并将它们传输给客户,以便在后续的辅助培训中使用。在客户端,FedMSI引入了一个包含集群中心模型的局部优化目标,从而能够提取信息知识以增强局部训练。实验证明了FedMSI在不同数据集、数据异质性水平和数据大小上的有效性。烧蚀实验进一步验证了局部优化目标设计的有效性。与最先进的方法相比,FedMSI在可伸缩性性能精度方面提高了13.16%。
{"title":"Merging Subgroup Information to Supplement Personal Information for Personalized Federated Learning Through Similar Client Grouping","authors":"Xuan Cai,&nbsp;Wenan Zhou","doi":"10.1111/coin.70135","DOIUrl":"https://doi.org/10.1111/coin.70135","url":null,"abstract":"<div>\u0000 \u0000 <p>Personalized federated learning represents a pivotal strategy for addressing the challenges posed by statistical heterogeneity in federated learning. Clients optimize their models by leveraging information from other clients through a global model. However, data heterogeneity constrains the generalization capacity of the global model, thereby degrading the feature representation capability of local client models, especially in clients with limited data. In response to this challenge, we propose the Federal Merging Subgroup Information (FedMSI) method to augment personalized information in personalized federated learning. On the server side, FedMSI employs model clustering to identify subgroups of clients with similar personalized data distributions. It then aggregates cluster center models within each subgroup and transmits them to clients for use in the subsequent round of assisted training. On the client side, FedMSI introduces a local optimization objective that incorporates the cluster center model, enabling the extraction of informative knowledge to enhance local training. Experiments demonstrate the effectiveness of FedMSI across different datasets, data heterogeneity levels, and data sizes. Ablation experiments further confirm the effectiveness of the design of the local optimization objective. Compared to state-of-the-art methods, FedMSI achieves a 13.16% improvement in scalability performance accuracy.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145341754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling of Nonlinear Oscillator System via Double Loop Radial Basis Function Neural Networks With Adaptive Radius and Lattices 基于自适应半径和格的双环径向基函数神经网络的非线性振子系统建模
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-22 DOI: 10.1111/coin.70153
Guo Luo, Yong Liu, Youcun Fang, Choujun Zhan, Bencong Jiang, Zhipeng Zhou

As modelling of nonlinear oscillator systems plays an important part in science and engineering fields, a double loop Radial Basis Function Neural Network (RBFNN) with adaptive radius and lattices is proposed for handling this issue. In this design, a large enough lattice arranged to cover all of the trajectories is taken as the mapping center of the RBFNN at the initial condition. The number of lattices will be dynamically adjusted, and those lattices far from the trajectories will be removed. Applying Taylor expansion in local space, the activated radius factor can be separated from the Gaussian function. In order to guarantee that the modelling scheme has the characteristic of fast convergence, the error power function is utilized to minimize the gain parameter of the error differential equation. In the double loop structure, the updated equation of weights and activated radius can be determined by the Lyapunov function, which can guarantee that the weights and the activated radius will converge to the neighborhood of their true value and the tracking error of state trajectories will converge to the neighborhood of zero. In order to show the effectiveness and superiority of the double loop RBFNN proposed in this paper, Helmholtz–Duffing and Vanderpol–Duffing are used as the testing objects of the nonlinear oscillator system while comparing with Deterministic Learning.

由于非线性振子系统的建模在科学和工程领域具有重要意义,提出了一种具有自适应半径和格的双环径向基函数神经网络(RBFNN)来处理这一问题。在本设计中,在初始条件下,选取一个足够大的栅格作为RBFNN的映射中心,栅格的排列足以覆盖所有轨迹。网格的数量将被动态调整,远离轨迹的网格将被移除。利用局部空间中的泰勒展开,可以将激活半径因子从高斯函数中分离出来。为了保证建模方案具有快速收敛的特点,利用误差幂函数最小化误差微分方程的增益参数。在双环结构中,利用Lyapunov函数确定权值和激活半径的更新方程,保证权值和激活半径收敛于其真值的邻域,状态轨迹的跟踪误差收敛于零邻域。为了证明本文提出的双环RBFNN的有效性和优越性,采用Helmholtz-Duffing和Vanderpol-Duffing作为非线性振子系统的测试对象,并与确定性学习进行比较。
{"title":"Modelling of Nonlinear Oscillator System via Double Loop Radial Basis Function Neural Networks With Adaptive Radius and Lattices","authors":"Guo Luo,&nbsp;Yong Liu,&nbsp;Youcun Fang,&nbsp;Choujun Zhan,&nbsp;Bencong Jiang,&nbsp;Zhipeng Zhou","doi":"10.1111/coin.70153","DOIUrl":"https://doi.org/10.1111/coin.70153","url":null,"abstract":"<div>\u0000 \u0000 <p>As modelling of nonlinear oscillator systems plays an important part in science and engineering fields, a double loop Radial Basis Function Neural Network (RBFNN) with adaptive radius and lattices is proposed for handling this issue. In this design, a large enough lattice arranged to cover all of the trajectories is taken as the mapping center of the RBFNN at the initial condition. The number of lattices will be dynamically adjusted, and those lattices far from the trajectories will be removed. Applying Taylor expansion in local space, the activated radius factor can be separated from the Gaussian function. In order to guarantee that the modelling scheme has the characteristic of fast convergence, the error power function is utilized to minimize the gain parameter of the error differential equation. In the double loop structure, the updated equation of weights and activated radius can be determined by the Lyapunov function, which can guarantee that the weights and the activated radius will converge to the neighborhood of their true value and the tracking error of state trajectories will converge to the neighborhood of zero. In order to show the effectiveness and superiority of the double loop RBFNN proposed in this paper, Helmholtz–Duffing and Vanderpol–Duffing are used as the testing objects of the nonlinear oscillator system while comparing with Deterministic Learning.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 6","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145341752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Adaptive LLM Instructions Optimization for Aspect-Based Sentiment Analysis by Incorporating Emotion-Oriented In-Contexts 基于情绪导向的情境情感分析的自适应LLM指令优化
IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-21 DOI: 10.1111/coin.70129
Weiqiang Jin, Junli Wang, Yang Gao, Bohang Shi, Ningwei Wang, Biao Zhao, Guang Yang

Aspect-based Sentiment Analysis (ABSA) is a vital NLP task that identifies sentiment towards specific entities or aspect terms within a text. Recently, large language models (LLMs) have shown impressive capabilities in semantic comprehension and logical inference. However, LLM hallucinations pose challenges in accurately determining sentiment polarity for aspect terms, leading to performance issues. Moreover, current ABSA methods often fail to fully leverage the vast prior knowledge embedded within LLMs, resulting in suboptimal classification outcomes for specific aspects. Inspired by these challenges, we propose the BYD-OBS-ABSA framework—‘Beyond Simple Observations, Embracing Comprehensive Contextual Insights’ for ABSA tasks. This framework leverages unique in-context constraints, backgrounds, and analogical reasoning to address LLM hallucinations and uses self-adaptive bootstrap instructions optimization to enhance LLM predictions. BYD-OBS-ABSA integrates various in-context augmentation strategies, including emotion-oriented backgrounds, constraints, and analogical reasoning. BYD-OBS-ABSA further improves initial LLM instructions through adaptive iterative optimization using a random search bootstrap algorithm, maximizing the benefits of LLM prompting. Extensive zero/few-shot experiments with GPT-3.5-turbo across six public datasets validate the effectiveness and robustness of our framework, even surpassing human judgment in certain scenarios.

基于方面的情感分析(ABSA)是一项重要的NLP任务,用于识别对文本中特定实体或方面术语的情感。近年来,大型语言模型(llm)在语义理解和逻辑推理方面表现出了令人印象深刻的能力。然而,LLM幻觉在准确确定方面术语的情感极性方面提出了挑战,从而导致性能问题。此外,目前的ABSA方法往往不能充分利用llm中嵌入的大量先验知识,导致特定方面的分类结果不理想。受到这些挑战的启发,我们提出了针对ABSA任务的BYD-OBS-ABSA框架——“超越简单观察,拥抱全面的情境洞察”。该框架利用独特的上下文约束、背景和类比推理来解决LLM幻觉,并使用自适应引导指令优化来增强LLM预测。BYD-OBS-ABSA集成了各种情境增强策略,包括情感导向背景、约束和类比推理。BYD-OBS-ABSA通过使用随机搜索自举算法的自适应迭代优化进一步改进初始LLM指令,最大化LLM提示的好处。在六个公共数据集上使用gpt -3.5 turbo进行了大量零/少射实验,验证了我们框架的有效性和鲁棒性,甚至在某些情况下超越了人类的判断。
{"title":"Self-Adaptive LLM Instructions Optimization for Aspect-Based Sentiment Analysis by Incorporating Emotion-Oriented In-Contexts","authors":"Weiqiang Jin,&nbsp;Junli Wang,&nbsp;Yang Gao,&nbsp;Bohang Shi,&nbsp;Ningwei Wang,&nbsp;Biao Zhao,&nbsp;Guang Yang","doi":"10.1111/coin.70129","DOIUrl":"https://doi.org/10.1111/coin.70129","url":null,"abstract":"<div>\u0000 \u0000 <p>Aspect-based Sentiment Analysis (ABSA) is a vital NLP task that identifies sentiment towards specific entities or aspect terms within a text. Recently, large language models (LLMs) have shown impressive capabilities in semantic comprehension and logical inference. However, LLM hallucinations pose challenges in accurately determining sentiment polarity for aspect terms, leading to performance issues. Moreover, current ABSA methods often fail to fully leverage the vast prior knowledge embedded within LLMs, resulting in suboptimal classification outcomes for specific aspects. Inspired by these challenges, we propose the BYD-OBS-ABSA framework—‘Beyond Simple Observations, Embracing Comprehensive Contextual Insights’ for ABSA tasks. This framework leverages unique in-context constraints, backgrounds, and analogical reasoning to address LLM hallucinations and uses self-adaptive bootstrap instructions optimization to enhance LLM predictions. BYD-OBS-ABSA integrates various in-context augmentation strategies, including emotion-oriented backgrounds, constraints, and analogical reasoning. BYD-OBS-ABSA further improves initial LLM instructions through adaptive iterative optimization using a random search bootstrap algorithm, maximizing the benefits of LLM prompting. Extensive zero/few-shot experiments with <span>GPT-3.5-turbo</span> across six public datasets validate the effectiveness and robustness of our framework, even surpassing human judgment in certain scenarios.</p>\u0000 </div>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"41 5","pages":""},"PeriodicalIF":1.7,"publicationDate":"2025-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145366433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computational Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1