首页 > 最新文献

IEEE Transactions on Emerging Topics in Computational Intelligence最新文献

英文 中文
The Goldilocks Principle: Achieving Just Right Boundary Fidelity for Long-Tailed Classification 金发姑娘原则:为长尾分类实现恰到好处的边界保真度
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-04-14 DOI: 10.1109/TETCI.2025.3551950
Faizanuddin Ansari;Abhranta Panigrahi;Swagatam Das
This study addresses the challenges of learning from long-tailed class imbalances in deep neural networks, particularly for image recognition. Long-tailed class imbalances occur when a dataset's class distribution is highly skewed, with a few head classes containing many instances and numerous tail classes having fewer instances. This imbalance becomes problematic when traditional classification methods, especially deep learning models, prioritize accuracy in the more frequent classes, neglecting the less common ones. Furthermore, these methods struggle to maintain consistent boundary fidelity—decision boundaries that are sharp enough to distinguish classes yet smooth enough to generalize well. Hard boundaries, often caused by overfitting tail classes, amplify intra-class variations, while overly soft boundaries blur distinctions between classes, reducing classification accuracy. We propose a dual-branch network with a shared feature extractor to overcome these challenges. This network uses instance and median samplers for head and medium classes and a reverse sampler for tail classes. Additionally, we implement a specialized loss function as a feature regularizer to reduce the model's sensitivity to irrelevant intra-class variations during classification. This loss function dynamically modulates feature representation alignment, promoting cohesive intra-class structures and clear inter-class separations. To achieve this, our framework incorporates two key components: Dual-Branch Sampler-Guided Mixup (DBSGM) and Adaptive Class-Aware Feature Regularizer (ACFR), which work together to balance class representation and refine decision boundaries. Integrating DBSGM and ACFR during training helps shape decision boundaries that align with class semantics. To ensure class boundaries are appropriately defined, we propose the temperature-adaptive supervised contrastive loss (TASCL) within the ACFR module, achieving the right balance between smoothness and sharpness. Our single-stage, end-to-end framework demonstrates significant performance improvements, offering a promising solution to the challenges of long-tailed class imbalances in deep learning.
本研究解决了深度神经网络中从长尾类不平衡中学习的挑战,特别是在图像识别方面。当数据集的类分布高度倾斜时,就会出现长尾类不平衡,一些头部类包含许多实例,而许多尾部类具有更少的实例。当传统的分类方法,特别是深度学习模型,优先考虑更频繁的类别的准确性,而忽略了不太常见的类别时,这种不平衡就会出现问题。此外,这些方法很难保持一致的边界保真度——决策边界足够清晰,可以区分类,但又足够平滑,可以很好地进行泛化。硬边界通常由尾类的过度拟合引起,它放大了类内的变化,而过于软的边界模糊了类之间的区别,降低了分类的准确性。我们提出了一个带有共享特征提取器的双分支网络来克服这些挑战。该网络使用实例和中位数采样器对头部和中等类别和反向采样器对尾部类别。此外,我们实现了一个专门的损失函数作为特征正则化器,以降低模型在分类过程中对不相关的类内变化的敏感性。这个损失函数动态调节特征表示对齐,促进类内结构的内聚和类间分离。为了实现这一点,我们的框架包含了两个关键组件:双分支采样器引导的混合(DBSGM)和自适应类感知特征正则化(ACFR),它们一起工作以平衡类表示并细化决策边界。在训练期间集成DBSGM和ACFR有助于形成与类语义一致的决策边界。为了确保类边界的适当定义,我们在ACFR模块中提出了温度自适应监督对比损失(TASCL),在平滑和锐利之间实现了适当的平衡。我们的单阶段端到端框架展示了显著的性能改进,为深度学习中长尾类失衡的挑战提供了一个有希望的解决方案。
{"title":"The Goldilocks Principle: Achieving Just Right Boundary Fidelity for Long-Tailed Classification","authors":"Faizanuddin Ansari;Abhranta Panigrahi;Swagatam Das","doi":"10.1109/TETCI.2025.3551950","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3551950","url":null,"abstract":"This study addresses the challenges of learning from long-tailed class imbalances in deep neural networks, particularly for image recognition. Long-tailed class imbalances occur when a dataset's class distribution is highly skewed, with a few head classes containing many instances and numerous tail classes having fewer instances. This imbalance becomes problematic when traditional classification methods, especially deep learning models, prioritize accuracy in the more frequent classes, neglecting the less common ones. Furthermore, these methods struggle to maintain consistent boundary fidelity—decision boundaries that are sharp enough to distinguish classes yet smooth enough to generalize well. Hard boundaries, often caused by overfitting tail classes, amplify intra-class variations, while overly soft boundaries blur distinctions between classes, reducing classification accuracy. We propose a dual-branch network with a shared feature extractor to overcome these challenges. This network uses instance and median samplers for head and medium classes and a reverse sampler for tail classes. Additionally, we implement a specialized loss function as a feature regularizer to reduce the model's sensitivity to irrelevant intra-class variations during classification. This loss function dynamically modulates feature representation alignment, promoting cohesive intra-class structures and clear inter-class separations. To achieve this, our framework incorporates two key components: Dual-Branch Sampler-Guided Mixup (DBSGM) and Adaptive Class-Aware Feature Regularizer (ACFR), which work together to balance class representation and refine decision boundaries. Integrating DBSGM and ACFR during training helps shape decision boundaries that align with class semantics. To ensure class boundaries are appropriately defined, we propose the temperature-adaptive supervised contrastive loss (TASCL) within the ACFR module, achieving the right balance between smoothness and sharpness. Our single-stage, end-to-end framework demonstrates significant performance improvements, offering a promising solution to the challenges of long-tailed class imbalances in deep learning.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 5","pages":"3650-3664"},"PeriodicalIF":5.3,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145128547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Smart Energy Hub Frequency Control-Based Machine Learning 智能能源集线器频率控制的机器学习
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-04-10 DOI: 10.1109/TETCI.2025.3551991
Burak Yildirim;Meysam Gheisarnejad;Mohammad Hassan Khooban
The increasing variety of energy conversion units and storage equipment connected to the multi-energy system, along with the uncertain factors posed by distributed wind and photovoltaic power generation, have made the energy flow structure of the system more complex. This complexity has created significant challenges for the frequency regulation of traditional energy hub systems. One of the characteristics of a microgrid (MG) is the use of combined heat and power (CHP) systems to generate both electrical and thermal energy at the same time. This can boost the system's dependability, efficiency, and economic performance. As a CHP's ramping capability makes it a useful tool for monitoring and controlling the MG's frequency, it will be employed in this research to achieve this goal. The complexity of the system's dynamics and set tasks throughout the course of the performance period necessitates advanced control structures for the MG with CHP systems. To address the challenges of controlling in MG with CHP systems, this research introduces a novel control structure based on deep reinforcement learning and single input interval type-2 fuzzy fractional-order proportional integral (SIT2-FFOPI) for this system. The SIT2-FFOPI serves as the main controller, with its fundamental parameters established through the utilization of the Improved Salp Swarm Algorithm (ISSA) optimization technique. An adaptive deep deterministic policy gradient (DDPG)-based actor-critic system has been developed to enhance the main controller's learning potential, thereby enabling it to more effectively address control challenges in the isolated MG. The efficacy of the suggested approach in real-time was evaluated through simulations carried out utilizing an OPAL-RT-based Hardware-in-the-Loop (HiL) configuration. As a result of this study, it was determined that the proposed controller for load disturbance, renewable energy sources (RES) power changes, and contingency circumstances in MG outperforms other controllers in terms of performance.
接入多能系统的能量转换单元和存储设备种类越来越多,加上分布式风电和光伏发电带来的不确定性因素,使得多能系统的能量流结构更加复杂。这种复杂性给传统能源枢纽系统的频率调节带来了重大挑战。微电网(MG)的特点之一是使用热电联产(CHP)系统同时产生电能和热能。这可以提高系统的可靠性、效率和经济性能。由于CHP的爬坡能力使其成为监测和控制MG频率的有用工具,因此将在本研究中使用它来实现这一目标。在整个运行过程中,系统的动力学和任务设置的复杂性需要先进的热电联产系统控制结构。为了解决热电联产系统在MG控制中的挑战,本研究引入了一种基于深度强化学习和单输入区间2型模糊分数阶比例积分(SIT2-FFOPI)的新型控制结构。SIT2-FFOPI作为主控制器,利用改进的Salp群算法(ISSA)优化技术建立基本参数。开发了一种基于自适应深度确定性策略梯度(DDPG)的actor-critic系统,以增强主控制器的学习潜力,从而使其能够更有效地解决孤立MG中的控制挑战。通过利用基于opal - rt的硬件在环(HiL)配置进行的仿真,实时评估了所建议方法的有效性。研究结果表明,针对负载扰动、可再生能源(RES)功率变化和MG中的突发情况,所提出的控制器在性能上优于其他控制器。
{"title":"Smart Energy Hub Frequency Control-Based Machine Learning","authors":"Burak Yildirim;Meysam Gheisarnejad;Mohammad Hassan Khooban","doi":"10.1109/TETCI.2025.3551991","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3551991","url":null,"abstract":"The increasing variety of energy conversion units and storage equipment connected to the multi-energy system, along with the uncertain factors posed by distributed wind and photovoltaic power generation, have made the energy flow structure of the system more complex. This complexity has created significant challenges for the frequency regulation of traditional energy hub systems. One of the characteristics of a microgrid (MG) is the use of combined heat and power (CHP) systems to generate both electrical and thermal energy at the same time. This can boost the system's dependability, efficiency, and economic performance. As a CHP's ramping capability makes it a useful tool for monitoring and controlling the MG's frequency, it will be employed in this research to achieve this goal. The complexity of the system's dynamics and set tasks throughout the course of the performance period necessitates advanced control structures for the MG with CHP systems. To address the challenges of controlling in MG with CHP systems, this research introduces a novel control structure based on deep reinforcement learning and single input interval type-2 fuzzy fractional-order proportional integral (SIT2-FFOPI) for this system. The SIT2-FFOPI serves as the main controller, with its fundamental parameters established through the utilization of the Improved Salp Swarm Algorithm (ISSA) optimization technique. An adaptive deep deterministic policy gradient (DDPG)-based actor-critic system has been developed to enhance the main controller's learning potential, thereby enabling it to more effectively address control challenges in the isolated MG. The efficacy of the suggested approach in real-time was evaluated through simulations carried out utilizing an OPAL-RT-based Hardware-in-the-Loop (HiL) configuration. As a result of this study, it was determined that the proposed controller for load disturbance, renewable energy sources (RES) power changes, and contingency circumstances in MG outperforms other controllers in terms of performance.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 5","pages":"3638-3649"},"PeriodicalIF":5.3,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145128551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Survey of Multimodal Fake News Detection: A Cross-Modal Interaction Perspective 多模态假新闻检测研究:跨模态交互视角
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-04-09 DOI: 10.1109/TETCI.2025.3543389
Xianghua Li;Jiao Qiao;Shu Yin;Lianwei Wu;Chao Gao;Zhen Wang;Xuelong Li
The growth of social media platforms has made it easier for fake news to spread, which poses a significant threat to authoritative news outlets, politics, and public health. Manual verification of the massive amount of online information has proven to be a daunting task, which has led to the growing interest in automatic fake news detection. Some methods that rely on news text, images, external knowledge, social contexts, or propagation graphs have demonstrated good performance. In contrast to earlier studies that focused solely on the unimodal news textual information, recent works have integrated multimodal features from various granularities, such as words, visual semantic regions, and multimodal entities, to more effectively leverage news content and align with human reading habits. However, a comprehensive review of Multimodal Fake News Detection (MFND) is still lacking, prompting our aim to complement this topic. Specifically, we present a systematic taxonomy from the perspective of cross-modal interactions. We categorize existing methods into the data-based, entity-based, and knowledge-based approaches. Connections between various works are detailed when outlining representative papers. Additionally, we introduce prevalent multimodal learning methods, present accessible MFND datasets and evaluation metrics, and analyze current research results. Finally, the promising future research directions are discussed.
社交媒体平台的发展使假新闻更容易传播,这对权威新闻媒体、政治和公共卫生构成了重大威胁。事实证明,对大量在线信息进行人工验证是一项艰巨的任务,这导致人们对自动检测假新闻的兴趣日益浓厚。一些依赖于新闻文本、图像、外部知识、社会背景或传播图的方法表现出了良好的性能。与早期研究仅关注单模态新闻文本信息不同,最近的研究整合了来自不同粒度的多模态特征,如单词、视觉语义区域和多模态实体,以更有效地利用新闻内容并与人类阅读习惯保持一致。然而,对多模式假新闻检测(MFND)的全面审查仍然缺乏,这促使我们的目标是补充这一主题。具体来说,我们从跨模态相互作用的角度提出了一个系统的分类法。我们将现有的方法分为基于数据、基于实体和基于知识的方法。在概述代表性论文时,详细介绍了各作品之间的联系。此外,我们介绍了流行的多模态学习方法,提供了可访问的MFND数据集和评估指标,并分析了当前的研究成果。最后,对未来的研究方向进行了展望。
{"title":"A Survey of Multimodal Fake News Detection: A Cross-Modal Interaction Perspective","authors":"Xianghua Li;Jiao Qiao;Shu Yin;Lianwei Wu;Chao Gao;Zhen Wang;Xuelong Li","doi":"10.1109/TETCI.2025.3543389","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3543389","url":null,"abstract":"The growth of social media platforms has made it easier for fake news to spread, which poses a significant threat to authoritative news outlets, politics, and public health. Manual verification of the massive amount of online information has proven to be a daunting task, which has led to the growing interest in automatic fake news detection. Some methods that rely on news text, images, external knowledge, social contexts, or propagation graphs have demonstrated good performance. In contrast to earlier studies that focused solely on the unimodal news textual information, recent works have integrated multimodal features from various granularities, such as words, visual semantic regions, and multimodal entities, to more effectively leverage news content and align with human reading habits. However, a comprehensive review of Multimodal Fake News Detection (MFND) is still lacking, prompting our aim to complement this topic. Specifically, we present a systematic taxonomy from the perspective of cross-modal interactions. We categorize existing methods into the data-based, entity-based, and knowledge-based approaches. Connections between various works are detailed when outlining representative papers. Additionally, we introduce prevalent multimodal learning methods, present accessible MFND datasets and evaluation metrics, and analyze current research results. Finally, the promising future research directions are discussed.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 4","pages":"2658-2675"},"PeriodicalIF":5.3,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144687742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SCGAN: Sampling and Clustering-Based Neural Architecture Search for GANs SCGAN:基于采样和聚类的gan神经结构搜索
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-31 DOI: 10.1109/TETCI.2025.3547611
Qingling Zhu;Yeming Yang;Songbai Liu;Qiuzhen Lin;Kay Chen Tan
The evolutionary neural architecture search for generative adversarial networks (GANs) has demonstrated promising performance for generating high-quality images. However, two challenges persist, including the long search times and unstable search results. To alleviate these problems, this paper proposes a sampling and clustering-based neural architecture search algorithm for GANs, named SCGAN, which can significantly improve searching efficiency and enhance generation quality. Two improved strategies are proposed in SCGAN. First, a constraint sampling strategy is designed to limit the parameter capacity of architectures, which calculates their architecture size and discards those exceeding a reasonable parameter threshold. Second, a clustering selection strategy is applied in each architecture iteration, which integrates a decomposition selection mechanism and a hierarchical clustering mechanism to further improve search stability. Extensive experiments on the CIFAR-10 and STL-10 datasets demonstrated that SCGAN only requires 0.4 GPU days to find a promising GAN architecture in a vast search space including approximately 10$^{15}$ networks. Our best-found GAN outperformed those obtained by other neural architecture search methods with performance metric results (IS = 9.68$pm$ 0.06, FID = 5.54) on CIFAR-10 and (IS = 12.12$pm$ 0.13, FID = 12.54) on STL-10.
生成对抗网络(GANs)的进化神经结构搜索在生成高质量图像方面表现出了良好的性能。然而,仍然存在两个挑战,包括较长的搜索时间和不稳定的搜索结果。为了解决这些问题,本文提出了一种基于采样和聚类的gan神经结构搜索算法SCGAN,该算法可以显著提高搜索效率和生成质量。在SCGAN中提出了两种改进策略。首先,设计了一种约束采样策略来限制体系结构的参数容量,该策略计算体系结构的大小,并丢弃超过合理参数阈值的结构。其次,在每次架构迭代中采用聚类选择策略,该策略集成了分解选择机制和分层聚类机制,进一步提高了搜索稳定性;在CIFAR-10和STL-10数据集上的大量实验表明,SCGAN只需要0.4 GPU天就可以在包括大约10美元^{15}美元网络的巨大搜索空间中找到一个有前途的GAN架构。我们发现的最佳GAN在CIFAR-10和STL-10上的性能指标结果(IS = 9.68$pm$ 0.06, FID = 5.54)和(IS = 12.12$pm$ 0.13, FID = 12.54)优于其他神经结构搜索方法获得的GAN。
{"title":"SCGAN: Sampling and Clustering-Based Neural Architecture Search for GANs","authors":"Qingling Zhu;Yeming Yang;Songbai Liu;Qiuzhen Lin;Kay Chen Tan","doi":"10.1109/TETCI.2025.3547611","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3547611","url":null,"abstract":"The evolutionary neural architecture search for generative adversarial networks (GANs) has demonstrated promising performance for generating high-quality images. However, two challenges persist, including the long search times and unstable search results. To alleviate these problems, this paper proposes a sampling and clustering-based neural architecture search algorithm for GANs, named SCGAN, which can significantly improve searching efficiency and enhance generation quality. Two improved strategies are proposed in SCGAN. First, a constraint sampling strategy is designed to limit the parameter capacity of architectures, which calculates their architecture size and discards those exceeding a reasonable parameter threshold. Second, a clustering selection strategy is applied in each architecture iteration, which integrates a decomposition selection mechanism and a hierarchical clustering mechanism to further improve search stability. Extensive experiments on the CIFAR-10 and STL-10 datasets demonstrated that SCGAN only requires 0.4 GPU days to find a promising GAN architecture in a vast search space including approximately 10<inline-formula><tex-math>$^{15}$</tex-math></inline-formula> networks. Our best-found GAN outperformed those obtained by other neural architecture search methods with performance metric results (IS = 9.68<inline-formula><tex-math>$pm$</tex-math></inline-formula> 0.06, FID = 5.54) on CIFAR-10 and (IS = 12.12<inline-formula><tex-math>$pm$</tex-math></inline-formula> 0.13, FID = 12.54) on STL-10.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 5","pages":"3626-3637"},"PeriodicalIF":5.3,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145128515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MSPredictor: A Multi-Scale Dynamic Graph Neural Network for Multivariate Time Series Prediction mpredictor:一个多尺度动态图神经网络,用于多变量时间序列预测
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-31 DOI: 10.1109/TETCI.2025.3548719
Jiashan Wan;Na Xia;Gongwen Li;Jingyang Li;Jinhua Wu;Xulei Pan;Mengqi Lian
In the field of multivariate time series prediction, capturing the dynamic relationships and complex cyclical patterns between sequences is key to improving prediction accuracy. To address this challenge, our paper introduces MSPredictor, a multi-scale dynamic graph neural network model, which uses Fast Fourier Transform for multi-scale decoupling in the frequency domain and employs Kolmogorov-Arnold Networks for multi-scale fusion, effectively extracting significant cyclical patterns. By decomposing the original series across different scales, MSPredictor accurately models complex cyclical patterns. To enhance the model's transparency and interpretability, we introduced the ClarityLens explanatory strategy, which employs visualization techniques to make the prediction process more transparent. Specifically, it displays the adjacency matrices learned at different scales, intuitively showing the dynamic correlations between series. We also visualized the proportion of different periods in the prediction results and the specific forecasting performance at each time scale. Extensive testing on multiple real-world datasets has demonstrated that the MSPredictor significantly outperforms existing benchmarks, validating its practicality and high transparency.
在多变量时间序列预测中,序列之间的动态关系和复杂的周期模式是提高预测精度的关键。为了解决这一挑战,本文引入了一种多尺度动态图神经网络模型MSPredictor,该模型在频域使用快速傅里叶变换进行多尺度解耦,并使用Kolmogorov-Arnold网络进行多尺度融合,有效地提取了重要的周期模式。通过在不同尺度上分解原始序列,MSPredictor可以准确地模拟复杂的周期模式。为了提高模型的透明度和可解释性,我们引入了ClarityLens解释策略,该策略采用可视化技术使预测过程更加透明。具体来说,它显示了在不同尺度下学习到的邻接矩阵,直观地显示了序列之间的动态相关性。我们还可视化了不同时期在预测结果中所占的比例以及每个时间尺度下的具体预测效果。在多个真实数据集上的广泛测试表明,MSPredictor显著优于现有的基准测试,验证了其实用性和高透明度。
{"title":"MSPredictor: A Multi-Scale Dynamic Graph Neural Network for Multivariate Time Series Prediction","authors":"Jiashan Wan;Na Xia;Gongwen Li;Jingyang Li;Jinhua Wu;Xulei Pan;Mengqi Lian","doi":"10.1109/TETCI.2025.3548719","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3548719","url":null,"abstract":"In the field of multivariate time series prediction, capturing the dynamic relationships and complex cyclical patterns between sequences is key to improving prediction accuracy. To address this challenge, our paper introduces MSPredictor, a multi-scale dynamic graph neural network model, which uses Fast Fourier Transform for multi-scale decoupling in the frequency domain and employs Kolmogorov-Arnold Networks for multi-scale fusion, effectively extracting significant cyclical patterns. By decomposing the original series across different scales, MSPredictor accurately models complex cyclical patterns. To enhance the model's transparency and interpretability, we introduced the ClarityLens explanatory strategy, which employs visualization techniques to make the prediction process more transparent. Specifically, it displays the adjacency matrices learned at different scales, intuitively showing the dynamic correlations between series. We also visualized the proportion of different periods in the prediction results and the specific forecasting performance at each time scale. Extensive testing on multiple real-world datasets has demonstrated that the MSPredictor significantly outperforms existing benchmarks, validating its practicality and high transparency.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 6","pages":"4243-4257"},"PeriodicalIF":5.3,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145584598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prompt-Based Out-of-Distribution Intent Detection 基于提示的分布外意图检测
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-27 DOI: 10.1109/TETCI.2024.3372440
Rudolf Chow;Albert Y. S. Lam
Recent rapid advances in pre-trained language models, such as BERT and GPT, in natural language processing (NLP) have greatly improved the efficacy of text classifiers, easily surpassing human level performance in standard datasets like GLUE. However, most of these standard tasks implicitly assume a closed-world situation, where all testing data are supposed to lie in the same scope or distribution of the training data. Out-of-distribution (OOD) detection is the task of detecting when an input data point lies beyond the scope of the seen training set. This is becoming increasingly important as NLP agents, such as chatbots or virtual assistants, have been being deployed ubiquitously in our daily lives, thus attracting more attention from the research community to make it more accurate and robust at the same time. Recent work can be broadly categorized into two orthogonal approaches – data generative/augmentative methods and threshold/boundary learning. In this work, we follow the former and propose a method for the task based on prompting, which is known for its zero and few-shot capabilities. Generating synthetic outliers in terms of prompts allows the model to more efficiently learn OOD samples than the existing methods. Testing on nine different settings across three standard datasets used for OOD detection, our method with adaptive decision boundary is able to achieve competitive or superior performances compared with the current state-of-the-art in all cases. We also provide extensive analysis on each dataset as well as perform comprehensive ablation studies on each component of our model.
最近在自然语言处理(NLP)中,预训练语言模型(如BERT和GPT)的快速发展极大地提高了文本分类器的效率,在GLUE等标准数据集上很容易超过人类水平的性能。然而,这些标准任务中的大多数都隐含地假设了一个封闭世界的情况,其中所有测试数据都应该位于训练数据的相同范围或分布中。out -distribution (OOD)检测是一种检测输入数据点是否在已知训练集范围之外的任务。随着聊天机器人或虚拟助手等NLP代理在我们的日常生活中无处不在,这一点变得越来越重要,从而吸引了研究界的更多关注,同时使其更加准确和健壮。最近的工作可以大致分为两种正交方法-数据生成/增强方法和阈值/边界学习。在这项工作中,我们遵循前者并提出了一种基于提示的任务方法,该方法以其零和少射能力而闻名。根据提示生成合成异常值使模型比现有方法更有效地学习OOD样本。在用于OOD检测的三个标准数据集的九种不同设置上进行测试,我们的方法具有自适应决策边界,与当前最先进的方法相比,在所有情况下都能够获得具有竞争力或更好的性能。我们还对每个数据集进行广泛的分析,并对我们模型的每个组成部分进行全面的消融研究。
{"title":"Prompt-Based Out-of-Distribution Intent Detection","authors":"Rudolf Chow;Albert Y. S. Lam","doi":"10.1109/TETCI.2024.3372440","DOIUrl":"https://doi.org/10.1109/TETCI.2024.3372440","url":null,"abstract":"Recent rapid advances in pre-trained language models, such as BERT and GPT, in natural language processing (NLP) have greatly improved the efficacy of text classifiers, easily surpassing human level performance in standard datasets like GLUE. However, most of these standard tasks implicitly assume a closed-world situation, where all testing data are supposed to lie in the same scope or distribution of the training data. Out-of-distribution (OOD) detection is the task of detecting when an input data point lies beyond the scope of the seen training set. This is becoming increasingly important as NLP agents, such as chatbots or virtual assistants, have been being deployed ubiquitously in our daily lives, thus attracting more attention from the research community to make it more accurate and robust at the same time. Recent work can be broadly categorized into two orthogonal approaches – data generative/augmentative methods and threshold/boundary learning. In this work, we follow the former and propose a method for the task based on prompting, which is known for its zero and few-shot capabilities. Generating synthetic outliers in terms of prompts allows the model to more efficiently learn OOD samples than the existing methods. Testing on nine different settings across three standard datasets used for OOD detection, our method with adaptive decision boundary is able to achieve competitive or superior performances compared with the current state-of-the-art in all cases. We also provide extensive analysis on each dataset as well as perform comprehensive ablation studies on each component of our model.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 3","pages":"2371-2382"},"PeriodicalIF":5.3,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144148162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Computational Intelligence Society Information IEEE计算智能学会信息
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-27 DOI: 10.1109/TETCI.2025.3568244
{"title":"IEEE Computational Intelligence Society Information","authors":"","doi":"10.1109/TETCI.2025.3568244","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3568244","url":null,"abstract":"","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 3","pages":"C4-C4"},"PeriodicalIF":5.3,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11016138","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144148134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Emerging Topics in Computational Intelligence Publication Information IEEE计算智能新兴主题汇刊
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-27 DOI: 10.1109/TETCI.2025.3568240
{"title":"IEEE Transactions on Emerging Topics in Computational Intelligence Publication Information","authors":"","doi":"10.1109/TETCI.2025.3568240","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3568240","url":null,"abstract":"","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 3","pages":"C2-C2"},"PeriodicalIF":5.3,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11016149","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144148123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AdptGL-CA: Adaptive Global-Local Metric Fusion With Contrastive Attention for Few-Shot Learning AdptGL-CA:基于对比注意的自适应全局-局部度量融合算法
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-27 DOI: 10.1109/TETCI.2025.3550529
Zhiying Song;Pengfei Wang;Xiaokang Wang;Nenggan Zheng
Few-shot learning (FSL) aims to learn novel concepts with very limited labeled data. The popular FSL methods typically rely on metric learning to measure image similarity in a learned feature space. However, existing approaches often overlook the synergy between the similarity metric and feature representation, and fail to fully exploit the combination of global and local features for effective similarity measurement. In this work, we propose a novel FSL method, AdptGL-CA, which adaptively uses global and local features to boost the discrimination capability of similarity metric, while improving feature representation and generalization through attention mechanism and contrastive learning, respectively. Specifically, we design a learnable adaptive fusion strategy that uses global similarity to represent task-specific status to adaptively determine the fusion weight of local similarity, thus effectively fusing the dual similarities for better classification. Besides, the salient parts of features are highlighted using channel and spatial attentions to improve feature representation while adjusting the importance of local descriptors. As the input to the similarity metric, these more informative features further boost its discriminative ability. Moreover, a contrastive learning loss is introduced to overcome the potential overfit to base classes and learn more generic features. Additionally, we extend the PAC-Bayes-Bernstein bound to FSL setting, introducing a theoretically grounded measure for assessing generalization. Theoretical analysis validates the generalization improvement of AdptGL-CA. Comprehensive experiments indicate that AdptGL-CA achieves competitive performance with few extra parameters on multiple standard and fine-grained few-shot benchmarks, showing the effectiveness.
少射学习(FSL)旨在通过非常有限的标记数据学习新概念。流行的FSL方法通常依赖于度量学习来测量学习到的特征空间中的图像相似性。然而,现有的方法往往忽略了相似度度量与特征表示之间的协同作用,未能充分利用全局特征与局部特征的结合来进行有效的相似度度量。在本文中,我们提出了一种新的FSL方法,AdptGL-CA,该方法自适应地使用全局和局部特征来增强相似性度量的识别能力,同时分别通过注意机制和对比学习来提高特征表征和泛化能力。具体而言,我们设计了一种可学习的自适应融合策略,以全局相似度表示任务特定状态,自适应确定局部相似度的融合权重,从而有效地融合双相似度,以获得更好的分类效果。此外,在调整局部描述符的重要性的同时,利用通道和空间关注来突出特征的显著部分,以提高特征的表示。作为相似性度量的输入,这些信息量更大的特征进一步增强了相似性度量的判别能力。此外,引入了对比学习损失来克服潜在的对基类的过拟合,并学习更多的通用特征。此外,我们将PAC-Bayes-Bernstein绑定扩展到FSL设置,引入了一个理论上有根据的评估泛化的度量。理论分析验证了AdptGL-CA的泛化改进。综合实验表明,AdptGL-CA在多标准和细粒度的少次基准测试中,以较少的额外参数获得了具有竞争力的性能,显示了有效性。
{"title":"AdptGL-CA: Adaptive Global-Local Metric Fusion With Contrastive Attention for Few-Shot Learning","authors":"Zhiying Song;Pengfei Wang;Xiaokang Wang;Nenggan Zheng","doi":"10.1109/TETCI.2025.3550529","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3550529","url":null,"abstract":"Few-shot learning (FSL) aims to learn novel concepts with very limited labeled data. The popular FSL methods typically rely on metric learning to measure image similarity in a learned feature space. However, existing approaches often overlook the synergy between the similarity metric and feature representation, and fail to fully exploit the combination of global and local features for effective similarity measurement. In this work, we propose a novel FSL method, AdptGL-CA, which adaptively uses global and local features to boost the discrimination capability of similarity metric, while improving feature representation and generalization through attention mechanism and contrastive learning, respectively. Specifically, we design a learnable adaptive fusion strategy that uses global similarity to represent task-specific status to adaptively determine the fusion weight of local similarity, thus effectively fusing the dual similarities for better classification. Besides, the salient parts of features are highlighted using channel and spatial attentions to improve feature representation while adjusting the importance of local descriptors. As the input to the similarity metric, these more informative features further boost its discriminative ability. Moreover, a contrastive learning loss is introduced to overcome the potential overfit to base classes and learn more generic features. Additionally, we extend the PAC-Bayes-Bernstein bound to FSL setting, introducing a theoretically grounded measure for assessing generalization. Theoretical analysis validates the generalization improvement of AdptGL-CA. Comprehensive experiments indicate that AdptGL-CA achieves competitive performance with few extra parameters on multiple standard and fine-grained few-shot benchmarks, showing the effectiveness.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 5","pages":"3598-3613"},"PeriodicalIF":5.3,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145128457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Emerging Topics in Computational Intelligence Information for Authors IEEE计算智能信息新主题汇刊
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-27 DOI: 10.1109/TETCI.2025.3568242
{"title":"IEEE Transactions on Emerging Topics in Computational Intelligence Information for Authors","authors":"","doi":"10.1109/TETCI.2025.3568242","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3568242","url":null,"abstract":"","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 3","pages":"C3-C3"},"PeriodicalIF":5.3,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11016135","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144148081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Emerging Topics in Computational Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1