首页 > 最新文献

IEEE Transactions on Emerging Topics in Computational Intelligence最新文献

英文 中文
Trustworthy Federated Fine-Tuning for Industrial Chains Demand Forecasting 面向产业链需求预测的可信联邦微调
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-13 DOI: 10.1109/TETCI.2025.3537941
Guoquan Huang;Guanyu Lin;Li Ning;Yicheng Xu;Chee Peng Lim;Yong Zhang
Demand forecasting is crucial for the robust development of industrial chains, given the direct impact of consumer market volatility on production planning. However, in the intricate industrial chain environment, limited accessible data from independent production entities poses challenges in achieving high performances and precise predictions for future demand. Centralized training using machine learning modeling on data from multiple production entities is a potential solution, yet issues like consumer privacy, industry competition, and data security hinder practical machine learning implementation. This research introduces an innovative distributed learning approach, utilizing privacy-preserving federated learning techniques to enhance time-series demand forecasting for multiple entities pertaining to industrial chains. Our approach involves several key steps, including federated learning among entities in the industrial chain on a blockchain platform, ensuring the trustworthiness of the computation process and results. Leveraging Pre-training Models (PTMs) facilitates federated fine-tuning among production entities, addressing model heterogeneity and minimizing privacy breach risks. A comprehensive comparison study on various federated learning demand forecasting models on data from two real-world industry chains demonstrates the superior performance and enhanced security of our developed approach.
考虑到消费市场波动对生产计划的直接影响,需求预测对产业链的强劲发展至关重要。然而,在复杂的产业链环境中,来自独立生产实体的有限可访问数据对实现高性能和对未来需求的精确预测构成了挑战。使用机器学习建模对来自多个生产实体的数据进行集中训练是一种潜在的解决方案,但消费者隐私、行业竞争和数据安全等问题阻碍了实际机器学习的实施。本研究引入了一种创新的分布式学习方法,利用保护隐私的联邦学习技术来增强与产业链相关的多个实体的时间序列需求预测。我们的方法涉及几个关键步骤,包括区块链平台上产业链中实体之间的联合学习,确保计算过程和结果的可信度。利用预训练模型(ptm)促进了生产实体之间的联合微调,解决了模型的异构性,并将隐私泄露风险降至最低。对来自两个真实产业链数据的各种联邦学习需求预测模型的综合比较研究表明,我们开发的方法具有优越的性能和增强的安全性。
{"title":"Trustworthy Federated Fine-Tuning for Industrial Chains Demand Forecasting","authors":"Guoquan Huang;Guanyu Lin;Li Ning;Yicheng Xu;Chee Peng Lim;Yong Zhang","doi":"10.1109/TETCI.2025.3537941","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3537941","url":null,"abstract":"Demand forecasting is crucial for the robust development of industrial chains, given the direct impact of consumer market volatility on production planning. However, in the intricate industrial chain environment, limited accessible data from independent production entities poses challenges in achieving high performances and precise predictions for future demand. Centralized training using machine learning modeling on data from multiple production entities is a potential solution, yet issues like consumer privacy, industry competition, and data security hinder practical machine learning implementation. This research introduces an innovative distributed learning approach, utilizing privacy-preserving federated learning techniques to enhance time-series demand forecasting for multiple entities pertaining to industrial chains. Our approach involves several key steps, including federated learning among entities in the industrial chain on a blockchain platform, ensuring the trustworthiness of the computation process and results. Leveraging Pre-training Models (PTMs) facilitates federated fine-tuning among production entities, addressing model heterogeneity and minimizing privacy breach risks. A comprehensive comparison study on various federated learning demand forecasting models on data from two real-world industry chains demonstrates the superior performance and enhanced security of our developed approach.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"1441-1453"},"PeriodicalIF":5.3,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143706688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Motion Expressions Guided Video Segmentation via Effective Motion Information Mining 基于有效运动信息挖掘的运动表达式指导视频分割
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-13 DOI: 10.1109/TETCI.2025.3537936
Ge Li;Hanqing Sun;Aiping Yang;Jiale Cao;Yanwei Pang
Motion expressions guided video segmentation is aimed to segment objects in videos according to the given language descriptions about object motion. To accurately segment moving objects across frames, it is important to capture motion information of objects within the entire video. However, the existing method fails to encode object motion information accurately. In this paper, we propose an effective motion information mining framework to improve motion expressions guided video segmentation, named EMIM. It consists of two novel modules, including a hierarchical motion aggregation module and a box-level positional encoding module. Specifically, the hierarchical motion aggregation module is aimed to capture local and global temporal information of objects within a video. To achieve this goal, we introduce local-window self-attention and selective state space models for short-term and long-term feature aggregation. Inspired by that the spatial changes of objects can effectively reflect the object motion across frames, the box-level positional encoding module integrates object spatial information into object embeddings. With two proposed modules, our proposed method can capture object spatial changes with temporal evolution. We conduct the extensive experiments on motion expressions guided video segmentation dataset MeViS to reveal the advantages of our EMIM. Our proposed EMIM achieves a $ mathcal {J & F}$ score of 42.2%, outperforming the prior approach, LMPM, by 5.0%.
运动表达式引导视频分割的目的是根据给定的物体运动语言描述对视频中的物体进行分割。为了准确地分割跨帧的运动物体,捕获整个视频中物体的运动信息是很重要的。然而,现有的方法不能准确地编码物体运动信息。在本文中,我们提出了一个有效的运动信息挖掘框架来改进运动表达式引导的视频分割,称为EMIM。它由两个新颖的模块组成,包括层次运动聚合模块和盒级位置编码模块。具体而言,分层运动聚合模块旨在捕获视频中对象的局部和全局时间信息。为了实现这一目标,我们引入了局部窗口自关注和选择状态空间模型,用于短期和长期特征聚合。受物体空间变化能有效反映物体跨帧运动的启发,盒级位置编码模块将物体空间信息整合到物体嵌入中。该方法采用两个模块,可以捕捉到随时间变化的物体空间变化。我们在运动表达式引导的视频分割数据集mei上进行了大量的实验,以揭示我们的EMIM的优势。我们提出的EMIM实现了42.2%的$ mathcal {J & F}$得分,比之前的方法LMPM高出5.0%。
{"title":"Motion Expressions Guided Video Segmentation via Effective Motion Information Mining","authors":"Ge Li;Hanqing Sun;Aiping Yang;Jiale Cao;Yanwei Pang","doi":"10.1109/TETCI.2025.3537936","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3537936","url":null,"abstract":"Motion expressions guided video segmentation is aimed to segment objects in videos according to the given language descriptions about object motion. To accurately segment moving objects across frames, it is important to capture motion information of objects within the entire video. However, the existing method fails to encode object motion information accurately. In this paper, we propose an effective motion information mining framework to improve motion expressions guided video segmentation, named EMIM. It consists of two novel modules, including a hierarchical motion aggregation module and a box-level positional encoding module. Specifically, the hierarchical motion aggregation module is aimed to capture local and global temporal information of objects within a video. To achieve this goal, we introduce local-window self-attention and selective state space models for short-term and long-term feature aggregation. Inspired by that the spatial changes of objects can effectively reflect the object motion across frames, the box-level positional encoding module integrates object spatial information into object embeddings. With two proposed modules, our proposed method can capture object spatial changes with temporal evolution. We conduct the extensive experiments on motion expressions guided video segmentation dataset MeViS to reveal the advantages of our EMIM. Our proposed EMIM achieves a <inline-formula><tex-math>$ mathcal {J &amp; F}$</tex-math></inline-formula> score of 42.2%, outperforming the prior approach, LMPM, by 5.0%.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 5","pages":"3712-3718"},"PeriodicalIF":5.3,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145141740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the Adversarial Frontier: Quantifying Robustness via Adversarial Hypervolume 探索对抗性前沿:通过对抗性超容量量化稳健性
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-13 DOI: 10.1109/TETCI.2025.3535656
Ping Guo;Cheng Gong;Xi Lin;Zhiyuan Yang;Qingfu Zhang
The escalating threat of adversarial attacks on deep learning models, particularly in security-critical fields, has highlighted the need for robust deep learning systems. Conventional evaluation methods of their robustness rely on adversarial accuracy, which measures the model performance under a specific perturbation intensity. However, this singular metric does not fully encapsulate the overall resilience of a model against varying degrees of perturbation. To address this issue, we propose a new metric termed as the adversarial hypervolume for assessing the robustness of deep learning models comprehensively over a range of perturbation intensities from a multi-objective optimization standpoint. This metric allows for an in-depth comparison of defense mechanisms and recognizes the trivial improvements in robustness brought by less potent defensive strategies. We adopt a novel training algorithm to enhance adversarial robustness uniformly across various perturbation intensities, instead of only optimizing adversarial accuracy. Our experiments validate the effectiveness of the adversarial hypervolume metric in robustness evaluation, demonstrating its ability to reveal subtle differences in robustness that adversarial accuracy overlooks.
深度学习模型的对抗性攻击威胁不断升级,特别是在安全关键领域,这凸显了对强大深度学习系统的需求。传统的鲁棒性评估方法依赖于对抗精度,它衡量模型在特定扰动强度下的性能。然而,这个单一度量并不能完全概括模型对不同程度扰动的整体弹性。为了解决这个问题,我们提出了一种新的度量,称为对抗超体积,用于从多目标优化的角度全面评估深度学习模型在一系列扰动强度上的鲁棒性。这个度量允许对防御机制进行深入的比较,并认识到由较弱的防御策略带来的鲁棒性的微不足道的改进。我们采用了一种新的训练算法来增强对抗鲁棒性,而不仅仅是优化对抗精度。我们的实验验证了对抗性超大体积度量在鲁棒性评估中的有效性,证明了它能够揭示对抗性准确性忽略的鲁棒性的细微差异。
{"title":"Exploring the Adversarial Frontier: Quantifying Robustness via Adversarial Hypervolume","authors":"Ping Guo;Cheng Gong;Xi Lin;Zhiyuan Yang;Qingfu Zhang","doi":"10.1109/TETCI.2025.3535656","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3535656","url":null,"abstract":"The escalating threat of adversarial attacks on deep learning models, particularly in security-critical fields, has highlighted the need for robust deep learning systems. Conventional evaluation methods of their robustness rely on adversarial accuracy, which measures the model performance under a specific perturbation intensity. However, this singular metric does not fully encapsulate the overall resilience of a model against varying degrees of perturbation. To address this issue, we propose a new metric termed as the adversarial hypervolume for assessing the robustness of deep learning models comprehensively over a range of perturbation intensities from a multi-objective optimization standpoint. This metric allows for an in-depth comparison of defense mechanisms and recognizes the trivial improvements in robustness brought by less potent defensive strategies. We adopt a novel training algorithm to enhance adversarial robustness uniformly across various perturbation intensities, instead of only optimizing adversarial accuracy. Our experiments validate the effectiveness of the adversarial hypervolume metric in robustness evaluation, demonstrating its ability to reveal subtle differences in robustness that adversarial accuracy overlooks.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"1367-1378"},"PeriodicalIF":5.3,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Class Discriminative Knowledge Distillation 类判别知识蒸馏
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-29 DOI: 10.1109/TETCI.2025.3529896
Shuoxi Zhang;Hanpeng Liu;Yuyi Wang;Kun He;Jun Lin;Yang Zeng
Knowledge distillation aims to transfer knowledge from a large teacher model to a lightweight student model, enabling the student to achieve performance comparable to the teacher. Existing methods explore various strategies for distillation, including soft logits, intermediate features, and even class-aware logits. Class-aware distillation, in particular, treats the columns of logit matrices as class representations, capturing potential relationships among instances within a batch. However, we argue that representing class embeddings solely as column vectors may not fully capture their inherent properties. In this study, we revisit class-aware knowledge distillation and propose that effective transfer of class-level knowledge requires two regularization strategies: separability and orthogonality. Additionally, we introduce an asymmetric architecture design to further enhance the transfer of class-level knowledge. Together, these components form a new methodology, Class Discriminative Knowledge Distillation (CD-KD). Empirical results demonstrate that CD-KD significantly outperforms several state-of-the-art logit-based and feature-based methods across diverse visual classification tasks, highlighting its effectiveness and robustness.
知识蒸馏的目的是将知识从一个庞大的教师模型转移到一个轻量级的学生模型,使学生达到与教师相当的表现。现有的方法探索了各种蒸馏策略,包括软逻辑、中间特征,甚至是类感知逻辑。特别是类感知蒸馏,它将logit矩阵的列视为类表示,捕获批处理中实例之间的潜在关系。然而,我们认为仅将类嵌入表示为列向量可能无法完全捕获其固有属性。在本研究中,我们重新审视了类感知知识的提炼,并提出类级知识的有效转移需要两种正则化策略:可分性和正交性。此外,我们引入了非对称架构设计,以进一步增强类级知识的转移。总之,这些组成部分形成了一种新的方法,类判别知识蒸馏(CD-KD)。实证结果表明,在不同的视觉分类任务中,CD-KD显著优于几种最先进的基于逻辑和基于特征的方法,突出了其有效性和鲁棒性。
{"title":"Class Discriminative Knowledge Distillation","authors":"Shuoxi Zhang;Hanpeng Liu;Yuyi Wang;Kun He;Jun Lin;Yang Zeng","doi":"10.1109/TETCI.2025.3529896","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3529896","url":null,"abstract":"Knowledge distillation aims to transfer knowledge from a large teacher model to a lightweight student model, enabling the student to achieve performance comparable to the teacher. Existing methods explore various strategies for distillation, including soft logits, intermediate features, and even class-aware logits. Class-aware distillation, in particular, treats the columns of logit matrices as class representations, capturing potential relationships among instances within a batch. However, we argue that representing class embeddings solely as column vectors may not fully capture their inherent properties. In this study, we revisit class-aware knowledge distillation and propose that effective transfer of class-level knowledge requires two regularization strategies: <italic>separability</i> and <italic>orthogonality</i>. Additionally, we introduce an asymmetric architecture design to further enhance the transfer of class-level knowledge. Together, these components form a new methodology, Class Discriminative Knowledge Distillation (CD-KD). Empirical results demonstrate that CD-KD significantly outperforms several state-of-the-art logit-based and feature-based methods across diverse visual classification tasks, highlighting its effectiveness and robustness.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"1340-1351"},"PeriodicalIF":5.3,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IDET: Iterative Difference-Enhanced Transformers for High-Quality Change Detection 用于高质量变化检测的迭代差分增强变压器
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-28 DOI: 10.1109/TETCI.2025.3529893
Qing Guo;Ruofei Wang;Rui Huang;Renjie Wan;Shuifa Sun;Yuxiang Zhang
Change detection (CD) is a crucial task in various real-world applications, aiming to identify regions of change between two images captured at different times. However, existing approaches mainly focus on designing advanced network architectures that map feature differences to change maps, overlooking the impact of feature difference quality. In this paper, we approach CD from a different perspective by exploring how to optimize feature differences to effectively highlight changes and suppress background regions. To achieve this, we propose a novel module called the iterative difference-enhanced transformers (IDET). IDET consists of three transformers: two for extracting long-range information from the bi-temporal images, and one for enhancing the feature difference. Unlike previous transformers, the third transformer utilizes the outputs of the first two transformers to guide iterative and dynamic enhancement of the feature difference. To further enhance refinement, we introduce the multi-scale IDET-based change detection approach, which utilizes multi-scale representations of the images to refine the feature difference at multiple scales. Additionally, we propose a coarse-to-fine fusion strategy to combine all refinements. Our final CD method surpasses nine state-of-the-art methods on six large-scale datasets across different application scenarios. This highlights the significance of feature difference enhancement and demonstrates the effectiveness of IDET. Furthermore, we demonstrate that our IDET can be seamlessly integrated into other existing CD methods, resulting in a substantial improvement in detection accuracy.
变化检测(CD)是各种实际应用中的一项关键任务,旨在识别在不同时间捕获的两幅图像之间的变化区域。然而,现有方法主要侧重于设计高级网络架构,通过映射特征差异来改变地图,忽略了特征差异质量的影响。在本文中,我们从不同的角度来研究CD,探索如何优化特征差异来有效地突出变化和抑制背景区域。为了实现这一目标,我们提出了一种新的模块,称为迭代差分增强变压器(IDET)。IDET由三个变换组成:两个用于提取双时相图像的远程信息,一个用于增强特征差异。与之前的变压器不同,第三个变压器利用前两个变压器的输出来指导特征差的迭代和动态增强。为了进一步提高精细化,我们引入了基于多尺度idet的变化检测方法,该方法利用图像的多尺度表示来细化多尺度的特征差异。此外,我们提出了一个从粗到精的融合策略来结合所有的精细化。我们最终的CD方法在不同应用场景的六个大规模数据集上超过了九种最先进的方法。这凸显了特征差异增强的重要性,也证明了IDET的有效性。此外,我们证明我们的IDET可以无缝集成到其他现有的CD方法中,从而大大提高了检测精度。
{"title":"IDET: Iterative Difference-Enhanced Transformers for High-Quality Change Detection","authors":"Qing Guo;Ruofei Wang;Rui Huang;Renjie Wan;Shuifa Sun;Yuxiang Zhang","doi":"10.1109/TETCI.2025.3529893","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3529893","url":null,"abstract":"Change detection (CD) is a crucial task in various real-world applications, aiming to identify regions of change between two images captured at different times. However, existing approaches mainly focus on designing advanced network architectures that map feature differences to change maps, overlooking the impact of feature difference quality. In this paper, we approach CD from a different perspective by exploring <italic>how to optimize feature differences to effectively highlight changes and suppress background regions</i>. To achieve this, we propose a novel module called the iterative difference-enhanced transformers (IDET). IDET consists of three transformers: two for extracting long-range information from the bi-temporal images, and one for enhancing the feature difference. Unlike previous transformers, the third transformer utilizes the outputs of the first two transformers to guide iterative and dynamic enhancement of the feature difference. To further enhance refinement, we introduce the multi-scale IDET-based change detection approach, which utilizes multi-scale representations of the images to refine the feature difference at multiple scales. Additionally, we propose a coarse-to-fine fusion strategy to combine all refinements. Our final CD method surpasses nine state-of-the-art methods on six large-scale datasets across different application scenarios. This highlights the significance of feature difference enhancement and demonstrates the effectiveness of IDET. Furthermore, we demonstrate that our IDET can be seamlessly integrated into other existing CD methods, resulting in a substantial improvement in detection accuracy.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"1093-1106"},"PeriodicalIF":5.3,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MENTOR: Guiding Hierarchical Reinforcement Learning With Human Feedback and Dynamic Distance Constraint 指导层次强化学习与人的反馈和动态距离约束
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-28 DOI: 10.1109/TETCI.2025.3529902
Xinglin Zhou;Yifu Yuan;Shaofu Yang;Jianye Hao
Hierarchical reinforcement learning (HRL) provides a promising solution for complex tasks with sparse rewards of agents, which uses a hierarchical framework that divides tasks into subgoals and completes them sequentially. However, current methods struggle to find suitable subgoals for ensuring a stable learning process. To address the issue, we propose a general hierarchical reinforcement learning framework incorporating human feedback and dynamic distance constraints, termed MENTOR, which acts as a “mentor”. Specifically, human feedback is incorporated into high-level policy learning to find better subgoals. Furthermore, we propose the Dynamic Distance Constraint (DDC) mechanism dynamically adjusting the space of optional subgoals, such that MENTOR can generate subgoals matching the low-level policy learning process from easy to hard. As a result, the learning efficiency can be improved. As for low-level policy, a dual policy is designed for exploration-exploitation decoupling to stabilize the training process. Extensive experiments demonstrate that MENTOR uses a small amount of human feedback to achieve significant improvement in complex tasks with sparse rewards.
分层强化学习(HRL)为具有稀疏代理奖励的复杂任务提供了一种有前途的解决方案,它使用分层框架将任务划分为子目标并依次完成它们。然而,目前的方法很难找到合适的子目标来确保稳定的学习过程。为了解决这个问题,我们提出了一个包含人类反馈和动态距离约束的通用分层强化学习框架,称为MENTOR,它充当“导师”。具体来说,人类反馈被纳入高层次的政策学习,以找到更好的子目标。在此基础上,提出了动态距离约束(DDC)机制,动态调整可选子目标的空间,使MENTOR从易到难生成与低级策略学习过程相匹配的子目标。这样可以提高学习效率。在低级策略方面,设计了双重策略进行探索-开发解耦,以稳定训练过程。大量的实验表明,MENTOR使用少量的人工反馈,在奖励稀疏的复杂任务中取得了显著的改进。
{"title":"MENTOR: Guiding Hierarchical Reinforcement Learning With Human Feedback and Dynamic Distance Constraint","authors":"Xinglin Zhou;Yifu Yuan;Shaofu Yang;Jianye Hao","doi":"10.1109/TETCI.2025.3529902","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3529902","url":null,"abstract":"Hierarchical reinforcement learning (HRL) provides a promising solution for complex tasks with sparse rewards of agents, which uses a hierarchical framework that divides tasks into subgoals and completes them sequentially. However, current methods struggle to find suitable subgoals for ensuring a stable learning process. To address the issue, we propose a general hierarchical reinforcement learning framework incorporating human feedback and dynamic distance constraints, termed <bold>MENTOR</b>, which acts as a “<italic>mentor</i>”. Specifically, human feedback is incorporated into high-level policy learning to find better subgoals. Furthermore, we propose the Dynamic Distance Constraint (DDC) mechanism dynamically adjusting the space of optional subgoals, such that MENTOR can generate subgoals matching the low-level policy learning process from easy to hard. As a result, the learning efficiency can be improved. As for low-level policy, a dual policy is designed for exploration-exploitation decoupling to stabilize the training process. Extensive experiments demonstrate that MENTOR uses a small amount of human feedback to achieve significant improvement in complex tasks with sparse rewards.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"1292-1306"},"PeriodicalIF":5.3,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Many-Objective Diversity-Guided Differential Evolution Algorithm for Multi-Label Feature Selection in High-Dimensional Datasets 高维数据集多标签特征选择的多目标多样性导向差分进化算法
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-27 DOI: 10.1109/TETCI.2025.3529840
Emrah Hancer;Bing Xue;Mengjie Zhang
Multi-label classification (MLC) is crucial as it allows for a more nuanced and realistic representation of complex real-world scenarios, where instances may belong to multiple categories simultaneously, providing a comprehensive understanding of the data. Effective feature selection in MLC is paramount as it cannot only enhance model efficiency and interpretability but also mitigate the curse of dimensionality, ensuring more accurate and streamlined predictions for complex, multi-label data. Despite the proven efficacy of evolutionary computation (EC) techniques in enhancing feature selection for multi-label datasets, research on feature selection in MLC remains sparse in the domain of multi- and many-objective optimization. This paper proposes a many-objective differential evolution algorithm called MODivDE for feature selection in high-dimensional MLC tasks. The MODivDE algorithm involves multiple improvements and innovations in quality indicator-based selection, logic-based search strategy, and diversity-based archive update. The results demonstrate the exceptional performance of the MODivDE algorithm across a diverse range of high-dimensional datasets, surpassing recently introduced many-objective and conventional multi-label feature selection algorithms. The advancements in MODivDE collectively contribute to significantly improved accuracy, efficiency, and interpretability compared to state-of-the-art methods in the realm of multi-label feature selection.
多标签分类(MLC)至关重要,因为它允许对复杂的现实场景进行更细致和更现实的表示,其中实例可能同时属于多个类别,从而提供对数据的全面理解。在MLC中,有效的特征选择是至关重要的,因为它不仅可以提高模型效率和可解释性,还可以减轻维度的诅咒,确保对复杂的多标签数据进行更准确和精简的预测。尽管进化计算(EC)技术在增强多标签数据集的特征选择方面已经被证明是有效的,但在多目标优化领域,关于多标签数据集特征选择的研究仍然很少。针对高维MLC任务的特征选择问题,提出了一种多目标差分进化算法MODivDE。MODivDE算法在基于质量指标的选择、基于逻辑的搜索策略和基于多样性的存档更新等方面进行了多项改进和创新。结果表明,MODivDE算法在各种高维数据集上的卓越性能,超过了最近引入的多目标和传统的多标签特征选择算法。与多标签特征选择领域中最先进的方法相比,MODivDE的进步共同有助于显著提高准确性、效率和可解释性。
{"title":"A Many-Objective Diversity-Guided Differential Evolution Algorithm for Multi-Label Feature Selection in High-Dimensional Datasets","authors":"Emrah Hancer;Bing Xue;Mengjie Zhang","doi":"10.1109/TETCI.2025.3529840","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3529840","url":null,"abstract":"Multi-label classification (MLC) is crucial as it allows for a more nuanced and realistic representation of complex real-world scenarios, where instances may belong to multiple categories simultaneously, providing a comprehensive understanding of the data. Effective feature selection in MLC is paramount as it cannot only enhance model efficiency and interpretability but also mitigate the curse of dimensionality, ensuring more accurate and streamlined predictions for complex, multi-label data. Despite the proven efficacy of evolutionary computation (EC) techniques in enhancing feature selection for multi-label datasets, research on feature selection in MLC remains sparse in the domain of multi- and many-objective optimization. This paper proposes a many-objective differential evolution algorithm called MODivDE for feature selection in high-dimensional MLC tasks. The MODivDE algorithm involves multiple improvements and innovations in quality indicator-based selection, logic-based search strategy, and diversity-based archive update. The results demonstrate the exceptional performance of the MODivDE algorithm across a diverse range of high-dimensional datasets, surpassing recently introduced many-objective and conventional multi-label feature selection algorithms. The advancements in MODivDE collectively contribute to significantly improved accuracy, efficiency, and interpretability compared to state-of-the-art methods in the realm of multi-label feature selection.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"1226-1237"},"PeriodicalIF":5.3,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aspect-Aware Graph Interaction Attention Network for Aspect Category Sentiment Analysis 面向方面类别情感分析的方面感知图交互注意网络
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-27 DOI: 10.1109/TETCI.2025.3526285
Pengfei Yu;Jingjing Gu;Dechang Pi;Qiang Zhou;Qiuhong Wang
This paper explores an implicit Aspect Category Sentiment Analysis task, which aims to determine the sentiment polarities of given aspect categories in social reviews. Currently, most researchers focus more on explicit aspect and rarely work on implicit aspect. Meanwhile, due to the semantic complexity of natural language, it is difficult for existing methods to retrieve such implicit semantics in sentences. To this end, we propose a novel framework, the Aspect-aware Graph Interaction Attention Network (AGIAN), which concentrates on aspect-related information implicitly in sentences and identifies its corresponding sentiment polarity. Specifically, first, we introduce an aspect-aware graph to represent potential associations between the implicit aspect category and the sentence. Then, we utilize two types of graph neural networks to extract rich relational semantics. Finally, we design a graph interaction mechanism to integrate sentiment features specific to the aspect category for sentiment classification. We evaluate the performance of the proposed framework on six publicly available benchmark datasets. Extensive experiments demonstrate that, compared to some competitive baseline methods, AGIAN can effectively improve accuracy and achieve state-of-the-art performance on the F1-score.
本文探讨了一个隐式方面类别情感分析任务,旨在确定社会评论中给定方面类别的情感极性。目前的研究大多集中在外显方面,对内隐方面的研究较少。同时,由于自然语言的语义复杂性,现有的方法很难检索到句子中的隐含语义。为此,我们提出了一个新的框架——方面感知图交互注意网络(AGIAN),它集中于句子中隐含的方面相关信息,并识别其相应的情感极性。具体来说,首先,我们引入了一个方面感知图来表示隐含方面类别和句子之间的潜在关联。然后,利用两种类型的图神经网络提取丰富的关系语义。最后,我们设计了一种图形交互机制来整合面向方面类别的情感特征进行情感分类。我们在六个公开可用的基准数据集上评估了所提出框架的性能。大量的实验表明,与一些有竞争力的基线方法相比,AGIAN可以有效地提高准确性,并在f1得分上达到最先进的水平。
{"title":"Aspect-Aware Graph Interaction Attention Network for Aspect Category Sentiment Analysis","authors":"Pengfei Yu;Jingjing Gu;Dechang Pi;Qiang Zhou;Qiuhong Wang","doi":"10.1109/TETCI.2025.3526285","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3526285","url":null,"abstract":"This paper explores an implicit Aspect Category Sentiment Analysis task, which aims to determine the sentiment polarities of given aspect categories in social reviews. Currently, most researchers focus more on explicit aspect and rarely work on implicit aspect. Meanwhile, due to the semantic complexity of natural language, it is difficult for existing methods to retrieve such implicit semantics in sentences. To this end, we propose a novel framework, the Aspect-aware Graph Interaction Attention Network (AGIAN), which concentrates on aspect-related information implicitly in sentences and identifies its corresponding sentiment polarity. Specifically, first, we introduce an aspect-aware graph to represent potential associations between the implicit aspect category and the sentence. Then, we utilize two types of graph neural networks to extract rich relational semantics. Finally, we design a graph interaction mechanism to integrate sentiment features specific to the aspect category for sentiment classification. We evaluate the performance of the proposed framework on six publicly available benchmark datasets. Extensive experiments demonstrate that, compared to some competitive baseline methods, AGIAN can effectively improve accuracy and achieve state-of-the-art performance on the F1-score.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 4","pages":"3122-3135"},"PeriodicalIF":5.3,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144687778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Computational Intelligence Society Information IEEE计算智能学会信息
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-23 DOI: 10.1109/TETCI.2025.3529608
{"title":"IEEE Computational Intelligence Society Information","authors":"","doi":"10.1109/TETCI.2025.3529608","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3529608","url":null,"abstract":"","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 1","pages":"C3-C3"},"PeriodicalIF":5.3,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10850899","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143360995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Emerging Topics in Computational Intelligence Information for Authors IEEE计算智能信息新主题汇刊
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-01-23 DOI: 10.1109/TETCI.2025.3529610
{"title":"IEEE Transactions on Emerging Topics in Computational Intelligence Information for Authors","authors":"","doi":"10.1109/TETCI.2025.3529610","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3529610","url":null,"abstract":"","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 1","pages":"C4-C4"},"PeriodicalIF":5.3,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10850888","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143361033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Emerging Topics in Computational Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1