首页 > 最新文献

IEEE Transactions on Cybernetics最新文献

英文 中文
VTFusion: A Vision-Text Multimodal Fusion Network for Few-Shot Anomaly Detection. VTFusion:一种用于少量异常检测的视觉-文本多模态融合网络。
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-21 DOI: 10.1109/tcyb.2026.3651630
Yuxin Jiang,Yunkang Cao,Yuqi Cheng,Yiheng Zhang,Weiming Shen
Few-shot anomaly detection (FSAD) has emerged as a critical paradigm for identifying irregularities using scarce normal references. While recent methods have integrated textual semantics to complement visual data, they predominantly rely on features pretrained on natural scenes, thereby neglecting the granular, domain-specific semantics essential for industrial inspection. Furthermore, prevalent fusion strategies often resort to superficial concatenation, failing to address the inherent semantic misalignment between visual and textual modalities, which compromises robustness against cross-modal interference. To bridge these gaps, this study proposes VTFusion, a vision-text multimodal fusion framework tailored for FSAD. The framework rests on two core designs. First, adaptive feature extractors for both image and text modalities are introduced to learn task-specific representations, bridging the domain gap between pretrained models and industrial data; this is further augmented by generating diverse synthetic anomalies to enhance feature discriminability. Second, a dedicated multimodal prediction fusion module is developed, comprising a fusion block that facilitates rich cross-modal information exchange and a segmentation network that generates refined pixel-level anomaly maps under multimodal guidance. VTFusion significantly advances FSAD performance, achieving image-level area under the receiver operating characteristics (AUROCs) of 96.8% and 86.2% in the 2-shot scenario on the MVTec AD and VisA datasets, respectively. Furthermore, VTFusion achieves an AUPRO of 93.5% on a real-world dataset of industrial automotive plastic parts introduced in this article, further demonstrating its practical applicability in demanding industrial scenarios.
少量异常检测(FSAD)已成为利用稀缺的正常参考来识别异常的关键范例。虽然最近的方法集成了文本语义来补充视觉数据,但它们主要依赖于对自然场景进行预训练的特征,从而忽略了工业检测所必需的颗粒化、特定领域的语义。此外,普遍的融合策略往往诉诸于表面的连接,未能解决视觉和文本模式之间固有的语义不一致,这损害了抗跨模态干扰的鲁棒性。为了弥补这些差距,本研究提出了VTFusion,这是一种为FSAD量身定制的视觉-文本多模态融合框架。该框架基于两个核心设计。首先,引入图像和文本模式的自适应特征提取器来学习特定于任务的表示,弥合了预训练模型和工业数据之间的领域差距;通过生成不同的合成异常来增强特征的可辨别性,进一步增强了这一点。其次,开发了专用的多模态预测融合模块,包括促进丰富的跨模态信息交换的融合块和在多模态指导下生成精细像素级异常图的分割网络。VTFusion显著提高了FSAD性能,在MVTec AD和VisA数据集的2次拍摄场景下,接收器工作特性(auroc)下的图像级面积分别达到96.8%和86.2%。此外,VTFusion在本文介绍的工业汽车塑料部件的真实数据集上实现了93.5%的AUPRO,进一步证明了其在苛刻的工业场景中的实际适用性。
{"title":"VTFusion: A Vision-Text Multimodal Fusion Network for Few-Shot Anomaly Detection.","authors":"Yuxin Jiang,Yunkang Cao,Yuqi Cheng,Yiheng Zhang,Weiming Shen","doi":"10.1109/tcyb.2026.3651630","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3651630","url":null,"abstract":"Few-shot anomaly detection (FSAD) has emerged as a critical paradigm for identifying irregularities using scarce normal references. While recent methods have integrated textual semantics to complement visual data, they predominantly rely on features pretrained on natural scenes, thereby neglecting the granular, domain-specific semantics essential for industrial inspection. Furthermore, prevalent fusion strategies often resort to superficial concatenation, failing to address the inherent semantic misalignment between visual and textual modalities, which compromises robustness against cross-modal interference. To bridge these gaps, this study proposes VTFusion, a vision-text multimodal fusion framework tailored for FSAD. The framework rests on two core designs. First, adaptive feature extractors for both image and text modalities are introduced to learn task-specific representations, bridging the domain gap between pretrained models and industrial data; this is further augmented by generating diverse synthetic anomalies to enhance feature discriminability. Second, a dedicated multimodal prediction fusion module is developed, comprising a fusion block that facilitates rich cross-modal information exchange and a segmentation network that generates refined pixel-level anomaly maps under multimodal guidance. VTFusion significantly advances FSAD performance, achieving image-level area under the receiver operating characteristics (AUROCs) of 96.8% and 86.2% in the 2-shot scenario on the MVTec AD and VisA datasets, respectively. Furthermore, VTFusion achieves an AUPRO of 93.5% on a real-world dataset of industrial automotive plastic parts introduced in this article, further demonstrating its practical applicability in demanding industrial scenarios.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"263 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146015297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A New Filter Design and Optimization Framework for Enhancing Transient and Steady-State Tracking in Repetitive-Control Systems. 一种新的增强重复控制系统暂态和稳态跟踪的滤波器设计和优化框架。
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-21 DOI: 10.1109/tcyb.2026.3651712
Manli Zhang,Chengda Lu,Yibing Wang,Shengnan Tian,Min Wu,Makoto Iwasaki
A low-pass filter is essential for stabilizing strictly proper repetitive-control systems, but it inevitably degrades steady-state tracking accuracy due to gain attenuation and phase lag. This article presents a new filter design and optimization method that improves both transient response and steady-state accuracy in continuous-time repetitive-control systems. First, the gain and phase characteristics of conventional filter-based repetitive controllers are rigorously analyzed to reveal the relationship between filter parameters and tracking performance. Based on this analysis, a new filter structure is designed to precisely compensate for gain attenuation and phase delay, specifically at the fundamental frequency, by minimizing the error term without increasing the filter bandwidth. A guideline for selecting the filter parameters for varying periodic trajectories is also provided. In addition, according to the one-to-one mapping between control and learning behaviors and their respective gains, dual performance indices are constructed to account for tracking error and control effort across multiple learning cycles. A multiobjective optimization framework is then developed to directly tune these gains subject to stability constraints, achieving an optimal balance between rapid transient convergence and control energy efficiency. Experimental results validate the effectiveness and superiority of the design.
低通滤波器对于稳定严格适当的重复控制系统是必不可少的,但由于增益衰减和相位滞后,它不可避免地降低了稳态跟踪精度。本文提出了一种新的滤波器设计和优化方法,可以提高连续时间重复控制系统的暂态响应和稳态精度。首先,严格分析了传统的基于滤波器的重复控制器的增益和相位特性,揭示了滤波器参数与跟踪性能之间的关系。基于此分析,设计了一种新的滤波器结构,通过最小化误差项而不增加滤波器带宽,精确补偿增益衰减和相位延迟,特别是在基频处。本文还提供了针对不同周期轨迹选择滤波器参数的指导原则。此外,根据控制行为和学习行为之间的一对一映射关系及其各自的收益,构建了双性能指标来考虑多个学习周期的跟踪误差和控制努力。然后开发了一个多目标优化框架,直接调整这些增益受稳定性约束,实现快速瞬态收敛和控制能量效率之间的最佳平衡。实验结果验证了该设计的有效性和优越性。
{"title":"A New Filter Design and Optimization Framework for Enhancing Transient and Steady-State Tracking in Repetitive-Control Systems.","authors":"Manli Zhang,Chengda Lu,Yibing Wang,Shengnan Tian,Min Wu,Makoto Iwasaki","doi":"10.1109/tcyb.2026.3651712","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3651712","url":null,"abstract":"A low-pass filter is essential for stabilizing strictly proper repetitive-control systems, but it inevitably degrades steady-state tracking accuracy due to gain attenuation and phase lag. This article presents a new filter design and optimization method that improves both transient response and steady-state accuracy in continuous-time repetitive-control systems. First, the gain and phase characteristics of conventional filter-based repetitive controllers are rigorously analyzed to reveal the relationship between filter parameters and tracking performance. Based on this analysis, a new filter structure is designed to precisely compensate for gain attenuation and phase delay, specifically at the fundamental frequency, by minimizing the error term without increasing the filter bandwidth. A guideline for selecting the filter parameters for varying periodic trajectories is also provided. In addition, according to the one-to-one mapping between control and learning behaviors and their respective gains, dual performance indices are constructed to account for tracking error and control effort across multiple learning cycles. A multiobjective optimization framework is then developed to directly tune these gains subject to stability constraints, achieving an optimal balance between rapid transient convergence and control energy efficiency. Experimental results validate the effectiveness and superiority of the design.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"35 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146015433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual Event-Triggered Polynomial Dynamic Output Control for Positive Fuzzy Systems via an IT2 Membership Function Relaxation Method. 基于IT2隶属函数松弛法的双事件触发多项式正模糊系统动态输出控制。
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-21 DOI: 10.1109/tcyb.2026.3651581
Xiaoxiao Wang,Zhiyong Bao,Xiaomiao Li,Hak-Keung Lam,Ziwei Wang
The co-design problem of dual event-triggered (DET) mechanism and polynomial dynamic output-feedback (PDOF) controller is investigated for positive polynomial fuzzy systems (PPFSs) with uncertainty and disturbance constraints. Specifically, a 1-norm DET mechanism compatible with the positivity of PPFSs is proposed to asynchronously update measurement outputs and PDOF control signals. However, synthesizing this DET-PDOF controller proves challenging due to the coupling of multiple unknown PDOF controller gain matrices within the positivity and stability conditions, which results in complex nonconvex terms. By introducing auxiliary variables and constraints, sufficient conditions for DET-PDOF controller solution are given to ensure both the $L_{1}$ -gain performance and strict positivity of PPFSs with uncertainty and disturbance. Moreover, existing stability analysis results that ignore membership functions (MFs) tend to be conservative, implying that the obtained DET-PDOF controller is effective only within a limited triggered threshold range, leading to worse transmission performance. Therefore, a multivariate optimization method based on an improved genetic algorithm (IGA), which accounts for the system states and PDOF controller variables, is developed to substantially expand the admissible DET threshold range while effectively suppressing dual-triggering frequencies. Finally, a numerical example and a two-linked tank system with parameter uncertainty are provided to validate the feasibility of the proposed scheme.
针对具有不确定性和干扰约束的正多项式模糊系统,研究了双事件触发(DET)机制与多项式动态输出反馈(PDOF)控制器的协同设计问题。具体而言,提出了一种与ppfs正性兼容的1范数DET机制,用于异步更新测量输出和PDOF控制信号。然而,由于在正稳定性条件下存在多个未知的PDOF控制器增益矩阵的耦合,从而导致了复杂的非凸项,因此,这种dt -PDOF控制器的合成具有挑战性。通过引入辅助变量和约束条件,给出了dt - pdof控制器解的充分条件,以保证具有不确定性和干扰的ppfs的L_ bb_0 $增益性能和严格正性。此外,现有的稳定性分析结果忽略了隶属函数(MFs),趋于保守,这意味着得到的dt - pdof控制器仅在有限的触发阈值范围内有效,导致传输性能变差。因此,提出了一种基于改进遗传算法(IGA)的多元优化方法,该方法考虑了系统状态和PDOF控制器变量,在有效抑制双触发频率的同时,大大扩大了可接受的DET阈值范围。最后,通过一个参数不确定的双连杆储罐系统的数值算例验证了该方法的可行性。
{"title":"Dual Event-Triggered Polynomial Dynamic Output Control for Positive Fuzzy Systems via an IT2 Membership Function Relaxation Method.","authors":"Xiaoxiao Wang,Zhiyong Bao,Xiaomiao Li,Hak-Keung Lam,Ziwei Wang","doi":"10.1109/tcyb.2026.3651581","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3651581","url":null,"abstract":"The co-design problem of dual event-triggered (DET) mechanism and polynomial dynamic output-feedback (PDOF) controller is investigated for positive polynomial fuzzy systems (PPFSs) with uncertainty and disturbance constraints. Specifically, a 1-norm DET mechanism compatible with the positivity of PPFSs is proposed to asynchronously update measurement outputs and PDOF control signals. However, synthesizing this DET-PDOF controller proves challenging due to the coupling of multiple unknown PDOF controller gain matrices within the positivity and stability conditions, which results in complex nonconvex terms. By introducing auxiliary variables and constraints, sufficient conditions for DET-PDOF controller solution are given to ensure both the $L_{1}$ -gain performance and strict positivity of PPFSs with uncertainty and disturbance. Moreover, existing stability analysis results that ignore membership functions (MFs) tend to be conservative, implying that the obtained DET-PDOF controller is effective only within a limited triggered threshold range, leading to worse transmission performance. Therefore, a multivariate optimization method based on an improved genetic algorithm (IGA), which accounts for the system states and PDOF controller variables, is developed to substantially expand the admissible DET threshold range while effectively suppressing dual-triggering frequencies. Finally, a numerical example and a two-linked tank system with parameter uncertainty are provided to validate the feasibility of the proposed scheme.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"29 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146015300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evolutionary Optimization-Based Design of LQG Controllers in Quantum Coherent Feedback 基于进化优化的量子相干反馈LQG控制器设计
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-19 DOI: 10.1109/tcyb.2026.3651242
Chunxiang Song, Yanan Liu, Guofeng Zhang, Huadong Mo, Daoyi Dong
{"title":"Evolutionary Optimization-Based Design of LQG Controllers in Quantum Coherent Feedback","authors":"Chunxiang Song, Yanan Liu, Guofeng Zhang, Huadong Mo, Daoyi Dong","doi":"10.1109/tcyb.2026.3651242","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3651242","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"61 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146001268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decentralized Impulsive Control for Nonlinear Interconnected Systems Based on Dynamic Event-Triggered Mechanism 基于动态事件触发机制的非线性互联系统分散脉冲控制
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-19 DOI: 10.1109/tcyb.2026.3651462
Weihao Pan, Xianfu Zhang, Lu Liu, Zhiyu Duan
{"title":"Decentralized Impulsive Control for Nonlinear Interconnected Systems Based on Dynamic Event-Triggered Mechanism","authors":"Weihao Pan, Xianfu Zhang, Lu Liu, Zhiyu Duan","doi":"10.1109/tcyb.2026.3651462","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3651462","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"27 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146001267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nontargeted Delay Attacks on Blockchain P2P Network: Feasibility and Financial Implications b区块链P2P网络的非目标延迟攻击:可行性和财务意义
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-19 DOI: 10.1109/tcyb.2025.3645481
Liyi Zeng, Wei Xu, Zhaoquan Gu, Yanchun Zhang
{"title":"Nontargeted Delay Attacks on Blockchain P2P Network: Feasibility and Financial Implications","authors":"Liyi Zeng, Wei Xu, Zhaoquan Gu, Yanchun Zhang","doi":"10.1109/tcyb.2025.3645481","DOIUrl":"https://doi.org/10.1109/tcyb.2025.3645481","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"100 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146001271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Differentially Private Consensus of Two-Time-Scale Multiagent Systems 双时间尺度多智能体系统的差分私有一致性
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-19 DOI: 10.1109/tcyb.2026.3651697
Lei Ma, Zhiwei Lu, Ying Zhang, Chunyu Yang, Guoqing Wang, Xinkai Chen
{"title":"Differentially Private Consensus of Two-Time-Scale Multiagent Systems","authors":"Lei Ma, Zhiwei Lu, Ying Zhang, Chunyu Yang, Guoqing Wang, Xinkai Chen","doi":"10.1109/tcyb.2026.3651697","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3651697","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"85 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146001269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extended Dissipative Analysis for Uncertain Delayed Genetic Regulatory Networks via Interval Type-2 T-S Fuzzy Framework 区间2型T-S模糊框架下不确定延迟遗传调控网络的扩展耗散分析
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-19 DOI: 10.1109/tcyb.2026.3652172
Menglu Zhu, Yi Zeng, Yankui Shi, Ligang Wu, Hak-Keung Lam
{"title":"Extended Dissipative Analysis for Uncertain Delayed Genetic Regulatory Networks via Interval Type-2 T-S Fuzzy Framework","authors":"Menglu Zhu, Yi Zeng, Yankui Shi, Ligang Wu, Hak-Keung Lam","doi":"10.1109/tcyb.2026.3652172","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3652172","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"5 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146001270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Q -Learning Approach to Finite-Horizon H ∞ Tracking With Partial Observation 部分观测有限视界H∞跟踪的Q学习方法
IF 11.8 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-19 DOI: 10.1109/tcyb.2026.3652143
Mingxiang Liu, Qianqian Cai, Wei Meng, Dandan Li, Minyue Fu
{"title":"Q -Learning Approach to Finite-Horizon H ∞ Tracking With Partial Observation","authors":"Mingxiang Liu, Qianqian Cai, Wei Meng, Dandan Li, Minyue Fu","doi":"10.1109/tcyb.2026.3652143","DOIUrl":"https://doi.org/10.1109/tcyb.2026.3652143","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"2 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146001318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object Detection. LASFNet:用于多模态目标检测的轻量级注意力引导自调制特征融合网络。
IF 10.5 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-16 DOI: 10.1109/TCYB.2025.3650459
Lei Hao, Lina Xu, Chang Liu, Yanni Dong

Effective deep feature extraction via feature-level fusion is crucial for multimodal object detection. However, previous studies often involve complex training processes that integrate modality-specific features by stacking multiple feature-level fusion units, leading to significant computational overhead. To address this issue, we propose a lightweight attention-guided self-modulation feature fusion network (LASFNet). The LASFNet adopts a single feature-level fusion unit to enable high-performance detection, thereby simplifying the training process. The attention-guided self-modulation feature fusion (ASFF) module in the model adaptively adjusts the responses of fused features at both global and local levels, promoting comprehensive and enriched feature generation. Additionally, a lightweight feature attention transformation module (FATM) is designed at the neck of LASFNet to enhance the focus on fused features and minimize information loss. Extensive experiments on three representative datasets demonstrate that our approach achieves a favorable efficiency-accuracy tradeoff. Compared to state-of-the-art methods, LASFNet reduced the number of parameters and computational cost by as much as 90% and 85%, respectively, while improving detection accuracy mean average precision (mAP) by 1%-3%. The code will be open-sourced at https://github.com/leileilei2000/LASFNet.

通过特征级融合进行有效的深度特征提取是多模态目标检测的关键。然而,先前的研究通常涉及复杂的训练过程,通过堆叠多个特征级融合单元来集成特定于模态的特征,从而导致显著的计算开销。为了解决这个问题,我们提出了一个轻量级的注意力引导自调制特征融合网络(LASFNet)。LASFNet采用单个特征级融合单元实现高性能检测,从而简化了训练过程。模型中的注意引导自调制特征融合(attention-guided self-modulation feature fusion, ASFF)模块可自适应调整融合特征在全局和局部层面的响应,促进特征生成的全面和丰富。此外,在LASFNet的颈部设计了一个轻量级的特征注意转换模块(FATM),增强了对融合特征的关注,减少了信息丢失。在三个代表性数据集上进行的大量实验表明,我们的方法实现了良好的效率-精度权衡。与最先进的方法相比,LASFNet将参数数量和计算成本分别减少了90%和85%,同时将检测精度平均精度(mAP)提高了1%-3%。代码将在https://github.com/leileilei2000/LASFNet上开源。
{"title":"LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object Detection.","authors":"Lei Hao, Lina Xu, Chang Liu, Yanni Dong","doi":"10.1109/TCYB.2025.3650459","DOIUrl":"https://doi.org/10.1109/TCYB.2025.3650459","url":null,"abstract":"<p><p>Effective deep feature extraction via feature-level fusion is crucial for multimodal object detection. However, previous studies often involve complex training processes that integrate modality-specific features by stacking multiple feature-level fusion units, leading to significant computational overhead. To address this issue, we propose a lightweight attention-guided self-modulation feature fusion network (LASFNet). The LASFNet adopts a single feature-level fusion unit to enable high-performance detection, thereby simplifying the training process. The attention-guided self-modulation feature fusion (ASFF) module in the model adaptively adjusts the responses of fused features at both global and local levels, promoting comprehensive and enriched feature generation. Additionally, a lightweight feature attention transformation module (FATM) is designed at the neck of LASFNet to enhance the focus on fused features and minimize information loss. Extensive experiments on three representative datasets demonstrate that our approach achieves a favorable efficiency-accuracy tradeoff. Compared to state-of-the-art methods, LASFNet reduced the number of parameters and computational cost by as much as 90% and 85%, respectively, while improving detection accuracy mean average precision (mAP) by 1%-3%. The code will be open-sourced at https://github.com/leileilei2000/LASFNet.</p>","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"PP ","pages":""},"PeriodicalIF":10.5,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145989039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Cybernetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1