首页 > 最新文献

IEEE Signal Processing Letters最新文献

英文 中文
Extended Node-Specific Distributed Generalized Sidelobe Canceler for Outdoor Wireless Acoustic Sensor Networks 面向室外无线声传感器网络的扩展节点特定分布式广义旁瓣对消器
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-15 DOI: 10.1109/LSP.2025.3644315
Shiqin Li;Jing Hu;Zhao Zhao;Zhiyong Xu
In distributed sound source enhancement (SSE) tasks using microphone array nodes, state-of-the-art node-specific distributed generalized sidelobe canceler (NS-DGSC) algorithm has achieved remarkable performance for simultaneously enhancing multiple desired sources. However, its assumption of an equal number of nodes and sources usually does not hold in outdoor applications. This letter proposes an extended NS-DGSC (ENS-DGSC) algorithm to tackle this issue. A correlation check module is introduced to handle scenarios where nodes outnumber or match sources. Furthermore, a temporal alignment module using two different strategies is designed to address time delays among nodes. Evaluations reveal that the proposed ENS-DGSC not only retains advantages of the NS-DGSC, but also provides superior enhancement performance with more nodes than sources.
在使用麦克风阵列节点的分布式声源增强(SSE)任务中,最先进的节点特定分布式广义旁瓣消除(NS-DGSC)算法在同时增强多个期望声源方面取得了显著的性能。然而,其假设的相等数量的节点和源通常不适用于户外应用。本文提出了一种扩展的NS-DGSC (ENS-DGSC)算法来解决这个问题。引入相关性检查模块来处理节点数量超过或匹配源的场景。此外,还设计了使用两种不同策略的时间对齐模块来解决节点间的时间延迟问题。评估结果表明,所提出的NS-DGSC不仅保留了NS-DGSC的优点,而且在节点多于源的情况下具有优越的增强性能。
{"title":"Extended Node-Specific Distributed Generalized Sidelobe Canceler for Outdoor Wireless Acoustic Sensor Networks","authors":"Shiqin Li;Jing Hu;Zhao Zhao;Zhiyong Xu","doi":"10.1109/LSP.2025.3644315","DOIUrl":"https://doi.org/10.1109/LSP.2025.3644315","url":null,"abstract":"In distributed sound source enhancement (SSE) tasks using microphone array nodes, state-of-the-art node-specific distributed generalized sidelobe canceler (NS-DGSC) algorithm has achieved remarkable performance for simultaneously enhancing multiple desired sources. However, its assumption of an equal number of nodes and sources usually does not hold in outdoor applications. This letter proposes an extended NS-DGSC (ENS-DGSC) algorithm to tackle this issue. A correlation check module is introduced to handle scenarios where nodes outnumber or match sources. Furthermore, a temporal alignment module using two different strategies is designed to address time delays among nodes. Evaluations reveal that the proposed ENS-DGSC not only retains advantages of the NS-DGSC, but also provides superior enhancement performance with more nodes than sources.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"306-310"},"PeriodicalIF":3.9,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explicit-Implicit Prompt Injection and Semantic-Guided Latent LoRA for Vision-Language Tracking 用于视觉语言跟踪的显隐提示注入和语义引导的潜在LoRA
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-12 DOI: 10.1109/LSP.2025.3643354
Jiapeng Zhang;Ying Wei;Yongfeng Li;Gang Yang;Qiaohong Hao
Prompt-based learning has shown promise in visual-language tracking (VLT), yet existing methods often rely on either explicit or implicit prompting alone, limiting fine-grained cross-modal alignment. Moreover, Low-Rank Adaptation (LoRA) -based fine-tuning in prior work typically focuses on visual-only adaptation, overlooking language semantics. To address these issues, we propose a unified VLT framework that integrates Explicit-Implicit Prompt Injection (EIPI) and Semantic-Guided Latent LoRA (SGLL). EIPI introduces semantic prompts to facilitate robust and context-sensitive target modeling through two pathways. The explicit prompts are constructed by interact between multi-modal target representations with the search region, while implicit prompts are learned from linguistic features via a lightweight bottleneck network. Then, SGLL extends standard LoRA by introducing learnable queries in the latent space, allowing residual modulation based on language-visual semantics without retraining the full model. This dual design yields a parameter-efficient tracker with strong cross-modal adaptability. Extensive experiments show our method outperforms prior prompt-based approaches while maintaining high efficiency.
基于提示的学习在视觉语言跟踪(VLT)中显示出前景,但现有的方法通常仅依赖于显式或隐式提示,限制了细粒度的跨模态对齐。此外,先前基于低秩自适应(LoRA)的微调通常只关注视觉自适应,而忽略了语言语义。为了解决这些问题,我们提出了一个统一的VLT框架,该框架集成了显式-隐式提示注入(EIPI)和语义引导的潜在LoRA (SGLL)。EIPI引入语义提示,通过两种途径促进健壮的和上下文敏感的目标建模。显式提示通过多模态目标表示与搜索区域之间的交互构建,而隐式提示通过轻量级瓶颈网络从语言特征中学习。然后,SGLL通过在潜在空间中引入可学习的查询来扩展标准的LoRA,允许基于语言视觉语义的残差调制,而无需重新训练整个模型。这种双重设计产生了具有强跨模态适应性的参数高效跟踪器。大量的实验表明,我们的方法在保持高效率的同时优于先前的基于提示的方法。
{"title":"Explicit-Implicit Prompt Injection and Semantic-Guided Latent LoRA for Vision-Language Tracking","authors":"Jiapeng Zhang;Ying Wei;Yongfeng Li;Gang Yang;Qiaohong Hao","doi":"10.1109/LSP.2025.3643354","DOIUrl":"https://doi.org/10.1109/LSP.2025.3643354","url":null,"abstract":"Prompt-based learning has shown promise in visual-language tracking (VLT), yet existing methods often rely on either explicit or implicit prompting alone, limiting fine-grained cross-modal alignment. Moreover, Low-Rank Adaptation (LoRA) -based fine-tuning in prior work typically focuses on visual-only adaptation, overlooking language semantics. To address these issues, we propose a unified VLT framework that integrates Explicit-Implicit Prompt Injection (EIPI) and Semantic-Guided Latent LoRA (SGLL). EIPI introduces semantic prompts to facilitate robust and context-sensitive target modeling through two pathways. The explicit prompts are constructed by interact between multi-modal target representations with the search region, while implicit prompts are learned from linguistic features via a lightweight bottleneck network. Then, SGLL extends standard LoRA by introducing learnable queries in the latent space, allowing residual modulation based on language-visual semantics without retraining the full model. This dual design yields a parameter-efficient tracker with strong cross-modal adaptability. Extensive experiments show our method outperforms prior prompt-based approaches while maintaining high efficiency.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"376-380"},"PeriodicalIF":3.9,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shallow Neural Network Training via Atomic Norms and Semidefinite Programming 基于原子规范和半定规划的浅层神经网络训练
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-12 DOI: 10.1109/LSP.2025.3643361
Andrew J. Christensen;Ananya Sen Gupta
Neural networks have achieved remarkable results across numerous scientific domains because of their ability to uncover complex patterns. However, despite their effectiveness, these networks rely on heuristic training of highly non-convex objective functions, limiting theoretical understanding and practical reliability. Recent work has shown that shallow neural networks with scalar outputs can be formulated as convex optimization problems, bridging empirical success with theory. In this work, we build upon this framework for vector-valued outputs, introducing a convex formulation for two-layer ReLU networks based on an atomic norm and expressible as a semidefinite program (SDP). This yields a principled convex relaxation of multi-output networks that is both expressive and tractable. We validate the approach using standard SDP solvers, demonstrating its feasibility. These results extend convex neural network training beyond scalar outputs and provide a foundation for scalable, robust alternatives to current heuristic deep learning methods. Our method achieved a 7.3% increase in classification accuracy compared to a baseline convex multi-output network.
神经网络在许多科学领域取得了显著的成果,因为它们能够发现复杂的模式。然而,尽管它们很有效,但这些网络依赖于高度非凸目标函数的启发式训练,限制了理论理解和实际可靠性。最近的工作表明,具有标量输出的浅层神经网络可以表述为凸优化问题,将经验成功与理论联系起来。在这项工作中,我们在这个向量值输出框架的基础上,引入了一个基于原子范数和可表示为半确定程序(SDP)的双层ReLU网络的凸公式。这产生了多输出网络的原则性凸松弛,既具有表现力又易于处理。我们使用标准的SDP求解器验证了该方法,证明了其可行性。这些结果将凸神经网络训练扩展到标量输出之外,并为当前启发式深度学习方法的可扩展、鲁棒替代方案提供了基础。与基线凸多输出网络相比,我们的方法在分类精度上提高了7.3%。
{"title":"Shallow Neural Network Training via Atomic Norms and Semidefinite Programming","authors":"Andrew J. Christensen;Ananya Sen Gupta","doi":"10.1109/LSP.2025.3643361","DOIUrl":"https://doi.org/10.1109/LSP.2025.3643361","url":null,"abstract":"Neural networks have achieved remarkable results across numerous scientific domains because of their ability to uncover complex patterns. However, despite their effectiveness, these networks rely on heuristic training of highly non-convex objective functions, limiting theoretical understanding and practical reliability. Recent work has shown that shallow neural networks with scalar outputs can be formulated as convex optimization problems, bridging empirical success with theory. In this work, we build upon this framework for vector-valued outputs, introducing a convex formulation for two-layer ReLU networks based on an atomic norm and expressible as a semidefinite program (SDP). This yields a principled convex relaxation of multi-output networks that is both expressive and tractable. We validate the approach using standard SDP solvers, demonstrating its feasibility. These results extend convex neural network training beyond scalar outputs and provide a foundation for scalable, robust alternatives to current heuristic deep learning methods. Our method achieved a 7.3% increase in classification accuracy compared to a baseline convex multi-output network.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"321-325"},"PeriodicalIF":3.9,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simple Self-Organizing Map With Vision Transformers 简单的自组织地图与视觉变压器
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-12 DOI: 10.1109/LSP.2025.3643388
Alan Luo;Kaiwen Yuan
Vision Transformers (ViTs) have demonstrated exceptional performance in various vision tasks. However, they tend to underperform on smaller datasets due to their inherent lack of inductive biases. Current approaches address this limitation implicitly—often by pairing ViTs with pretext tasks or by distilling knowledge from convolutional neural networks (CNNs) to strengthen the prior. In contrast, Self-Organizing Maps (SOMs), a widely adopted self-supervised framework, are inherently structured to preserve topology and spatial organization, making them a promising candidate to directly address the limitations of ViTs in limited or small training datasets. Despite this potential, equipping SOMs with modern deep learning architectures remains largely unexplored. In this study, we conduct a novel exploration on how Vision Transformers (ViTs) and Self-Organizing Maps (SOMs) can empower each other, aiming to bridge this critical research gap. Our findings demonstrate that these architectures can synergistically enhance each other, leading to significantly improved performance in both unsupervised and supervised tasks.
视觉变压器(ViTs)在各种视觉任务中表现出优异的性能。然而,由于它们固有的缺乏归纳偏差,它们往往在较小的数据集上表现不佳。目前的方法直接解决了这一限制,通常是通过将vit与借口任务配对,或者从卷积神经网络(cnn)中提取知识来增强先验。相比之下,自组织地图(SOMs)是一种广泛采用的自监督框架,其固有的结构可以保持拓扑和空间组织,使其成为直接解决vit在有限或小型训练数据集中的局限性的有希望的候选。尽管有这种潜力,但为som配备现代深度学习架构在很大程度上仍未被探索。在这项研究中,我们对视觉变形器(ViTs)和自组织地图(SOMs)如何相互授权进行了新颖的探索,旨在弥合这一关键的研究差距。我们的研究结果表明,这些架构可以协同增强彼此,从而在无监督和有监督任务中显著提高性能。
{"title":"Simple Self-Organizing Map With Vision Transformers","authors":"Alan Luo;Kaiwen Yuan","doi":"10.1109/LSP.2025.3643388","DOIUrl":"https://doi.org/10.1109/LSP.2025.3643388","url":null,"abstract":"Vision Transformers (ViTs) have demonstrated exceptional performance in various vision tasks. However, they tend to underperform on smaller datasets due to their inherent lack of inductive biases. Current approaches address this limitation implicitly—often by pairing ViTs with pretext tasks or by distilling knowledge from convolutional neural networks (CNNs) to strengthen the prior. In contrast, Self-Organizing Maps (SOMs), a widely adopted self-supervised framework, are inherently structured to preserve topology and spatial organization, making them a promising candidate to directly address the limitations of ViTs in limited or small training datasets. Despite this potential, equipping SOMs with modern deep learning architectures remains largely unexplored. In this study, we conduct a novel exploration on how Vision Transformers (ViTs) and Self-Organizing Maps (SOMs) can empower each other, aiming to bridge this critical research gap. Our findings demonstrate that these architectures can synergistically enhance each other, leading to significantly improved performance in both unsupervised and supervised tasks.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"331-335"},"PeriodicalIF":3.9,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonintrusive Watermarking for CycleGAN CycleGAN的非侵入式水印
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-12 DOI: 10.1109/LSP.2025.3643348
Yebin Zheng;Haonan An;Guang Hua;Yongming Chen;Zhiping Lin
Generative adversarial networks (GANs) are a set of powerful generative models, among which CycleGAN, featuring the unique cycle-consistency loss, has gained special popularity. However, this unique structure and the cycle-consistency loss make watermarking CycleGAN particularly challenging, rendering existing deep neural network (DNN) watermarking methods, whether model-agnostic or GAN-specific, inapplicable. Meanwhile, existing DNN watermarking methods are intrusive in nature, requiring direct or indirect modification of model parameters for watermark embedding, which raises fidelity concerns. To solve the above problems, we propose the first nonintrusive and robust watermarking method for CycleGAN. We empirically show that without modifying the CycleGAN model, a user-defined watermark image can still be extracted from model outputs using a dedicated watermark decoder. Extensive experimental results verify that while achieving the so-called absolute fidelity, the proposed method is robust to various attacks, from image post-processing to model stealing.
生成式对抗网络(Generative adversarial networks, GANs)是一组功能强大的生成模型,其中CycleGAN以其独特的循环一致性损失特性而受到特别的关注。然而,这种独特的结构和周期一致性损失使得水印CycleGAN特别具有挑战性,使得现有的深度神经网络(DNN)水印方法,无论是模型无关的还是gan特定的,都不适用。同时,现有深度神经网络水印方法具有侵入性,需要直接或间接修改模型参数进行水印嵌入,存在保真度问题。为了解决上述问题,我们提出了CycleGAN的第一种非侵入式鲁棒水印方法。我们的经验表明,在不修改CycleGAN模型的情况下,使用专用的水印解码器仍然可以从模型输出中提取自定义的水印图像。大量的实验结果证明,在实现所谓的绝对保真度的同时,该方法对从图像后处理到模型窃取的各种攻击都具有鲁棒性。
{"title":"Nonintrusive Watermarking for CycleGAN","authors":"Yebin Zheng;Haonan An;Guang Hua;Yongming Chen;Zhiping Lin","doi":"10.1109/LSP.2025.3643348","DOIUrl":"https://doi.org/10.1109/LSP.2025.3643348","url":null,"abstract":"Generative adversarial networks (GANs) are a set of powerful generative models, among which CycleGAN, featuring the unique cycle-consistency loss, has gained special popularity. However, this unique structure and the cycle-consistency loss make watermarking CycleGAN particularly challenging, rendering existing deep neural network (DNN) watermarking methods, whether model-agnostic or GAN-specific, inapplicable. Meanwhile, existing DNN watermarking methods are intrusive in nature, requiring direct or indirect modification of model parameters for watermark embedding, which raises fidelity concerns. To solve the above problems, we propose the first nonintrusive and robust watermarking method for CycleGAN. We empirically show that without modifying the CycleGAN model, a user-defined watermark image can still be extracted from model outputs using a dedicated watermark decoder. Extensive experimental results verify that while achieving the so-called absolute fidelity, the proposed method is robust to various attacks, from image post-processing to model stealing.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"256-260"},"PeriodicalIF":3.9,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Concentration Inequalities for Semidefinite Least Squares Based on Data 基于数据的半定最小二乘集中不等式
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-11 DOI: 10.1109/LSP.2025.3643385
Filippo Fabiani;Andrea Simonetto
We study data-driven least squares (LS) problems with semidefinite (SD) constraints and derive finite-sample guarantees on the spectrum of their optimal solutions when these constraints are relaxed. In particular, we provide a high confidence bound allowing one to solve a simpler program in place of the full SDLS problem, while ensuring that the eigenvalues of the resulting solution are $varepsilon$-close of those enforced by the SD constraints. The developed certificate, which consistently shrinks as the number of data increases, turns out to be easy-to-compute, distribution-free, and only requires independent and identically distributed samples. Moreover, when the SDLS is used to learn an unknown quadratic function, we establish bounds on the error between a gradient descent iterate minimizing the surrogate cost obtained with no SD constraints and the true minimizer.
本文研究了具有半定约束的数据驱动最小二乘问题,并推导了这些约束松弛时其最优解谱的有限样本保证。特别是,我们提供了一个高置信度界,允许人们解决一个更简单的程序来代替完整的SDLS问题,同时确保最终解决方案的特征值与SD约束所强制的特征值接近。开发的证书随着数据数量的增加而不断缩小,结果证明它易于计算、不受分布限制,并且只需要独立且相同分布的样本。此外,当SDLS用于学习未知的二次函数时,我们建立了在没有SD约束的情况下梯度下降迭代最小化代理代价与真正的最小化器之间的误差界限。
{"title":"Concentration Inequalities for Semidefinite Least Squares Based on Data","authors":"Filippo Fabiani;Andrea Simonetto","doi":"10.1109/LSP.2025.3643385","DOIUrl":"https://doi.org/10.1109/LSP.2025.3643385","url":null,"abstract":"We study data-driven least squares (LS) problems with semidefinite (SD) constraints and derive finite-sample guarantees on the spectrum of their optimal solutions when these constraints are relaxed. In particular, we provide a high confidence bound allowing one to solve a simpler program in place of the full SDLS problem, while ensuring that the eigenvalues of the resulting solution are <inline-formula><tex-math>$varepsilon$</tex-math></inline-formula>-close of those enforced by the SD constraints. The developed certificate, which consistently shrinks as the number of data increases, turns out to be easy-to-compute, distribution-free, and only requires independent and identically distributed samples. Moreover, when the SDLS is used to learn an unknown quadratic function, we establish bounds on the error between a gradient descent iterate minimizing the surrogate cost obtained with no SD constraints and the true minimizer.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"326-330"},"PeriodicalIF":3.9,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LoRA Patching: Exposing the Fragility of Proactive Defenses Against Deepfakes LoRA补丁:暴露针对深度伪造的主动防御的脆弱性
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-11 DOI: 10.1109/LSP.2025.3643352
Zuomin Qu;Yimao Guo;Qianyue Hu;Wei Lu
Deepfakes pose significant societal risks, motivating the development of proactive defenses that embed adversarial perturbations in facial images to prevent manipulation. However, in this paper, we show that these preemptive defenses often lack robustness and reliability. We propose a novel approach, Low-Rank Adaptation (LoRA) patching, which injects a plug-and-play LoRA patch into Deepfake generators to bypass state-of-the-art defenses. A learnable gating mechanism adaptively controls the effect of the LoRA patch and prevents gradient explosions during fine-tuning. We also introduce a Multi-Modal Feature Alignment (MMFA) loss, encouraging the features of adversarial outputs to align with those of the desired outputs at the semantic level. Beyond bypassing, we present defensive LoRA patching, embedding visible warnings in the outputs as a complementary solution to mitigate this newly identified security vulnerability. With only 1,000 facial examples and a single epoch of fine-tuning, LoRA patching successfully defeats multiple proactive defenses. These results reveal a critical weakness in current paradigms and underscore the need for more robust Deepfake defense strategies.
深度造假构成了重大的社会风险,推动了主动防御的发展,在面部图像中嵌入对抗性扰动,以防止操纵。然而,在本文中,我们表明这些先发制人的防御往往缺乏鲁棒性和可靠性。我们提出了一种新颖的方法,低秩适应(LoRA)补丁,它将即插即用的LoRA补丁注入Deepfake生成器中,以绕过最先进的防御。一种可学习的门控机制自适应地控制LoRA贴片的效果,并防止微调过程中的梯度爆炸。我们还引入了多模态特征对齐(MMFA)损失,鼓励对抗性输出的特征在语义层面上与期望输出的特征对齐。除了绕过之外,我们还提供了防御性的LoRA补丁,在输出中嵌入可见的警告,作为缓解新发现的安全漏洞的补充解决方案。只有1000个面部样本和一个微调时期,LoRA补丁成功地击败了多个主动防御。这些结果揭示了当前范式的一个关键弱点,并强调了对更强大的Deepfake防御策略的需求。
{"title":"LoRA Patching: Exposing the Fragility of Proactive Defenses Against Deepfakes","authors":"Zuomin Qu;Yimao Guo;Qianyue Hu;Wei Lu","doi":"10.1109/LSP.2025.3643352","DOIUrl":"https://doi.org/10.1109/LSP.2025.3643352","url":null,"abstract":"Deepfakes pose significant societal risks, motivating the development of proactive defenses that embed adversarial perturbations in facial images to prevent manipulation. However, in this paper, we show that these preemptive defenses often lack robustness and reliability. We propose a novel approach, Low-Rank Adaptation (LoRA) patching, which injects a plug-and-play LoRA patch into Deepfake generators to bypass state-of-the-art defenses. A learnable gating mechanism adaptively controls the effect of the LoRA patch and prevents gradient explosions during fine-tuning. We also introduce a Multi-Modal Feature Alignment (MMFA) loss, encouraging the features of adversarial outputs to align with those of the desired outputs at the semantic level. Beyond bypassing, we present defensive LoRA patching, embedding visible warnings in the outputs as a complementary solution to mitigate this newly identified security vulnerability. With only 1,000 facial examples and a single epoch of fine-tuning, LoRA patching successfully defeats multiple proactive defenses. These results reveal a critical weakness in current paradigms and underscore the need for more robust Deepfake defense strategies.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"286-290"},"PeriodicalIF":3.9,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learnable Time-Frequency Transform and Ridge Separation 可学习时频变换和脊分离
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-11 DOI: 10.1109/LSP.2025.3643359
Pingping Pan;Yunjian Zhang;You Li;Renzhong Guo
Time-frequency analysis (TFA) and ridge separation of non-stationary signals have long been research topics in signal processing. They are mutually dependent: informative time-frequency representations (TFRs) enable reliable ridge estimation, while accurate ridges refine TFRs by outlining component-wise time-frequency (TF) trajectories. However, the uncertainty principle limits TF resolution and ridge discriminability, and existing ridge tracking or optimization-based methods rely on empirical tuning and degrade with weak or closely spaced components, highlighting the need for a more robust and unified solution. This letter proposes a unified network that jointly performs TFA and ridge separation. It features a knowledge-guided short-time transform module for extracting discriminative TF features, coupled with an instance segmentation module with learnable queries that interacts with the extracted TF features to achieve ridge separation. This knowledge- and data-integrated framework enables fine-grained TFR construction and high-accuracy ridge separation, while eliminating manual parameter tuning and enhancing adaptability. Finally, experiments on simulated and real-world data validate its effectiveness.
非平稳信号的时频分析和脊分离一直是信号处理领域的研究课题。它们是相互依赖的:信息性时频表示(TFRs)能够实现可靠的脊估计,而精确的脊通过概述组件的时频(TF)轨迹来改进TFRs。然而,不确定性原理限制了TF分辨率和山脊可分辨性,现有的山脊跟踪或基于优化的方法依赖于经验调谐,并且在弱或紧密间隔的分量下会退化,这突出了对更健壮和统一的解决方案的需求。这封信提出了一个统一的网络,共同执行TFA和岭分离。它具有知识引导的短时变换模块,用于提取判别性TF特征,以及具有可学习查询的实例分割模块,该模块与提取的TF特征交互,实现脊分离。这种知识和数据集成框架实现了细粒度TFR构建和高精度脊分离,同时消除了手动参数调整并增强了适应性。最后,通过仿真和实际数据验证了该方法的有效性。
{"title":"Learnable Time-Frequency Transform and Ridge Separation","authors":"Pingping Pan;Yunjian Zhang;You Li;Renzhong Guo","doi":"10.1109/LSP.2025.3643359","DOIUrl":"https://doi.org/10.1109/LSP.2025.3643359","url":null,"abstract":"Time-frequency analysis (TFA) and ridge separation of non-stationary signals have long been research topics in signal processing. They are mutually dependent: informative time-frequency representations (TFRs) enable reliable ridge estimation, while accurate ridges refine TFRs by outlining component-wise time-frequency (TF) trajectories. However, the uncertainty principle limits TF resolution and ridge discriminability, and existing ridge tracking or optimization-based methods rely on empirical tuning and degrade with weak or closely spaced components, highlighting the need for a more robust and unified solution. This letter proposes a unified network that jointly performs TFA and ridge separation. It features a knowledge-guided short-time transform module for extracting discriminative TF features, coupled with an instance segmentation module with learnable queries that interacts with the extracted TF features to achieve ridge separation. This knowledge- and data-integrated framework enables fine-grained TFR construction and high-accuracy ridge separation, while eliminating manual parameter tuning and enhancing adaptability. Finally, experiments on simulated and real-world data validate its effectiveness.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"296-300"},"PeriodicalIF":3.9,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inter-Trial Coherence Reveals Enhanced Synchrony During Mantra Listening 在听咒语的过程中,试验间的一致性揭示了增强的同步
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-11 DOI: 10.1109/LSP.2025.3643357
Daisy Das;Nabamita Deb;Saswati Sanyal Choudhury
Pregnancy is often a time of increased stress and anxiety, both psychologically and physically. More and more research is being done on the calming effects of relaxation techniques on the mother's brain. Passively listening to meditative mantras is one such method. This study investigates neuronal phase synchronization in three distinct cognitive states in pregnant women with their eyes closed: resting state (RS), mantra listening (M), and after mantra listening (AM). 32 scalp electrodes and a 128 Hz sampling rate were used to collect EEG data from 43 pregnant subjects. There were two 2-minute trials in each state. To assess the temporal synchrony of brain oscillations, the Inter-Trial Coherence (ITC), a phase-locking metric that quantifies the stability of neural phase over multiple trials, was computed. Bandpass filtering followed by Hilbert transform was used to assess ITC across the Theta (4–8 Hz), Alpha (8–13 Hz), and Beta (13–30 Hz) frequency bands. The Mantra condition had the greatest mean ITC, according to the results: 0.9117 (Theta), 0.8891 (Alpha), and 0.8083 (Beta). Conversely, the After-Mantra condition displayed moderate ITC levels of 0.6582 (Theta), 0.6510 (Alpha), and 0.6437 (Beta), whereas the Resting State produced 0.6392 (Theta), 0.6381 (Alpha), and 0.6368 (Beta). According to these results, passive mantra listening improves brain phase synchrony, especially in the lower frequency bands, and could be a useful non-invasive method of meditative pregnant relaxation.
怀孕通常是心理上和身体上压力和焦虑增加的时期。越来越多的研究是关于放松技术对母亲大脑的镇静作用。被动地听冥想咒语就是这样一种方法。本文研究了闭眼孕妇在静息状态(RS)、静听状态(M)和静听状态(AM)三种不同认知状态下的神经相同步。采用32个头皮电极,128 Hz采样率采集43例孕妇的脑电图数据。每个州有两次两分钟的试验。为了评估大脑振荡的时间同步性,计算了试验间相干性(ITC),这是一种锁相度量,用于量化多次试验中神经相位的稳定性。采用希尔伯特变换后的带通滤波来评估Theta (4-8 Hz)、Alpha (8-13 Hz)和Beta (13-30 Hz)频段的ITC。根据结果,咒语条件具有最大的平均ITC: 0.9117 (Theta), 0.8891 (Alpha)和0.8083 (Beta)。相反,咒语后状态显示出适度的ITC水平,为0.6582 (Theta), 0.6510 (Alpha)和0.6437 (Beta),而静息状态产生0.6392 (Theta), 0.6381 (Alpha)和0.6368 (Beta)。根据这些结果,被动听咒语可以改善大脑相同步,特别是在较低的频段,可能是一种有用的非侵入性怀孕冥想放松方法。
{"title":"Inter-Trial Coherence Reveals Enhanced Synchrony During Mantra Listening","authors":"Daisy Das;Nabamita Deb;Saswati Sanyal Choudhury","doi":"10.1109/LSP.2025.3643357","DOIUrl":"https://doi.org/10.1109/LSP.2025.3643357","url":null,"abstract":"Pregnancy is often a time of increased stress and anxiety, both psychologically and physically. More and more research is being done on the calming effects of relaxation techniques on the mother's brain. Passively listening to meditative mantras is one such method. This study investigates neuronal phase synchronization in three distinct cognitive states in pregnant women with their eyes closed: resting state (RS), mantra listening (M), and after mantra listening (AM). 32 scalp electrodes and a 128 Hz sampling rate were used to collect EEG data from 43 pregnant subjects. There were two 2-minute trials in each state. To assess the temporal synchrony of brain oscillations, the Inter-Trial Coherence (ITC), a phase-locking metric that quantifies the stability of neural phase over multiple trials, was computed. Bandpass filtering followed by Hilbert transform was used to assess ITC across the Theta (4–8 Hz), Alpha (8–13 Hz), and Beta (13–30 Hz) frequency bands. The Mantra condition had the greatest mean ITC, according to the results: 0.9117 (Theta), 0.8891 (Alpha), and 0.8083 (Beta). Conversely, the After-Mantra condition displayed moderate ITC levels of 0.6582 (Theta), 0.6510 (Alpha), and 0.6437 (Beta), whereas the Resting State produced 0.6392 (Theta), 0.6381 (Alpha), and 0.6368 (Beta). According to these results, passive mantra listening improves brain phase synchrony, especially in the lower frequency bands, and could be a useful non-invasive method of meditative pregnant relaxation.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"291-295"},"PeriodicalIF":3.9,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
P-Norm Based Fractional-Order Robust Subband Adaptive Filtering Algorithm for Impulsive Noise and Noisy Input 基于p范数的分数阶鲁棒子带自适应脉冲噪声和噪声输入滤波算法
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-10 DOI: 10.1109/LSP.2025.3642765
Jianhong Ye;Haiquan Zhao;Yi Peng
Building upon the mean $p$-power error (MPE) criterion, the normalized subband $p$-norm (NSPN) algorithm demonstrates superior robustness in $alpha$-stable noise environments ($1< alpha leq 2$) through effective utilization of low-order moment hidden in robust loss functions. Nevertheless, its performance degrades significantly when processing noise input or additive noise characterized by $alpha$-stable processes ($0< alpha leq 1$). To overcome these limitations, we propose a novel fractional-order NSPN (FoNSPN) algorithm that incorporates the fractional-order stochastic gradient descent (FoSGD) method into the MPE framework. Additionally, this paper also analyzes the convergence range of its step-size, the theoretical domain of values for the fractional-order $beta$, and establishes the theoretical steady-state mean square deviation (MSD) model. Simulations conducted in diverse impulsive noise environments confirm the superiority of the proposed FoNSPN algorithm against existing state-of-the-art algorithms.
基于平均$p$ -功率误差(MPE)准则,归一化子带$p$ -范数(NSPN)算法通过有效利用隐藏在鲁棒损失函数中的低阶矩,在$alpha$ -稳定噪声环境($1< alpha leq 2$)中表现出优越的鲁棒性。然而,当处理噪声输入或以$alpha$ -稳定过程($0< alpha leq 1$)为特征的加性噪声时,其性能会显著下降。为了克服这些限制,我们提出了一种新的分数阶NSPN (FoNSPN)算法,该算法将分数阶随机梯度下降(FoSGD)方法整合到MPE框架中。此外,本文还分析了其步长的收敛范围,分数阶的理论值域$beta$,并建立了理论稳态均方差(MSD)模型。在各种脉冲噪声环境下进行的仿真验证了所提出的FoNSPN算法相对于现有最先进算法的优越性。
{"title":"P-Norm Based Fractional-Order Robust Subband Adaptive Filtering Algorithm for Impulsive Noise and Noisy Input","authors":"Jianhong Ye;Haiquan Zhao;Yi Peng","doi":"10.1109/LSP.2025.3642765","DOIUrl":"https://doi.org/10.1109/LSP.2025.3642765","url":null,"abstract":"Building upon the mean <inline-formula><tex-math>$p$</tex-math></inline-formula>-power error (MPE) criterion, the normalized subband <inline-formula><tex-math>$p$</tex-math></inline-formula>-norm (NSPN) algorithm demonstrates superior robustness in <inline-formula><tex-math>$alpha$</tex-math></inline-formula>-stable noise environments (<inline-formula><tex-math>$1&lt; alpha leq 2$</tex-math></inline-formula>) through effective utilization of low-order moment hidden in robust loss functions. Nevertheless, its performance degrades significantly when processing noise input or additive noise characterized by <inline-formula><tex-math>$alpha$</tex-math></inline-formula>-stable processes (<inline-formula><tex-math>$0&lt; alpha leq 1$</tex-math></inline-formula>). To overcome these limitations, we propose a novel fractional-order NSPN (FoNSPN) algorithm that incorporates the fractional-order stochastic gradient descent (FoSGD) method into the MPE framework. Additionally, this paper also analyzes the convergence range of its step-size, the theoretical domain of values for the fractional-order <inline-formula><tex-math>$beta$</tex-math></inline-formula>, and establishes the theoretical steady-state mean square deviation (MSD) model. Simulations conducted in diverse impulsive noise environments confirm the superiority of the proposed FoNSPN algorithm against existing state-of-the-art algorithms.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"281-285"},"PeriodicalIF":3.9,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145830893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Signal Processing Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1