首页 > 最新文献

IEEE Signal Processing Letters最新文献

英文 中文
A Low-Complexity Bit-Embedding Decoder for Non-Binary LDPC Codes in $mathbb {F}_{2^{q}}$ $mathbb {F}_{2^{q}}$ 中非二进制 LDPC 编码的低复杂度比特嵌入式解码器
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-09-27 DOI: 10.1109/LSP.2024.3469776
Xiaomeng Guo;Yongchao Wang
In this letter, a new decoder based on a bit-embedding technique for non-binary low-density parity-check (LDPC) codes over Galois fields of characteristic two ($mathbb {F}_{2^{q}}$) is presented. The main content of this work is summarized in the following: first, we derive the equivalent binary codeword and binary parity-check matrix for non-binary LDPC codes in $mathbb {F}_{2^{q}}$; second, a customized belief propagation algorithm utilizing a bit-embedding technique is proposed based on the formulated binary-check matrix; in the end, simulations results demonstrate that the proposed approach enjoys significant improvement on computational complexity, but almost similar error-correction performance in comparison with the state-of-the-art non-binary LDPC decoders.
在这封信中,我们介绍了一种基于比特嵌入技术的新型解码器,它适用于特征二伽罗瓦场上的非二进制低密度奇偶校验(LDPC)码($mathbb {F}_{2^{q}}$ )。这项工作的主要内容概括如下:首先,我们推导出了 $mathbb {F}_{2^{q}}$ 中非二进制 LDPC 码的等效二进制码字和二进制奇偶校验矩阵;其次,基于所制定的二进制校验矩阵,提出了一种利用比特嵌入技术的定制信念传播算法;最后,仿真结果表明,与最先进的非二进制 LDPC 解码器相比,所提出的方法在计算复杂性上有显著改善,但纠错性能几乎相似。
{"title":"A Low-Complexity Bit-Embedding Decoder for Non-Binary LDPC Codes in $mathbb {F}_{2^{q}}$","authors":"Xiaomeng Guo;Yongchao Wang","doi":"10.1109/LSP.2024.3469776","DOIUrl":"https://doi.org/10.1109/LSP.2024.3469776","url":null,"abstract":"In this letter, a new decoder based on a bit-embedding technique for non-binary low-density parity-check (LDPC) codes over Galois fields of characteristic two (\u0000<inline-formula><tex-math>$mathbb {F}_{2^{q}}$</tex-math></inline-formula>\u0000) is presented. The main content of this work is summarized in the following: first, we derive the equivalent binary codeword and binary parity-check matrix for non-binary LDPC codes in \u0000<inline-formula><tex-math>$mathbb {F}_{2^{q}}$</tex-math></inline-formula>\u0000; second, a customized belief propagation algorithm utilizing a bit-embedding technique is proposed based on the formulated binary-check matrix; in the end, simulations results demonstrate that the proposed approach enjoys significant improvement on computational complexity, but almost similar error-correction performance in comparison with the state-of-the-art non-binary LDPC decoders.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142452741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coalitional Game-Theoretic Paradigm for Power Allocation in Distributed Antenna Systems 分布式天线系统功率分配的联盟博弈论范式
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-09-25 DOI: 10.1109/LSP.2024.3468989
Cheng Qi;Junwei Xie;Haowei Zhang;Weijian Liu;Weike Feng
This letter introduces a novel coalitional game-theoretic power allocation (CGTPA) paradigm, tailored for resource management for communication and netted radar systems. Taking the power allocation for enhancing downlink throughput in a distributed antenna system (DAS) as a clear-cut example, theoretical expositions, and experimental simulations are presented accordingly. As an optimal decision-making method grounded in behavioral rules, the CGTPA method distinguishes itself from the methods yielded by conventional convex optimization and heuristic methods. It evolves around the idea that cooperation among antennas can be modeled as a coalitional game. Then the Sharpley value-based power allocation ensures a Pareto solution for both optimality and fairness. Furthermore, it is mathematically proven that the throughput consistently improves through iterative application of the allocation rule. Mathematical analysis and simulation results validate the effectiveness of the proposed method.
这篇文章介绍了一种新颖的联盟博弈理论功率分配(CGTPA)范式,专门用于通信和网状雷达系统的资源管理。本文以提高分布式天线系统(DAS)下行链路吞吐量的功率分配为例,进行了相应的理论阐述和实验模拟。作为一种基于行为规则的优化决策方法,CGTPA 方法有别于传统的凸优化和启发式方法。它围绕着天线之间的合作可以被模拟为联盟博弈的思想而发展。然后,基于 Sharpley 值的功率分配确保了帕累托解决方案的最优性和公平性。此外,数学证明,通过迭代应用分配规则,吞吐量会持续提高。数学分析和仿真结果验证了所提方法的有效性。
{"title":"Coalitional Game-Theoretic Paradigm for Power Allocation in Distributed Antenna Systems","authors":"Cheng Qi;Junwei Xie;Haowei Zhang;Weijian Liu;Weike Feng","doi":"10.1109/LSP.2024.3468989","DOIUrl":"https://doi.org/10.1109/LSP.2024.3468989","url":null,"abstract":"This letter introduces a novel coalitional game-theoretic power allocation (CGTPA) paradigm, tailored for resource management for communication and netted radar systems. Taking the power allocation for enhancing downlink throughput in a distributed antenna system (DAS) as a clear-cut example, theoretical expositions, and experimental simulations are presented accordingly. As an optimal decision-making method grounded in behavioral rules, the CGTPA method distinguishes itself from the methods yielded by conventional convex optimization and heuristic methods. It evolves around the idea that cooperation among antennas can be modeled as a coalitional game. Then the Sharpley value-based power allocation ensures a Pareto solution for both optimality and fairness. Furthermore, it is mathematically proven that the throughput consistently improves through iterative application of the allocation rule. Mathematical analysis and simulation results validate the effectiveness of the proposed method.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142442993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Image-Text Matching by Integrating Word Sense Disambiguation 通过整合词义消歧改进图像-文本匹配
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-09-24 DOI: 10.1109/LSP.2024.3466992
Xiao Pu;Ping Yang;Lin Yuan;Xinbo Gao
This letter presents a novel approach to enhance image-text matching by incorporating word sense disambiguation (WSD) within the text encoder. Our method explicitly models the senses of potentially ambiguous words, refining the semantic understanding between images and text. We introduce a sense-aware mechanism for image-text alignment by integrating a lightweight WSD component into the matching framework, optimizing both tasks simultaneously. Our WSD module operates on extensive word contexts, leveraging the power of graph attention networks (GAT), and distills knowledge from a substantially larger pre-trained WSD model through multi-task learning. Our experiments demonstrate the effectiveness of augmenting original word embeddings with sense representations derived from our WSD approach. We systematically evaluate our method against several baselines and state-of-the-art approaches on two widely-used image-text matching benchmarks: MS-COCO and Flickr30K. The results illustrate significant improvements in matching accuracy, highlighting the efficacy of our proposed approach.
这封信提出了一种新方法,通过在文本编码器中加入词义消歧(WSD)来增强图像与文本的匹配。我们的方法明确地对可能存在歧义的单词进行词义建模,从而完善图像和文本之间的语义理解。通过将轻量级 WSD 组件集成到匹配框架中,我们引入了一种感知机制来实现图像与文本的对齐,同时优化这两项任务。我们的 WSD 模块利用图注意网络(GAT)的强大功能,在广泛的单词上下文中运行,并通过多任务学习,从规模更大的预训练 WSD 模型中提炼知识。我们的实验证明了用从 WSD 方法中获得的意义表征来增强原始词嵌入的有效性。我们在两个广泛使用的图像-文本匹配基准上系统地评估了我们的方法与几种基准和最先进的方法的对比情况:MS-COCO 和 Flickr30K。结果表明,我们的方法显著提高了匹配准确率,凸显了我们提出的方法的功效。
{"title":"Improving Image-Text Matching by Integrating Word Sense Disambiguation","authors":"Xiao Pu;Ping Yang;Lin Yuan;Xinbo Gao","doi":"10.1109/LSP.2024.3466992","DOIUrl":"https://doi.org/10.1109/LSP.2024.3466992","url":null,"abstract":"This letter presents a novel approach to enhance image-text matching by incorporating word sense disambiguation (WSD) within the text encoder. Our method explicitly models the senses of potentially ambiguous words, refining the semantic understanding between images and text. We introduce a sense-aware mechanism for image-text alignment by integrating a lightweight WSD component into the matching framework, optimizing both tasks simultaneously. Our WSD module operates on extensive word contexts, leveraging the power of graph attention networks (GAT), and distills knowledge from a substantially larger pre-trained WSD model through multi-task learning. Our experiments demonstrate the effectiveness of augmenting original word embeddings with sense representations derived from our WSD approach. We systematically evaluate our method against several baselines and state-of-the-art approaches on two widely-used image-text matching benchmarks: MS-COCO and Flickr30K. The results illustrate significant improvements in matching accuracy, highlighting the efficacy of our proposed approach.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Initial Phase Coding With Two-Dimensional Local Low Sidelobes for Suppression of Active Forwarding Signals 利用二维局部低边沿抑制主动转发信号的初始相位编码
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-09-24 DOI: 10.1109/LSP.2024.3466996
Qianlan Huang;Sergey Heister;Hongqi Fan;Huaitie Xiao
The electromagnetic environment in modern battlefields is highly complex, particularly due to active forwarding signals based on digital radio frequency memory (DRFM), which significantly affects radar performance. Active forwarding signals generate jammers in the radar receiver.A jammer with a negative range offset needs to reach the radar with an inter-pulse delay. In addition, the target may also introduce range ambiguity in some cases. This paper establishes a general signal model for inter-pulse initial phase coding and proposes a two-dimensional local low sidelobe initial phase coding algorithm. This algorithm optimizes both the slow-time and Doppler dimensions to make local low sidelobes in both dimensional. The designed initial phase sequence ensures the Doppler resolution of the target while exhibiting locally low sidelobes in the Doppler responses of jammers.Simulation results demonstrate that this algorithm effectively enhances the radar's resistance to jammers and improves target detection performance. Furthermore, it operates independently of prior detection information and maintains low computational complexity.
现代战场的电磁环境非常复杂,尤其是基于数字射频存储器(DRFM)的有源转发信号会严重影响雷达性能。主动转发信号会在雷达接收器中产生干扰信号。具有负范围偏移的干扰信号需要经过脉冲间延迟才能到达雷达。此外,在某些情况下,目标还可能带来范围模糊。本文建立了脉冲间初始相位编码的一般信号模型,并提出了一种二维局部低侧叶初始相位编码算法。该算法同时优化了慢时维度和多普勒维度,从而在两个维度上都实现了局部低边瓣。仿真结果表明,该算法有效增强了雷达对干扰器的抗干扰能力,提高了目标探测性能。仿真结果表明,这种算法有效地增强了雷达对干扰器的抗干扰能力,提高了目标探测性能。此外,它的运行不受先前探测信息的影响,计算复杂度也很低。
{"title":"Initial Phase Coding With Two-Dimensional Local Low Sidelobes for Suppression of Active Forwarding Signals","authors":"Qianlan Huang;Sergey Heister;Hongqi Fan;Huaitie Xiao","doi":"10.1109/LSP.2024.3466996","DOIUrl":"https://doi.org/10.1109/LSP.2024.3466996","url":null,"abstract":"The electromagnetic environment in modern battlefields is highly complex, particularly due to active forwarding signals based on digital radio frequency memory (DRFM), which significantly affects radar performance. Active forwarding signals generate jammers in the radar receiver.A jammer with a negative range offset needs to reach the radar with an inter-pulse delay. In addition, the target may also introduce range ambiguity in some cases. This paper establishes a general signal model for inter-pulse initial phase coding and proposes a two-dimensional local low sidelobe initial phase coding algorithm. This algorithm optimizes both the slow-time and Doppler dimensions to make local low sidelobes in both dimensional. The designed initial phase sequence ensures the Doppler resolution of the target while exhibiting locally low sidelobes in the Doppler responses of jammers.Simulation results demonstrate that this algorithm effectively enhances the radar's resistance to jammers and improves target detection performance. Furthermore, it operates independently of prior detection information and maintains low computational complexity.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142408816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modal Excitation in Feedback Delay Networks 反馈延迟网络中的模态激励
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-09-23 DOI: 10.1109/LSP.2024.3466790
Sebastian J. Schlecht;Matteo Scerbo;Enzo De Sena;Vesa Välimäki
Feedback delay networks (FDNs) are used in audio processing and synthesis. The modal shapes of the system describe the modal excitation by input and output signals. Previously, the Ehrlich-Aberth method was used to find modes in large FDNs. Here, the method is extended to the corresponding eigenvectors indicating the modal shape. In particular, the computational complexity of the proposed analysis method does not depend on the delay-line lengths and is thus suitable for large FDNs, such as artificial reverberators. We show the relation between the compact generalized eigenvectors in the delay state space and the spatially extended modal shapes in the state space. We illustrate this method with an example FDN in which the suggested modal excitation control does not increase the computational cost. The modal shapes can help optimize input and output gains. This letter teaches how selecting the input and output points along the delay lines of an FDN adjusts the spectral shape of the system output.
反馈延迟网络(FDN)用于音频处理和合成。系统的模态形状描述了输入和输出信号的模态激励。以前,人们使用 Ehrlich-Aberth 方法来寻找大型 FDN 的模态。在这里,该方法扩展到了表示模态形状的相应特征向量。特别是,所提分析方法的计算复杂度与延迟线长度无关,因此适用于大型 FDN,如人工混响器。我们展示了延迟状态空间中的紧凑广义特征向量与状态空间中的空间扩展模态形状之间的关系。我们用一个 FDN 例子来说明这种方法,其中建议的模态激励控制不会增加计算成本。模态振型有助于优化输入和输出增益。这封信介绍了如何沿着 FDN 的延迟线选择输入和输出点来调整系统输出的频谱形状。
{"title":"Modal Excitation in Feedback Delay Networks","authors":"Sebastian J. Schlecht;Matteo Scerbo;Enzo De Sena;Vesa Välimäki","doi":"10.1109/LSP.2024.3466790","DOIUrl":"https://doi.org/10.1109/LSP.2024.3466790","url":null,"abstract":"Feedback delay networks (FDNs) are used in audio processing and synthesis. The modal shapes of the system describe the modal excitation by input and output signals. Previously, the Ehrlich-Aberth method was used to find modes in large FDNs. Here, the method is extended to the corresponding eigenvectors indicating the modal shape. In particular, the computational complexity of the proposed analysis method does not depend on the delay-line lengths and is thus suitable for large FDNs, such as artificial reverberators. We show the relation between the compact generalized eigenvectors in the delay state space and the spatially extended modal shapes in the state space. We illustrate this method with an example FDN in which the suggested modal excitation control does not increase the computational cost. The modal shapes can help optimize input and output gains. This letter teaches how selecting the input and output points along the delay lines of an FDN adjusts the spectral shape of the system output.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diffusion Augmentation and Pose Generation Based Pre-Training Method for Robust Visible-Infrared Person Re-Identification 基于扩散增强和姿势生成的预训练方法,用于可靠的可见光-红外人员再识别
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-09-23 DOI: 10.1109/LSP.2024.3466792
Rui Sun;Guoxi Huang;Ruirui Xie;Xuebin Wang;Long Chen
Cross-Modal Visible-Infrared Person Re-identification (VI-REID) constitutes a vital application for constructing all-time surveillance systems. However, the current VI-REID model exhibits significant performance deterioration in noisy environments. Existing algorithms endeavor to mitigate this challenge through fine-tuning stages. We contend that, in contrast to fine-tuning stages, the pre-training phase can effectively exploit the attributes of extensive unlabeled data, thereby facilitating the development of a robust VI-REID model. Therefore, in this paper, we propose a pre-training method for VI-REID based on Diffusion Augmentation and Pose Generation (DAPG), aiming to enhance the robustness and recognition rate of VI-REID models in the presence of damaged scenes. Multiple transfer experiments on the SYSU-MM01 and RegDB datasets demonstrate that our method outperforms existing self-supervised methods, as evidenced by the results.
跨模态可见光-红外人员再识别(VI-REID)是构建全时监控系统的一项重要应用。然而,当前的 VI-REID 模型在嘈杂环境中表现出明显的性能下降。现有算法试图通过微调阶段来缓解这一挑战。我们认为,与微调阶段不同,预训练阶段可以有效利用大量未标记数据的属性,从而促进稳健的 VI-REID 模型的开发。因此,本文提出了一种基于扩散增强和姿态生成(DAPG)的VI-REID预训练方法,旨在提高VI-REID模型在受损场景下的鲁棒性和识别率。在SYSU-MM01和RegDB数据集上进行的多次转移实验证明,我们的方法优于现有的自监督方法,这一点从结果中可见一斑。
{"title":"Diffusion Augmentation and Pose Generation Based Pre-Training Method for Robust Visible-Infrared Person Re-Identification","authors":"Rui Sun;Guoxi Huang;Ruirui Xie;Xuebin Wang;Long Chen","doi":"10.1109/LSP.2024.3466792","DOIUrl":"https://doi.org/10.1109/LSP.2024.3466792","url":null,"abstract":"Cross-Modal Visible-Infrared Person Re-identification (VI-REID) constitutes a vital application for constructing all-time surveillance systems. However, the current VI-REID model exhibits significant performance deterioration in noisy environments. Existing algorithms endeavor to mitigate this challenge through fine-tuning stages. We contend that, in contrast to fine-tuning stages, the pre-training phase can effectively exploit the attributes of extensive unlabeled data, thereby facilitating the development of a robust VI-REID model. Therefore, in this paper, we propose a pre-training method for VI-REID based on Diffusion Augmentation and Pose Generation (DAPG), aiming to enhance the robustness and recognition rate of VI-REID models in the presence of damaged scenes. Multiple transfer experiments on the SYSU-MM01 and RegDB datasets demonstrate that our method outperforms existing self-supervised methods, as evidenced by the results.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142383361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Edge-and-Mask Integration-Driven Diffusion Models for Medical Image Segmentation 用于医学图像分割的边缘与掩膜集成驱动扩散模型
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-09-23 DOI: 10.1109/LSP.2024.3466608
Qian Tang;Qikui Zhu;Yuxuan Xiong;Yongchao Xu;Bo Du
Denoising diffusion probabilistic models (DDPMs) exhibit significant potential in the realm of medical image segmentation. Nevertheless, current DDPM implementations rely on original image features as conditional information, thus lacking the ability to specifically emphasize edge information, a critical aspect in addressing the primary challenge of segmentation. Furthermore, the necessary semantic features for conditioning the diffusion process lack effective alignment with the noise embedding. To address the above issues, we propose a novel edge-and-mask integration-driven diffusion model (EMidDiff). Specifically, 1) an edge-and-mask condition strategy is proposed for the segmentation diffusion model to effectively leverage rich semantic features, particularly the edge feature. 2) A novel co-attention guidance block is designed to align the segmentation map and condition features. The experimental results on brain tumor segmentation and optic-cup segmentation underscore the effectiveness of our approach, surpassing the performance of some state-of-the-art segmentation diffusion models.
去噪扩散概率模型(DDPM)在医学图像分割领域展现出巨大的潜力。然而,目前的 DDPM 实现依赖于原始图像特征作为条件信息,因此缺乏特别强调边缘信息的能力,而边缘信息是解决分割这一主要挑战的关键方面。此外,用于调节扩散过程的必要语义特征与噪声嵌入缺乏有效的一致性。为了解决上述问题,我们提出了一种新颖的边缘与掩码整合驱动扩散模型(EMidDiff)。具体来说,1)为分割扩散模型提出了边缘与掩码条件策略,以有效利用丰富的语义特征,尤其是边缘特征。2) 设计了一个新颖的共同注意引导块,以对齐分割图和条件特征。脑肿瘤分割和视杯分割的实验结果表明,我们的方法非常有效,其性能超过了一些最先进的分割扩散模型。
{"title":"Edge-and-Mask Integration-Driven Diffusion Models for Medical Image Segmentation","authors":"Qian Tang;Qikui Zhu;Yuxuan Xiong;Yongchao Xu;Bo Du","doi":"10.1109/LSP.2024.3466608","DOIUrl":"https://doi.org/10.1109/LSP.2024.3466608","url":null,"abstract":"Denoising diffusion probabilistic models (DDPMs) exhibit significant potential in the realm of medical image segmentation. Nevertheless, current DDPM implementations rely on original image features as conditional information, thus lacking the ability to specifically emphasize edge information, a critical aspect in addressing the primary challenge of segmentation. Furthermore, the necessary semantic features for conditioning the diffusion process lack effective alignment with the noise embedding. To address the above issues, we propose a novel edge-and-mask integration-driven diffusion model (EMidDiff). Specifically, 1) an edge-and-mask condition strategy is proposed for the segmentation diffusion model to effectively leverage rich semantic features, particularly the edge feature. 2) A novel co-attention guidance block is designed to align the segmentation map and condition features. The experimental results on brain tumor segmentation and optic-cup segmentation underscore the effectiveness of our approach, surpassing the performance of some state-of-the-art segmentation diffusion models.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142383431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Digital Superresolution Method With Minimal Sensitivity to Shift Estimation Error 对移位估计误差敏感度最小的数字超分辨率方法
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-09-23 DOI: 10.1109/LSP.2024.3466008
Neethu S. Ravi;Rakesh Kumar;Bradley M. Ratliff
Superresolution (SR) methods become essential when an undersampled low-resolution (LR) image is unable to provide accurate target detection. The estimation of an HR image from a single LR image is ill-posed problem, and hence requires prior information. More the constraints, better is the reconstruction accuracy and this forms the basis of most of the contemporary state-of-the-art (SOTA) superresolution methods such as SRCNN and SRGAN which implement prior information as training set. Yet another approach to overcome the ill-posedness of the problem is to have multiple diverse LR images with the potential to reconstruct accurate HR image. Here we present the performance analysis of a generalized sampling theorem (GST) based multi-frame SR method. A simulation study using Gaussian targets is conducted, and a comparative performance analysis of the GST multi-frame SR method with the traditional multi-frame interpolation schemes and SOTA methods is presented using the percentage mean square error ($%$MSE) and Structural Similarity Index Measure (SSIM). Our findings indicate that the GST SR method can outperform traditional interpolation and the SOTA methods.
当采样不足的低分辨率(LR)图像无法提供准确的目标检测时,超分辨率(SR)方法就变得至关重要。从单幅低分辨率图像估计高分辨率图像是一个难以解决的问题,因此需要先验信息。约束条件越多,重建精度就越高,这就构成了大多数当代最先进(SOTA)超分辨率方法的基础,如 SRCNN 和 SRGAN,它们都将先验信息作为训练集。另一种克服问题不确定性的方法是使用多幅不同的 LR 图像来重建准确的 HR 图像。在此,我们介绍了基于广义抽样定理(GST)的多帧 SR 方法的性能分析。我们使用高斯目标进行了模拟研究,并使用均方误差百分比($%$MSE)和结构相似性指数测量(SSIM)对 GST 多帧 SR 方法与传统多帧插值方案和 SOTA 方法进行了性能比较分析。研究结果表明,GST SR 方法的性能优于传统插值法和 SOTA 方法。
{"title":"A Digital Superresolution Method With Minimal Sensitivity to Shift Estimation Error","authors":"Neethu S. Ravi;Rakesh Kumar;Bradley M. Ratliff","doi":"10.1109/LSP.2024.3466008","DOIUrl":"https://doi.org/10.1109/LSP.2024.3466008","url":null,"abstract":"Superresolution (SR) methods become essential when an undersampled low-resolution (LR) image is unable to provide accurate target detection. The estimation of an HR image from a single LR image is ill-posed problem, and hence requires prior information. More the constraints, better is the reconstruction accuracy and this forms the basis of most of the contemporary state-of-the-art (SOTA) superresolution methods such as SRCNN and SRGAN which implement prior information as training set. Yet another approach to overcome the ill-posedness of the problem is to have multiple diverse LR images with the potential to reconstruct accurate HR image. Here we present the performance analysis of a generalized sampling theorem (GST) based multi-frame SR method. A simulation study using Gaussian targets is conducted, and a comparative performance analysis of the GST multi-frame SR method with the traditional multi-frame interpolation schemes and SOTA methods is presented using the percentage mean square error (\u0000<inline-formula><tex-math>$%$</tex-math></inline-formula>\u0000MSE) and Structural Similarity Index Measure (SSIM). Our findings indicate that the GST SR method can outperform traditional interpolation and the SOTA methods.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Algorithm for Multi-Target Angle and Range-Velocity Estimation With MIMO-OFDM Communication Waveforms 利用 MIMO-OFDM 通信波形进行多目标角度和距离-速度估计的新算法
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-09-23 DOI: 10.1109/LSP.2024.3466788
Zhuangxin Fang;Zhiyong Luo
In this letter, we address the challenge of employing multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) to simultaneously estimate the range, velocity, and angle of multiple moving targets at the transmitter, while maintaining communication. Specifically, we present a mathematical model for radar sensing signal processing and elaborate on a successive estimation scheme. We reveal that the compensation processing algorithm proposed in existing successive estimation scheme suffers from the problem of ambiguous matching, which fails to properly match the angle and the range-velocity for targets with close angles. To address the ambiguous matching problem, this letter proposes a novel compensation processing algorithm for range-velocity estimation. Based on minimum variance unbiased estimation, the proposed compensation processing algorithm fully exploits the estimated angle information to extract corresponding range-velocity information for eliminating the interference from targets at close angles.
在这封信中,我们探讨了采用多输入多输出(MIMO)正交频分复用(OFDM)技术,在发射机上同时估计多个移动目标的距离、速度和角度,同时保持通信的难题。具体而言,我们提出了雷达传感信号处理的数学模型,并详细阐述了连续估计方案。我们发现,现有连续估计方案中提出的补偿处理算法存在模糊匹配问题,无法正确匹配角度和距离-速度接近的目标。为解决模糊匹配问题,本文提出了一种新的测距-测速估计补偿处理算法。基于最小方差无偏估计,所提出的补偿处理算法充分利用估计的角度信息来提取相应的测距-测速信息,以消除近角目标的干扰。
{"title":"A Novel Algorithm for Multi-Target Angle and Range-Velocity Estimation With MIMO-OFDM Communication Waveforms","authors":"Zhuangxin Fang;Zhiyong Luo","doi":"10.1109/LSP.2024.3466788","DOIUrl":"https://doi.org/10.1109/LSP.2024.3466788","url":null,"abstract":"In this letter, we address the challenge of employing multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) to simultaneously estimate the range, velocity, and angle of multiple moving targets at the transmitter, while maintaining communication. Specifically, we present a mathematical model for radar sensing signal processing and elaborate on a successive estimation scheme. We reveal that the compensation processing algorithm proposed in existing successive estimation scheme suffers from the problem of ambiguous matching, which fails to properly match the angle and the range-velocity for targets with close angles. To address the ambiguous matching problem, this letter proposes a novel compensation processing algorithm for range-velocity estimation. Based on minimum variance unbiased estimation, the proposed compensation processing algorithm fully exploits the estimated angle information to extract corresponding range-velocity information for eliminating the interference from targets at close angles.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142408860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Complex Immittance Spectral Frequency With the Perceptual-Metric-Based Codebook Search 利用基于感知-度量的码本搜索实现高效的复浸透谱频率
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-09-20 DOI: 10.1109/LSP.2024.3466012
Byeongho Jo;Seungkwon Beack
Complex-valued frequency-domain linear predictive coding (CLPC) has been developed for audio coding. Recently, representations for efficiently quantizing CLPC coefficients have been proposed, including the complex immittance spectral frequency (CISF). The CISF has limitations in that it requires signalling the sequential information to eliminate ambiguity and the highest-order coefficient (HOC) for reconstructing the CLPC coefficients. This study developed a modified CISF-based method that eliminates the need for additional information by utilizing intermediate complex polynomial properties. Furthermore, a perceptual-metric-based codebook search was proposed to improve quantization efficiency. The experimental results show robust quantization performance, while listening tests demonstrate superior audio quality compared to MPEG-D USAC long TCX at 12 kbps.
为音频编码开发了复值频域线性预测编码(CLPC)。最近,有人提出了对 CLPC 系数进行有效量化的表示方法,其中包括复值频谱频率 (CISF)。CISF 有其局限性,因为它需要传递序列信息以消除歧义,并需要最高阶系数(HOC)来重构 CLPC 系数。本研究开发了一种基于 CISF 的改进方法,通过利用中间复多项式特性,消除了对额外信息的需求。此外,还提出了一种基于感知度量的编码本搜索方法,以提高量化效率。实验结果表明,该方法具有稳健的量化性能,而听力测试表明,与 12 kbps 的 MPEG-D USAC 长 TCX 相比,该方法的音频质量更胜一筹。
{"title":"Efficient Complex Immittance Spectral Frequency With the Perceptual-Metric-Based Codebook Search","authors":"Byeongho Jo;Seungkwon Beack","doi":"10.1109/LSP.2024.3466012","DOIUrl":"https://doi.org/10.1109/LSP.2024.3466012","url":null,"abstract":"Complex-valued frequency-domain linear predictive coding (CLPC) has been developed for audio coding. Recently, representations for efficiently quantizing CLPC coefficients have been proposed, including the complex immittance spectral frequency (CISF). The CISF has limitations in that it requires signalling the sequential information to eliminate ambiguity and the highest-order coefficient (HOC) for reconstructing the CLPC coefficients. This study developed a modified CISF-based method that eliminates the need for additional information by utilizing intermediate complex polynomial properties. Furthermore, a perceptual-metric-based codebook search was proposed to improve quantization efficiency. The experimental results show robust quantization performance, while listening tests demonstrate superior audio quality compared to MPEG-D USAC long TCX at 12 kbps.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Signal Processing Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1