首页 > 最新文献

IEEE Signal Processing Letters最新文献

英文 中文
Geometry-Guided Point Generation for 3D Object Detection 用于 3D 物体检测的几何引导点生成技术
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-20 DOI: 10.1109/LSP.2024.3503359
Kai Wang;Mingliang Zhou;Qing Lin;Guanglin Niu;Xiaowei Zhang
Point cloud completion 3D object detectors effectively tackle the challenge of incomplete shapes in sparse point clouds by generating pseudo points to improve detection performance. However, the absence of guidance provided by the heatmap information and the geometric shape information renders the precise recovery of object shapes an arduous task. To this end, we propose a Geometry-guided Point Generation for 3D Object Detection, named GgPG. Specifically, we first design a 3D heatmap auxiliary supervision subnetwork to enhance the quality of object proposals by capturing the actual size and position of the object within the 3D heatmap representation. Moreover, we introduce a density-aware point generation module that employs Kernel Density Estimation (KDE) to embed the point density into the grid point's feature representation, thereby enabling the completion of more precise object shapes. Our GgPG achieves progressive performance in both Waymo and KITTI benchmarks, notably GgPG outperforms PGRCNN by +1.02$%$, +1.18$%$, and +0.56$%$ on the vehicle, pedestrian, and cyclist under LEVEL$_$ 2 mAPH classes on Waymo Open Dataset, respectively.
点云补全三维目标检测器通过生成伪点,有效地解决了稀疏点云中形状不完整的难题,提高了检测性能。然而,缺乏热图信息和几何形状信息的指导,使得物体形状的精确恢复是一项艰巨的任务。为此,我们提出了一种用于三维物体检测的几何引导点生成方法,称为GgPG。具体来说,我们首先设计了一个3D热图辅助监督子网,通过捕获物体在3D热图表示中的实际尺寸和位置来提高物体建议的质量。此外,我们引入了一个密度感知的点生成模块,该模块使用核密度估计(KDE)将点密度嵌入到网格点的特征表示中,从而能够完成更精确的物体形状。我们的GgPG在Waymo和KITTI基准测试中都实现了渐进式性能,特别是在Waymo开放数据集上LEVEL$_$ 2 mAPH类下,GgPG在车辆、行人和骑自行车者上的表现分别优于PGRCNN +1.02$%$、+1.18$%$和+0.56$%$。
{"title":"Geometry-Guided Point Generation for 3D Object Detection","authors":"Kai Wang;Mingliang Zhou;Qing Lin;Guanglin Niu;Xiaowei Zhang","doi":"10.1109/LSP.2024.3503359","DOIUrl":"https://doi.org/10.1109/LSP.2024.3503359","url":null,"abstract":"Point cloud completion 3D object detectors effectively tackle the challenge of incomplete shapes in sparse point clouds by generating pseudo points to improve detection performance. However, the absence of guidance provided by the heatmap information and the geometric shape information renders the precise recovery of object shapes an arduous task. To this end, we propose a Geometry-guided Point Generation for 3D Object Detection, named GgPG. Specifically, we first design a 3D heatmap auxiliary supervision subnetwork to enhance the quality of object proposals by capturing the actual size and position of the object within the 3D heatmap representation. Moreover, we introduce a density-aware point generation module that employs Kernel Density Estimation (KDE) to embed the point density into the grid point's feature representation, thereby enabling the completion of more precise object shapes. Our GgPG achieves progressive performance in both Waymo and KITTI benchmarks, notably GgPG outperforms PGRCNN by +1.02\u0000<inline-formula><tex-math>$%$</tex-math></inline-formula>\u0000, +1.18\u0000<inline-formula><tex-math>$%$</tex-math></inline-formula>\u0000, and +0.56\u0000<inline-formula><tex-math>$%$</tex-math></inline-formula>\u0000 on the vehicle, pedestrian, and cyclist under LEVEL\u0000<inline-formula><tex-math>$_$</tex-math></inline-formula>\u0000 2 mAPH classes on Waymo Open Dataset, respectively.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"136-140"},"PeriodicalIF":3.2,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142844509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Maximum-Likelihood Active Device Detection and Channel Estimation in One-Bit MIMO System 位MIMO系统中最大似然有源设备检测与信道估计
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-20 DOI: 10.1109/LSP.2024.3503365
Mingye Ge;Yatao Liu;Mingjie Shao;Haixia Zhang
The future machine-type communication in internet-of-things (IoT) systems involves a massive number of devices sporadically communicating with a base station (BS) equipped with multiple antennas. Detecting active devices and estimating their associated channels are crucial but challenging due to the large number of potential devices and the small fraction of active devices. Existing studies assume high-resolution analog-to-digital converters (ADCs) at the BS, while there is a growing interest in implementing low-resolution ADCs, particularly one-bit ADCs, in massive multiple-input multiple-output (MIMO) systems. This paper focuses on the joint one-bit active device detection and channel estimation problem. We consider the maximum-likelihood approach and propose a novel expectation maximization (EM) algorithm with acceleration. On the theoretical aspect, we provide the convergent computational complexity analysis for the accelerated EM algorithm. The proposed method, evaluated through numerical simulations, outperforms benchmark algorithms in terms of both estimation accuracy and computational complexity.
物联网(IoT)系统中未来的机器类型通信涉及大量设备偶尔与配备多个天线的基站(BS)通信。检测有源设备并估计其相关通道至关重要,但由于潜在设备数量众多而有源设备的比例很小,因此具有挑战性。现有研究假设BS采用高分辨率模数转换器(adc),而在大规模多输入多输出(MIMO)系统中实现低分辨率adc,特别是1位adc的兴趣越来越大。本文主要研究联合位有源器件检测和信道估计问题。考虑最大似然方法,提出了一种新的加速期望最大化算法。在理论方面,给出了加速EM算法的收敛计算复杂度分析。通过数值模拟评估,该方法在估计精度和计算复杂度方面都优于基准算法。
{"title":"Maximum-Likelihood Active Device Detection and Channel Estimation in One-Bit MIMO System","authors":"Mingye Ge;Yatao Liu;Mingjie Shao;Haixia Zhang","doi":"10.1109/LSP.2024.3503365","DOIUrl":"https://doi.org/10.1109/LSP.2024.3503365","url":null,"abstract":"The future machine-type communication in internet-of-things (IoT) systems involves a massive number of devices sporadically communicating with a base station (BS) equipped with multiple antennas. Detecting active devices and estimating their associated channels are crucial but challenging due to the large number of potential devices and the small fraction of active devices. Existing studies assume high-resolution analog-to-digital converters (ADCs) at the BS, while there is a growing interest in implementing low-resolution ADCs, particularly one-bit ADCs, in massive multiple-input multiple-output (MIMO) systems. This paper focuses on the joint one-bit active device detection and channel estimation problem. We consider the maximum-likelihood approach and propose a novel expectation maximization (EM) algorithm with acceleration. On the theoretical aspect, we provide the convergent computational complexity analysis for the accelerated EM algorithm. The proposed method, evaluated through numerical simulations, outperforms benchmark algorithms in terms of both estimation accuracy and computational complexity.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"141-145"},"PeriodicalIF":3.2,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142844295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inpainting-Driven Graph Learning via Explainable Neural Networks 基于可解释神经网络的绘画驱动图形学习
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-18 DOI: 10.1109/LSP.2024.3501273
Subbareddy Batreddy;Pushkal Mishra;Yaswanth Kakarla;Aditya Siripuram
Given partial measurements of a time-varying graph signal, we propose an algorithm to simultaneously estimate both the underlying graph topology and the missing measurements. The proposed algorithm operates by training an interpretable neural network, designed from the unrolling framework. The proposed technique can be used as a graph learning and/or a graph signal reconstruction algorithm. This work builds on prior work in graph learning by tailoring the learned graph to the signal reconstruction task; and also enhances prior work in graph signal reconstruction by allowing the underlying graph to be unknown.
给定时变图信号的部分测量值,我们提出了一种同时估计底层图拓扑和缺失测量值的算法。该算法通过训练一个可解释的神经网络来运行,该神经网络是根据展开框架设计的。所提出的技术可以用作图学习和/或图信号重建算法。这项工作建立在先前的图学习工作的基础上,通过将学习到的图调整到信号重建任务;并且通过允许底层图是未知的,增强了先前在图信号重构中的工作。
{"title":"Inpainting-Driven Graph Learning via Explainable Neural Networks","authors":"Subbareddy Batreddy;Pushkal Mishra;Yaswanth Kakarla;Aditya Siripuram","doi":"10.1109/LSP.2024.3501273","DOIUrl":"https://doi.org/10.1109/LSP.2024.3501273","url":null,"abstract":"Given partial measurements of a time-varying graph signal, we propose an algorithm to simultaneously estimate both the underlying graph topology and the missing measurements. The proposed algorithm operates by training an interpretable neural network, designed from the unrolling framework. The proposed technique can be used as a graph learning and/or a graph signal reconstruction algorithm. This work builds on prior work in graph learning by tailoring the learned graph to the signal reconstruction task; and also enhances prior work in graph signal reconstruction by allowing the underlying graph to be unknown.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"111-115"},"PeriodicalIF":3.2,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SCA and IBCD Hybrid Algorithm Based Secure Beamforming Optimization for IRS-Assisted Multiuser CR-SWIPT System 基于SCA和IBCD混合算法的irs辅助多用户CR-SWIPT系统安全波束形成优化
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-18 DOI: 10.1109/LSP.2024.3501286
Xiaorong Xu;Shuo Yang;Zhaoting Liu;Jun Wu;Jianrong Bao
This letter investigates the design and optimization of secure beamforming in an intelligent reflecting surface (IRS)-assisted multiuser cognitive radio simultaneous wireless information and power transfer (CR-SWIPT) system. The proposed method leverages IRS to address cognitive energy harvesting nodes as potential eavesdroppers (Eve). The objective is to maximize the achievable secrecy rate while satisfying multiple constraints, such as transmit power control, energy harvesting, phase shifts, and maximum tolerable interference power. To solve this highly non-convex optimization problem, we propose a hybrid algorithm that combines successive convex approximation (SCA) and inexact block coordinate descent (IBCD). By decomposing the problem into three sub-problems, local optimal beamforming matrices and phase-shift matrices are obtained using the SCA method and complex circle manifold (CCM) method, respectively. Simulation results show that the secrecy rate at the SWIPT information decoding node in the IRS-assisted multiuser CR-SWIPT system improves significantly, with an approximate 40% enhancement compared to the random phase shift scheme with maximum transmit power. The effectiveness of the proposed algorithm is further validated through secrecy rate performance evaluations under different system configurations.
本文研究了智能反射面(IRS)辅助多用户认知无线电同步无线信息和电力传输(CR-SWIPT)系统中安全波束形成的设计和优化。该方法利用IRS将认知能量收集节点定位为潜在窃听者(Eve)。目标是在满足发射功率控制、能量收集、相移和最大可容忍干扰功率等多个约束的同时,最大限度地提高可实现的保密率。为了解决这一高度非凸优化问题,我们提出了一种结合连续凸逼近(SCA)和不精确块坐标下降(IBCD)的混合算法。通过将问题分解为三个子问题,分别采用SCA方法和复圆流形(CCM)方法得到了局部最优波束形成矩阵和相移矩阵。仿真结果表明,在irs辅助下的多用户CR-SWIPT系统中,SWIPT信息解码节点的保密率显著提高,与最大发射功率的随机相移方案相比,提高了约40%。通过不同系统配置下的保密率性能评估,进一步验证了该算法的有效性。
{"title":"SCA and IBCD Hybrid Algorithm Based Secure Beamforming Optimization for IRS-Assisted Multiuser CR-SWIPT System","authors":"Xiaorong Xu;Shuo Yang;Zhaoting Liu;Jun Wu;Jianrong Bao","doi":"10.1109/LSP.2024.3501286","DOIUrl":"https://doi.org/10.1109/LSP.2024.3501286","url":null,"abstract":"This letter investigates the design and optimization of secure beamforming in an intelligent reflecting surface (IRS)-assisted multiuser cognitive radio simultaneous wireless information and power transfer (CR-SWIPT) system. The proposed method leverages IRS to address cognitive energy harvesting nodes as potential eavesdroppers (Eve). The objective is to maximize the achievable secrecy rate while satisfying multiple constraints, such as transmit power control, energy harvesting, phase shifts, and maximum tolerable interference power. To solve this highly non-convex optimization problem, we propose a hybrid algorithm that combines successive convex approximation (SCA) and inexact block coordinate descent (IBCD). By decomposing the problem into three sub-problems, local optimal beamforming matrices and phase-shift matrices are obtained using the SCA method and complex circle manifold (CCM) method, respectively. Simulation results show that the secrecy rate at the SWIPT information decoding node in the IRS-assisted multiuser CR-SWIPT system improves significantly, with an approximate 40% enhancement compared to the random phase shift scheme with maximum transmit power. The effectiveness of the proposed algorithm is further validated through secrecy rate performance evaluations under different system configurations.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"336-340"},"PeriodicalIF":3.2,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142890273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Instantaneous Frequency Estimation Based on Reassignment Operators and Linear Chirp Points Detection 基于重配算子和线性啁啾点检测的瞬时频率估计
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-18 DOI: 10.1109/LSP.2024.3501272
Marcelo A. Colominas;Sylvain Meignen
This paper aims at building a new instantaneous frequency (IF) estimator of the modes making up non-stationary multi-component signals, using reassignment operators used in Fourier-based synchrosqueezing transforms (FSSTs) and linear chirp points detection. Reassignment operators provide with different IF estimates, depending on the assumption made on the local polynomial order of the phase of the studied mode. In practice, it is difficult to estimate locally which order fits the best, and to choose too high an order, typically larger than two, when not necessary results in both an inaccurate estimation and an increased sensitivity to noise. To circumvent this, we propose to localize linear chirp points in modes to find out where second order phase approximation is sufficient for the estimation, and then build a new IF estimate based on a weighted spline approximation based on these points. Numerical results show the improvement brought by the proposed approach in noisy situations over classical IF estimators used in FSSTs.
本文利用基于傅立叶的同步压缩变换(FSSTs)和线性啁啾点检测中使用的重分配算子,对组成非平稳多分量信号的模态建立一种新的瞬时频率估计器。根据对所研究模式的相位的局部多项式阶数的假设,重赋算子提供了不同的中频估计。在实践中,很难在局部估计哪个阶最适合,并且在不必要的情况下选择过高的阶,通常大于2,会导致不准确的估计和对噪声的敏感性增加。为了避免这种情况,我们建议在模式中局部化线性啁啾点,以找出二阶相位近似足以进行估计的地方,然后基于这些点的加权样条近似构建新的中频估计。数值结果表明,与FSSTs中使用的经典中频估计方法相比,该方法在有噪声情况下有明显改善。
{"title":"Instantaneous Frequency Estimation Based on Reassignment Operators and Linear Chirp Points Detection","authors":"Marcelo A. Colominas;Sylvain Meignen","doi":"10.1109/LSP.2024.3501272","DOIUrl":"https://doi.org/10.1109/LSP.2024.3501272","url":null,"abstract":"This paper aims at building a new instantaneous frequency (IF) estimator of the modes making up non-stationary multi-component signals, using reassignment operators used in Fourier-based synchrosqueezing transforms (FSSTs) and linear chirp points detection. Reassignment operators provide with different IF estimates, depending on the assumption made on the local polynomial order of the phase of the studied mode. In practice, it is difficult to estimate locally which order fits the best, and to choose too high an order, typically larger than two, when not necessary results in both an inaccurate estimation and an increased sensitivity to noise. To circumvent this, we propose to localize linear chirp points in modes to find out where second order phase approximation is sufficient for the estimation, and then build a new IF estimate based on a weighted spline approximation based on these points. Numerical results show the improvement brought by the proposed approach in noisy situations over classical IF estimators used in FSSTs.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"106-110"},"PeriodicalIF":3.2,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Independent Components Time-Frequency Purification With Channel Consensus Against Adversarial Attack in SSVEP-Based BCIs 基于 SSVEP 的 BCI 中独立成分时频净化与信道共识对抗对抗性攻击
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-18 DOI: 10.1109/LSP.2024.3501274
Hangjie Yi;Jingsheng Qian;Yuhang Ming;Wanzeng Kong
The Steady State Visual Evoked Potential (SSVEP) paradigm has been widely employed in various Brain-Computer Interface (BCI) systems. However, recent studies indicate that SSVEP is vulnerable to adversarial attacks, resulting in manipulated results and drastic degradation in recognition performance, which pose inconveniences and even risks to users. Noticing the fact that the adversarial attack on SSVEP is done by adding subtle waveform perturbations into random EEG channels, we propose Independent Components Time-Frequency Purification with Channel Consensus (ICTFP-CC) as a defensive strategy. In particular, we first detect and remove suspicious perturbations with independent component analysis from the time and frequency domain, and then reconstruct the purified EEG signals. Additionally, we introduce a voting mechanism to achieve channel consensus and enhance overall robustness. We conducted experiments on two public datasets and three SSVEP recognition algorithms. The results demonstrate that our method can significantly improve the classification accuracy and information transfer rate of attacked SSVEP signals by a maximum of 46.79 (%) and 62.87 (bits/min).
稳态视觉诱发电位(SSVEP)模式已广泛应用于各种脑机接口(BCI)系统。然而,最近的研究表明,SSVEP容易受到对抗性攻击,导致结果被操纵,识别性能急剧下降,给用户带来不便甚至风险。注意到对SSVEP的对抗性攻击是通过在随机EEG通道中添加微妙的波形扰动来完成的,我们提出了信道共识的独立分量时频净化(ICTFP-CC)作为防御策略。特别地,我们首先从时域和频域用独立分量分析检测和去除可疑的扰动,然后重建纯化后的脑电信号。此外,我们引入了一种投票机制来实现通道共识并增强整体鲁棒性。我们在两个公共数据集和三种SSVEP识别算法上进行了实验。结果表明,该方法能显著提高被攻击SSVEP信号的分类准确率和信息传输速率,分别提高46.79(%)和62.87 (bits/min)。
{"title":"Independent Components Time-Frequency Purification With Channel Consensus Against Adversarial Attack in SSVEP-Based BCIs","authors":"Hangjie Yi;Jingsheng Qian;Yuhang Ming;Wanzeng Kong","doi":"10.1109/LSP.2024.3501274","DOIUrl":"https://doi.org/10.1109/LSP.2024.3501274","url":null,"abstract":"The Steady State Visual Evoked Potential (SSVEP) paradigm has been widely employed in various Brain-Computer Interface (BCI) systems. However, recent studies indicate that SSVEP is vulnerable to adversarial attacks, resulting in manipulated results and drastic degradation in recognition performance, which pose inconveniences and even risks to users. Noticing the fact that the adversarial attack on SSVEP is done by adding subtle waveform perturbations into random EEG channels, we propose Independent Components Time-Frequency Purification with Channel Consensus (ICTFP-CC) as a defensive strategy. In particular, we first detect and remove suspicious perturbations with independent component analysis from the time and frequency domain, and then reconstruct the purified EEG signals. Additionally, we introduce a voting mechanism to achieve channel consensus and enhance overall robustness. We conducted experiments on two public datasets and three SSVEP recognition algorithms. The results demonstrate that our method can significantly improve the classification accuracy and information transfer rate of attacked SSVEP signals by a maximum of 46.79 (%) and 62.87 (bits/min).","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"116-120"},"PeriodicalIF":3.2,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Free Meal: Boosting Semi-Supervised Polyp Segmentation by Harvesting Negative Samples 免费用餐:通过采集阴性样本提升半监督息肉分割能力
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-18 DOI: 10.1109/LSP.2024.3501957
Xinyu Xiong;Wenxue Li;Jie Ma;Duojun Huang;Siying Li
Existing semi-supervised polyp segmentation methods assume that unlabeled images are positive, containing lesions to be annotated, while neglecting negative samples that are widely available in practice. This letter reveals that harvesting lesion-free negative samples can effectively boost polyp segmentation performance. Directly extending the labeled set with negative samples is sub-optimal since it introduces potential class imbalance. To overcome this challenge, we first introduce a data augmentation strategy named TypeMix. By fusing unlabeled samples with negative samples, the network can better benefit from diverse features provided by negatives while alleviating the potential side effects. Furthermore, it is observed that the number of negative samples significantly exceeds that of lesion samples. To reduce redundancy and improve training efficiency, we propose a dynamic informativeness-aware sampling strategy, prioritizing the active selection of high-valuable negative samples. Extensive experiments on public datasets demonstrate that our simple but effective strategies are enough to consistently outperform other state-of-the-art methods, offering new possibilities for future work from a data collection perspective.
现有的半监督息肉分割方法假设未标记的图像是阳性的,包含需要注释的病变,而忽略了在实践中广泛存在的阴性样本。这封信揭示了收获无病变阴性样本可以有效地提高息肉分割性能。直接用负样本扩展标记集是次优的,因为它引入了潜在的类不平衡。为了克服这个挑战,我们首先引入一个名为TypeMix的数据增强策略。通过将未标记的样本与负样本融合,网络可以更好地利用负提供的多种特征,同时减轻潜在的副作用。此外,观察到阴性样本的数量明显超过病变样本的数量。为了减少冗余,提高训练效率,我们提出了一种动态信息感知采样策略,优先选择高价值的负样本。在公共数据集上进行的大量实验表明,我们简单而有效的策略足以始终优于其他最先进的方法,从数据收集的角度为未来的工作提供了新的可能性。
{"title":"Free Meal: Boosting Semi-Supervised Polyp Segmentation by Harvesting Negative Samples","authors":"Xinyu Xiong;Wenxue Li;Jie Ma;Duojun Huang;Siying Li","doi":"10.1109/LSP.2024.3501957","DOIUrl":"https://doi.org/10.1109/LSP.2024.3501957","url":null,"abstract":"Existing semi-supervised polyp segmentation methods assume that unlabeled images are positive, containing lesions to be annotated, while neglecting negative samples that are widely available in practice. This letter reveals that harvesting lesion-free negative samples can effectively boost polyp segmentation performance. Directly extending the labeled set with negative samples is sub-optimal since it introduces potential class imbalance. To overcome this challenge, we first introduce a data augmentation strategy named TypeMix. By fusing unlabeled samples with negative samples, the network can better benefit from diverse features provided by negatives while alleviating the potential side effects. Furthermore, it is observed that the number of negative samples significantly exceeds that of lesion samples. To reduce redundancy and improve training efficiency, we propose a dynamic informativeness-aware sampling strategy, prioritizing the active selection of high-valuable negative samples. Extensive experiments on public datasets demonstrate that our simple but effective strategies are enough to consistently outperform other state-of-the-art methods, offering new possibilities for future work from a data collection perspective.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"131-135"},"PeriodicalIF":3.2,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142844571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Audio Watermarking Against Manipulation Attacks Based on Deep Learning 基于深度学习的鲁棒音频水印技术对抗篡改攻击
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-18 DOI: 10.1109/LSP.2024.3501285
Shuangbing Wen;Qishan Zhang;Tao Hu;Jun Li
Artificial intelligence technology has been developing rapidly, and speech synthesis models have become increasingly mature, capable of generating highly realistic synthetic audio used to disseminate misinformation, which poses a serious security risk problem. Digital watermarking technology can effectively protect digital content. Deep learning is currently achieving significant research success in digital watermarking. However, the current robustness against audio manipulation remains understudied. Based on this, we propose a robust audio watermarking method based on deep learning against manipulation attacks. Specifically, the embedding of watermarking information is performed in the encoder and the extraction of watermarking information is performed in the decoder; In addition, various audio attacks are simulated during iterative training, a sampling noise layer is used to increase robustness, and a discriminator is used to distinguish between encoded audio and original audio to improve the invisibility of the watermark. We comprehensively evaluate the performance of our model against various manipulation attacks. Experimental results demonstrate that the framework effectively embeds and extracts watermarked signals, exhibiting strong robustness.
人工智能技术发展迅速,语音合成模型日趋成熟,能够生成高度逼真的合成音频,用于传播虚假信息,带来了严重的安全隐患问题。数字水印技术可以有效保护数字内容。目前,深度学习在数字水印领域的研究取得了重大成果。然而,目前针对音频篡改的鲁棒性研究仍然不足。基于此,我们提出了一种基于深度学习的鲁棒音频水印方法,以对抗操纵攻击。具体来说,在编码器中嵌入水印信息,在解码器中提取水印信息;此外,在迭代训练过程中模拟各种音频攻击,使用采样噪声层提高鲁棒性,使用鉴别器区分编码音频和原始音频以提高水印的隐蔽性。我们全面评估了我们的模型在应对各种操纵攻击时的性能。实验结果表明,该框架能有效嵌入和提取水印信号,并表现出很强的鲁棒性。
{"title":"Robust Audio Watermarking Against Manipulation Attacks Based on Deep Learning","authors":"Shuangbing Wen;Qishan Zhang;Tao Hu;Jun Li","doi":"10.1109/LSP.2024.3501285","DOIUrl":"https://doi.org/10.1109/LSP.2024.3501285","url":null,"abstract":"Artificial intelligence technology has been developing rapidly, and speech synthesis models have become increasingly mature, capable of generating highly realistic synthetic audio used to disseminate misinformation, which poses a serious security risk problem. Digital watermarking technology can effectively protect digital content. Deep learning is currently achieving significant research success in digital watermarking. However, the current robustness against audio manipulation remains understudied. Based on this, we propose a robust audio watermarking method based on deep learning against manipulation attacks. Specifically, the embedding of watermarking information is performed in the encoder and the extraction of watermarking information is performed in the decoder; In addition, various audio attacks are simulated during iterative training, a sampling noise layer is used to increase robustness, and a discriminator is used to distinguish between encoded audio and original audio to improve the invisibility of the watermark. We comprehensively evaluate the performance of our model against various manipulation attacks. Experimental results demonstrate that the framework effectively embeds and extracts watermarked signals, exhibiting strong robustness.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"126-130"},"PeriodicalIF":3.2,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142844570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boundary Refinement Network for Polyp Segmentation With Deformable Attention 利用可变形注意力进行息肉分割的边界细化网络
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-18 DOI: 10.1109/LSP.2024.3501283
Zijian Li;Zhiyong Yang;Wangsheng Wu;Zihang Guo;Dongdong Zhu
Early and accurate polyp segmentation is crucial for the diagnosis and treatment of colorectal cancer. However, polyp segmentation faces many challenges: different polyp sizes, complex shapes, and ambiguous intestinal wall boundaries. To solve these problems, we propose a novel polyp segmentation network named DeformSegNet. Specifically, we first introduce a polyp perception module (PPM), which combines the dynamic multi-kernel spatial selection network (DMS-Net) and a transformer encoder to effectively locate polyps of different sizes. Next, we design a deformation-aware separable module (DSM), which consists of deformable attention that adaptively adjusts the sampling position, enabling the network to adapt to complex and diverse polyp boundaries. Finally, a cross-attention aggregation module (CAAM) effectively retains low-level features, further enhancing the boundary features and suppressing false positives. DeformSegNet achieves competitive segmentation accuracy on five polyp datasets, demonstrating excellent learning and generalization capabilities.
早期准确的息肉分割对于结直肠癌的诊断和治疗至关重要。然而,息肉分割面临许多挑战:息肉大小不一、形状复杂、肠壁边界模糊。为了解决这些问题,我们提出了一种名为 DeformSegNet 的新型息肉分割网络。 具体来说,我们首先引入了息肉感知模块(PPM),该模块结合了动态多核空间选择网络(DMS-Net)和变压器编码器,可有效定位不同大小的息肉。接着,我们设计了变形感知可分离模块(DSM),该模块由可变形注意力组成,可自适应地调整采样位置,使网络能够适应复杂多样的息肉边界。最后,交叉注意力聚合模块(CAAM)能有效保留低级特征,进一步增强边界特征并抑制误报。DeformSegNet 在五个息肉数据集上实现了极具竞争力的分割准确率,展示了卓越的学习和泛化能力。
{"title":"Boundary Refinement Network for Polyp Segmentation With Deformable Attention","authors":"Zijian Li;Zhiyong Yang;Wangsheng Wu;Zihang Guo;Dongdong Zhu","doi":"10.1109/LSP.2024.3501283","DOIUrl":"https://doi.org/10.1109/LSP.2024.3501283","url":null,"abstract":"Early and accurate polyp segmentation is crucial for the diagnosis and treatment of colorectal cancer. However, polyp segmentation faces many challenges: different polyp sizes, complex shapes, and ambiguous intestinal wall boundaries. To solve these problems, we propose a novel polyp segmentation network named DeformSegNet. Specifically, we first introduce a polyp perception module (PPM), which combines the dynamic multi-kernel spatial selection network (DMS-Net) and a transformer encoder to effectively locate polyps of different sizes. Next, we design a deformation-aware separable module (DSM), which consists of deformable attention that adaptively adjusts the sampling position, enabling the network to adapt to complex and diverse polyp boundaries. Finally, a cross-attention aggregation module (CAAM) effectively retains low-level features, further enhancing the boundary features and suppressing false positives. DeformSegNet achieves competitive segmentation accuracy on five polyp datasets, demonstrating excellent learning and generalization capabilities.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"121-125"},"PeriodicalIF":3.2,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sketched Over-Parametrized Projected Gradient Descent for Sparse Spike Estimation 稀疏尖峰估计的草图化过参数化投影梯度下降
IF 3.2 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-15 DOI: 10.1109/LSP.2024.3488496
Pierre-Jean Bénard;Yann Traonmilin;Jean-François Aujol
We consider the problem of recovering off-the-grid spikes from linear measurements in the context of Single Molecule Localization Microscopy (SMLM). State of the art model-based methods such as Over-Parametrized Continuous Orthogonal Matching Pursuit (OP-COMP) with Projected Gradient Descent (PGD) have been shown to successfully recover those signals. The computational cost of these methods scales linearly with the number of measurements. When this number of measurements is large with respect to the dimensionality of the signal, we propose to reduce it with a so-called sketching operator. Based on recent results on compressive sensing in the space of measures, we approximate the ideal sketching operator (benefiting from theoretical recovery guarantees), in the context of SMLM. This sketching method coupled to OP-COMP with PGD shows significant improvements in calculation time in realistic synthetic microscopy experiments.
我们考虑了在单分子定位显微镜(SMLM)的背景下从线性测量中恢复离网尖峰的问题。最先进的基于模型的方法,如带投影梯度下降(PGD)的过参数化连续正交匹配追踪(OP-COMP)已经被证明可以成功地恢复这些信号。这些方法的计算成本与测量次数成线性关系。当这个测量数相对于信号的维数很大时,我们建议用所谓的草图算子来减少它。基于测度空间中压缩感知的最新结果,我们在SMLM的背景下近似了理想的草图算子(受益于理论恢复保证)。这种写生方法与OP-COMP和PGD相结合,在现实合成显微镜实验中显示出计算时间的显著改善。
{"title":"Sketched Over-Parametrized Projected Gradient Descent for Sparse Spike Estimation","authors":"Pierre-Jean Bénard;Yann Traonmilin;Jean-François Aujol","doi":"10.1109/LSP.2024.3488496","DOIUrl":"https://doi.org/10.1109/LSP.2024.3488496","url":null,"abstract":"We consider the problem of recovering off-the-grid spikes from linear measurements in the context of Single Molecule Localization Microscopy (SMLM). State of the art model-based methods such as Over-Parametrized Continuous Orthogonal Matching Pursuit (OP-COMP) with Projected Gradient Descent (PGD) have been shown to successfully recover those signals. The computational cost of these methods scales linearly with the number of measurements. When this number of measurements is large with respect to the dimensionality of the signal, we propose to reduce it with a so-called sketching operator. Based on recent results on compressive sensing in the space of measures, we approximate the ideal sketching operator (benefiting from theoretical recovery guarantees), in the context of SMLM. This sketching method coupled to OP-COMP with PGD shows significant improvements in calculation time in realistic synthetic microscopy experiments.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"16-20"},"PeriodicalIF":3.2,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Signal Processing Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1