IEEE open journal of signal processing最新文献_第2页

Track Coalescence and Repulsion in Multitarget Tracking: An Analysis of MHT, JPDA, and Belief Propagation Methods 多目标跟踪中的轨迹聚合和排斥：MHT、JPDA 和信念传播方法分析

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE open journal of signal processing

Pub Date : 2024-08-28 DOI: 10.1109/OJSP.2024.3451167

Thomas Kropfreiter;Florian Meyer;David F. Crouse;Stefano Coraluppi;Franz Hlawatsch;Peter Willett

Joint probabilistic data association (JPDA) filter methods and multiple hypothesis tracking (MHT) methods are widely used for multitarget tracking (MTT). However, they are known to exhibit undesirable behavior in tracking scenarios with targets in close proximity: JPDA filter methods suffer from the track coalescence effect, i.e., the estimated tracks of targets in close proximity tend to merge and can become indistinguishable, while MHT methods suffer from an opposite effect known as track repulsion, i.e., the estimated tracks of targets in close proximity tend to repel each other in the sense that their separation is larger than the actual distance between the targets. In this paper, we review the JPDA filter and MHT methods and discuss the track coalescence and track repulsion effects. We also consider a more recent methodology for MTT that is based on the belief propagation (BP) algorithm. We argue that BP-based MTT does not exhibit track repulsion because it is not based on maximum a posteriori estimation, and that it exhibits significantly reduced track coalescence because certain properties of the BP messages related to data association encourage separation of target state estimates. Our theoretical arguments are confirmed by numerical results for four representative simulation scenarios.

联合概率数据关联（JPDA）滤波方法和多重假设跟踪（MHT）方法被广泛用于多目标跟踪（MTT）。然而，众所周知，在目标距离很近的跟踪场景中，它们会表现出不理想的行为：JPDA 滤波方法会受到轨迹凝聚效应的影响，即近距离目标的估计轨迹趋于合并，变得难以区分；而 MHT 方法则会受到称为轨迹排斥的相反效应的影响，即近距离目标的估计轨迹趋于相互排斥，它们之间的距离大于目标之间的实际距离。本文回顾了 JPDA 滤波和 MHT 方法，并讨论了轨迹凝聚和轨迹排斥效应。我们还考虑了一种基于信念传播（BP）算法的最新 MTT 方法。我们认为，基于 BP 算法的 MTT 不会出现轨迹排斥现象，因为它不是基于最大后验估计，而且由于与数据关联相关的 BP 信息的某些特性会鼓励目标状态估计的分离，因此它能显著减少轨迹凝聚。我们的理论论点得到了四种代表性模拟场景的数值结果的证实。

{"title":"Track Coalescence and Repulsion in Multitarget Tracking: An Analysis of MHT, JPDA, and Belief Propagation Methods","authors":"Thomas Kropfreiter;Florian Meyer;David F. Crouse;Stefano Coraluppi;Franz Hlawatsch;Peter Willett","doi":"10.1109/OJSP.2024.3451167","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3451167","url":null,"abstract":"Joint probabilistic data association (JPDA) filter methods and multiple hypothesis tracking (MHT) methods are widely used for multitarget tracking (MTT). However, they are known to exhibit undesirable behavior in tracking scenarios with targets in close proximity: JPDA filter methods suffer from the track coalescence effect, i.e., the estimated tracks of targets in close proximity tend to merge and can become indistinguishable, while MHT methods suffer from an opposite effect known as track repulsion, i.e., the estimated tracks of targets in close proximity tend to repel each other in the sense that their separation is larger than the actual distance between the targets. In this paper, we review the JPDA filter and MHT methods and discuss the track coalescence and track repulsion effects. We also consider a more recent methodology for MTT that is based on the belief propagation (BP) algorithm. We argue that BP-based MTT does not exhibit track repulsion because it is not based on maximum a posteriori estimation, and that it exhibits significantly reduced track coalescence because certain properties of the BP messages related to data association encourage separation of target state estimates. Our theoretical arguments are confirmed by numerical results for four representative simulation scenarios.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"1089-1106"},"PeriodicalIF":2.9,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10654568","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142595863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Impact of Varying Distance-Based Fingerprint Similarity Metrics on Affinity Propagation Clustering Performance in Received Signal Strength-Based Fingerprint Databases 基于接收信号强度的指纹数据库中不同距离指纹相似度指标对亲缘传播聚类性能的影响

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE open journal of signal processing

Pub Date : 2024-08-26 DOI: 10.1109/OJSP.2024.3449816

Abdulmalik Shehu Yaro;Filip Maly;Karel Maly;Pavel Prazak

The affinity propagation clustering (APC) algorithm is popular for fingerprint database clustering because it can cluster without pre-defining the number of clusters. However, the clustering performance of the APC algorithm heavily depends on the chosen fingerprint similarity metric, with distance-based metrics being the most commonly used. Despite its popularity, the APC algorithm lacks comprehensive research on how distance-based metrics affect clustering performance. This emphasizes the need for a better understanding of how these metrics influence its clustering performance, particularly in fingerprint databases. This paper investigates the impact of various distance-based fingerprint similarity metrics on the clustering performance of the APC algorithm. It identifies the best fingerprint similarity metric for optimal clustering performance for a given fingerprint database. The analysis is conducted across five experimentally generated online fingerprint databases, utilizing seven distance-based metrics: Euclidean, squared Euclidean, Manhattan, Spearman, cosine, Canberra, and Chebyshev distances. Using the silhouette score as the performance metric, the simulation results indicate that structural characteristics of the fingerprint database, such as the distribution of fingerprint vectors, play a key role in selecting the best fingerprint similarity metric. However, Euclidean and Manhattan distances are generally the preferable choices for use as fingerprint similarity metrics for the APC algorithm across most fingerprint databases, regardless of their structural characteristics. It is recommended that other factors, such as computational intensity and the presence or absence of outliers, be considered alongside the structural characteristics of the fingerprint database when choosing the appropriate fingerprint similarity metric for maximum clustering performance.

亲和传播聚类（APC）算法在指纹数据库聚类中很受欢迎，因为它可以在不预先确定聚类数量的情况下进行聚类。然而，APC 算法的聚类性能在很大程度上取决于所选的指纹相似度指标，其中基于距离的指标最为常用。尽管 APC 算法很受欢迎，但对基于距离的指标如何影响聚类性能缺乏全面的研究。这就强调了更好地了解这些指标如何影响其聚类性能的必要性，尤其是在指纹数据库中。本文研究了各种基于距离的指纹相似度指标对 APC 算法聚类性能的影响。它为给定的指纹数据库确定了最佳聚类性能的最佳指纹相似度指标。分析在五个实验生成的在线指纹数据库中进行，使用了七种基于距离的指标：欧氏距离、欧氏平方距离、曼哈顿距离、斯皮尔曼距离、余弦距离、堪培拉距离和切比雪夫距离。模拟结果表明，以剪影得分作为性能指标，指纹数据库的结构特征（如指纹向量的分布）在选择最佳指纹相似度指标时起着关键作用。不过，在大多数指纹数据库中，无论其结构特征如何，欧几里得距离和曼哈顿距离通常都是 APC 算法用作指纹相似度度量的首选。建议在选择适当的指纹相似度指标以实现最高聚类性能时，除了考虑指纹数据库的结构特征外，还要考虑其他因素，如计算强度和是否存在异常值。

{"title":"Impact of Varying Distance-Based Fingerprint Similarity Metrics on Affinity Propagation Clustering Performance in Received Signal Strength-Based Fingerprint Databases","authors":"Abdulmalik Shehu Yaro;Filip Maly;Karel Maly;Pavel Prazak","doi":"10.1109/OJSP.2024.3449816","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3449816","url":null,"abstract":"The affinity propagation clustering (APC) algorithm is popular for fingerprint database clustering because it can cluster without pre-defining the number of clusters. However, the clustering performance of the APC algorithm heavily depends on the chosen fingerprint similarity metric, with distance-based metrics being the most commonly used. Despite its popularity, the APC algorithm lacks comprehensive research on how distance-based metrics affect clustering performance. This emphasizes the need for a better understanding of how these metrics influence its clustering performance, particularly in fingerprint databases. This paper investigates the impact of various distance-based fingerprint similarity metrics on the clustering performance of the APC algorithm. It identifies the best fingerprint similarity metric for optimal clustering performance for a given fingerprint database. The analysis is conducted across five experimentally generated online fingerprint databases, utilizing seven distance-based metrics: Euclidean, squared Euclidean, Manhattan, Spearman, cosine, Canberra, and Chebyshev distances. Using the silhouette score as the performance metric, the simulation results indicate that structural characteristics of the fingerprint database, such as the distribution of fingerprint vectors, play a key role in selecting the best fingerprint similarity metric. However, Euclidean and Manhattan distances are generally the preferable choices for use as fingerprint similarity metrics for the APC algorithm across most fingerprint databases, regardless of their structural characteristics. It is recommended that other factors, such as computational intensity and the presence or absence of outliers, be considered alongside the structural characteristics of the fingerprint database when choosing the appropriate fingerprint similarity metric for maximum clustering performance.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"1005-1014"},"PeriodicalIF":2.9,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10646489","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142276495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Image Detection Using Combinatorial Auction 利用组合拍卖进行图像检测

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE open journal of signal processing

Pub Date : 2024-08-15 DOI: 10.1109/OJSP.2024.3444717

Simon Anuk;Tamir Bendory;Amichai Painsky

This paper studies the optimal solution of the classical problem of detecting the location of multiple image occurrences in a two-dimensional, noisy measurement. Assuming the image occurrences do not overlap, we formulate this task as a constrained maximum likelihood optimization problem. We show that the maximum likelihood estimator is equivalent to an instance of the winner determination problem from the field of combinatorial auction and that the solution can be obtained by searching over a binary tree. We then design a pruning mechanism that significantly accelerates the runtime of the search. We demonstrate on simulations and electron microscopy data sets that the proposed algorithm provides accurate detection in challenging regimes of high noise levels and densely packed image occurrences.

本文研究了在二维噪声测量中检测多个图像出现位置这一经典问题的最优解。假设图像出现的位置没有重叠，我们将这一任务表述为一个受约束的最大似然优化问题。我们证明，最大似然估计器等同于组合拍卖领域的赢家确定问题的一个实例，并且可以通过在二叉树上搜索来获得解决方案。然后，我们设计了一种剪枝机制，大大加快了搜索的运行时间。我们通过模拟和电子显微镜数据集证明，所提出的算法能在高噪声水平和密集图像出现的挑战环境中提供精确的检测。

引用次数: 0

Multilinear Kernel Regression and Imputation via Manifold Learning 多线性核回归和通过 Manifold Learning 进行推算

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE open journal of signal processing

Pub Date : 2024-08-15 DOI: 10.1109/OJSP.2024.3444707

Duc Thien Nguyen;Konstantinos Slavakis

This paper introduces a novel kernel regression framework for data imputation, coined multilinear kernel regression and imputation via the manifold assumption (MultiL-KRIM). Motivated by manifold learning, MultiL-KRIM models data features as a point-cloud located in or close to a user-unknown smooth manifold embedded in a reproducing kernel Hilbert space. Unlike typical manifold-learning routes, which seek low-dimensional patterns via regularizers based on graph-Laplacian matrices, MultiL-KRIM builds instead on the intuitive concept of tangent spaces to manifolds and incorporates collaboration among point-cloud neighbors (regressors) directly into the data-modeling term of the loss function. Multiple kernel functions are allowed to offer robustness and rich approximation properties, while multiple matrix factors offer low-rank modeling, dimensionality reduction and streamlined computations, with no need of training data. Two important application domains showcase the functionality of MultiL-KRIM: time-varying-graph-signal (TVGS) recovery, and reconstruction of highly accelerated dynamic-magnetic-resonance-imaging (dMRI) data. Extensive numerical tests on real TVGS and synthetic dMRI data demonstrate that the “shallow” MultiL-KRIM offers remarkable speedups over its predecessors and outperforms other “shallow” state-of-the-art techniques, with a more intuitive and explainable pipeline than deep-image-prior methods.

本文介绍了一种用于数据归因的新型核回归框架，被称为多线性核回归和流形假设归因（MultiL-KRIM）。受流形学习的启发，MultiL-KRIM 将数据特征建模为一个点云，该点云位于或接近嵌入再现核希尔伯特空间的用户未知光滑流形。典型的流形学习方法是通过基于图-拉普拉斯矩阵的正则来寻求低维模式，与此不同，MultiL-KRIM 基于流形切空间的直观概念，将点云邻域（回归因子）之间的协作直接纳入损失函数的数据建模项中。多个核函数可提供稳健性和丰富的近似特性，而多个矩阵因子可提供低阶建模、降维和简化计算，且无需训练数据。两个重要应用领域展示了 MultiL-KRIM 的功能：时变图信号（TVGS）恢复和高加速动态磁共振成像（dMRI）数据重建。在真实 TVGS 和合成 dMRI 数据上进行的大量数值测试表明，"浅层 "MultiL-KRIM 比其前辈技术有显著的提速，并优于其他 "浅层 "先进技术，其管道比深度成像前辈方法更直观、更易解释。

{"title":"Multilinear Kernel Regression and Imputation via Manifold Learning","authors":"Duc Thien Nguyen;Konstantinos Slavakis","doi":"10.1109/OJSP.2024.3444707","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3444707","url":null,"abstract":"This paper introduces a novel kernel regression framework for data imputation, coined multilinear kernel regression and imputation via the manifold assumption (MultiL-KRIM). Motivated by manifold learning, MultiL-KRIM models data features as a point-cloud located in or close to a user-unknown smooth manifold embedded in a reproducing kernel Hilbert space. Unlike typical manifold-learning routes, which seek low-dimensional patterns via regularizers based on graph-Laplacian matrices, MultiL-KRIM builds instead on the intuitive concept of tangent spaces to manifolds and incorporates collaboration among point-cloud neighbors (regressors) directly into the data-modeling term of the loss function. Multiple kernel functions are allowed to offer robustness and rich approximation properties, while multiple matrix factors offer low-rank modeling, dimensionality reduction and streamlined computations, with no need of training data. Two important application domains showcase the functionality of MultiL-KRIM: time-varying-graph-signal (TVGS) recovery, and reconstruction of highly accelerated dynamic-magnetic-resonance-imaging (dMRI) data. Extensive numerical tests on real TVGS and synthetic dMRI data demonstrate that the “shallow” MultiL-KRIM offers remarkable speedups over its predecessors and outperforms other “shallow” state-of-the-art techniques, with a more intuitive and explainable pipeline than deep-image-prior methods.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"1073-1088"},"PeriodicalIF":2.9,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10637724","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142587638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Occlusion-Informed Radar Detection for Millimeter-Wave Indoor Sensing 毫米波室内传感的遮挡感应雷达探测

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE open journal of signal processing

Pub Date : 2024-08-15 DOI: 10.1109/OJSP.2024.3444709

Ahmed Murtada;Bhavani Shankar Mysore Rama Rao;Moein Ahmadi;Udo Schroeder

The emergence of Multiple-Input Multiple-Output (MIMO) millimeter-wave (mmWave) radar sensors has prompted interest in indoor sensing applications, including human detection, vital signs monitoring, and real-time tracking in crowded environments. These sensors, equipped with multiple antenna elements, offer high angular resolution, often referred to as imaging radars for their capability to detect high-resolution point clouds. Employing radar systems with high-angular resolution in occlusion-prone scenarios often results in sparse signal returns in range profiles. In extreme cases, only one target return may be observed, as the resolution grid size becomes significantly smaller than the targets, causing portions of the targets to consistently occupy the full area of a test cell. Leveraging this structure, we propose two detectors to enhance the detection of non-occluded targets in such scenarios, thereby providing accurate high-resolution point clouds. The first method employs multiple hypothesis testing over each range profile where the range cells within are considered mutually occluding. The second is formulated based on binary hypothesis testing for each cell, considering the distribution of the signal in the other cells within the same range profile. Numerical analysis demonstrates the superior performance of the latter method over both the classic detection and the former method, especially in low Signal-to-Noise Ratio (SNR) scenarios. Our work showcases the potential of occlusion-informed detection in imaging radars to improve the detection probability of non-occluded targets and reduce false alarms in challenging indoor environments.

多输入多输出（MIMO）毫米波（mmWave）雷达传感器的出现引起了人们对室内传感应用的兴趣，包括在拥挤的环境中进行人体探测、生命体征监测和实时跟踪。这些传感器配备多个天线元件，具有很高的角度分辨率，通常被称为成像雷达，因为它们具有探测高分辨率点云的能力。在容易发生遮挡的情况下使用具有高角分辨率的雷达系统，往往会导致测距剖面中的信号回波稀疏。在极端情况下，由于分辨率网格尺寸明显小于目标，可能只能观测到一个目标回波，导致部分目标始终占据测试单元的整个区域。利用这种结构，我们提出了两种检测方法，以增强在这种情况下对非闭塞目标的检测，从而提供精确的高分辨率点云。第一种方法在每个范围剖面上采用多重假设检验，其中的范围单元被认为是相互遮挡的。第二种方法基于每个单元的二元假设检验，同时考虑同一范围剖面内其他单元的信号分布。数值分析表明，后一种方法的性能优于传统的检测方法和前一种方法，尤其是在低信噪比（SNR）的情况下。我们的工作展示了在成像雷达中进行闭塞信息检测的潜力，以提高对非闭塞目标的检测概率，并减少具有挑战性的室内环境中的误报。

{"title":"Occlusion-Informed Radar Detection for Millimeter-Wave Indoor Sensing","authors":"Ahmed Murtada;Bhavani Shankar Mysore Rama Rao;Moein Ahmadi;Udo Schroeder","doi":"10.1109/OJSP.2024.3444709","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3444709","url":null,"abstract":"The emergence of Multiple-Input Multiple-Output (MIMO) millimeter-wave (mmWave) radar sensors has prompted interest in indoor sensing applications, including human detection, vital signs monitoring, and real-time tracking in crowded environments. These sensors, equipped with multiple antenna elements, offer high angular resolution, often referred to as imaging radars for their capability to detect high-resolution point clouds. Employing radar systems with high-angular resolution in occlusion-prone scenarios often results in sparse signal returns in range profiles. In extreme cases, only one target return may be observed, as the resolution grid size becomes significantly smaller than the targets, causing portions of the targets to consistently occupy the full area of a test cell. Leveraging this structure, we propose two detectors to enhance the detection of non-occluded targets in such scenarios, thereby providing accurate high-resolution point clouds. The first method employs multiple hypothesis testing over each range profile where the range cells within are considered mutually occluding. The second is formulated based on binary hypothesis testing for each cell, considering the distribution of the signal in the other cells within the same range profile. Numerical analysis demonstrates the superior performance of the latter method over both the classic detection and the former method, especially in low Signal-to-Noise Ratio (SNR) scenarios. Our work showcases the potential of occlusion-informed detection in imaging radars to improve the detection probability of non-occluded targets and reduce false alarms in challenging indoor environments.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"976-990"},"PeriodicalIF":2.9,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10637692","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142246512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

IEEE Signal Processing Society Information 电气和电子工程师学会信号处理学会信息

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE open journal of signal processing

Pub Date : 2024-08-14 DOI: 10.1109/OJSP.2023.3347994

引用次数: 0

Self-Supervised Learning-Based Time Series Classification via Hierarchical Sparse Convolutional Masked-Autoencoder 通过层次稀疏卷积掩码自动编码器进行基于自监督学习的时间序列分类

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE open journal of signal processing

Pub Date : 2024-07-30 DOI: 10.1109/OJSP.2024.3435673

Ting Yu;Kele Xu;Xu Wang;Bo Ding;Dawei Feng

In recent years, the use of time series analysis has become widespread, prompting researchers to explore methods to improve classification. Time series self-supervised learning has emerged as a significant area of study, aiming to uncover patterns in unlabeled data for richer information. Contrastive self-supervised learning, particularly, has gained attention for time series classification. However, it introduces inductive bias by generating positive and negative samples. Another approach involves Masked Autoencoders (MAE), which are effective for various data types. However, due to their reliance on the Transformer architecture, they demand significant computational resources during the pre-training phase. Recently, inspired by the remarkable advancements achieved by convolutional networks in the domain of time series forecasting, we aspire to employ convolutional networks utilizing a strategy of mask recovery for pre-training time series models. This study introduces a novel model termed Hierarchical Sparse Convolutional Masked-Autoencoder, “HSC-MAE”, which seamlessly integrates convolutional operations with the MAE architecture to adeptly capture time series features across varying scales. Furthermore, the HSC-MAE model incorporates dedicated decoders that amalgamate global and local information, enhancing its capacity to comprehend intricate temporal patterns. To gauge the effectiveness of the proposed approach, an extensive array of experiments was conducted across nine distinct datasets. The experimental outcomes stand as a testament to the efficacy of HSC-MAE in effectively mitigating the aforementioned challenges.

近年来，时间序列分析的应用越来越广泛，促使研究人员探索改进分类的方法。时间序列自监督学习已成为一个重要的研究领域，其目的是从无标签数据中发现模式，从而获得更丰富的信息。尤其是对比式自我监督学习，在时间序列分类方面受到了广泛关注。然而，这种方法会产生正负样本，从而带来归纳偏差。另一种方法是掩码自动编码器（MAE），它对各种数据类型都很有效。但是，由于它们依赖于变换器架构，因此在预训练阶段需要大量的计算资源。最近，受卷积网络在时间序列预测领域取得的显著进步的启发，我们希望利用掩码恢复策略来使用卷积网络，对时间序列模型进行预训练。本研究引入了一种名为 "HSC-MAE "的分层稀疏卷积掩码自动编码器（Hierarchical Sparse Convolutional Masked-Autoencoder）的新型模型，它将卷积操作与 MAE 架构无缝集成，能有效捕捉不同尺度的时间序列特征。此外，HSC-MAE 模型还集成了专门的解码器，可综合全局和局部信息，增强其理解复杂时间模式的能力。为了评估所提出方法的有效性，我们在九个不同的数据集上进行了大量实验。实验结果证明了 HSC-MAE 在有效缓解上述挑战方面的功效。

{"title":"Self-Supervised Learning-Based Time Series Classification via Hierarchical Sparse Convolutional Masked-Autoencoder","authors":"Ting Yu;Kele Xu;Xu Wang;Bo Ding;Dawei Feng","doi":"10.1109/OJSP.2024.3435673","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3435673","url":null,"abstract":"In recent years, the use of time series analysis has become widespread, prompting researchers to explore methods to improve classification. Time series self-supervised learning has emerged as a significant area of study, aiming to uncover patterns in unlabeled data for richer information. Contrastive self-supervised learning, particularly, has gained attention for time series classification. However, it introduces inductive bias by generating positive and negative samples. Another approach involves Masked Autoencoders (MAE), which are effective for various data types. However, due to their reliance on the Transformer architecture, they demand significant computational resources during the pre-training phase. Recently, inspired by the remarkable advancements achieved by convolutional networks in the domain of time series forecasting, we aspire to employ convolutional networks utilizing a strategy of mask recovery for pre-training time series models. This study introduces a novel model termed Hierarchical Sparse Convolutional Masked-Autoencoder, “HSC-MAE”, which seamlessly integrates convolutional operations with the MAE architecture to adeptly capture time series features across varying scales. Furthermore, the HSC-MAE model incorporates dedicated decoders that amalgamate global and local information, enhancing its capacity to comprehend intricate temporal patterns. To gauge the effectiveness of the proposed approach, an extensive array of experiments was conducted across nine distinct datasets. The experimental outcomes stand as a testament to the efficacy of HSC-MAE in effectively mitigating the aforementioned challenges.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"964-975"},"PeriodicalIF":2.9,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10614789","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142099791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Detection of Radar Pulse Signals Based on Deep Learning 基于深度学习的雷达脉冲信号检测

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE open journal of signal processing

Pub Date : 2024-07-30 DOI: 10.1109/OJSP.2024.3435703

Fengyang Gu;Luxin Zhang;Shilian Zheng;Jie Chen;Keqiang Yue;Zhijin Zhao;Xiaoniu Yang

Radar is widely used in aviation, meteorology, and military fields, and radar pulse signal detection has become an indispensable and essential function of cognitive radio systems as well as electronic warfare systems. In this paper, we propose a deep learning-based radar signal detection method. Firstly, we propose a detection method based on raw in-phase and quadrature (IQ) input, which utilizes a convolutional neural network (CNN) to automatically learn the features of radar pulse signals and noises, to accomplish the detection task. To further reduce the computational complexity, we also propose a hybrid detection method that combines compressed sensing (CS) and deep learning, which reduces the length of the signal by compressed downsampling, and then feeds the compressed signal to the CNN for detection. Extensive simulation results show that our proposed IQ-based method outperforms the traditional short-time Fourier transform method as well as three existing deep learning-based detection methods in terms of probability of detection. Furthermore, our proposed IQ-CS-based method can achieve satisfactory detection performance with significantly reduced computational complexity.

雷达广泛应用于航空、气象和军事领域，雷达脉冲信号检测已成为认知无线电系统和电子战系统不可或缺的重要功能。本文提出了一种基于深度学习的雷达信号检测方法。首先，我们提出了一种基于原始同相和正交（IQ）输入的检测方法，利用卷积神经网络（CNN）自动学习雷达脉冲信号和噪声的特征，从而完成检测任务。为了进一步降低计算复杂度，我们还提出了一种结合压缩传感（CS）和深度学习的混合检测方法，即通过压缩降采样减少信号长度，然后将压缩后的信号输入 CNN 进行检测。广泛的仿真结果表明，我们提出的基于 IQ 的方法在检测概率方面优于传统的短时傅立叶变换方法以及现有的三种基于深度学习的检测方法。此外，我们提出的基于 IQ-CS 的方法可以在显著降低计算复杂度的情况下实现令人满意的检测性能。

{"title":"Detection of Radar Pulse Signals Based on Deep Learning","authors":"Fengyang Gu;Luxin Zhang;Shilian Zheng;Jie Chen;Keqiang Yue;Zhijin Zhao;Xiaoniu Yang","doi":"10.1109/OJSP.2024.3435703","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3435703","url":null,"abstract":"Radar is widely used in aviation, meteorology, and military fields, and radar pulse signal detection has become an indispensable and essential function of cognitive radio systems as well as electronic warfare systems. In this paper, we propose a deep learning-based radar signal detection method. Firstly, we propose a detection method based on raw in-phase and quadrature (IQ) input, which utilizes a convolutional neural network (CNN) to automatically learn the features of radar pulse signals and noises, to accomplish the detection task. To further reduce the computational complexity, we also propose a hybrid detection method that combines compressed sensing (CS) and deep learning, which reduces the length of the signal by compressed downsampling, and then feeds the compressed signal to the CNN for detection. Extensive simulation results show that our proposed IQ-based method outperforms the traditional short-time Fourier transform method as well as three existing deep learning-based detection methods in terms of probability of detection. Furthermore, our proposed IQ-CS-based method can achieve satisfactory detection performance with significantly reduced computational complexity.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"991-1004"},"PeriodicalIF":2.9,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10614929","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142246552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Difference Frequency Gridless Sparse Array Processing 差频无网格稀疏阵列处理

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE open journal of signal processing

Pub Date : 2024-07-09 DOI: 10.1109/OJSP.2024.3425284

Yongsung Park;Peter Gerstoft

This paper introduces a DOA estimation method for sources beyond the aliasing frequency. The method utilizes multiple frequencies of sources to exploit the frequency difference between them, enabling processing at a frequency below the aliasing frequency. Gridless sparse processing with atomic norm minimization is derived for DOA using difference frequency (DF). This approach achieves higher DOA resolution than previous DF-DOA estimators by enforcing sparsity in the beamforming spectrum and estimating DOAs in the continuous angular domain. We consider one or more measurements in both time (snapshot) and frequency (DF). We also analyze approaches for considering multiple DFs: multi-DF and multi-DF spectral-averaging. Numerical simulations demonstrate the effective performance of the method compared to existing DF techniques.

本文介绍了一种针对超出混叠频率的信号源的 DOA 估算方法。该方法利用信号源的多个频率来利用它们之间的频率差，从而实现低于混叠频率的处理。利用差分频率 (DF) 对 DOA 进行无网格稀疏处理，原子规范最小化。这种方法通过强制波束成形频谱的稀疏性和在连续角域中估计 DOA，实现了比以前的 DF-DOA 估计器更高的 DOA 分辨率。我们同时考虑了时间（快照）和频率（DF）方面的一个或多个测量。我们还分析了考虑多个 DF 的方法：多DF 和多DF 频谱平均。数值模拟证明，与现有的 DF 技术相比，该方法的性能非常有效。

引用次数: 0

Fully Quantized Neural Networks for Audio Source Separation 用于音源分离的全量化神经网络

IF 2.9 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE open journal of signal processing

Pub Date : 2024-07-09 DOI: 10.1109/OJSP.2024.3425287

Elad Cohen;Hai Victor Habi;Reuven Peretz;Arnon Netzer

Deep neural networks have shown state-of-the-art results in audio source separation tasks in recent years. However, deploying such networks, especially on edge devices, is challenging due to memory and computation requirements. In this work, we focus on quantization, a leading approach for addressing these challenges. We start with a theoretical and empirical analysis of the signal-to-distortion ratio (SDR) in the presence of quantization noise, which presents a fundamental limitation in audio source separation tasks. These analyses show that quantization noise mainly affects performance when the model produces high SDRs. We empirically validate the theoretical insights and illustrate them on audio source separation models. In addition, the empirical analysis shows a high sensitivity to activations quantization, especially to the network's input and output signals. Following the analysis, we propose Fully Quantized Source Separation (FQSS), a quantization-aware training (QAT) method for audio source separation tasks. FQSS introduces a novel loss function based on knowledge distillation that considers quantization-sensitive samples during training and handles the quantization noise of the input and output signals. We validate the efficiency of our method in both time and frequency domains. Finally, we apply FQSS to several architectures (CNNs, LSTMs, and Transformers) and show negligible degradation compared to the full-precision baseline models.

近年来，深度神经网络在音源分离任务中取得了最先进的成果。然而，由于内存和计算要求，部署此类网络，尤其是在边缘设备上部署此类网络具有挑战性。在这项工作中，我们将重点放在量化上，这是应对这些挑战的主要方法。我们首先对存在量化噪声时的信号失真比（SDR）进行了理论和实证分析，量化噪声是音源分离任务中的一个基本限制因素。这些分析表明，当模型产生高 SDR 时，量化噪声主要会影响性能。我们通过经验验证了这些理论见解，并在音源分离模型中加以说明。此外，实证分析表明了激活量化的高度敏感性，尤其是对网络输入和输出信号的敏感性。根据分析结果，我们提出了用于音源分离任务的量化感知训练（QAT）方法--全量化音源分离（FQSS）。FQSS 引入了一种基于知识提炼的新型损失函数，在训练过程中考虑量化敏感样本，并处理输入和输出信号的量化噪声。我们验证了该方法在时域和频域的效率。最后，我们将 FQSS 应用于几种架构（CNN、LSTM 和 Transformers），结果表明与全精度基线模型相比，FQSS 的性能下降可以忽略不计。

{"title":"Fully Quantized Neural Networks for Audio Source Separation","authors":"Elad Cohen;Hai Victor Habi;Reuven Peretz;Arnon Netzer","doi":"10.1109/OJSP.2024.3425287","DOIUrl":"https://doi.org/10.1109/OJSP.2024.3425287","url":null,"abstract":"Deep neural networks have shown state-of-the-art results in audio source separation tasks in recent years. However, deploying such networks, especially on edge devices, is challenging due to memory and computation requirements. In this work, we focus on quantization, a leading approach for addressing these challenges. We start with a theoretical and empirical analysis of the signal-to-distortion ratio (SDR) in the presence of quantization noise, which presents a fundamental limitation in audio source separation tasks. These analyses show that quantization noise mainly affects performance when the model produces high SDRs. We empirically validate the theoretical insights and illustrate them on audio source separation models. In addition, the empirical analysis shows a high sensitivity to activations quantization, especially to the network's input and output signals. Following the analysis, we propose Fully Quantized Source Separation (FQSS), a quantization-aware training (QAT) method for audio source separation tasks. FQSS introduces a novel loss function based on knowledge distillation that considers quantization-sensitive samples during training and handles the quantization noise of the input and output signals. We validate the efficiency of our method in both time and frequency domains. Finally, we apply FQSS to several architectures (CNNs, LSTMs, and Transformers) and show negligible degradation compared to the full-precision baseline models.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"5 ","pages":"926-933"},"PeriodicalIF":2.9,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10591369","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141966239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0