Jakob Möderl;Erik Leitinger;Franz Pernkopf;Klaus Witrisal
{"title":"Variational Inference of Structured Line Spectra Exploiting Group-Sparsity","authors":"Jakob Möderl;Erik Leitinger;Franz Pernkopf;Klaus Witrisal","doi":"10.1109/TSP.2024.3493603","DOIUrl":null,"url":null,"abstract":"This paper introduces a method of decomposing a signal into several groups of related spectral lines. The frequencies of the spectral lines in each group are related to a parameter common to all spectral lines within the same group, such as the fundamental frequency of a harmonic series of spectral lines. The parameters of each group are estimated on a continuum by the proposed variational expectation-maximization (EM) algorithm. Additionally, the number of groups and the number of spectral lines within each group are inferred through a group-sparse solution, obtained by latent variables in a hierarchical Bernoulli-Gamma-Gaussian prior model inspired by sparse Bayesian learning (SBL). The performance of the proposed algorithm is demonstrated on three tasks: multi-pitch estimation, extended object detection using radar signals, and variational mode decomposition (VMD). On the Bach 10 dataset, which contains recordings of ten musical pieces, the proposed algorithm outperforms state-of-the-art model-based and machine-learning-based multi-pitch estimation algorithms in terms of fundamental frequency, i.e. pitch, detection accuracy. In addition, the extended object detection task demonstrates how incorporating knowledge of the structural relationships between spectral lines into the estimation procedure can lead to performance gains compared to assuming independent spectral lines, especially under low signal-to-noise ratio (SNR) conditions. Finally, the VMD task is included to further demonstrate the versatility of the proposed algorithm.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"74 ","pages":"499-513"},"PeriodicalIF":5.8000,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10746854","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10746854/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
This paper introduces a method of decomposing a signal into several groups of related spectral lines. The frequencies of the spectral lines in each group are related to a parameter common to all spectral lines within the same group, such as the fundamental frequency of a harmonic series of spectral lines. The parameters of each group are estimated on a continuum by the proposed variational expectation-maximization (EM) algorithm. Additionally, the number of groups and the number of spectral lines within each group are inferred through a group-sparse solution, obtained by latent variables in a hierarchical Bernoulli-Gamma-Gaussian prior model inspired by sparse Bayesian learning (SBL). The performance of the proposed algorithm is demonstrated on three tasks: multi-pitch estimation, extended object detection using radar signals, and variational mode decomposition (VMD). On the Bach 10 dataset, which contains recordings of ten musical pieces, the proposed algorithm outperforms state-of-the-art model-based and machine-learning-based multi-pitch estimation algorithms in terms of fundamental frequency, i.e. pitch, detection accuracy. In addition, the extended object detection task demonstrates how incorporating knowledge of the structural relationships between spectral lines into the estimation procedure can lead to performance gains compared to assuming independent spectral lines, especially under low signal-to-noise ratio (SNR) conditions. Finally, the VMD task is included to further demonstrate the versatility of the proposed algorithm.
期刊介绍:
The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.