首页 > 最新文献

2022 30th European Signal Processing Conference (EUSIPCO)最新文献

英文 中文
Safe importance sampling based on partial posteriors and neural variational approximations 基于部分后验和神经变分近似的安全重要性抽样
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909576
F. Llorente, E. Curbelo, L. Martino, P. Olmos, D. Delgado
In this work, we present two novel importance sampling (IS) methods, which can be considered safe in the sense that they avoid catastrophic scenarios where the IS estimators could have infinite variance. This is obtained by using a population of proposal densities where each one is wider than the posterior distribution. In fact, we consider partial posterior distributions (i.e., considering a smaller number of data) as proposal densities. Neuronal variational approximations are also discussed. The experimental results show the benefits of the proposed schemes.
在这项工作中,我们提出了两种新的重要性抽样(IS)方法,它们可以被认为是安全的,因为它们避免了IS估计器可能具有无限方差的灾难性场景。这是通过使用建议密度的总体来获得的,其中每个密度都比后验分布更宽。事实上,我们考虑部分后验分布(即考虑较少数量的数据)作为建议密度。神经变分逼近也进行了讨论。实验结果表明了所提方案的有效性。
{"title":"Safe importance sampling based on partial posteriors and neural variational approximations","authors":"F. Llorente, E. Curbelo, L. Martino, P. Olmos, D. Delgado","doi":"10.23919/eusipco55093.2022.9909576","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909576","url":null,"abstract":"In this work, we present two novel importance sampling (IS) methods, which can be considered safe in the sense that they avoid catastrophic scenarios where the IS estimators could have infinite variance. This is obtained by using a population of proposal densities where each one is wider than the posterior distribution. In fact, we consider partial posterior distributions (i.e., considering a smaller number of data) as proposal densities. Neuronal variational approximations are also discussed. The experimental results show the benefits of the proposed schemes.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"PP 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126361305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating and Reproducing Ambience in Ambisonic Recordings 预估与再现双音录音中的氛围
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909850
L. McCormack, A. Politis
Spatial audio coding and reproduction methods are often based on the estimation of primary directional and secondary ambience components. This paper details a study into the estimation and subsequent reproduction of the ambient components found in ambisonic sound scenes. More specifically, two different ambience estimation approaches are investigated. The first estimates the ambient Ambisonic signals through a source-separation and spatial subtraction approach, and there-fore requires an estimate of both the number of sources and their directions. The second instead requires only the number of sources to be known, and employs a multi-channel Wiener filter (MWF) to obtain the estimated ambient signals. One approach for reproducing estimated ambient signals is through a signal processing chain of: a plane-wave decomposition, signal decor-relation, and subsequent spatialisation for the target playback setup. However, this reproduction approach may be sensitive to spatial and signal fidelity degradations incurred during the beamforming and decorrelation operations. Therefore, an optimal mixing alternative is proposed for this reproduction task, which achieves spatially incoherent rendering of ambience directly for the target playback setup; bypassing intermediate plane-wave decomposition and excessive decorrelation. Listening tests indicate improved perceived quality when using the proposed reproduction method in conjunction with both tested ambience estimation approaches.
空间音频编码和再现方法通常基于对主要方向分量和次要环境分量的估计。本文详细研究了在双声场景中发现的环境成分的估计和随后的再现。更具体地说,研究了两种不同的环境估计方法。第一种方法是通过源分离和空间减法来估计周围的双声信号,因此需要估计源的数量和方向。第二种方法只需要知道信号源的数量,并采用多通道维纳滤波器(MWF)来获得估计的环境信号。再现估计环境信号的一种方法是通过信号处理链:平面波分解,信号去相关,以及随后的目标回放设置的空间化。然而,这种再现方法可能对波束形成和去相关操作期间产生的空间和信号保真度下降很敏感。因此,本文提出了一种最佳混合方案,该方案可直接为目标播放设置实现环境的空间非相干渲染;绕过中间平面波分解和过度去相关。听力测试表明,当将拟议的再现方法与两种已测试的氛围估计方法结合使用时,感知质量有所提高。
{"title":"Estimating and Reproducing Ambience in Ambisonic Recordings","authors":"L. McCormack, A. Politis","doi":"10.23919/eusipco55093.2022.9909850","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909850","url":null,"abstract":"Spatial audio coding and reproduction methods are often based on the estimation of primary directional and secondary ambience components. This paper details a study into the estimation and subsequent reproduction of the ambient components found in ambisonic sound scenes. More specifically, two different ambience estimation approaches are investigated. The first estimates the ambient Ambisonic signals through a source-separation and spatial subtraction approach, and there-fore requires an estimate of both the number of sources and their directions. The second instead requires only the number of sources to be known, and employs a multi-channel Wiener filter (MWF) to obtain the estimated ambient signals. One approach for reproducing estimated ambient signals is through a signal processing chain of: a plane-wave decomposition, signal decor-relation, and subsequent spatialisation for the target playback setup. However, this reproduction approach may be sensitive to spatial and signal fidelity degradations incurred during the beamforming and decorrelation operations. Therefore, an optimal mixing alternative is proposed for this reproduction task, which achieves spatially incoherent rendering of ambience directly for the target playback setup; bypassing intermediate plane-wave decomposition and excessive decorrelation. Listening tests indicate improved perceived quality when using the proposed reproduction method in conjunction with both tested ambience estimation approaches.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114147452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Multiroom Speech Emotion Recognition 多房间语音情感识别
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909798
Erez Shalev, I. Cohen
Automated audio systems, such as speech emotion recognition, can benefit from the ability to work from another room. No research has yet been conducted on the effectiveness of such systems when the sound source originates in a different room than the target system, and the sound has to travel between the rooms through the wall. New advancements in room-impulse-response generators enable a large-scale simulation of audio sources from adjacent rooms and integration into a training dataset. Such a capability improves the performance of datadriven methods such as deep learning. This paper presents the first evaluation of multiroom speech emotion recognition systems. The isolating policies due to COVID-19 presented many cases of isolated individuals suffering emotional difficulties, where such capabilities would be very beneficial. We perform training, with and without an audio simulation generator, and compare the results of three different models on real data recorded in a real multiroom audio scene. We show that models trained without the new generator achieve poor results when presented with multiroom data. We proceed to show that augmentation using the new generator improves the performances for all three models. Our results demonstrate the advantage of using such a generator. Furthermore, testing with two different deep learning architectures shows that the generator improves the results independently of the given architecture.
自动音频系统,如语音情感识别,可以从在另一个房间工作的能力中受益。当声源来自与目标系统不同的房间,并且声音必须穿过墙壁在房间之间传播时,还没有对这种系统的有效性进行过研究。房间脉冲响应发生器的新进展使来自相邻房间的音频源的大规模模拟和集成到训练数据集成为可能。这种能力提高了数据驱动方法(如深度学习)的性能。本文对多房间语音情感识别系统进行了初步评价。由于新冠疫情的隔离政策,出现了许多孤立的个人遭受情感困难的案例,这种能力将非常有益。我们在有和没有音频模拟生成器的情况下进行训练,并在真实的多房间音频场景中记录的真实数据上比较三种不同模型的结果。我们表明,没有新生成器训练的模型在呈现多房间数据时效果不佳。我们进一步证明,使用新生成器的增强提高了所有三个模型的性能。我们的结果证明了使用这种发生器的优点。此外,对两种不同深度学习架构的测试表明,生成器独立于给定架构提高了结果。
{"title":"Multiroom Speech Emotion Recognition","authors":"Erez Shalev, I. Cohen","doi":"10.23919/eusipco55093.2022.9909798","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909798","url":null,"abstract":"Automated audio systems, such as speech emotion recognition, can benefit from the ability to work from another room. No research has yet been conducted on the effectiveness of such systems when the sound source originates in a different room than the target system, and the sound has to travel between the rooms through the wall. New advancements in room-impulse-response generators enable a large-scale simulation of audio sources from adjacent rooms and integration into a training dataset. Such a capability improves the performance of datadriven methods such as deep learning. This paper presents the first evaluation of multiroom speech emotion recognition systems. The isolating policies due to COVID-19 presented many cases of isolated individuals suffering emotional difficulties, where such capabilities would be very beneficial. We perform training, with and without an audio simulation generator, and compare the results of three different models on real data recorded in a real multiroom audio scene. We show that models trained without the new generator achieve poor results when presented with multiroom data. We proceed to show that augmentation using the new generator improves the performances for all three models. Our results demonstrate the advantage of using such a generator. Furthermore, testing with two different deep learning architectures shows that the generator improves the results independently of the given architecture.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116055966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multiple Sound Source Localization Based on Stochastic Modeling of Spatial Gradient Spectra 基于空间梯度谱随机建模的多声源定位
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909524
Natsuki Ueno, H. Kameoka
We propose source localization methods for multiple sound sources. The proposed method requires only an observation of a sound pressure and its spatial gradient at one fixed point, which can be realized by a small microphone array. The key idea is to utilize the partial differential equation relating the observed signals and the source position, which was originally proposed for the direct method for the single source localization problem. We extend this framework using stochastic modeling and proposed a method for the mutliple source localization in the presence of noises. Two source localization methods are proposed: one is the expectation-minimization algorithm for a given number of sources, and the other is the variational Bayesian inference for an unknown number of sources. By numerical experiments, the localization accuracies of the two proposed methods are compared with the baseline method.
提出了多声源的声源定位方法。该方法只需要在一个固定点上观测声压及其空间梯度,这可以通过一个小的传声器阵列来实现。其关键思想是利用观测信号与源位置相关的偏微分方程,该方程最初是针对单源定位问题的直接方法提出的。我们利用随机建模扩展了这一框架,并提出了一种存在噪声的多源定位方法。提出了两种源定位方法:一种是针对给定数量源的期望最小化算法,另一种是针对未知数量源的变分贝叶斯推理。通过数值实验,将两种方法的定位精度与基线方法进行了比较。
{"title":"Multiple Sound Source Localization Based on Stochastic Modeling of Spatial Gradient Spectra","authors":"Natsuki Ueno, H. Kameoka","doi":"10.23919/eusipco55093.2022.9909524","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909524","url":null,"abstract":"We propose source localization methods for multiple sound sources. The proposed method requires only an observation of a sound pressure and its spatial gradient at one fixed point, which can be realized by a small microphone array. The key idea is to utilize the partial differential equation relating the observed signals and the source position, which was originally proposed for the direct method for the single source localization problem. We extend this framework using stochastic modeling and proposed a method for the mutliple source localization in the presence of noises. Two source localization methods are proposed: one is the expectation-minimization algorithm for a given number of sources, and the other is the variational Bayesian inference for an unknown number of sources. By numerical experiments, the localization accuracies of the two proposed methods are compared with the baseline method.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"106 1-2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116114572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Few-shot learning for E2E speech recognition: architectural variants for support set generation 端到端语音识别的少量学习:支持集生成的架构变体
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909613
Dhanya Eledath, Narasimha Rao Thurlapati, V. Pavithra, Tirthankar Banerjee, V. Ramasubramanian
In this paper, we propose two architectural variants of our recent adaptation of a ‘few shot-learning’ (FSL) framework ‘Matching Networks’ (MN) to end-to-end (E2E) continuous speech recognition (CSR) in a formulation termed ‘MN-CTC’ which involves a CTC-loss based end-to-end episodic training of MN and an associated CTC-based decoding of continuous speech. An important component of the MN theory is the labelled support-set during training and inference. The architectural variants proposed and studied here for E2E CSR, namely, the ‘Uncoupled MN-CTC’ and the ‘Coupled MN-CTC’, address this problem of generating supervised support sets from continuous speech. While the ‘Uncoupled MN-CTC’ generates the support-sets ‘outside’ the MN-architecture, the ‘Coupled MN-CTC’ variant is a derivative framework which generates the support set ‘within’ the MN-architecture through a multi-task formulation coupling the support-set generation loss and the main MN-CTC loss for jointly optimizing the support-sets and the embedding functions of MN. On TIMIT and Librispeech datasets, we establish the ‘few-shot’ effectiveness of the proposed variants with PER and LER performances and also demonstrate the cross-domain applicability of the MN-CTC formulation with a Librispeech trained ‘Coupled MN-CTC’ variant inferencing on TIMIT low resource target-corpus with a 8% (absolute) LER advantage over a single-domain (TIMIT only) scenario.
在本文中,我们提出了我们最近将“少量射击学习”(FSL)框架“匹配网络”(MN)改编为端到端(E2E)连续语音识别(CSR)的两个架构变体,其公式称为“MN- ctc”,其中包括基于ctc损失的端到端MN情景训练和相关的基于ctc的连续语音解码。MN理论的一个重要组成部分是训练和推理过程中的标记支持集。本文提出并研究了E2E CSR的架构变体,即“不耦合的MN-CTC”和“耦合的MN-CTC”,解决了从连续语音中生成监督支持集的问题。“不耦合的MN- ctc”生成MN架构“外部”的支持集,而“耦合的MN- ctc”变体是一个衍生框架,它通过多任务公式耦合支持集生成损失和主要MN- ctc损失来生成MN架构“内部”的支持集,以共同优化MN的支持集和嵌入函数。在TIMIT和librisspeech数据集上,我们建立了具有PER和LER性能的拟议变体的“少射”有效性,并通过librisspeech训练的“耦合MN-CTC”变体推理在TIMIT低资源目标语料库上证明了MN-CTC公式的跨域适用性,比单域(仅TIMIT)场景具有8%(绝对)的LER优势。
{"title":"Few-shot learning for E2E speech recognition: architectural variants for support set generation","authors":"Dhanya Eledath, Narasimha Rao Thurlapati, V. Pavithra, Tirthankar Banerjee, V. Ramasubramanian","doi":"10.23919/eusipco55093.2022.9909613","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909613","url":null,"abstract":"In this paper, we propose two architectural variants of our recent adaptation of a ‘few shot-learning’ (FSL) framework ‘Matching Networks’ (MN) to end-to-end (E2E) continuous speech recognition (CSR) in a formulation termed ‘MN-CTC’ which involves a CTC-loss based end-to-end episodic training of MN and an associated CTC-based decoding of continuous speech. An important component of the MN theory is the labelled support-set during training and inference. The architectural variants proposed and studied here for E2E CSR, namely, the ‘Uncoupled MN-CTC’ and the ‘Coupled MN-CTC’, address this problem of generating supervised support sets from continuous speech. While the ‘Uncoupled MN-CTC’ generates the support-sets ‘outside’ the MN-architecture, the ‘Coupled MN-CTC’ variant is a derivative framework which generates the support set ‘within’ the MN-architecture through a multi-task formulation coupling the support-set generation loss and the main MN-CTC loss for jointly optimizing the support-sets and the embedding functions of MN. On TIMIT and Librispeech datasets, we establish the ‘few-shot’ effectiveness of the proposed variants with PER and LER performances and also demonstrate the cross-domain applicability of the MN-CTC formulation with a Librispeech trained ‘Coupled MN-CTC’ variant inferencing on TIMIT low resource target-corpus with a 8% (absolute) LER advantage over a single-domain (TIMIT only) scenario.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130116163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Comprehensive Exploration of Noise Robustness and Noise Compensation in ResNet and TDNN-based Speaker Recognition Systems 基于ResNet和tdnn的说话人识别系统中噪声鲁棒性和噪声补偿的综合探索
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909726
Mohammad MohammadAmini, D. Matrouf, J. Bonastre, Sandipana Dowerah, R. Serizel, D. Jouvet
In this paper, a comprehensive exploration of noise robustness and noise compensation of ResNet and TDNN speaker recognition systems is presented. Firstly the robustness of the TDNN and ResNet in the presence of noise, reverberation, and both distortions is explored. Our experimental results show that in all cases the ResNet system is more robust than TDNN. After that, a noise compensation task is done with denoising autoen-coder (DAE) over the x-vectors extracted from both systems. We explored two scenarios: 1) compensation of artificial noise with artificial data, 2) compensation of real noise with artificial data. The second case is the most desired scenario, because it makes noise compensation affordable without having real data to train denoising techniques. The experimental results show that in the first scenario noise compensation gives significant improvement with TDNN while this improvement in Resnet is not significant. In the second scenario, we achieved 15% improvement of EER over VoiCes Eval challenge in both TDNN and ResNet systems. In most cases the performance of ResNet without compensation is superior to TDNN with noise compensation.
本文对ResNet和TDNN说话人识别系统的噪声鲁棒性和噪声补偿进行了全面的研究。首先探讨了TDNN和ResNet在存在噪声、混响和两种失真时的鲁棒性。实验结果表明,在所有情况下,ResNet系统都比TDNN具有更强的鲁棒性。之后,对从两个系统中提取的x向量进行去噪自动编码器(DAE)的噪声补偿任务。我们探索了两种场景:1)用人工数据补偿人工噪声,2)用人工数据补偿真实噪声。第二种情况是最理想的情况,因为它使噪音补偿负担得起,而没有真正的数据来训练去噪技术。实验结果表明,在第一种情况下,噪声补偿对TDNN有显著的改善,而在Resnet中这种改善不显著。在第二种情况下,我们在TDNN和ResNet系统中实现了15%的EER比voice Eval挑战的改进。在大多数情况下,无补偿的ResNet的性能优于有噪声补偿的TDNN。
{"title":"A Comprehensive Exploration of Noise Robustness and Noise Compensation in ResNet and TDNN-based Speaker Recognition Systems","authors":"Mohammad MohammadAmini, D. Matrouf, J. Bonastre, Sandipana Dowerah, R. Serizel, D. Jouvet","doi":"10.23919/eusipco55093.2022.9909726","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909726","url":null,"abstract":"In this paper, a comprehensive exploration of noise robustness and noise compensation of ResNet and TDNN speaker recognition systems is presented. Firstly the robustness of the TDNN and ResNet in the presence of noise, reverberation, and both distortions is explored. Our experimental results show that in all cases the ResNet system is more robust than TDNN. After that, a noise compensation task is done with denoising autoen-coder (DAE) over the x-vectors extracted from both systems. We explored two scenarios: 1) compensation of artificial noise with artificial data, 2) compensation of real noise with artificial data. The second case is the most desired scenario, because it makes noise compensation affordable without having real data to train denoising techniques. The experimental results show that in the first scenario noise compensation gives significant improvement with TDNN while this improvement in Resnet is not significant. In the second scenario, we achieved 15% improvement of EER over VoiCes Eval challenge in both TDNN and ResNet systems. In most cases the performance of ResNet without compensation is superior to TDNN with noise compensation.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122871725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse-Aware Approach for Covariance Conversion in FDD Systems FDD系统中协方差转换的稀疏感知方法
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909956
C. López, J. Riba
This paper proposes a practical way to solve the Uplink-Downlink Covariance Conversion (UDCC) problem in a frequency Division Duplex (FDD) communication system. The UDCC problem consists in the estimation of the Downlink (DL) spatial covariance matrix from the prior knowledge of the Uplink (UL) spatial covariance matrix without the need of a feedback transmission from the User Equipment (UE) to the Base Station (BS). Estimating the DL sample spatial covariance matrix is unfeasible in current massive Multiple-Input Multiple-Output (MIMO) deployments in frequency selective or fast fading channels due to the required large training overhead. Our method is based on the application of sparse filtering ideas to the estimation of a quantized version of the so-called Angular Power Spectrum (APS), being the common factor between the UL and DL spatial channel covariance matrices.
提出了一种解决频分双工(FDD)通信系统中上下行协方差转换(UDCC)问题的实用方法。UDCC问题是在不需要从用户设备(UE)到基站(BS)的反馈传输的情况下,根据上行链路(UL)空间协方差矩阵的先验知识估计下行链路(DL)空间协方差矩阵。由于需要大量的训练开销,在当前频率选择或快速衰落信道中的大规模多输入多输出(MIMO)部署中,估计深度学习样本空间协方差矩阵是不可行的。我们的方法是基于稀疏滤波思想的应用,以估计所谓的角功率谱(APS)的量化版本,是UL和DL空间信道协方差矩阵之间的共同因素。
{"title":"Sparse-Aware Approach for Covariance Conversion in FDD Systems","authors":"C. López, J. Riba","doi":"10.23919/eusipco55093.2022.9909956","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909956","url":null,"abstract":"This paper proposes a practical way to solve the Uplink-Downlink Covariance Conversion (UDCC) problem in a frequency Division Duplex (FDD) communication system. The UDCC problem consists in the estimation of the Downlink (DL) spatial covariance matrix from the prior knowledge of the Uplink (UL) spatial covariance matrix without the need of a feedback transmission from the User Equipment (UE) to the Base Station (BS). Estimating the DL sample spatial covariance matrix is unfeasible in current massive Multiple-Input Multiple-Output (MIMO) deployments in frequency selective or fast fading channels due to the required large training overhead. Our method is based on the application of sparse filtering ideas to the estimation of a quantized version of the so-called Angular Power Spectrum (APS), being the common factor between the UL and DL spatial channel covariance matrices.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122940394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gridless Joint Delay-DoA-Doppler Estimation Using OFDM Signals: A Multilevel Hankel Matrix Approach 基于OFDM信号的无网格联合时延多普勒估计:一种多电平汉克尔矩阵方法
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909935
Ziyu Zhou, Wei Dai
This paper investigates the problem of joint es-timation of delay, direction of arrival (DoA), and Doppler when an orthogonal frequency-division multiplexing (OFDM) signal is used for probing. A gridless approach is taken where the above three parameters live on a continuous space rather than a discrete grid. A low-rank multilevel Hankel matrix is used to capture the underlying structure of the back-scattered signals. A convex optimization, termed as Hankel nuclear norm minimization (HNNM), is developed for denoising and parameter estimation, and solved by alternating direction method of multi-pliers (ADMM). Simulations demonstrate that HNNM is robust to noise, and can go beyond the minimum separation bound required by another gridless method atomic norm minimization.
研究了正交频分复用(OFDM)信号探测时时延、到达方向和多普勒的联合估计问题。采用无网格方法,上述三个参数驻留在连续空间而不是离散网格上。采用低秩多电平汉克尔矩阵捕获后向散射信号的底层结构。提出了一种用于去噪和参数估计的凸优化方法——汉克尔核范数最小化(HNNM),并采用多钳子交替方向法(ADMM)求解。仿真结果表明,HNNM对噪声具有较强的鲁棒性,并且可以超越原子范数最小化方法所要求的最小分离界限。
{"title":"Gridless Joint Delay-DoA-Doppler Estimation Using OFDM Signals: A Multilevel Hankel Matrix Approach","authors":"Ziyu Zhou, Wei Dai","doi":"10.23919/eusipco55093.2022.9909935","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909935","url":null,"abstract":"This paper investigates the problem of joint es-timation of delay, direction of arrival (DoA), and Doppler when an orthogonal frequency-division multiplexing (OFDM) signal is used for probing. A gridless approach is taken where the above three parameters live on a continuous space rather than a discrete grid. A low-rank multilevel Hankel matrix is used to capture the underlying structure of the back-scattered signals. A convex optimization, termed as Hankel nuclear norm minimization (HNNM), is developed for denoising and parameter estimation, and solved by alternating direction method of multi-pliers (ADMM). Simulations demonstrate that HNNM is robust to noise, and can go beyond the minimum separation bound required by another gridless method atomic norm minimization.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"11 13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131501708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Graph Topology Learning with Non-Convex Penalties 具有非凸惩罚的动态图拓扑学习
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909609
Reza Mirzaeifard, Vinay Chakravarthi Gogineni, Naveen K. D. Venkategowda, Stefan Werner
This paper presents a majorization-minimization-based framework for learning time-varying graphs from spatial-temporal measurements with non-convex penalties. The proposed approach infers time-varying graphs by using the log-likelihood function in conjunction with two non-convex regularizers. Using the log-likelihood function under a total positivity constraint, we can construct the Laplacian matrix from the off-diagonal elements of the precision matrix. Furthermore, we employ non-convex regularizer functions to constrain the changes in graph topology and associated weight evolution to be sparse. The experimental results demonstrate that our proposed method outperforms the state-of-the-art methods in sparse and non-sparse situations.
本文提出了一种基于最大化最小化的框架,用于从具有非凸惩罚的时空测量中学习时变图。该方法利用对数似然函数和两个非凸正则化器来推断时变图。利用全正约束下的对数似然函数,我们可以从精度矩阵的非对角元素构造拉普拉斯矩阵。此外,我们使用非凸正则化函数来约束图拓扑的变化和相关权演化为稀疏。实验结果表明,本文提出的方法在稀疏和非稀疏情况下都优于现有的方法。
{"title":"Dynamic Graph Topology Learning with Non-Convex Penalties","authors":"Reza Mirzaeifard, Vinay Chakravarthi Gogineni, Naveen K. D. Venkategowda, Stefan Werner","doi":"10.23919/eusipco55093.2022.9909609","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909609","url":null,"abstract":"This paper presents a majorization-minimization-based framework for learning time-varying graphs from spatial-temporal measurements with non-convex penalties. The proposed approach infers time-varying graphs by using the log-likelihood function in conjunction with two non-convex regularizers. Using the log-likelihood function under a total positivity constraint, we can construct the Laplacian matrix from the off-diagonal elements of the precision matrix. Furthermore, we employ non-convex regularizer functions to constrain the changes in graph topology and associated weight evolution to be sparse. The experimental results demonstrate that our proposed method outperforms the state-of-the-art methods in sparse and non-sparse situations.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127603378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Transfer learning for human activity classification in multiple radar setups 多雷达环境下人类活动分类的迁移学习
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909851
Jérémy Fix, Israel David Hinostroza Sáenz, Chengfang Ren, G. Manfredi, T. Letertre
Deep Learning techniques require vast amount of data for a proper training. In human activity classification using radar signals, the data acquisition can be very expensive and takes a lot of time, but radar databases are starting to be available to the public. In this work we show that we can use these available radar databases to pretrain a neural network that will finish its training on the final radar data even though the radar configuration is different (geometry configuration and carrier frequency).
深度学习技术需要大量的数据来进行适当的训练。在利用雷达信号进行人类活动分类的过程中,数据采集成本高,耗时长,但雷达数据库已开始向公众开放。在这项工作中,我们表明,我们可以使用这些可用的雷达数据库来预训练神经网络,即使雷达配置不同(几何配置和载波频率),该神经网络也将在最终雷达数据上完成训练。
{"title":"Transfer learning for human activity classification in multiple radar setups","authors":"Jérémy Fix, Israel David Hinostroza Sáenz, Chengfang Ren, G. Manfredi, T. Letertre","doi":"10.23919/eusipco55093.2022.9909851","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909851","url":null,"abstract":"Deep Learning techniques require vast amount of data for a proper training. In human activity classification using radar signals, the data acquisition can be very expensive and takes a lot of time, but radar databases are starting to be available to the public. In this work we show that we can use these available radar databases to pretrain a neural network that will finish its training on the final radar data even though the radar configuration is different (geometry configuration and carrier frequency).","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133607531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2022 30th European Signal Processing Conference (EUSIPCO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1