首页 > 最新文献

2020 28th European Signal Processing Conference (EUSIPCO)最新文献

英文 中文
Gated Recurrent Networks for Video Super Resolution 用于视频超分辨率的门控循环网络
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287713
Santiago López-Tapia, Alice Lucas, R. Molina, A. Katsaggelos
Despite the success of Recurrent Neural Networks in tasks involving temporal video processing, few works in Video Super-Resolution (VSR) have employed them. In this work we propose a new Gated Recurrent Convolutional Neural Network for VSR adapting some of the key components of a Gated Recurrent Unit. Our model employs a deformable attention module to align the features calculated at the previous time step with the ones in the current step and then uses a gated operation to combine them. This allows our model to effectively reuse previously calculated features and exploit longer temporal relationships between frames without the need of explicit motion compensation. The experimental validation shows that our approach outperforms current VSR learning based models in terms of perceptual quality and temporal consistency.
尽管递归神经网络在涉及时间视频处理的任务中取得了成功,但在视频超分辨率(VSR)领域很少使用它们。在这项工作中,我们提出了一种新的门控循环卷积神经网络用于VSR,该网络采用了门控循环单元的一些关键组件。我们的模型使用一个可变形的注意力模块将前一个时间步计算的特征与当前步骤的特征对齐,然后使用门控操作将它们组合起来。这使得我们的模型可以有效地重用先前计算的特征,并在不需要显式运动补偿的情况下利用帧之间更长的时间关系。实验验证表明,我们的方法在感知质量和时间一致性方面优于当前基于VSR学习的模型。
{"title":"Gated Recurrent Networks for Video Super Resolution","authors":"Santiago López-Tapia, Alice Lucas, R. Molina, A. Katsaggelos","doi":"10.23919/Eusipco47968.2020.9287713","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287713","url":null,"abstract":"Despite the success of Recurrent Neural Networks in tasks involving temporal video processing, few works in Video Super-Resolution (VSR) have employed them. In this work we propose a new Gated Recurrent Convolutional Neural Network for VSR adapting some of the key components of a Gated Recurrent Unit. Our model employs a deformable attention module to align the features calculated at the previous time step with the ones in the current step and then uses a gated operation to combine them. This allows our model to effectively reuse previously calculated features and exploit longer temporal relationships between frames without the need of explicit motion compensation. The experimental validation shows that our approach outperforms current VSR learning based models in terms of perceptual quality and temporal consistency.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"13 1","pages":"700-704"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78371605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Signal Analysis Using Local Polynomial Approximations 用局部多项式逼近的信号分析
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287801
R. Wildhaber, Elizabeth Ren, F. Waldmann, Hans-Andrea Loeliger
Local polynomial approximations represent a versatile feature space for time-domain signal analysis. The parameters of such polynomial approximations can be computed by efficient recursions using autonomous linear state space models and often allow analytical solutions for quantities of interest. The approach is illustrated by practical examples including the estimation of the delay difference between two acoustic signals and template matching in electrocardiogram signals with local variations in amplitude and time scale.
局部多项式近似为时域信号分析提供了一个通用的特征空间。这种多项式近似的参数可以通过使用自主线性状态空间模型的有效递归来计算,并且通常允许对感兴趣的量进行解析解。通过实际算例说明了该方法的有效性,包括估计两声信号之间的延迟差,以及对局部幅度和时间尺度变化的心电图信号进行模板匹配。
{"title":"Signal Analysis Using Local Polynomial Approximations","authors":"R. Wildhaber, Elizabeth Ren, F. Waldmann, Hans-Andrea Loeliger","doi":"10.23919/Eusipco47968.2020.9287801","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287801","url":null,"abstract":"Local polynomial approximations represent a versatile feature space for time-domain signal analysis. The parameters of such polynomial approximations can be computed by efficient recursions using autonomous linear state space models and often allow analytical solutions for quantities of interest. The approach is illustrated by practical examples including the estimation of the delay difference between two acoustic signals and template matching in electrocardiogram signals with local variations in amplitude and time scale.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"52 6 1","pages":"2239-2243"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77790220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Super-Resolution Time-of-Arrival Estimation using Neural Networks 基于神经网络的超分辨率到达时间估计
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287673
Yao-Shan Hsiao, Mingyu Yang, Hun-Seok Kim
This paper presents a learning-based algorithm that estimates the time of arrival (ToA) of radio frequency (RF) signals from channel frequency response (CFR) measurements for wireless localization applications. A generator neural network is proposed to enhance the effective bandwidth of the narrowband CFR measurement and to produce a high-resolution estimation of channel impulse response (CIR). In addition, two regressor neural networks are introduced to perform a two-step coarsefine ToA estimation based on the enhanced CIR. For simulated channels, the proposed method achieves 9% – 58% improved root mean squared error (RMSE) for distance ranging and up to 22% improved false detection rate compared with conventional super-resolution algorithms. For real-world measured channels, the proposed method exhibits an improvement of 1.3m in distance error at 90 percentile.
本文提出了一种基于学习的算法,用于无线定位应用中,从信道频率响应(CFR)测量中估计射频(RF)信号的到达时间(ToA)。为了提高窄带CFR测量的有效带宽,并对信道脉冲响应(CIR)进行高分辨率估计,提出了一种生成器神经网络。此外,引入两个回归神经网络进行基于增强CIR的两步粗化ToA估计,对于模拟信道,与传统超分辨率算法相比,该方法距离测距的均方根误差(RMSE)提高了9% ~ 58%,误检率提高了22%。对于实际测量信道,该方法在90百分位处的距离误差提高了1.3m。
{"title":"Super-Resolution Time-of-Arrival Estimation using Neural Networks","authors":"Yao-Shan Hsiao, Mingyu Yang, Hun-Seok Kim","doi":"10.23919/Eusipco47968.2020.9287673","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287673","url":null,"abstract":"This paper presents a learning-based algorithm that estimates the time of arrival (ToA) of radio frequency (RF) signals from channel frequency response (CFR) measurements for wireless localization applications. A generator neural network is proposed to enhance the effective bandwidth of the narrowband CFR measurement and to produce a high-resolution estimation of channel impulse response (CIR). In addition, two regressor neural networks are introduced to perform a two-step coarsefine ToA estimation based on the enhanced CIR. For simulated channels, the proposed method achieves 9% – 58% improved root mean squared error (RMSE) for distance ranging and up to 22% improved false detection rate compared with conventional super-resolution algorithms. For real-world measured channels, the proposed method exhibits an improvement of 1.3m in distance error at 90 percentile.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"35 1","pages":"1692-1696"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81689834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Automatic Image Colorization based on Multi-Discriminators Generative Adversarial Networks 基于多鉴别器生成对抗网络的图像自动着色
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287792
Youssef Mourchid, M. Donias, Y. Berthoumieu
This paper presents a deep automatic colorization approach which avoids any manual intervention. Recently Generative Adversarial Network (GANs) approaches have proven their effectiveness for image colorization tasks. Inspired by GANs methods, we propose a novel colorization model that produces more realistic quality results. The model employs an additional discriminator which works in the feature domain. Using a feature discriminator, our generator produces structural high-frequency features instead of noisy artifacts. To achieve the required level of details in the colorization process, we incorporate non-adversarial losses from recent image style transfer techniques. Besides, the generator architecture follows the general shape of U-Net, to transfer information more effectively between distant layers. The performance of the proposed model was evaluated quantitatively as well as qualitatively with places365 dataset. Results show that the proposed model achieves more realistic colors with less artifacts compared to the state-of-the-art approaches.
本文提出了一种避免人工干预的深度自动着色方法。近年来,生成对抗网络(GANs)方法已经证明了其在图像着色任务中的有效性。受gan方法的启发,我们提出了一种新的着色模型,可以产生更逼真的质量结果。该模型采用了一个附加的识别器,该识别器在特征域中工作。使用特征鉴别器,我们的生成器产生结构性高频特征,而不是噪声伪影。为了在着色过程中达到所需的细节水平,我们从最近的图像风格转移技术中纳入了非对抗性损失。此外,发生器架构遵循U-Net的一般形状,以便在远距离层之间更有效地传输信息。使用places365数据集对所提出模型的性能进行了定量和定性评估。结果表明,与最先进的方法相比,所提出的模型实现了更真实的颜色和更少的伪影。
{"title":"Automatic Image Colorization based on Multi-Discriminators Generative Adversarial Networks","authors":"Youssef Mourchid, M. Donias, Y. Berthoumieu","doi":"10.23919/Eusipco47968.2020.9287792","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287792","url":null,"abstract":"This paper presents a deep automatic colorization approach which avoids any manual intervention. Recently Generative Adversarial Network (GANs) approaches have proven their effectiveness for image colorization tasks. Inspired by GANs methods, we propose a novel colorization model that produces more realistic quality results. The model employs an additional discriminator which works in the feature domain. Using a feature discriminator, our generator produces structural high-frequency features instead of noisy artifacts. To achieve the required level of details in the colorization process, we incorporate non-adversarial losses from recent image style transfer techniques. Besides, the generator architecture follows the general shape of U-Net, to transfer information more effectively between distant layers. The performance of the proposed model was evaluated quantitatively as well as qualitatively with places365 dataset. Results show that the proposed model achieves more realistic colors with less artifacts compared to the state-of-the-art approaches.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"23 1","pages":"1532-1536"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78857899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Distributed combined acoustic echo cancellation and noise reduction using GEVD-based distributed adaptive node specific signal estimation with prior knowledge 基于gevd的分布式自适应节点特定信号估计与先验知识的分布式组合声回波抵消与降噪
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287337
Santiago Ruiz, T. Waterschoot, M. Moonen
Distributed combined acoustic echo cancellation (AEC) and noise reduction (NR) in a wireless acoustic sensor network (WASN) is tackled by using a specific version of the PK-GEVD-DANSE algorithm (cfr. [1]). Although this algorithm was initially developed for distributed NR with partial prior knowledge of the desired speech steering vector, it is shown that it can also be used for AEC combined with NR. Simulations have been carried out using centralized and distributed batch-mode implementations to verify the performance of the algorithm in terms of AEC quantified with the echo return loss enhancement (ERLE), as well as in terms of the NR quantified with the signal- to-noise ratio (SNR).
使用特定版本的PK-GEVD-DANSE算法(cfr)解决了无线声传感器网络(nn)中的分布式联合声回波抵消(AEC)和降噪(NR)问题。[1])。虽然该算法最初是针对具有部分先验知识的期望语音转向向量的分布式NR开发的,但研究表明,它也可以用于与NR结合的AEC。使用集中式和分布式批处理模式进行了仿真,以验证该算法在用回波回波损耗增强(ERLE)量化的AEC方面的性能,以及用信噪比(SNR)量化的NR方面的性能。
{"title":"Distributed combined acoustic echo cancellation and noise reduction using GEVD-based distributed adaptive node specific signal estimation with prior knowledge","authors":"Santiago Ruiz, T. Waterschoot, M. Moonen","doi":"10.23919/Eusipco47968.2020.9287337","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287337","url":null,"abstract":"Distributed combined acoustic echo cancellation (AEC) and noise reduction (NR) in a wireless acoustic sensor network (WASN) is tackled by using a specific version of the PK-GEVD-DANSE algorithm (cfr. [1]). Although this algorithm was initially developed for distributed NR with partial prior knowledge of the desired speech steering vector, it is shown that it can also be used for AEC combined with NR. Simulations have been carried out using centralized and distributed batch-mode implementations to verify the performance of the algorithm in terms of AEC quantified with the echo return loss enhancement (ERLE), as well as in terms of the NR quantified with the signal- to-noise ratio (SNR).","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"363 1","pages":"206-210"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76557926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Transmit Beampattern Synthesis for MIMO Radar with One-Bit DACs 基于位dac的MIMO雷达发射波束图合成
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287840
Tong Wei, B. Liao, Peng Xiao, Ziyang Cheng
In this paper, the problem of transmit beampattern synthesis (i.e., transmit beamforming) in multiple input multiple output (MIMO) radar which deploys one-bit digital-to-analog converts (DACs) is investigated. We aim to design appropriate transmit signal sequences, which are quantized by one-bit DACs, such that the amount of transmit energy can be focused into mainlobe region as much as possible, meanwhile, the leakage power of sidelobe region is minimized. It is shown that these requirements can be simultaneously fulfilled by minimizing the integrated sidelobe to mainlobe ratio (ISMR) of transmit beampattern with discrete binary constraints. According to this concept, we utilize the alternating direction multiplier method (ADMM) framework to solve the resulting nonconvex problem. Simulation results will demonstrate the effectiveness and improved performance of the proposed method.
本文研究了多输入多输出(MIMO)雷达的发射波束图合成(即发射波束形成)问题,该雷达采用1位数模转换器(dac)。我们的目标是设计合适的发射信号序列,通过位dac进行量化,使发射能量尽可能集中在主瓣区域,同时使副瓣区域的泄漏功率最小。结果表明,在离散二进制约束下,通过最小化发射波束方向图的综合副瓣与主瓣比(ISMR),可以同时满足上述要求。根据这一概念,我们利用交替方向乘子法(ADMM)框架来解决由此产生的非凸问题。仿真结果将证明该方法的有效性和改进的性能。
{"title":"Transmit Beampattern Synthesis for MIMO Radar with One-Bit DACs","authors":"Tong Wei, B. Liao, Peng Xiao, Ziyang Cheng","doi":"10.23919/Eusipco47968.2020.9287840","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287840","url":null,"abstract":"In this paper, the problem of transmit beampattern synthesis (i.e., transmit beamforming) in multiple input multiple output (MIMO) radar which deploys one-bit digital-to-analog converts (DACs) is investigated. We aim to design appropriate transmit signal sequences, which are quantized by one-bit DACs, such that the amount of transmit energy can be focused into mainlobe region as much as possible, meanwhile, the leakage power of sidelobe region is minimized. It is shown that these requirements can be simultaneously fulfilled by minimizing the integrated sidelobe to mainlobe ratio (ISMR) of transmit beampattern with discrete binary constraints. According to this concept, we utilize the alternating direction multiplier method (ADMM) framework to solve the resulting nonconvex problem. Simulation results will demonstrate the effectiveness and improved performance of the proposed method.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"19 1","pages":"1827-1830"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87485898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Graph Signal Processing Framework for the Classification of Temporal Brain Data 脑时态数据分类的图信号处理框架
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287486
Sarah Itani, D. Thanou
Graph Signal Processing (GSP) addresses the analysis of data living on an irregular domain which can be modeled with a graph. This capability is of great interest for the study of brain connectomes. In this case, data lying on the nodes of the graph are considered as signals (e.g., fMRI time-series) that have a strong dependency on the graph topology (e.g., brain structural connectivity). In this paper, we adopt GSP tools to build features related to the frequency content of the signals. To make these features highly discriminative, we apply an extension of the Fukunaga-Koontz transform. We then use these new features to train a decision tree for the prediction of autism spectrum disorder. Interestingly, our framework outperforms state-of-the-art methods on the publicly available ABIDE dataset.
图信号处理(GSP)解决了对不规则域上的数据的分析,这些数据可以用图来建模。这种能力对于大脑连接体的研究具有重要意义。在这种情况下,位于图节点上的数据被视为信号(例如,fMRI时间序列),这些信号强烈依赖于图拓扑(例如,大脑结构连接)。在本文中,我们采用GSP工具来构建与信号频率内容相关的特征。为了使这些特征具有高度的判别性,我们应用了Fukunaga-Koontz变换的扩展。然后我们使用这些新的特征来训练一个决策树来预测自闭症谱系障碍。有趣的是,我们的框架在公开可用的ABIDE数据集上优于最先进的方法。
{"title":"A Graph Signal Processing Framework for the Classification of Temporal Brain Data","authors":"Sarah Itani, D. Thanou","doi":"10.23919/Eusipco47968.2020.9287486","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287486","url":null,"abstract":"Graph Signal Processing (GSP) addresses the analysis of data living on an irregular domain which can be modeled with a graph. This capability is of great interest for the study of brain connectomes. In this case, data lying on the nodes of the graph are considered as signals (e.g., fMRI time-series) that have a strong dependency on the graph topology (e.g., brain structural connectivity). In this paper, we adopt GSP tools to build features related to the frequency content of the signals. To make these features highly discriminative, we apply an extension of the Fukunaga-Koontz transform. We then use these new features to train a decision tree for the prediction of autism spectrum disorder. Interestingly, our framework outperforms state-of-the-art methods on the publicly available ABIDE dataset.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"38 1","pages":"1180-1184"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87088152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Improving Energy Compaction of Adaptive Fourier Decomposition 改进自适应傅里叶分解的能量压缩
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287567
A. Borowicz
Adaptive Fourier decomposition (AFD) provides an expansion of an analytic function into a sum of basic signals, called mono-components. Unlike the Fourier series decomposition, the AFD is based on an adaptive rational orthogonal system, hence it is better suited for analyzing non-stationary data. The most popular algorithm for the AFD decomposes any signal in such a way that the energy of the low-frequency components is maximized. Unfortunately, this results in poor energy compaction of high-frequency components. In this paper, we develop a novel algorithm for the AFD. The key idea is to maximize the energy of any components no matter how big or small the corresponding frequencies are. A comparative evaluation was conducted of the signal reconstruction efficiency of the proposed approach and several conventional algorithms by using speech recordings. The experimental results show that with the new algorithm, it is possible to get a better performance in terms of the reconstruction quality and energy compaction property.
自适应傅立叶分解(AFD)将解析函数扩展为称为单分量的基本信号和。与傅立叶级数分解不同,AFD基于自适应有理正交系统,因此更适合于分析非平稳数据。最流行的AFD算法分解任何信号的方式是使低频分量的能量最大化。不幸的是,这导致高频元件的能量压缩不良。在本文中,我们开发了一种新的AFD算法。关键思想是使任何元件的能量最大化,不管相应的频率有多大或多小。利用语音记录对比评价了该方法与几种传统算法的信号重构效率。实验结果表明,新算法在重构质量和能量压缩性能方面都有较好的表现。
{"title":"Improving Energy Compaction of Adaptive Fourier Decomposition","authors":"A. Borowicz","doi":"10.23919/Eusipco47968.2020.9287567","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287567","url":null,"abstract":"Adaptive Fourier decomposition (AFD) provides an expansion of an analytic function into a sum of basic signals, called mono-components. Unlike the Fourier series decomposition, the AFD is based on an adaptive rational orthogonal system, hence it is better suited for analyzing non-stationary data. The most popular algorithm for the AFD decomposes any signal in such a way that the energy of the low-frequency components is maximized. Unfortunately, this results in poor energy compaction of high-frequency components. In this paper, we develop a novel algorithm for the AFD. The key idea is to maximize the energy of any components no matter how big or small the corresponding frequencies are. A comparative evaluation was conducted of the signal reconstruction efficiency of the proposed approach and several conventional algorithms by using speech recordings. The experimental results show that with the new algorithm, it is possible to get a better performance in terms of the reconstruction quality and energy compaction property.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"62 1","pages":"2348-2352"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86369905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multipitch tracking in music signals using Echo State Networks 回声状态网络在音乐信号中的多音高跟踪
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287638
P. Steiner, Simon Stone, P. Birkholz, A. Jalalvand
Currently, convolutional neural networks (CNNs) define the state of the art for multipitch tracking in music signals. Echo State Networks (ESNs), a recently introduced recurrent neural network architecture, achieved similar results as CNNs for various tasks, such as phoneme or digit recognition. However, they have not yet received much attention in the community of Music Information Retrieval. The core of ESNs is a group of unordered, randomly connected neurons, i.e., the reservoir, by which the low-dimensional input space is non-linearly transformed into a high-dimensional feature space. Because only the weights of the connections between the reservoir and the output are trained using linear regression, ESNs are easier to train than deep neural networks. This paper presents a first exploration of ESNs for the challenging task of multipitch tracking in music signals. The best results presented in this paper were achieved with a bidirectional two-layer ESN with 20 000 neurons in each layer. Although the final F-score of 0.7198 still falls below the state of the art (0.7370), the proposed ESN-based approach serves as a baseline for further investigations of ESNs in audio signal processing in the future.
目前,卷积神经网络(cnn)定义了音乐信号中多音高跟踪的最新技术。回声状态网络(Echo State Networks, ESNs)是最近引入的一种循环神经网络架构,在各种任务(如音素或数字识别)上取得了与cnn相似的结果。然而,它们在音乐信息检索界还没有得到足够的重视。ESNs的核心是一组无序、随机连接的神经元,即存储库,通过它将低维输入空间非线性转换为高维特征空间。因为只有储层和输出之间的连接权值是用线性回归训练的,所以esn比深度神经网络更容易训练。本文首次探索了ESNs用于音乐信号中多音高跟踪的挑战性任务。本文给出的最佳结果是双向双层回声状态网络,每层有20,000个神经元。虽然最终的f值0.7198仍然低于目前的水平(0.7370),但所提出的基于esn的方法可以作为未来音频信号处理中进一步研究esn的基线。
{"title":"Multipitch tracking in music signals using Echo State Networks","authors":"P. Steiner, Simon Stone, P. Birkholz, A. Jalalvand","doi":"10.23919/Eusipco47968.2020.9287638","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287638","url":null,"abstract":"Currently, convolutional neural networks (CNNs) define the state of the art for multipitch tracking in music signals. Echo State Networks (ESNs), a recently introduced recurrent neural network architecture, achieved similar results as CNNs for various tasks, such as phoneme or digit recognition. However, they have not yet received much attention in the community of Music Information Retrieval. The core of ESNs is a group of unordered, randomly connected neurons, i.e., the reservoir, by which the low-dimensional input space is non-linearly transformed into a high-dimensional feature space. Because only the weights of the connections between the reservoir and the output are trained using linear regression, ESNs are easier to train than deep neural networks. This paper presents a first exploration of ESNs for the challenging task of multipitch tracking in music signals. The best results presented in this paper were achieved with a bidirectional two-layer ESN with 20 000 neurons in each layer. Although the final F-score of 0.7198 still falls below the state of the art (0.7370), the proposed ESN-based approach serves as a baseline for further investigations of ESNs in audio signal processing in the future.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"2012 1","pages":"126-130"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86352897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
User Activity And Data Detection For MIMO Uplink C-RAN Using Bayesian Learning 基于贝叶斯学习的MIMO上行链路C-RAN用户活动和数据检测
Pub Date : 2021-01-24 DOI: 10.23919/Eusipco47968.2020.9287867
Anupama Rajoriya, Vidushi Katiyar, Rohit Budhiraja
We investigate user activity and data detection problem in a multiple-input multiple-output uplink cloud-radio access network, where the data matrix over a time-frame has overlapped burst sparsity due to sporadic user activity. We exploit this sparsity to recover data by proposing a weighted prior-sparse Bayesian learning algorithm. The proposed algorithm, due to carefully selected prior, captures not only the overlapped burst sparsity across time but also the block sparsity due to multi-user antennas. We also derive hyperparameter updates, and estimate the weight parameters using the support estimated via index-wise log-likelihood ratio test. We numerically demonstrate that the proposed algorithm has much lower bit error rate than the state-of-the-art competing algorithms.
我们研究了多输入多输出上行云无线接入网络中的用户活动和数据检测问题,其中数据矩阵在一个时间框架内由于零星的用户活动而重叠突发稀疏性。我们通过提出一种加权先验-稀疏贝叶斯学习算法来利用这种稀疏性来恢复数据。该算法通过对先验条件的仔细选择,既能捕获重叠突发稀疏度,又能捕获多用户天线引起的块稀疏度。我们还推导了超参数更新,并使用通过索引对数似然比检验估计的支持度来估计权重参数。数值结果表明,该算法具有较低的误码率。
{"title":"User Activity And Data Detection For MIMO Uplink C-RAN Using Bayesian Learning","authors":"Anupama Rajoriya, Vidushi Katiyar, Rohit Budhiraja","doi":"10.23919/Eusipco47968.2020.9287867","DOIUrl":"https://doi.org/10.23919/Eusipco47968.2020.9287867","url":null,"abstract":"We investigate user activity and data detection problem in a multiple-input multiple-output uplink cloud-radio access network, where the data matrix over a time-frame has overlapped burst sparsity due to sporadic user activity. We exploit this sparsity to recover data by proposing a weighted prior-sparse Bayesian learning algorithm. The proposed algorithm, due to carefully selected prior, captures not only the overlapped burst sparsity across time but also the block sparsity due to multi-user antennas. We also derive hyperparameter updates, and estimate the weight parameters using the support estimated via index-wise log-likelihood ratio test. We numerically demonstrate that the proposed algorithm has much lower bit error rate than the state-of-the-art competing algorithms.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"21 1","pages":"1742-1746"},"PeriodicalIF":0.0,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89220420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2020 28th European Signal Processing Conference (EUSIPCO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1