首页 > 最新文献

2022 30th European Signal Processing Conference (EUSIPCO)最新文献

英文 中文
Cooperative Pose Estimation in a Robotic Swarm: Framework, Simulation and Experimental Results 机器人群中的协同姿态估计:框架、仿真和实验结果
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909666
Siwei Zhang, Kimon Cokona, R. Pöhlmann, E. Staudinger, T. Wiedemann, A. Dammann
Swarm robotics has gained an increasing attention in applications like extraterrestrial exploration and disaster management, due to the ability of simultaneously observing at different locations and avoiding a single point of failure. In order to operate autonomously, robots in a swarm need to know their precise poses, including their positions, velocities and orientations. When external navigation infrastructures like the global navigation satellite systems (GNSS) are not ubiquitously accessible, the swarm of robots need to rely on internal measurements to estimate their poses. In this paper, we propose a cooperative 3D pose estimation framework, based on the insights of sensor characteristics that we gained from outdoor swarm navigation experiments. A decentralized particle filter (DPF) operates on each robot to estimate its pose via fusing radio-based ranging, inertial sensor data, control commands and the pose estimates of its neighbors. This framework is integrated in the swarm navigation ecosystem developed at the German Aerospace Center (DLR), and is unified for both simulations and experiments.
由于能够同时在不同位置观察并避免单点故障,群机器人在外星探索和灾害管理等应用中获得了越来越多的关注。为了自主操作,成群的机器人需要知道它们的精确姿势,包括它们的位置、速度和方向。当像全球导航卫星系统(GNSS)这样的外部导航基础设施不是无处不在时,机器人群需要依靠内部测量来估计它们的姿势。在本文中,我们基于从室外群体导航实验中获得的传感器特性的见解,提出了一种协同三维姿态估计框架。分散式粒子滤波器(DPF)通过融合基于无线电的测距、惯性传感器数据、控制命令和邻居的姿态估计,对每个机器人进行估计。该框架集成在德国航空航天中心(DLR)开发的群导航生态系统中,并统一用于模拟和实验。
{"title":"Cooperative Pose Estimation in a Robotic Swarm: Framework, Simulation and Experimental Results","authors":"Siwei Zhang, Kimon Cokona, R. Pöhlmann, E. Staudinger, T. Wiedemann, A. Dammann","doi":"10.23919/eusipco55093.2022.9909666","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909666","url":null,"abstract":"Swarm robotics has gained an increasing attention in applications like extraterrestrial exploration and disaster management, due to the ability of simultaneously observing at different locations and avoiding a single point of failure. In order to operate autonomously, robots in a swarm need to know their precise poses, including their positions, velocities and orientations. When external navigation infrastructures like the global navigation satellite systems (GNSS) are not ubiquitously accessible, the swarm of robots need to rely on internal measurements to estimate their poses. In this paper, we propose a cooperative 3D pose estimation framework, based on the insights of sensor characteristics that we gained from outdoor swarm navigation experiments. A decentralized particle filter (DPF) operates on each robot to estimate its pose via fusing radio-based ranging, inertial sensor data, control commands and the pose estimates of its neighbors. This framework is integrated in the swarm navigation ecosystem developed at the German Aerospace Center (DLR), and is unified for both simulations and experiments.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121998378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Minimum Variance Distortionless Response Spectral Estimator with Kronecker Product Filters 基于Kronecker积滤波器的最小方差无失真响应谱估计
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909584
Xianrui Wang, J. Benesty, Gongping Huang, Jingdong Chen
Spectral estimation is of significant practical importance in a wide range of applications. This paper proposes a minimum variance distortionless response (MVDR) method for spectral estimation based on the Kronecker product. Taking advantage of the particular structure of the Fourier vector, we decompose it as a Kronecker product of two shorter vectors. Then, we design the spectral estimation filters under the same structure, i.e., as a Kronecker product of two filters. Consequently, the conventional MVDR spectrum problem is transformed to one of estimating two filters of much shorter lengths. Since it has much fewer parameters to estimate, the proposed method is able to achieve better performance than its conventional counterpart, particularly when the number of available signal samples is small. Also presented in this paper is the generalization to the estimation of the cross-spectrum and coherence function.
光谱估计在广泛的应用中具有重要的实际意义。提出了一种基于Kronecker积的最小方差无失真响应(MVDR)谱估计方法。利用傅里叶向量的特殊结构,我们把它分解成两个短向量的克罗内克积。然后,我们在相同的结构下设计了光谱估计滤波器,即两个滤波器的Kronecker积。因此,将传统的MVDR频谱问题转化为估计两个更短长度的滤波器的问题。由于需要估计的参数要少得多,因此所提出的方法能够获得比传统方法更好的性能,特别是当可用信号样本数量较少时。本文还介绍了对交叉谱和相干函数估计的推广。
{"title":"A Minimum Variance Distortionless Response Spectral Estimator with Kronecker Product Filters","authors":"Xianrui Wang, J. Benesty, Gongping Huang, Jingdong Chen","doi":"10.23919/eusipco55093.2022.9909584","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909584","url":null,"abstract":"Spectral estimation is of significant practical importance in a wide range of applications. This paper proposes a minimum variance distortionless response (MVDR) method for spectral estimation based on the Kronecker product. Taking advantage of the particular structure of the Fourier vector, we decompose it as a Kronecker product of two shorter vectors. Then, we design the spectral estimation filters under the same structure, i.e., as a Kronecker product of two filters. Consequently, the conventional MVDR spectrum problem is transformed to one of estimating two filters of much shorter lengths. Since it has much fewer parameters to estimate, the proposed method is able to achieve better performance than its conventional counterpart, particularly when the number of available signal samples is small. Also presented in this paper is the generalization to the estimation of the cross-spectrum and coherence function.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122629216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Message Passing-based Inference in Switching Autoregressive Models 交换自回归模型中基于消息传递的推理
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909828
Albert Podusenko, B. V. Erp, Dmitry V. Bagaev, Ismail Senöz, B. Vries
The switching autoregressive model is a flexible model for signals generated by non-stationary processes. Unfortunately, evaluation of the exact posterior distributions of the latent variables for a switching autoregressive model is analytically intractable, and this limits the applicability of switching autoregressive models in practical signal processing tasks. In this paper we present a message passing-based approach for computing approximate posterior distributions in the switching autoregressive model. Our solution tracks approximate posterior distributions in a modular way and easily extends to more complicated model variations. The proposed message passing algorithm is verified and validated on synthetic and acoustic data sets respectively.
开关自回归模型对于非平稳过程产生的信号是一种灵活的模型。不幸的是,对切换自回归模型的潜在变量的精确后验分布的评估在分析上是难以解决的,这限制了切换自回归模型在实际信号处理任务中的适用性。在本文中,我们提出了一种基于消息传递的方法来计算开关自回归模型中的近似后验分布。我们的解决方案以模块化的方式跟踪近似后验分布,并且很容易扩展到更复杂的模型变化。在合成数据集和声学数据集上分别对所提出的消息传递算法进行了验证和验证。
{"title":"Message Passing-based Inference in Switching Autoregressive Models","authors":"Albert Podusenko, B. V. Erp, Dmitry V. Bagaev, Ismail Senöz, B. Vries","doi":"10.23919/eusipco55093.2022.9909828","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909828","url":null,"abstract":"The switching autoregressive model is a flexible model for signals generated by non-stationary processes. Unfortunately, evaluation of the exact posterior distributions of the latent variables for a switching autoregressive model is analytically intractable, and this limits the applicability of switching autoregressive models in practical signal processing tasks. In this paper we present a message passing-based approach for computing approximate posterior distributions in the switching autoregressive model. Our solution tracks approximate posterior distributions in a modular way and easily extends to more complicated model variations. The proposed message passing algorithm is verified and validated on synthetic and acoustic data sets respectively.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116303850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Acoustic Model Adaptation In Reverberant Conditions Using Multi-task Learned Embeddings 基于多任务学习嵌入的混响条件下声学模型自适应
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909579
Aditya Raikar, Meet H. Soni, Ashish Panda, S. Kopparapu
Acoustic environment plays a major role in the performance of a large-scale Automatic Speech Recognition (ASR) system. It becomes a lot more challenging when substantial amount of distortions, such as background noise and reverberations are present. Of late, it has been standard to use i-vectors for Acoustic Model (AM) adaptation. Embeddings from Single Task Learned (STL) neural network systems, such as x-vectors and r-vectors, have also been used to a varying degree of success. This paper proposes the use of Multi Task Learned (MTL) embeddings for large vocabulary hybrid acoustic model adaptation in reverberant environments. MTL embeddings are extracted from an affine layer of the deep neural network trained on multiple tasks such as speaker information and room information. Our experiments show that the proposed MTL embeddings outperform i-vectors, x-vectors and r-vectors for AM adaptation in reverberant conditions. Besides, it has been demonstrated that the proposed MTL-embeddings can be fused with i-vectors to provide further improvement. We provide results on artificially reverberated Librispeech data as well as real world reverberated HRRE data. On Librispeech database, the proposed method provides an improvement of 10.9% and 8.7% relative to i-vector in reverberated test-clean and test-other data respectively, while an improvement of 7% is observed relative to i-vector when the proposed system is tested on HRRE dataset.
声环境对大规模自动语音识别系统的性能起着至关重要的作用。当大量的失真,如背景噪音和混响存在时,它变得更具挑战性。最近,使用i向量进行声学模型(AM)适配已成为标准。来自单任务学习(STL)神经网络系统的嵌入,如x向量和r向量,也已被用于不同程度的成功。本文提出了在混响环境下使用多任务学习(MTL)嵌入来适应大词汇混合声学模型。MTL嵌入是从深度神经网络的仿射层中提取的,深度神经网络是在多个任务(如说话者信息和房间信息)上训练的。我们的实验表明,在混响条件下,所提出的MTL嵌入在调幅适应方面优于i向量、x向量和r向量。此外,还证明了所提出的mtl嵌入可以与i向量融合,以提供进一步的改进。我们提供了人工混响librisspeech数据和真实世界混响HRRE数据的结果。在librisspeech数据库上,该方法在混响test-clean和test-other数据上相对于i-vector分别提高了10.9%和8.7%,在HRRE数据集上相对于i-vector提高了7%。
{"title":"Acoustic Model Adaptation In Reverberant Conditions Using Multi-task Learned Embeddings","authors":"Aditya Raikar, Meet H. Soni, Ashish Panda, S. Kopparapu","doi":"10.23919/eusipco55093.2022.9909579","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909579","url":null,"abstract":"Acoustic environment plays a major role in the performance of a large-scale Automatic Speech Recognition (ASR) system. It becomes a lot more challenging when substantial amount of distortions, such as background noise and reverberations are present. Of late, it has been standard to use i-vectors for Acoustic Model (AM) adaptation. Embeddings from Single Task Learned (STL) neural network systems, such as x-vectors and r-vectors, have also been used to a varying degree of success. This paper proposes the use of Multi Task Learned (MTL) embeddings for large vocabulary hybrid acoustic model adaptation in reverberant environments. MTL embeddings are extracted from an affine layer of the deep neural network trained on multiple tasks such as speaker information and room information. Our experiments show that the proposed MTL embeddings outperform i-vectors, x-vectors and r-vectors for AM adaptation in reverberant conditions. Besides, it has been demonstrated that the proposed MTL-embeddings can be fused with i-vectors to provide further improvement. We provide results on artificially reverberated Librispeech data as well as real world reverberated HRRE data. On Librispeech database, the proposed method provides an improvement of 10.9% and 8.7% relative to i-vector in reverberated test-clean and test-other data respectively, while an improvement of 7% is observed relative to i-vector when the proposed system is tested on HRRE dataset.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126653473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech Enhancement in Distributed Microphone Arrays Using Polynomial Eigenvalue Decomposition 基于多项式特征值分解的分布式麦克风阵列语音增强
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909555
Emilie D'Olne, Vincent W. Neo, P. Naylor
As the number of connected devices equipped with multiple microphones increases, scientific interest in distributed microphone array processing grows. Current beamforming methods heavily rely on estimating quantities related to array geometry, which is extremely challenging in real, non-stationary environments. Recent work on polynomial eigenvalue decomposition (PEVD) has shown promising results for speech enhancement in singular arrays without requiring the estimation of any array-related parameter [1]. This work extends these results to the realm of distributed microphone arrays, and further presents a novel framework for speech enhancement in distributed microphone arrays using PEVD. The proposed approach is shown to almost always outperform optimum beamformers located at arrays closest to the desired speaker. Moreover, the proposed approach exhibits very strong robustness to steering vector errors.
随着配备多个麦克风的连接设备数量的增加,对分布式麦克风阵列处理的科学兴趣也在增长。目前的波束形成方法严重依赖于与阵列几何形状相关的估计量,这在真实的非平稳环境中极具挑战性。最近对多项式特征值分解(PEVD)的研究表明,在不需要估计任何与阵列相关的参数的情况下,奇异阵列的语音增强结果很有希望[1]。这项工作将这些结果扩展到分布式麦克风阵列领域,并进一步提出了一个使用PEVD在分布式麦克风阵列中进行语音增强的新框架。所提出的方法几乎总是优于位于最靠近所需扬声器的阵列的最佳波束形成器。此外,该方法对转向矢量误差具有很强的鲁棒性。
{"title":"Speech Enhancement in Distributed Microphone Arrays Using Polynomial Eigenvalue Decomposition","authors":"Emilie D'Olne, Vincent W. Neo, P. Naylor","doi":"10.23919/eusipco55093.2022.9909555","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909555","url":null,"abstract":"As the number of connected devices equipped with multiple microphones increases, scientific interest in distributed microphone array processing grows. Current beamforming methods heavily rely on estimating quantities related to array geometry, which is extremely challenging in real, non-stationary environments. Recent work on polynomial eigenvalue decomposition (PEVD) has shown promising results for speech enhancement in singular arrays without requiring the estimation of any array-related parameter [1]. This work extends these results to the realm of distributed microphone arrays, and further presents a novel framework for speech enhancement in distributed microphone arrays using PEVD. The proposed approach is shown to almost always outperform optimum beamformers located at arrays closest to the desired speaker. Moreover, the proposed approach exhibits very strong robustness to steering vector errors.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124141726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
On Interpretability of CNNs for Multimodal Medical Image Segmentation 多模态医学图像分割的cnn可解释性研究
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909776
Srdan Lazendic, Jens Janssens, Shaoguang Huang, A. Pižurica
Despite their huge potential, deep learning-based models are still not trustful enough to warrant their adoption in clinical practice. The research on the interpretability and explainability of deep learning is currently attracting huge attention. Multilayer Convolutional Sparse Coding (ML-CSC) data model, provides a model-based explanation of convolutional neural networks (CNNs). In this article, we extend the ML-CSC framework towards multimodal data for medical image segmentation, and propose a merged joint feature extraction ML-CSC model. This work generalizes and improves upon our previous model, by deriving a more elegant approach that merges feature extraction and convolutional sparse coding in a unified framework. A segmentation study on a multimodal magnetic resonance imaging (MRI) dataset confirms the effectiveness of the proposed approach. We also supply an interpretability study regarding the involved model parameters.
尽管有巨大的潜力,但基于深度学习的模型仍然不够可信,不足以保证在临床实践中采用。关于深度学习的可解释性和可解释性的研究目前备受关注。多层卷积稀疏编码(ML-CSC)数据模型为卷积神经网络(cnn)提供了一种基于模型的解释。在本文中,我们将ML-CSC框架扩展到医学图像分割的多模态数据,并提出了一个合并的联合特征提取ML-CSC模型。这项工作在我们之前的模型上进行了推广和改进,通过推导出一种更优雅的方法,将特征提取和卷积稀疏编码合并在一个统一的框架中。对多模态磁共振成像(MRI)数据集的分割研究证实了所提出方法的有效性。我们还提供了关于所涉及的模型参数的可解释性研究。
{"title":"On Interpretability of CNNs for Multimodal Medical Image Segmentation","authors":"Srdan Lazendic, Jens Janssens, Shaoguang Huang, A. Pižurica","doi":"10.23919/eusipco55093.2022.9909776","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909776","url":null,"abstract":"Despite their huge potential, deep learning-based models are still not trustful enough to warrant their adoption in clinical practice. The research on the interpretability and explainability of deep learning is currently attracting huge attention. Multilayer Convolutional Sparse Coding (ML-CSC) data model, provides a model-based explanation of convolutional neural networks (CNNs). In this article, we extend the ML-CSC framework towards multimodal data for medical image segmentation, and propose a merged joint feature extraction ML-CSC model. This work generalizes and improves upon our previous model, by deriving a more elegant approach that merges feature extraction and convolutional sparse coding in a unified framework. A segmentation study on a multimodal magnetic resonance imaging (MRI) dataset confirms the effectiveness of the proposed approach. We also supply an interpretability study regarding the involved model parameters.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126414184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Ensembles of Gaussian process latent variable models 高斯过程潜在变量模型的集成
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909949
Marzieh Ajirak, Yuhao Liu, P. Djurić
In this paper, we address the classification and dimensionality reduction via ensembles of Gaussian Process Latent Variable Models (GPLVMs). The underlying idea is to have a diverse representation of latent spaces represented by an ensemble of GPLVMs. Each GPLVM of the ensemble has its own projections of the high dimensional observed data on a low dimensional latent space. These models are weighted using importance sampling. Since in practical settings, neither the kernel of the GPLVM nor the dimension of the latent space is known, it is logical to engage an ensemble of GPLVMs based on different kernels and for each of them estimate the dimension of the lower dimensional space. We demonstrate the advantage of working with ensembles for classification and show the performance of dimensionality reduction of our method with numerical simulations.
在本文中,我们通过高斯过程潜在变量模型(gplvm)的集成来解决分类和降维问题。其基本思想是拥有由gplvm集合表示的潜在空间的不同表示。集合的每个GPLVM都有自己的高维观测数据在低维潜在空间上的投影。这些模型使用重要性抽样进行加权。由于在实际设置中,既不知道GPLVM的核,也不知道潜在空间的维数,因此,基于不同核的GPLVM集成并为每个GPLVM估计较低维空间的维数是合乎逻辑的。我们通过数值模拟证明了使用集成进行分类的优势,并展示了我们的方法的降维性能。
{"title":"Ensembles of Gaussian process latent variable models","authors":"Marzieh Ajirak, Yuhao Liu, P. Djurić","doi":"10.23919/eusipco55093.2022.9909949","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909949","url":null,"abstract":"In this paper, we address the classification and dimensionality reduction via ensembles of Gaussian Process Latent Variable Models (GPLVMs). The underlying idea is to have a diverse representation of latent spaces represented by an ensemble of GPLVMs. Each GPLVM of the ensemble has its own projections of the high dimensional observed data on a low dimensional latent space. These models are weighted using importance sampling. Since in practical settings, neither the kernel of the GPLVM nor the dimension of the latent space is known, it is logical to engage an ensemble of GPLVMs based on different kernels and for each of them estimate the dimension of the lower dimensional space. We demonstrate the advantage of working with ensembles for classification and show the performance of dimensionality reduction of our method with numerical simulations.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124418968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Algorithmic Advances for the Adjacency Spectral Embedding 邻接谱嵌入算法研究进展
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909610
Marcelo Fiori, Bernardo Marenco, Federico Larroca, P. Bermolen, G. Mateos
The Random Dot Product Graph (RDPG) is a popular generative graph model for relational data. RDPGs postulate there exist latent positions for each node, and specifies the edge formation probabilities via the inner product of the corresponding latent vectors. The embedding task of estimating these latent positions from observed graphs is usually posed as a non-convex matrix factorization problem. The workhorse Adjacency Spectral Embedding offers an approximate solution obtained via the eigendecomposition of the adjacency matrix, which enjoys solid statistical guarantees but can be computationally intensive and is formally solving a surrogate problem. In this paper, we bring to bear recent non-convex optimization advances and demonstrate their impact to RDPG inference. We develop first-order gradient descent methods to better solve the original optimization problem, and to accommodate broader network embedding applications in an organic way. The effectiveness of the resulting graph representation learning framework is demonstrated on both synthetic and real data. We show the algorithms are scalable, robust to missing network data, and can track the latent positions over time when the graphs are acquired in a streaming fashion.
随机点积图(RDPG)是一种流行的关系数据生成图模型。RDPGs假设每个节点存在潜在位置,并通过相应潜在向量的内积指定边缘形成概率。从观察到的图中估计这些潜在位置的嵌入任务通常被视为一个非凸矩阵分解问题。主要的邻接谱嵌入提供了通过邻接矩阵的特征分解获得的近似解,它具有可靠的统计保证,但计算量大,并且是正式解决代理问题。在本文中,我们介绍了最近的非凸优化进展,并证明了它们对RDPG推理的影响。我们开发了一阶梯度下降方法来更好地解决原始优化问题,并以有机的方式适应更广泛的网络嵌入应用。所得到的图表示学习框架的有效性在合成数据和实际数据上都得到了验证。我们展示了算法具有可扩展性,对缺失的网络数据具有鲁棒性,并且当以流方式获取图形时,可以随着时间的推移跟踪潜在位置。
{"title":"Algorithmic Advances for the Adjacency Spectral Embedding","authors":"Marcelo Fiori, Bernardo Marenco, Federico Larroca, P. Bermolen, G. Mateos","doi":"10.23919/eusipco55093.2022.9909610","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909610","url":null,"abstract":"The Random Dot Product Graph (RDPG) is a popular generative graph model for relational data. RDPGs postulate there exist latent positions for each node, and specifies the edge formation probabilities via the inner product of the corresponding latent vectors. The embedding task of estimating these latent positions from observed graphs is usually posed as a non-convex matrix factorization problem. The workhorse Adjacency Spectral Embedding offers an approximate solution obtained via the eigendecomposition of the adjacency matrix, which enjoys solid statistical guarantees but can be computationally intensive and is formally solving a surrogate problem. In this paper, we bring to bear recent non-convex optimization advances and demonstrate their impact to RDPG inference. We develop first-order gradient descent methods to better solve the original optimization problem, and to accommodate broader network embedding applications in an organic way. The effectiveness of the resulting graph representation learning framework is demonstrated on both synthetic and real data. We show the algorithms are scalable, robust to missing network data, and can track the latent positions over time when the graphs are acquired in a streaming fashion.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127981128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Fast Disparity Estimation from a Single Compressed Light Field Measurement 基于单压缩光场测量的快速视差估计
Pub Date : 2022-08-29 DOI: 10.48550/arXiv.2209.11342
Emmanuel Martinez, Edwin Vargas, H. Arguello
The abundant spatial and angular information from light fields has allowed the development of multiple disparity estimation approaches. However, the acquisition of light fields requires high storage and processing cost, limiting the use of this technology in practical applications. To overcome these drawbacks, the compressive sensing (CS) theory has allowed the development of optical architectures to acquire a single coded light field measurement. This measurement is decoded using an optimization algorithm or deep neural network that requires high computational costs. The traditional approach for disparity estimation from compressed light fields requires first recovering the entire light field and then a post-processing step, thus requiring long times. In contrast, this work proposes a fast disparity estimation from a single compressed measurement by omitting the recovery step required in traditional approaches. Specifically, we propose to jointly optimize an optical architecture for acquiring a single coded light field snapshot and a convolutional neural network (CNN) for estimating the disparity maps. Experimentally, the proposed method estimates disparity maps comparable with those obtained from light fields reconstructed using deep learning approaches. Furthermore, the proposed method is 20 times faster in training and inference than the best method that estimates the disparity from reconstructed light fields.
光场中丰富的空间和角度信息使得多种视差估计方法得以发展。然而,光场的获取需要较高的存储和处理成本,限制了该技术在实际应用中的应用。为了克服这些缺点,压缩感知(CS)理论允许光学架构的发展,以获得单一编码光场测量。该测量使用需要高计算成本的优化算法或深度神经网络进行解码。传统的压缩光场视差估计方法需要先恢复整个光场,然后再进行后处理,耗时较长。相比之下,这项工作提出了一个快速的视差估计从一个单一的压缩测量,省略了传统方法所需的恢复步骤。具体来说,我们建议共同优化用于获取单个编码光场快照的光学架构和用于估计视差图的卷积神经网络(CNN)。在实验中,该方法估计的视差图与使用深度学习方法重建的光场得到的视差图相当。此外,该方法的训练和推理速度比最优的从重建光场估计视差的方法快20倍。
{"title":"Fast Disparity Estimation from a Single Compressed Light Field Measurement","authors":"Emmanuel Martinez, Edwin Vargas, H. Arguello","doi":"10.48550/arXiv.2209.11342","DOIUrl":"https://doi.org/10.48550/arXiv.2209.11342","url":null,"abstract":"The abundant spatial and angular information from light fields has allowed the development of multiple disparity estimation approaches. However, the acquisition of light fields requires high storage and processing cost, limiting the use of this technology in practical applications. To overcome these drawbacks, the compressive sensing (CS) theory has allowed the development of optical architectures to acquire a single coded light field measurement. This measurement is decoded using an optimization algorithm or deep neural network that requires high computational costs. The traditional approach for disparity estimation from compressed light fields requires first recovering the entire light field and then a post-processing step, thus requiring long times. In contrast, this work proposes a fast disparity estimation from a single compressed measurement by omitting the recovery step required in traditional approaches. Specifically, we propose to jointly optimize an optical architecture for acquiring a single coded light field snapshot and a convolutional neural network (CNN) for estimating the disparity maps. Experimentally, the proposed method estimates disparity maps comparable with those obtained from light fields reconstructed using deep learning approaches. Furthermore, the proposed method is 20 times faster in training and inference than the best method that estimates the disparity from reconstructed light fields.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127465257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Group equivariant networks for leakage detection in vacuum bagging 真空装袋泄漏检测的群等变网络
Pub Date : 2022-08-29 DOI: 10.23919/eusipco55093.2022.9909715
Christoph Brauer, D. Lorenz, Lionel Tondji
The incorporation of prior knowledge into the ma-chine learning pipeline is subject of informed machine learning. Spatial invariances constitute a class of prior knowledge that can be taken into account especially in the design of model architectures or through virtual training examples. In this contribution, we investigate fully connected neural network architectures that are equivariant with respect to the dihedral group of order eight. This is practically motivated by the application of leakage detection in vacuum bagging which plays an important role in the manufacturing of fiber composite components. Our approach for the derivation of an equivariant architecture is constructive and transferable to other symmetry groups. It starts from a standard network architecture and results in a specific kind of weight sharing in each layer. In numerical experiments, we compare equivariant and standard networks on a novel leakage detection dataset. Our results indicate that group equivariant networks can capture the application specific prior knowledge much better than standard networks, even if the latter are trained on augmented data.
将先验知识整合到机器学习管道中是知情机器学习的主题。空间不变性构成了一类先验知识,特别是在设计模型架构或通过虚拟训练示例时可以考虑到它。在这篇文章中,我们研究了关于八阶二面体群的等变的全连接神经网络架构。这实际上是由于真空装袋中泄漏检测的应用,在纤维复合材料部件的制造中起着重要的作用。我们对等变结构的推导方法是建设性的,并且可转移到其他对称群。它从一个标准的网络体系结构开始,并在每一层中产生一种特定的权重共享。在数值实验中,我们在一个新的泄漏检测数据集上比较了等变网络和标准网络。我们的研究结果表明,群体等变网络可以比标准网络更好地捕获特定应用的先验知识,即使后者是在增强数据上训练的。
{"title":"Group equivariant networks for leakage detection in vacuum bagging","authors":"Christoph Brauer, D. Lorenz, Lionel Tondji","doi":"10.23919/eusipco55093.2022.9909715","DOIUrl":"https://doi.org/10.23919/eusipco55093.2022.9909715","url":null,"abstract":"The incorporation of prior knowledge into the ma-chine learning pipeline is subject of informed machine learning. Spatial invariances constitute a class of prior knowledge that can be taken into account especially in the design of model architectures or through virtual training examples. In this contribution, we investigate fully connected neural network architectures that are equivariant with respect to the dihedral group of order eight. This is practically motivated by the application of leakage detection in vacuum bagging which plays an important role in the manufacturing of fiber composite components. Our approach for the derivation of an equivariant architecture is constructive and transferable to other symmetry groups. It starts from a standard network architecture and results in a specific kind of weight sharing in each layer. In numerical experiments, we compare equivariant and standard networks on a novel leakage detection dataset. Our results indicate that group equivariant networks can capture the application specific prior knowledge much better than standard networks, even if the latter are trained on augmented data.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130457233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 30th European Signal Processing Conference (EUSIPCO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1