首页 > 最新文献

2018 26th European Signal Processing Conference (EUSIPCO)最新文献

英文 中文
Time-Frequency-Bin-Wise Beamformer Selection and Masking for Speech Enhancement in Underdetermined Noisy Scenarios 欠确定噪声环境下语音增强的时频方向波束形成器选择和掩蔽
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553299
K. Yamaoka, Andreas Brendel, Nobutaka Ono, S. Makino, M. Buerger, Takeshi Yamada, Walter Kellermann
In this paper, we present a speech enhancement method using two microphones for underdetermined situations. A conventional speech enhancement method for underdetermined situations is time-frequency masking, where speech is enhanced by multiplying zero or one to each time-frequency component appropriately. Extending this method, we switch multiple preconstructed beamformers at each time-frequency bin, each of which suppresses a particular interferer. This method can suppress an interferer even when both the target and an interferer are simultaneously active at a given time-frequency bin. As a switching criterion, selection of minimum value of the outputs of the all beamformers at each time-frequency bin is investigated. Additionally, another method using direction of arrival estimation is also investigated. In experiments, we confirmed that the proposed methods were superior to conventional time-frequency masking and fixed beamforming in the performance of speech enhancement.
在本文中,我们提出了一种使用双麦克风的语音增强方法。对于欠确定情况,传统的语音增强方法是时频掩蔽,通过对每个时频分量适当地乘以0或1来增强语音。扩展此方法,我们在每个时频本处切换多个预构造波束形成器,每个波束形成器抑制一个特定的干扰。这种方法可以抑制干扰源,即使目标和干扰源在给定的时间-频率域中同时处于活动状态。研究了各波束形成器在各时频本处输出最小值的选择作为切换准则。此外,还研究了另一种利用到达方向估计的方法。在实验中,我们证实了所提出的方法在语音增强性能上优于传统的时频掩蔽和固定波束形成。
{"title":"Time-Frequency-Bin-Wise Beamformer Selection and Masking for Speech Enhancement in Underdetermined Noisy Scenarios","authors":"K. Yamaoka, Andreas Brendel, Nobutaka Ono, S. Makino, M. Buerger, Takeshi Yamada, Walter Kellermann","doi":"10.23919/EUSIPCO.2018.8553299","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553299","url":null,"abstract":"In this paper, we present a speech enhancement method using two microphones for underdetermined situations. A conventional speech enhancement method for underdetermined situations is time-frequency masking, where speech is enhanced by multiplying zero or one to each time-frequency component appropriately. Extending this method, we switch multiple preconstructed beamformers at each time-frequency bin, each of which suppresses a particular interferer. This method can suppress an interferer even when both the target and an interferer are simultaneously active at a given time-frequency bin. As a switching criterion, selection of minimum value of the outputs of the all beamformers at each time-frequency bin is investigated. Additionally, another method using direction of arrival estimation is also investigated. In experiments, we confirmed that the proposed methods were superior to conventional time-frequency masking and fixed beamforming in the performance of speech enhancement.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"249 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132166895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Sparsity Based Framework for Spatial Sound Reproduction in Spherical Harmonic Domain 基于稀疏度的球谐域空间声音再现框架
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553209
Gyanajyoti Routray, R. Hegde
In this paper, a novel sparsity based framework is proposed for accurate spatial sound field reproduction in spherical harmonic domain. The proposed framework can effectively reduce the number of loudspeakers required to reproduce the desired sound field using higher order ambisonics (HOA) over a fixed listening area. Although H OA provides accurate reproduction of spatial sound, it has a disadvantage in terms of the restriction on the area of sound reproduction. This area can be increased with the increase in the number of loudspeakers during reproduction. In order to limit the use of a large number of loudspeakers the sparse nature of the weight vector in the HOA signal model is utilized in this work. The problem of obtaining the weight vector is first formulated as a constrained optimization problem which is difficult to solve due to orthogonality property of the spherical harmonic matrix. This problem is therefore reformulated to exploit the sparse nature of the weight vector. The solution is then obtained by using the Bregman iteration method. Experiments on sound field reproduction in free space using the proposed sparsity based method are conducted using loudspeaker arrays. Performance improvements are noted when compared to least squares and compressed sensing methods in terms of sound field reproduction accuracy, subjective, and objective evaluations.
本文提出了一种基于稀疏度的球谐域空间声场精确再现框架。所提出的框架可以有效地减少在固定收听区域使用高阶双声系统(HOA)再现所需声场所需的扬声器数量。虽然hoa提供了空间声音的精确再现,但它在声音再现面积的限制方面存在缺点。在重放过程中,这个区域可以随着扬声器数量的增加而增加。为了限制大量扬声器的使用,本文利用了HOA信号模型中权向量的稀疏特性。首先将权向量的求解问题表述为一个由于球调和矩阵的正交性而难以求解的约束优化问题。因此,这个问题被重新表述,以利用权向量的稀疏性质。然后用布雷格曼迭代法求解。利用该方法在自由空间进行了基于稀疏度的声场再现实验。在声场再现精度、主观和客观评价方面,与最小二乘和压缩感知方法相比,性能有所提高。
{"title":"Sparsity Based Framework for Spatial Sound Reproduction in Spherical Harmonic Domain","authors":"Gyanajyoti Routray, R. Hegde","doi":"10.23919/EUSIPCO.2018.8553209","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553209","url":null,"abstract":"In this paper, a novel sparsity based framework is proposed for accurate spatial sound field reproduction in spherical harmonic domain. The proposed framework can effectively reduce the number of loudspeakers required to reproduce the desired sound field using higher order ambisonics (HOA) over a fixed listening area. Although H OA provides accurate reproduction of spatial sound, it has a disadvantage in terms of the restriction on the area of sound reproduction. This area can be increased with the increase in the number of loudspeakers during reproduction. In order to limit the use of a large number of loudspeakers the sparse nature of the weight vector in the HOA signal model is utilized in this work. The problem of obtaining the weight vector is first formulated as a constrained optimization problem which is difficult to solve due to orthogonality property of the spherical harmonic matrix. This problem is therefore reformulated to exploit the sparse nature of the weight vector. The solution is then obtained by using the Bregman iteration method. Experiments on sound field reproduction in free space using the proposed sparsity based method are conducted using loudspeaker arrays. Performance improvements are noted when compared to least squares and compressed sensing methods in terms of sound field reproduction accuracy, subjective, and objective evaluations.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132396687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Path Orthogonal Matching Pursuit for k-Sparse Image Reconstruction k-稀疏图像重构的路径正交匹配追踪
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553497
T. Emerson, T. Doster, C. Olson
We introduce a path-augmentation step to the standard orthogonal matching pursuit algorithm. Our augmentation may be applied to any algorithm that relies on the selection and sorting of high-correlation atoms during an analysis or identification phase by generating a “path” between the two highest-correlation atoms. Here we investigate two types of path: a linear combination (Euclidean geodesic) and a construction relying on an optimal transport map (2-Wasserstein geodesic). We test our extension by generating k-sparse reconstructions of faces using an eigen-face dictionary learned from a subset of the data. We show that our method achieves lower reconstruction error for fixed sparsity levels than either orthogonal matching pursuit or generalized orthogonal matching pursuit.
我们在标准的正交匹配追踪算法中引入了一个路径增广步骤。我们的扩展可以应用于任何在分析或识别阶段依赖于高相关原子的选择和排序的算法,通过生成两个最高相关原子之间的“路径”。在这里,我们研究了两种类型的路径:线性组合(欧几里得测地线)和依赖于最优运输图的构造(2-Wasserstein测地线)。我们通过使用从数据子集中学习到的特征-面部字典生成人脸的k-稀疏重建来测试我们的扩展。结果表明,该方法在固定稀疏度下的重建误差比正交匹配追踪和广义正交匹配追踪都要小。
{"title":"Path Orthogonal Matching Pursuit for k-Sparse Image Reconstruction","authors":"T. Emerson, T. Doster, C. Olson","doi":"10.23919/EUSIPCO.2018.8553497","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553497","url":null,"abstract":"We introduce a path-augmentation step to the standard orthogonal matching pursuit algorithm. Our augmentation may be applied to any algorithm that relies on the selection and sorting of high-correlation atoms during an analysis or identification phase by generating a “path” between the two highest-correlation atoms. Here we investigate two types of path: a linear combination (Euclidean geodesic) and a construction relying on an optimal transport map (2-Wasserstein geodesic). We test our extension by generating k-sparse reconstructions of faces using an eigen-face dictionary learned from a subset of the data. We show that our method achieves lower reconstruction error for fixed sparsity levels than either orthogonal matching pursuit or generalized orthogonal matching pursuit.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132661763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An acoustic image-source characterisation of surface profiles 表面轮廓的声像源表征
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553206
P. Dawson, E. D. Sena, P. Naylor
The image-source method models the specular reflection from a plane by means of a secondary source positioned at the source's reflected image. The method has been widely used in acoustics to model the reverberant field of rectangular rooms, but can also be used for general-shaped rooms and non-flat reflectors. This paper explores the relationship between the physical properties of a non-flat reflector and the statistical properties of the associated cloud of image-sources. It is shown here that the standard deviation of the image-sources is strongly correlated with the ratio between depth and width of the reflector's spatial features.
像源法通过位于源的反射像处的二次源对平面的镜面反射进行建模。该方法在声学中已广泛用于矩形房间的混响场建模,但也可用于一般形状房间和非平面反射器。本文探讨了非平面反射器的物理特性与相关图像源云的统计特性之间的关系。如图所示,图像源的标准偏差与反射器空间特征的深度和宽度之比密切相关。
{"title":"An acoustic image-source characterisation of surface profiles","authors":"P. Dawson, E. D. Sena, P. Naylor","doi":"10.23919/EUSIPCO.2018.8553206","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553206","url":null,"abstract":"The image-source method models the specular reflection from a plane by means of a secondary source positioned at the source's reflected image. The method has been widely used in acoustics to model the reverberant field of rectangular rooms, but can also be used for general-shaped rooms and non-flat reflectors. This paper explores the relationship between the physical properties of a non-flat reflector and the statistical properties of the associated cloud of image-sources. It is shown here that the standard deviation of the image-sources is strongly correlated with the ratio between depth and width of the reflector's spatial features.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"161 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134500192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Computationally Efficient Image Super Resolution from Totally Aliased Low Resolution Images 从完全混叠的低分辨率图像中计算高效的图像超分辨率
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553237
Adarsh Kumar, N. Narendra, P. Balamuralidhar, M. Chandra
This paper considers the problem of super-resolution (SR) image reconstruction from a set of totally aliased low resolution (LR) images with different unknown sub-pixel offsets. By assuming the translational motion model, a linear compact representation between the LR image spectrums and SR image spectrum, based on multi-coset sampling is provided. Based on this model, we formulate the joint estimation of the unknown shifts and SR image spectrum as a dictionary learning problem and alternating minimization approach is employed to solve this joint estimation. Two different approaches for obtaining the SR image; one based on estimated shifts and another based on estimate SR spectrum are described. The significant advantage of the proposed approach is the smaller matrix sizes to be handled during the computation; typically on the order of number of images and enhancement factors, and is completely independent on the actual dimensions of LR and SR images, hence requiring significantly lesser resources than the current state of the art approaches. Brief simulation results are also provided to demonstrate the efficacy of this approach.
本文研究了一组具有不同未知亚像素偏移量的全混叠低分辨率图像的超分辨率图像重建问题。通过假设平移运动模型,给出了基于多共集采样的LR图像频谱和SR图像频谱之间的线性紧凑表示。基于该模型,我们将未知位移和SR图像频谱的联合估计表述为字典学习问题,并采用交替最小化方法求解该联合估计。获取SR图像的两种不同方法;描述了一种基于估计位移的方法和一种基于估计SR谱的方法。该方法的显著优点是在计算过程中需要处理的矩阵尺寸较小;通常取决于图像数量和增强因子的顺序,并且完全独立于LR和SR图像的实际尺寸,因此比目前最先进的方法所需的资源要少得多。简要的仿真结果验证了该方法的有效性。
{"title":"Computationally Efficient Image Super Resolution from Totally Aliased Low Resolution Images","authors":"Adarsh Kumar, N. Narendra, P. Balamuralidhar, M. Chandra","doi":"10.23919/EUSIPCO.2018.8553237","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553237","url":null,"abstract":"This paper considers the problem of super-resolution (SR) image reconstruction from a set of totally aliased low resolution (LR) images with different unknown sub-pixel offsets. By assuming the translational motion model, a linear compact representation between the LR image spectrums and SR image spectrum, based on multi-coset sampling is provided. Based on this model, we formulate the joint estimation of the unknown shifts and SR image spectrum as a dictionary learning problem and alternating minimization approach is employed to solve this joint estimation. Two different approaches for obtaining the SR image; one based on estimated shifts and another based on estimate SR spectrum are described. The significant advantage of the proposed approach is the smaller matrix sizes to be handled during the computation; typically on the order of number of images and enhancement factors, and is completely independent on the actual dimensions of LR and SR images, hence requiring significantly lesser resources than the current state of the art approaches. Brief simulation results are also provided to demonstrate the efficacy of this approach.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133822811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature Fusion via Tensor Network Summation 基于张量网络求和的特征融合
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553605
G. G. Calvi, I. Kisil, D. Mandic
Tensor networks (TNs) have been earning considerable attention as multiway data analysis tools owing to their ability to tackle the curse of dimensionality through the representation of large-scale tensors via smaller-scale interconnections of their intrinsic features. However, despite the obvious benefits, the current treatment of TNs as stand-alone entities does not take full advantage of their underlying structure and the associated feature localization. To this end, we exploit the analogy with feature fusion to propose a rigorous framework for the combination of TNs, with a particular focus on their summation as a natural way of their combination. The proposed framework is shown to allow for feature combination of any number of tensors, as long as their TN representation topologies are isomorphic. Simulations involving multi-class classification of an image dataset show the benefits of the proposed framework.
张量网络(TNs)作为一种多路数据分析工具,由于其能够通过其内在特征的较小尺度互连来表示大规模张量,从而解决维度的诅咒,因此受到了相当大的关注。然而,尽管有明显的好处,目前将神经网络作为独立实体处理并没有充分利用其底层结构和相关特征定位。为此,我们利用与特征融合的类比,提出了一个严格的tnn组合框架,特别关注它们的总和作为它们组合的自然方式。所提出的框架被证明允许任意数量张量的特征组合,只要它们的TN表示拓扑是同构的。涉及图像数据集的多类分类的仿真显示了该框架的优点。
{"title":"Feature Fusion via Tensor Network Summation","authors":"G. G. Calvi, I. Kisil, D. Mandic","doi":"10.23919/EUSIPCO.2018.8553605","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553605","url":null,"abstract":"Tensor networks (TNs) have been earning considerable attention as multiway data analysis tools owing to their ability to tackle the curse of dimensionality through the representation of large-scale tensors via smaller-scale interconnections of their intrinsic features. However, despite the obvious benefits, the current treatment of TNs as stand-alone entities does not take full advantage of their underlying structure and the associated feature localization. To this end, we exploit the analogy with feature fusion to propose a rigorous framework for the combination of TNs, with a particular focus on their summation as a natural way of their combination. The proposed framework is shown to allow for feature combination of any number of tensors, as long as their TN representation topologies are isomorphic. Simulations involving multi-class classification of an image dataset show the benefits of the proposed framework.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"525 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133733408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Graph representation using mutual information for graph model discrimination 利用互信息的图表示进行图模型判别
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553169
Francisco Hawas, P. Djurić
We present a novel approach of graph representation based on mutual information of a random walk in a graph. This representation, as any global metric of a graph, can be used to identify the model generator of the observed network. In this study, we use our graph representation combined with Random Forest (RF) to discriminate between Erdos-Renyi (ER), Stochastic Block Model (SBM) and Planted Clique (PC) models. We also combine our graph representation with a Squared Mahalanobis Distance (SMD)-based test to reject a model given an observed network. We test the proposed method with computer simulations.
提出了一种基于图中随机游走互信息的图表示方法。这种表示,作为图的任何全局度量,可以用来识别观察到的网络的模型生成器。在这项研究中,我们使用我们的图表示结合随机森林(RF)来区分Erdos-Renyi (ER),随机块模型(SBM)和植团(PC)模型。我们还将我们的图表示与基于平方马氏距离(SMD)的测试相结合,以拒绝给定观察网络的模型。我们用计算机模拟测试了所提出的方法。
{"title":"Graph representation using mutual information for graph model discrimination","authors":"Francisco Hawas, P. Djurić","doi":"10.23919/EUSIPCO.2018.8553169","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553169","url":null,"abstract":"We present a novel approach of graph representation based on mutual information of a random walk in a graph. This representation, as any global metric of a graph, can be used to identify the model generator of the observed network. In this study, we use our graph representation combined with Random Forest (RF) to discriminate between Erdos-Renyi (ER), Stochastic Block Model (SBM) and Planted Clique (PC) models. We also combine our graph representation with a Squared Mahalanobis Distance (SMD)-based test to reject a model given an observed network. We test the proposed method with computer simulations.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115208731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Satellite Image Segmentation with Deep Residual Architectures for Time-Critical Applications 基于深度残差结构的卫星图像分割
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553545
Sina Ghassemi, C. Sandu, A. Fiandrotti, F. G. Tonolo, P. Boccardo, Gianluca Francini, E. Magli
We address the problem of training a convolutional neural network for satellite images segmentation in emergency situations, where response time constraints prevent training the network from scratch. Such case is particularly challenging due to the large intra-class statistics variations between training images and images to be segmented captured at different locations by different sensors. We propose a convolutional encoder-decoder network architecture where the encoder builds upon a residual architecture. We show that our proposed architecture enables learning features suitable to generalize the learning process across images with different statistics. Our architecture can accurately segment images that have no reference in the training set, whereas a minimal refinement of the trained network significantly boosts the segmentation accuracy.
我们解决了在紧急情况下训练卷积神经网络用于卫星图像分割的问题,在这种情况下,响应时间的限制阻止了从头开始训练网络。这种情况特别具有挑战性,因为训练图像和由不同传感器在不同位置捕获的待分割图像之间的类内统计差异很大。我们提出了一种卷积编码器-解码器网络架构,其中编码器建立在残差架构之上。我们表明,我们提出的架构使学习特征适用于具有不同统计量的图像的学习过程。我们的架构可以准确地分割训练集中没有参考的图像,而对训练网络进行最小的细化可以显著提高分割精度。
{"title":"Satellite Image Segmentation with Deep Residual Architectures for Time-Critical Applications","authors":"Sina Ghassemi, C. Sandu, A. Fiandrotti, F. G. Tonolo, P. Boccardo, Gianluca Francini, E. Magli","doi":"10.23919/EUSIPCO.2018.8553545","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553545","url":null,"abstract":"We address the problem of training a convolutional neural network for satellite images segmentation in emergency situations, where response time constraints prevent training the network from scratch. Such case is particularly challenging due to the large intra-class statistics variations between training images and images to be segmented captured at different locations by different sensors. We propose a convolutional encoder-decoder network architecture where the encoder builds upon a residual architecture. We show that our proposed architecture enables learning features suitable to generalize the learning process across images with different statistics. Our architecture can accurately segment images that have no reference in the training set, whereas a minimal refinement of the trained network significantly boosts the segmentation accuracy.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115398721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Asymmetric Supercardioid Beamforming Using Circular Microphone Arrays 利用圆形传声器阵列的非对称超心波束形成
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553582
Y. Buchris, I. Cohen, J. Benesty
We present a joint-diagonalization based approach for a closed-form solution of the asymmetric supercardioid, implemented with circular differential microphone arrays. These arrays are characterized as compact frequency-invariant su-perdirective beamformers, allowing perfect steering for all azimuthal directions. Experimental results show that the asymmetric supercardioid yields superior performance in terms of white noise gain, directivity factor, and front-to-back ratio, when additional directional attenuation constraints are imposed in order to suppress interfering signals.
我们提出了一种基于联合对角化的非对称超心线闭合解的方法,该方法由圆形差分传声器阵列实现。这些阵列的特点是紧凑的频率不变超定向波束形成器,允许所有方位方向的完美控制。实验结果表明,当施加额外的方向衰减约束以抑制干扰信号时,非对称超心线在白噪声增益、指向性因子和前后比方面具有优异的性能。
{"title":"Asymmetric Supercardioid Beamforming Using Circular Microphone Arrays","authors":"Y. Buchris, I. Cohen, J. Benesty","doi":"10.23919/EUSIPCO.2018.8553582","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553582","url":null,"abstract":"We present a joint-diagonalization based approach for a closed-form solution of the asymmetric supercardioid, implemented with circular differential microphone arrays. These arrays are characterized as compact frequency-invariant su-perdirective beamformers, allowing perfect steering for all azimuthal directions. Experimental results show that the asymmetric supercardioid yields superior performance in terms of white noise gain, directivity factor, and front-to-back ratio, when additional directional attenuation constraints are imposed in order to suppress interfering signals.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"167 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124162704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Motor Condition Monitoring by Empirical Wavelet Transform 基于经验小波变换的电机状态监测
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553566
L. Eren, Y. Çekiç, M. Devaney
Bearing faults are by far the biggest single source of motor failures. Both fast Fourier (frequency based) and wavelet (time-scale based) transforms are used commonly in analyzing raw vibration or current data to detect bearing faults. A hybrid method, Empirical Wavelet Transform (EWT), is used in this study to provide better accuracy in detecting faults from bearing vibration data. In the proposed method, the raw vibration data is processed by fast Fourier transform. Then, the Fourier spectrum of the vibration signal is divided into segments adaptively with each segment containing part of the frequency band. Next, the wavelet transform is applied to all segments. Finally, inverse Fourier transform is utilized to obtain time domain signal with the frequency band of interest from EWT coefficients to detect bearing faults. The bearing fault related segments are identified by comparing rms values of healthy bearing vibration signal segments with the same segments of faulty bearing. The main advantage of the proposed method is the possibility of extracting the segments of interest from the original vibration data for determining both fault type and severity.
轴承故障是迄今为止电机故障的最大单一来源。快速傅立叶变换(基于频率)和小波变换(基于时间尺度)通常用于分析原始振动或电流数据以检测轴承故障。本文采用经验小波变换(EWT)混合方法对轴承振动数据进行故障检测,提高了故障检测的精度。该方法对原始振动数据进行快速傅里叶变换处理。然后,自适应地将振动信号的傅立叶谱分成若干段,每段包含部分频带;接下来,将小波变换应用于所有段。最后,利用傅里叶反变换从小波变换系数中得到感兴趣频带的时域信号,进行轴承故障检测。通过比较健康轴承振动信号段与故障轴承振动信号段的均方根值,识别出轴承故障相关段。该方法的主要优点是可以从原始振动数据中提取出感兴趣的部分,从而确定故障类型和严重程度。
{"title":"Motor Condition Monitoring by Empirical Wavelet Transform","authors":"L. Eren, Y. Çekiç, M. Devaney","doi":"10.23919/EUSIPCO.2018.8553566","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553566","url":null,"abstract":"Bearing faults are by far the biggest single source of motor failures. Both fast Fourier (frequency based) and wavelet (time-scale based) transforms are used commonly in analyzing raw vibration or current data to detect bearing faults. A hybrid method, Empirical Wavelet Transform (EWT), is used in this study to provide better accuracy in detecting faults from bearing vibration data. In the proposed method, the raw vibration data is processed by fast Fourier transform. Then, the Fourier spectrum of the vibration signal is divided into segments adaptively with each segment containing part of the frequency band. Next, the wavelet transform is applied to all segments. Finally, inverse Fourier transform is utilized to obtain time domain signal with the frequency band of interest from EWT coefficients to detect bearing faults. The bearing fault related segments are identified by comparing rms values of healthy bearing vibration signal segments with the same segments of faulty bearing. The main advantage of the proposed method is the possibility of extracting the segments of interest from the original vibration data for determining both fault type and severity.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114360911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2018 26th European Signal Processing Conference (EUSIPCO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1