首页 > 最新文献

2018 26th European Signal Processing Conference (EUSIPCO)最新文献

英文 中文
Modeling Time of Arrival Probability Distribution and TDOA Bias in Acoustic Emission Testing 声发射测试中到达时间概率分布和TDOA偏差的建模
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553447
Carlos A. Prete Junior, V. Nascimento, C. G. Lopes
Acoustic emission testing is widely used by industry to detect and localize faults in structures, but estimated source positions often show significant bias in real tests as a consequence of Time Difference of Arrival (TDOA) bias. In this work, a model for TDOA bias is developed considering the time of arrival was estimated using the fixed threshold algorithm, as well as theoretical upper and lower bounds for it. In addition, we derive the time of arrival probability distribution function in terms of the noise distribution and acoustic emission waveform for the fixed threshold algorithm, showing that, contrary to usual practice, it in general cannot be well approximated by a Gaussian distribution.
声发射测试在工业上被广泛用于结构故障的检测和定位,但由于到达时差(TDOA)偏差,在实际测试中估计的震源位置往往会出现明显的偏差。本文建立了基于固定阈值算法估计到达时间的TDOA偏差模型,并给出了该模型的理论上限和下限。此外,我们根据固定阈值算法的噪声分布和声发射波形导出了到达时间的概率分布函数,表明与通常的做法相反,它通常不能很好地近似于高斯分布。
{"title":"Modeling Time of Arrival Probability Distribution and TDOA Bias in Acoustic Emission Testing","authors":"Carlos A. Prete Junior, V. Nascimento, C. G. Lopes","doi":"10.23919/EUSIPCO.2018.8553447","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553447","url":null,"abstract":"Acoustic emission testing is widely used by industry to detect and localize faults in structures, but estimated source positions often show significant bias in real tests as a consequence of Time Difference of Arrival (TDOA) bias. In this work, a model for TDOA bias is developed considering the time of arrival was estimated using the fixed threshold algorithm, as well as theoretical upper and lower bounds for it. In addition, we derive the time of arrival probability distribution function in terms of the noise distribution and acoustic emission waveform for the fixed threshold algorithm, showing that, contrary to usual practice, it in general cannot be well approximated by a Gaussian distribution.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114474295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Unified Stochastic Reverberation Modeling 统一随机混响模型
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553562
R. Badeau
In the field of room acoustics, it is well known that reverberation can be characterized statistically in a particular region of the time-frequency domain (after the transition time and above Schroeder's frequency). Since the 1950s, various formulas have been established, focusing on particular aspects of reverberation: exponential decay over time, correlations between frequencies, correlations between sensors at each frequency, and time-frequency distribution. In this paper, we introduce a new stochastic reverberation model, that permits us to retrieve all these well-known results within a common mathematical framework. To the best of our knowledge, this is the first time that such a unification work is presented. The benefits are multiple: several new formulas generalizing the classical results are established, that jointly characterize the spatial, temporal and spectral properties of late reverberation.
在室内声学领域,众所周知,混响可以在时频域的特定区域(在过渡时间之后,在施罗德频率之上)进行统计表征。自20世纪50年代以来,已经建立了各种公式,重点关注混响的特定方面:随时间的指数衰减,频率之间的相关性,每个频率的传感器之间的相关性以及时频分布。在本文中,我们引入了一个新的随机混响模型,使我们能够在一个共同的数学框架内检索所有这些众所周知的结果。据我们所知,这是第一次提出这样一个统一的工作。这样做的好处是多方面的:建立了几个推广经典结果的新公式,这些公式共同表征了晚混响的空间、时间和频谱特性。
{"title":"Unified Stochastic Reverberation Modeling","authors":"R. Badeau","doi":"10.23919/EUSIPCO.2018.8553562","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553562","url":null,"abstract":"In the field of room acoustics, it is well known that reverberation can be characterized statistically in a particular region of the time-frequency domain (after the transition time and above Schroeder's frequency). Since the 1950s, various formulas have been established, focusing on particular aspects of reverberation: exponential decay over time, correlations between frequencies, correlations between sensors at each frequency, and time-frequency distribution. In this paper, we introduce a new stochastic reverberation model, that permits us to retrieve all these well-known results within a common mathematical framework. To the best of our knowledge, this is the first time that such a unification work is presented. The benefits are multiple: several new formulas generalizing the classical results are established, that jointly characterize the spatial, temporal and spectral properties of late reverberation.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115003437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Free-Walking 3D Pedestrian Large Trajectory Reconstruction from IMU Sensors 基于IMU传感器的自由行走三维行人大轨迹重建
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553462
Haoyu Li, S. Derrode, L. Benyoussef, W. Pieczynski
This paper presents a pedestrian navigation algorithm based on a foot-mounted 9 Degree of Freedom (DOF) Inertial Measurement Unit (IMU), which provides tri-axial accelerations, angular rates and magnetics. Most algorithms used worldwide employ Zero Velocity Update (ZUPT) to reduce the tremendous error of integration from acceleration to displacement. The crucial part in ZUPT is to detect stance phase precisely. A cyclic left-to-right style Hidden Markov Model is introduced in this work which is able to appropriately model the periodic nature of signals. Stance detection is then made unsupervised by using a suited learning algorithm. Then orientation estimation is performed independently by a quaternion-based method, a simplified error-state Extended Kalman Filter (EKF) assists trajectory reconstruction in 3D space, neither extra method nor prior knowledge is needed to estimate the height. Experimental results on large free-walking trajectories show that the proposed algorithm can provide more accurate locations, especially in z-axis compared to competitive algorithms, w.r.t. to a ground-truth obtained using OpenStreetMap.
本文提出了一种基于足部9自由度惯性测量单元(IMU)的行人导航算法,该单元提供三轴加速度、角速度和磁力。为了减小从加速度到位移积分的巨大误差,世界上大多数算法都采用了零速度更新(ZUPT)。ZUPT的关键是精确地检测姿态相位。本文介绍了一种从左到右的循环隐马尔可夫模型,该模型能够适当地模拟信号的周期性。然后使用合适的学习算法使姿态检测无监督。然后采用基于四元数的方法独立进行方向估计,利用简化的误差状态扩展卡尔曼滤波(EKF)辅助三维空间的轨迹重建,高度估计不需要额外的方法和先验知识。在大型自由行走轨迹上的实验结果表明,与竞争算法相比,该算法可以提供更精确的位置,特别是在z轴上,w.r.t.使用OpenStreetMap获得的地面真实值。
{"title":"Free-Walking 3D Pedestrian Large Trajectory Reconstruction from IMU Sensors","authors":"Haoyu Li, S. Derrode, L. Benyoussef, W. Pieczynski","doi":"10.23919/EUSIPCO.2018.8553462","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553462","url":null,"abstract":"This paper presents a pedestrian navigation algorithm based on a foot-mounted 9 Degree of Freedom (DOF) Inertial Measurement Unit (IMU), which provides tri-axial accelerations, angular rates and magnetics. Most algorithms used worldwide employ Zero Velocity Update (ZUPT) to reduce the tremendous error of integration from acceleration to displacement. The crucial part in ZUPT is to detect stance phase precisely. A cyclic left-to-right style Hidden Markov Model is introduced in this work which is able to appropriately model the periodic nature of signals. Stance detection is then made unsupervised by using a suited learning algorithm. Then orientation estimation is performed independently by a quaternion-based method, a simplified error-state Extended Kalman Filter (EKF) assists trajectory reconstruction in 3D space, neither extra method nor prior knowledge is needed to estimate the height. Experimental results on large free-walking trajectories show that the proposed algorithm can provide more accurate locations, especially in z-axis compared to competitive algorithms, w.r.t. to a ground-truth obtained using OpenStreetMap.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114635540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Inferring User Gender from User Generated Visual Content on a Deep Semantic Space 基于深度语义空间从用户生成的视觉内容推断用户性别
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553425
David Semedo, João Magalhães, Flávio Martins
In this paper we address the task of gender classification on picture sharing social media networks such as Instagram and Flickr. We aim to infer the gender of an user given only a small set of the images shared in its profile. We make the assumption that user's images contain a collection of visual elements that implicitly encode discriminative patterns that allow inferring its gender, in a language independent way. This information can then be used in personalisation and recommendation. Our main hypothesis is that semantic visual features are more adequate for discriminating high-level classes. The gender detection task is formalised as: given an user's profile, represented as a bag of images, we want to infer the gender of the user. Social media profiles can be noisy and contain confounding factors, therefore we classify bags of user-profile‘s images to provide a more robust prediction. Experiments using a dataset from the picture sharing social network Instagram show that the use of multiple images is key to improve detection performance. Moreover, we verify that deep semantic features are more suited for gender detection than low-level image representations. The methods proposed can infer the gender with precision scores higher than 0.825, and the best performing method achieving 0.911 precision.
在本文中,我们解决了图片共享社交媒体网络(如Instagram和Flickr)的性别分类任务。我们的目标是仅根据用户个人资料中共享的一小部分图像来推断用户的性别。我们假设用户的图像包含一组视觉元素,这些元素隐式编码了判别模式,可以以一种独立于语言的方式推断其性别。这些信息可以用于个性化和推荐。我们的主要假设是语义视觉特征更适合于判别高级类。性别检测任务形式化为:给定用户的个人资料,表示为一袋图像,我们想要推断用户的性别。社交媒体个人资料可能是嘈杂的,并且包含混淆因素,因此我们对用户个人资料图像进行分类,以提供更稳健的预测。使用图片分享社交网络Instagram的数据集进行的实验表明,使用多张图像是提高检测性能的关键。此外,我们验证了深度语义特征比低级图像表征更适合于性别检测。所提方法的性别推断精度得分均在0.825以上,其中性能最好的方法达到了0.911的精度。
{"title":"Inferring User Gender from User Generated Visual Content on a Deep Semantic Space","authors":"David Semedo, João Magalhães, Flávio Martins","doi":"10.23919/EUSIPCO.2018.8553425","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553425","url":null,"abstract":"In this paper we address the task of gender classification on picture sharing social media networks such as Instagram and Flickr. We aim to infer the gender of an user given only a small set of the images shared in its profile. We make the assumption that user's images contain a collection of visual elements that implicitly encode discriminative patterns that allow inferring its gender, in a language independent way. This information can then be used in personalisation and recommendation. Our main hypothesis is that semantic visual features are more adequate for discriminating high-level classes. The gender detection task is formalised as: given an user's profile, represented as a bag of images, we want to infer the gender of the user. Social media profiles can be noisy and contain confounding factors, therefore we classify bags of user-profile‘s images to provide a more robust prediction. Experiments using a dataset from the picture sharing social network Instagram show that the use of multiple images is key to improve detection performance. Moreover, we verify that deep semantic features are more suited for gender detection than low-level image representations. The methods proposed can infer the gender with precision scores higher than 0.825, and the best performing method achieving 0.911 precision.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114728293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Novel Online Generalized Possibilistic Clustering Algorithm for Big Data Processing 一种新的大数据处理的在线广义可能性聚类算法
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553146
Spyridoula D. Xenaki, K. Koutroumbas, A. Rontogiannis
In this paper a novel efficient online possibilistic c-means clustering algorithm, called Online Generalized Adaptive Possibilistic C-Means (O-GAPCM), is presented. The algorithm extends the abilities of the Adaptive Possibilistic C-Means (APCM) algorithm, allowing the study of cases where the data form compact and hyper-ellipsoidally shaped clusters in the feature space. In addition, the algorithm performs online processing, that is the data vectors are processed one-by-one and their impact is memorized to suitably defined parameters. It also embodies new procedures for creating new clusters and merging existing ones. Thus, O-GAPCM is able to unravel on its own the number and the actual hyper-ellipsoidal shape of the physical clusters formed by the data. Experimental results verify the effectiveness of O-GAPCM both in terms of accuracy and time efficiency.
本文提出了一种新的高效在线可能性c均值聚类算法——在线广义自适应可能性c均值聚类算法(O-GAPCM)。该算法扩展了自适应可能性c均值(APCM)算法的能力,允许研究数据在特征空间中形成紧凑和超椭球形聚类的情况。此外,该算法进行在线处理,即对数据向量逐一处理,并将其影响记忆到适当定义的参数中。它还包含了创建新集群和合并现有集群的新过程。因此,O-GAPCM能够自行解开由数据形成的物理星团的数量和实际超椭球形状。实验结果验证了O-GAPCM在精度和时间效率方面的有效性。
{"title":"A Novel Online Generalized Possibilistic Clustering Algorithm for Big Data Processing","authors":"Spyridoula D. Xenaki, K. Koutroumbas, A. Rontogiannis","doi":"10.23919/EUSIPCO.2018.8553146","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553146","url":null,"abstract":"In this paper a novel efficient online possibilistic c-means clustering algorithm, called Online Generalized Adaptive Possibilistic C-Means (O-GAPCM), is presented. The algorithm extends the abilities of the Adaptive Possibilistic C-Means (APCM) algorithm, allowing the study of cases where the data form compact and hyper-ellipsoidally shaped clusters in the feature space. In addition, the algorithm performs online processing, that is the data vectors are processed one-by-one and their impact is memorized to suitably defined parameters. It also embodies new procedures for creating new clusters and merging existing ones. Thus, O-GAPCM is able to unravel on its own the number and the actual hyper-ellipsoidal shape of the physical clusters formed by the data. Experimental results verify the effectiveness of O-GAPCM both in terms of accuracy and time efficiency.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116247326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Accelerated stochastic multiplicative update with gradient averaging for nonnegative matrix factorizations 非负矩阵分解的梯度平均加速随机乘法更新
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553610
Hiroyuki Kasai
Nonnegative matrix factorization (NMF) is a powerful tool in data analysis by discovering latent features and part-based patterns from high-dimensional data, and is a special case in which factor matrices have low-rank nonnegative constraints. Applying NMF into huge-size matrices, we specifically address stochastic multiplicative update (MU) rule, which is the most popular, but which has slow convergence property. This present paper introduces a gradient averaging technique of stochastic gradient on the stochastic MU rule, and proposes an accelerated stochastic multiplicative update rule: SAGMU. Extensive computational experiments using both synthetic and real-world datasets demonstrate the effectiveness of SAGMU.
非负矩阵分解(NMF)是一种从高维数据中发现潜在特征和基于部件的模式的强大数据分析工具,是因子矩阵具有低秩非负约束的特殊情况。将NMF应用到大矩阵中,我们特别解决了随机乘法更新(MU)规则,这是最流行的规则,但收敛速度慢。本文介绍了随机梯度在随机MU规则上的梯度平均技术,并提出了一种加速的随机乘法更新规则:SAGMU。使用合成数据集和真实数据集的大量计算实验证明了SAGMU的有效性。
{"title":"Accelerated stochastic multiplicative update with gradient averaging for nonnegative matrix factorizations","authors":"Hiroyuki Kasai","doi":"10.23919/EUSIPCO.2018.8553610","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553610","url":null,"abstract":"Nonnegative matrix factorization (NMF) is a powerful tool in data analysis by discovering latent features and part-based patterns from high-dimensional data, and is a special case in which factor matrices have low-rank nonnegative constraints. Applying NMF into huge-size matrices, we specifically address stochastic multiplicative update (MU) rule, which is the most popular, but which has slow convergence property. This present paper introduces a gradient averaging technique of stochastic gradient on the stochastic MU rule, and proposes an accelerated stochastic multiplicative update rule: SAGMU. Extensive computational experiments using both synthetic and real-world datasets demonstrate the effectiveness of SAGMU.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122162812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Deep Neural Networks for Joint Voice Activity Detection and Speaker Localization 联合语音活动检测和说话人定位的深度神经网络
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553461
Paolo Vecchiotti, E. Principi, S. Squartini, F. Piazza
Detecting the presence of speakers and suitably localize them in indoor environments undoubtedly represent two important tasks in the speech processing community. Several algorithms have been proposed for Voice Activity Detection (VAD) and Speaker LOCalization (SLOC) so far, while their accomplishment by means of a joint integrated model has not received much attention. In particular, no studies focused on cooperative exploitation of VAD and SLOC information by means of machine learning have been conducted, up to the authors' knowledge. That is why the authors propose in this work a data driven approach for joint speech detection and speaker localization, relying on Convolutional Neural Network (CNN) which simultaneously process LogMel and GCC-PHAT Patterns features. The proposed algorithm is compared with a two-stage model composed by the cascade of a neural network (NN) based VAD and an NN based SLOC, discussed in previous authors' contributions. Computer simulations, accomplished against the DIRHA dataset addressing a multi-room acoustic environment, show that the proposed method allows to achieve a remarkable relative reduction of speech activity detection error equal to 33% compared to the original NN based VAD. Moreover, the overall localization accuracy is improved as well, by employing the joint model as speech detector and the standard neural SLOC system in cascade.
在室内环境中检测说话人的存在并对其进行适当的定位无疑是语音处理领域的两项重要任务。目前,针对语音活动检测(VAD)和说话人定位(SLOC)已经提出了几种算法,但通过联合集成模型实现这些算法的研究并不多见。特别是,据作者所知,还没有针对VAD和SLOC信息的机器学习协同开发的研究。这就是为什么作者在这项工作中提出了一种数据驱动的联合语音检测和说话人定位方法,依靠卷积神经网络(CNN)同时处理LogMel和GCC-PHAT模式特征。将所提出的算法与先前作者所讨论的由基于VAD的神经网络级联和基于SLOC的神经网络级联组成的两阶段模型进行比较。针对多房间声学环境的DIRHA数据集完成的计算机模拟表明,与原始的基于神经网络的VAD相比,所提出的方法可以实现语音活动检测误差的显著降低,相对降低误差为33%。此外,采用联合模型作为语音检测器和标准神经SLOC系统级联,提高了整体定位精度。
{"title":"Deep Neural Networks for Joint Voice Activity Detection and Speaker Localization","authors":"Paolo Vecchiotti, E. Principi, S. Squartini, F. Piazza","doi":"10.23919/EUSIPCO.2018.8553461","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553461","url":null,"abstract":"Detecting the presence of speakers and suitably localize them in indoor environments undoubtedly represent two important tasks in the speech processing community. Several algorithms have been proposed for Voice Activity Detection (VAD) and Speaker LOCalization (SLOC) so far, while their accomplishment by means of a joint integrated model has not received much attention. In particular, no studies focused on cooperative exploitation of VAD and SLOC information by means of machine learning have been conducted, up to the authors' knowledge. That is why the authors propose in this work a data driven approach for joint speech detection and speaker localization, relying on Convolutional Neural Network (CNN) which simultaneously process LogMel and GCC-PHAT Patterns features. The proposed algorithm is compared with a two-stage model composed by the cascade of a neural network (NN) based VAD and an NN based SLOC, discussed in previous authors' contributions. Computer simulations, accomplished against the DIRHA dataset addressing a multi-room acoustic environment, show that the proposed method allows to achieve a remarkable relative reduction of speech activity detection error equal to 33% compared to the original NN based VAD. Moreover, the overall localization accuracy is improved as well, by employing the joint model as speech detector and the standard neural SLOC system in cascade.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116799688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Estimation of Pitch Targets from Speech Signals by Joint Regularized Optimization 基于联合正则化优化的语音基音目标估计
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8552945
P. Birkholz, Patrick Schmaser, Yi Xu
This paper presents a novel method to estimate the pitch target parameters of the target approximation model (TAM). The TAM allows the compact representation of natural pitch contours on a solid theoretical basis and can be used as an intonation model for text-to-speech synthesis. In contrast to previous approaches, the method proposed here estimates the parameters of all targets jointly, uses 5th-order (instead of 3rd-order) linear systems to model the target approximation process, and uses regularization to avoid unnatural pitch targets. The effect of these features on the modeling error and the target parameter distributions are shown. The proposed method has been made available as the open-source software tool TargetOptimizer.
提出了一种新的目标近似模型(TAM)的俯仰目标参数估计方法。TAM允许在坚实的理论基础上紧凑地表示自然音高轮廓,并且可以用作文本到语音合成的语调模型。与以往的方法相比,本文提出的方法联合估计所有目标的参数,使用5阶(而不是3阶)线性系统来建模目标逼近过程,并使用正则化来避免非自然俯pitch目标。给出了这些特征对建模误差和目标参数分布的影响。所提出的方法已经作为开源软件工具TargetOptimizer提供。
{"title":"Estimation of Pitch Targets from Speech Signals by Joint Regularized Optimization","authors":"P. Birkholz, Patrick Schmaser, Yi Xu","doi":"10.23919/EUSIPCO.2018.8552945","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8552945","url":null,"abstract":"This paper presents a novel method to estimate the pitch target parameters of the target approximation model (TAM). The TAM allows the compact representation of natural pitch contours on a solid theoretical basis and can be used as an intonation model for text-to-speech synthesis. In contrast to previous approaches, the method proposed here estimates the parameters of all targets jointly, uses 5th-order (instead of 3rd-order) linear systems to model the target approximation process, and uses regularization to avoid unnatural pitch targets. The effect of these features on the modeling error and the target parameter distributions are shown. The proposed method has been made available as the open-source software tool TargetOptimizer.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116844926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Information Subspace-Based Fusion for Vehicle Classification 基于信息子空间的车辆分类融合
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553445
Sally Ghanem, Ashkan Panahi, H. Krim, R. Kerekes, J. Mattingly
Union of Subspaces (UoS) is a new paradigm for signal modeling and processing, which is capable of identifying more complex trends in data sets than simple linear models. Relying on a bi-sparsity pursuit framework and advanced nonsmooth optimization techniques, the Robust Subspace Recovery (RoSuRe) algorithm was introduced in the recent literature as a reliable and numerically efficient algorithm to unfold unions of subspaces. In this study, we apply RoSuRe to prospect the structure of a data type (e.g. sensed data on vehicle through passive audio and magnetic observations). Applying RoSuRe to the observation data set, we obtain a new representation of the time series, respecting an underlying UoS model. We subsequently employ Spectral Clustering on the new representations of the data set. The classification performance on the dataset shows a considerable improvement compared to direct application of other unsupervised clustering methods.
子空间联合(UoS)是信号建模和处理的一种新范式,它能够识别数据集中比简单线性模型更复杂的趋势。基于双稀疏追求框架和先进的非光滑优化技术,鲁棒子空间恢复(RoSuRe)算法作为一种可靠且数值高效的展开子空间并集的算法在最近的文献中被引入。在本研究中,我们应用RoSuRe来预测数据类型的结构(例如,通过无源音频和磁观测在车辆上感知数据)。将RoSuRe应用于观测数据集,我们获得了时间序列的新表示,尊重底层UoS模型。我们随后在数据集的新表示上使用谱聚类。与直接应用其他无监督聚类方法相比,该方法在数据集上的分类性能有了相当大的提高。
{"title":"Information Subspace-Based Fusion for Vehicle Classification","authors":"Sally Ghanem, Ashkan Panahi, H. Krim, R. Kerekes, J. Mattingly","doi":"10.23919/EUSIPCO.2018.8553445","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553445","url":null,"abstract":"Union of Subspaces (UoS) is a new paradigm for signal modeling and processing, which is capable of identifying more complex trends in data sets than simple linear models. Relying on a bi-sparsity pursuit framework and advanced nonsmooth optimization techniques, the Robust Subspace Recovery (RoSuRe) algorithm was introduced in the recent literature as a reliable and numerically efficient algorithm to unfold unions of subspaces. In this study, we apply RoSuRe to prospect the structure of a data type (e.g. sensed data on vehicle through passive audio and magnetic observations). Applying RoSuRe to the observation data set, we obtain a new representation of the time series, respecting an underlying UoS model. We subsequently employ Spectral Clustering on the new representations of the data set. The classification performance on the dataset shows a considerable improvement compared to direct application of other unsupervised clustering methods.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120854058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Analysis vs Synthesis-based Regularization for Combined Compressed Sensing and Parallel MRI Reconstruction at 7 Tesla 7特斯拉压缩感知与MRI并行重建的分析与综合正则化
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553476
Hamza Cherkaoui, L. Gueddari, C. Lazarus, A. Grigis, F. Poupon, A. Vignaud, S. Farrens, Jean-Luc Starck, P. Ciuciu
Compressed Sensing (CS) has allowed a significant reduction of acquisition times in MRI, especially in the high spatial resolution (e.g., 400 $mu{mathrm{m}}$) context. Nonlinear CS reconstruction usually relies on analysis (e.g., Total Variation) or synthesis (e.g., wavelet) based priors and $ell_{1}$ regularization to promote sparsity in the transform domain. Here, we compare the performance of several orthogonal wavelet transforms with those of tight frames for MR image reconstruction in the CS setting combined with parallel imaging (multiple receiver coil). We show that overcomplete dictionaries such as the fast curvelet transform provide improved image quality as compared to orthogonal transforms. For doing so, we rely on an analysis-based formulation where the underlying $ell_{1}$ regularized criterion is minimized using a primal dual splitting method (e.g., Condat-V $tilde{u}$ algorithm). Validation is performed on ex-vivo baboon brain $T^{*}_{2}$ MRI data collected at 7 Tesla and restrospectively under-sampled using non-Cartesian schemes (radial and Sparkling). We show that multiscale analysis priors based on tight frames instead of orthogonal transforms achieve better image quality (pSNR, SSIM) in particular at low signal-to-noise ratio.
压缩感知(CS)可以显著减少MRI的采集时间,特别是在高空间分辨率(例如400 $mu{mathrm{m}}$)的情况下。非线性CS重构通常依赖于基于先验的分析(如Total Variation)或综合(如小波)和$ell_{1}$正则化来提高变换域的稀疏性。在这里,我们比较了几个正交小波变换与紧帧小波变换在CS设置下结合并行成像(多个接收器线圈)的磁共振图像重建的性能。我们表明,与正交变换相比,像快速曲线变换这样的过完备字典提供了更好的图像质量。为此,我们依赖于基于分析的公式,其中使用原始对偶分裂方法(例如,Condat-V $tilde{u}$算法)最小化底层$ell_{1}$正则化标准。在离体狒狒大脑$T^{*}_{2}$上进行验证,在7特斯拉采集MRI数据,并使用非笛卡尔格式(径向和气泡)回顾性欠采样。我们证明了基于紧帧的多尺度分析先验而不是正交变换可以获得更好的图像质量(pSNR, SSIM),特别是在低信噪比下。
{"title":"Analysis vs Synthesis-based Regularization for Combined Compressed Sensing and Parallel MRI Reconstruction at 7 Tesla","authors":"Hamza Cherkaoui, L. Gueddari, C. Lazarus, A. Grigis, F. Poupon, A. Vignaud, S. Farrens, Jean-Luc Starck, P. Ciuciu","doi":"10.23919/EUSIPCO.2018.8553476","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553476","url":null,"abstract":"Compressed Sensing (CS) has allowed a significant reduction of acquisition times in MRI, especially in the high spatial resolution (e.g., 400 $mu{mathrm{m}}$) context. Nonlinear CS reconstruction usually relies on analysis (e.g., Total Variation) or synthesis (e.g., wavelet) based priors and $ell_{1}$ regularization to promote sparsity in the transform domain. Here, we compare the performance of several orthogonal wavelet transforms with those of tight frames for MR image reconstruction in the CS setting combined with parallel imaging (multiple receiver coil). We show that overcomplete dictionaries such as the fast curvelet transform provide improved image quality as compared to orthogonal transforms. For doing so, we rely on an analysis-based formulation where the underlying $ell_{1}$ regularized criterion is minimized using a primal dual splitting method (e.g., Condat-V $tilde{u}$ algorithm). Validation is performed on ex-vivo baboon brain $T^{*}_{2}$ MRI data collected at 7 Tesla and restrospectively under-sampled using non-Cartesian schemes (radial and Sparkling). We show that multiscale analysis priors based on tight frames instead of orthogonal transforms achieve better image quality (pSNR, SSIM) in particular at low signal-to-noise ratio.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123935097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
期刊
2018 26th European Signal Processing Conference (EUSIPCO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1