首页 > 最新文献

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

英文 中文
Kalman Filtering and Clustering in Sensor Networks 传感器网络中的卡尔曼滤波与聚类
S. Talebi, Stefan Werner, V. Koivunen
In this work, a distributed Kalman filtering and clustering framework for sensor networks tasked with tracking multiple state vector sequences is developed. This is achieved through recursively updating the likelihood of a state vector estimation from one agent offering valid information about the state vector of its neighbors, given the available observation data. These likelihoods then form the diffusion coefficients, used for information fusion over the sensor network. For rigour, the mean and mean square behavior of the developed Kalman filtering and clustering framework is analyzed, convergence criteria are established, and the performance of the developed framework is demonstrated in a simulation example.
在这项工作中,开发了一种用于跟踪多个状态向量序列的传感器网络的分布式卡尔曼滤波和聚类框架。这是通过递归地更新一个代理的状态向量估计的可能性来实现的,该代理提供有关其邻居状态向量的有效信息,给定可用的观测数据。这些可能性然后形成扩散系数,用于传感器网络上的信息融合。为提高算法的严密性,分析了所开发的卡尔曼滤波聚类框架的均值和均方行为,建立了收敛准则,并通过仿真实例验证了所开发框架的性能。
{"title":"Kalman Filtering and Clustering in Sensor Networks","authors":"S. Talebi, Stefan Werner, V. Koivunen","doi":"10.1109/ICASSP.2018.8462039","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8462039","url":null,"abstract":"In this work, a distributed Kalman filtering and clustering framework for sensor networks tasked with tracking multiple state vector sequences is developed. This is achieved through recursively updating the likelihood of a state vector estimation from one agent offering valid information about the state vector of its neighbors, given the available observation data. These likelihoods then form the diffusion coefficients, used for information fusion over the sensor network. For rigour, the mean and mean square behavior of the developed Kalman filtering and clustering framework is analyzed, convergence criteria are established, and the performance of the developed framework is demonstrated in a simulation example.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"32 1","pages":"4309-4313"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88550520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Data Injection Attack on Decentralized Optimization 分散优化中的数据注入攻击
Sissi Xiaoxiao Wu, Hoi-To Wai, A. Scaglione, A. Nedić, Amir Leshem
This paper studies the security aspect of gossip-based decentralized optimization algorithms for multi agent systems against data injection attacks. Our contributions are two-fold. First, we show that the popular distributed projected gradient method (by Nedić et al.) can be attacked by coordinated insider attacks, in which the attackers are able to steer the final state to a point of their choosing. Second, we propose a metric that can be computed locally by the trustworthy agents processing their own iterates and those of their neighboring agents. This metric can be used by the trustworthy agents to detect and localize the attackers. We conclude the paper by supporting our findings with numerical experiments.
本文研究了多智能体系统中基于八卦的分散优化算法的安全性,以防止数据注入攻击。我们的贡献是双重的。首先,我们证明了流行的分布式投影梯度方法(由nedidic等人提出)可以受到协同内部攻击的攻击,攻击者能够将最终状态引导到他们选择的点。其次,我们提出了一个度量,该度量可以由可信赖的代理处理自己的迭代和相邻代理的迭代在本地计算。可信代理可以使用该度量来检测和定位攻击者。最后,我们用数值实验来支持我们的发现。
{"title":"Data Injection Attack on Decentralized Optimization","authors":"Sissi Xiaoxiao Wu, Hoi-To Wai, A. Scaglione, A. Nedić, Amir Leshem","doi":"10.1109/ICASSP.2018.8462528","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8462528","url":null,"abstract":"This paper studies the security aspect of gossip-based decentralized optimization algorithms for multi agent systems against data injection attacks. Our contributions are two-fold. First, we show that the popular distributed projected gradient method (by Nedić et al.) can be attacked by coordinated insider attacks, in which the attackers are able to steer the final state to a point of their choosing. Second, we propose a metric that can be computed locally by the trustworthy agents processing their own iterates and those of their neighboring agents. This metric can be used by the trustworthy agents to detect and localize the attackers. We conclude the paper by supporting our findings with numerical experiments.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"24 1","pages":"3644-3648"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89145823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
End-to-End Multi-Speaker Speech Recognition 端到端多说话人语音识别
Shane Settle, Jonathan Le Roux, Takaaki Hori, Shinji Watanabe, J. Hershey
Current advances in deep learning have resulted in a convergence of methods across a wide range of tasks, opening the door for tighter integration of modules that were previously developed and optimized in isolation. Recent ground-breaking works have produced end-to-end deep network methods for both speech separation and end-to-end automatic speech recognition (ASR). Speech separation methods such as deep clustering address the challenging cocktail-party problem of distinguishing multiple simultaneous speech signals. This is an enabling technology for real-world human machine interaction (HMI). However, speech separation requires ASR to interpret the speech for any HMI task. Likewise, ASR requires speech separation to work in an unconstrained environment. Although these two components can be trained in isolation and connected after the fact, this paradigm is likely to be sub-optimal, since it relies on artificially mixed data. In this paper, we develop the first fully end-to-end, jointly trained deep learning system for separation and recognition of overlapping speech signals. The joint training framework synergistically adapts the separation and recognition to each other. As an additional benefit, it enables training on more realistic data that contains only mixed signals and their transcriptions, and thus is suited to large scale training on existing transcribed data.
当前深度学习的进展导致了各种任务方法的融合,为以前孤立开发和优化的模块更紧密地集成打开了大门。最近的突破性工作已经产生了用于语音分离和端到端自动语音识别(ASR)的端到端深度网络方法。语音分离方法,如深度聚类,解决了识别多个同时语音信号的鸡尾酒会问题。这是一种支持现实世界人机交互(HMI)的技术。然而,语音分离需要ASR为任何HMI任务解释语音。同样,ASR要求语音分离在不受约束的环境中工作。尽管这两个组件可以单独训练并在事后连接起来,但这种范式可能不是最优的,因为它依赖于人为混合的数据。在本文中,我们开发了第一个完全端到端、联合训练的深度学习系统,用于分离和识别重叠语音信号。联合训练框架协同适应了彼此的分离和识别。作为一个额外的好处,它可以在更真实的数据上进行训练,这些数据只包含混合信号及其转录,因此适合于在现有转录数据上进行大规模训练。
{"title":"End-to-End Multi-Speaker Speech Recognition","authors":"Shane Settle, Jonathan Le Roux, Takaaki Hori, Shinji Watanabe, J. Hershey","doi":"10.1109/ICASSP.2018.8461893","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8461893","url":null,"abstract":"Current advances in deep learning have resulted in a convergence of methods across a wide range of tasks, opening the door for tighter integration of modules that were previously developed and optimized in isolation. Recent ground-breaking works have produced end-to-end deep network methods for both speech separation and end-to-end automatic speech recognition (ASR). Speech separation methods such as deep clustering address the challenging cocktail-party problem of distinguishing multiple simultaneous speech signals. This is an enabling technology for real-world human machine interaction (HMI). However, speech separation requires ASR to interpret the speech for any HMI task. Likewise, ASR requires speech separation to work in an unconstrained environment. Although these two components can be trained in isolation and connected after the fact, this paradigm is likely to be sub-optimal, since it relies on artificially mixed data. In this paper, we develop the first fully end-to-end, jointly trained deep learning system for separation and recognition of overlapping speech signals. The joint training framework synergistically adapts the separation and recognition to each other. As an additional benefit, it enables training on more realistic data that contains only mixed signals and their transcriptions, and thus is suited to large scale training on existing transcribed data.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"38 1","pages":"4819-4823"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74298807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 63
How are the Centered Kernel Principal Components Relevant to Regression Task? -An Exact Analysis 中心核主成分是如何与回归任务相关的?——准确的分析
M. Yukawa, K. Müller, Yuto Ogino
We present an exact analytic expression of the contributions of the kernel principal components to the relevant information in a nonlinear regression problem. A related study has been presented by Braun, Buhmann, and Müller in 2008, where an upper bound of the contributions was given for a general supervised learning problem but with “uncentered” kernel PCAs. Our analysis clarifies that the relevant information of a kernel regression under explicit centering operation is contained in a finite number of leading kernel principal components, as in the “uncentered” kernel-Pca case, if the kernel matches the underlying nonlinear function so that the eigenvalues of the centered kernel matrix decay quickly. We compare the regression performances of the least-square-based methods with the centered and uncentered kernel PCAs by simulations.
本文给出了非线性回归问题中核主成分对相关信息贡献的精确解析表达式。Braun, Buhmann和m ller在2008年提出了一项相关研究,其中给出了一般监督学习问题的贡献上限,但具有“非中心”核pca。我们的分析表明,在显式定心操作下,核回归的相关信息包含在有限数量的主要核主成分中,如在“无中心”核pca情况下,如果核匹配底层非线性函数,则中心核矩阵的特征值会迅速衰减。通过仿真比较了基于最小二乘的方法与有中心和无中心核pca的回归性能。
{"title":"How are the Centered Kernel Principal Components Relevant to Regression Task? -An Exact Analysis","authors":"M. Yukawa, K. Müller, Yuto Ogino","doi":"10.1109/ICASSP.2018.8462392","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8462392","url":null,"abstract":"We present an exact analytic expression of the contributions of the kernel principal components to the relevant information in a nonlinear regression problem. A related study has been presented by Braun, Buhmann, and Müller in 2008, where an upper bound of the contributions was given for a general supervised learning problem but with “uncentered” kernel PCAs. Our analysis clarifies that the relevant information of a kernel regression under explicit centering operation is contained in a finite number of leading kernel principal components, as in the “uncentered” kernel-Pca case, if the kernel matches the underlying nonlinear function so that the eigenvalues of the centered kernel matrix decay quickly. We compare the regression performances of the least-square-based methods with the centered and uncentered kernel PCAs by simulations.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"1 1","pages":"2841-2845"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82497587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large-Scale Regularized Sumcor GCCA via Penalty-Dual Decomposition 基于惩罚对偶分解的大规模正则化Sumcor GCCA
Charilaos I. Kanatsoulis, Xiao Fu, N. Sidiropoulos, Mingyi Hong
The sum-of-correlations (SUMCOR) generalized canonical correlation analysis (GCCA) aims at producing low-dimensional representations of multiview data via enforcing pairwise similarity of the reduced-dimension views. SUMCOR has been applied to a large variety of applications including blind separation, multilingual word embedding, and cross-modality retrieval. Despite the NP-hardness of SUMCOR, recent work has proposed effective algorithms for handling it at very large scale. However, the existing scalable algorithms are not easy to extend to incorporate structural regularization and prior information - which are critical for real-world applications where outliers and modeling mismatches are present. In this work, we propose a new computational framework for large-scale SUMCOR GCCA. The algorithm can easily incorporate a suite of structural regularizers which are frequently used in data analytics, has lightweight updates and low memory complexity, and can be easily implemented in a parallel fashion. The proposed algorithm is also guaranteed to converge to a Karush-Kuhn-Tucker (KKT) point of the regularized SUMCOR problem. Carefully designed simulations are employed to demonstrate the effectiveness of the proposed algorithm.
关联和(SUMCOR)广义典型相关分析(GCCA)旨在通过增强降维视图的两两相似性来生成多视图数据的低维表示。SUMCOR已经应用于各种各样的应用,包括盲分离、多语言词嵌入和跨模态检索。尽管SUMCOR具有np -硬度,但最近的工作已经提出了用于大规模处理它的有效算法。然而,现有的可扩展算法不容易扩展到包含结构正则化和先验信息-这对于存在异常值和建模不匹配的现实应用至关重要。在这项工作中,我们提出了一种新的大规模SUMCOR GCCA计算框架。该算法可以很容易地结合一套经常用于数据分析的结构正则器,具有轻量级更新和低内存复杂性,并且可以很容易地以并行方式实现。该算法还保证收敛到正则化SUMCOR问题的一个Karush-Kuhn-Tucker点。通过精心设计的仿真,验证了所提算法的有效性。
{"title":"Large-Scale Regularized Sumcor GCCA via Penalty-Dual Decomposition","authors":"Charilaos I. Kanatsoulis, Xiao Fu, N. Sidiropoulos, Mingyi Hong","doi":"10.1109/ICASSP.2018.8462354","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8462354","url":null,"abstract":"The sum-of-correlations (SUMCOR) generalized canonical correlation analysis (GCCA) aims at producing low-dimensional representations of multiview data via enforcing pairwise similarity of the reduced-dimension views. SUMCOR has been applied to a large variety of applications including blind separation, multilingual word embedding, and cross-modality retrieval. Despite the NP-hardness of SUMCOR, recent work has proposed effective algorithms for handling it at very large scale. However, the existing scalable algorithms are not easy to extend to incorporate structural regularization and prior information - which are critical for real-world applications where outliers and modeling mismatches are present. In this work, we propose a new computational framework for large-scale SUMCOR GCCA. The algorithm can easily incorporate a suite of structural regularizers which are frequently used in data analytics, has lightweight updates and low memory complexity, and can be easily implemented in a parallel fashion. The proposed algorithm is also guaranteed to converge to a Karush-Kuhn-Tucker (KKT) point of the regularized SUMCOR problem. Carefully designed simulations are employed to demonstrate the effectiveness of the proposed algorithm.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"7 1","pages":"6363-6367"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84153948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bitwise Neural Networks for Efficient Single-Channel Source Separation 高效单通道源分离的位神经网络
Minje Kim, P. Smaragdis
We present Bitwise Neural Networks (BNN) as an efficient hardware-friendly solution to single-channel source separation tasks in resource-constrained environments. In the proposed BNN system, we replace all the real-valued operations during the feedforward process of a Deep Neural Network (DNN) with bitwise arithmetic (e.g. the XNOR operation between bipolar binaries in place of multiplications). Thanks to the fully bitwise run-time operations, the BNN system can serve as an alternative solution where efficient real-time processing is critical, for example real-time speech enhancement in embedded systems. Furthermore, we also propose a binarization scheme to convert the input signals into bit strings so that the BNN parameters learn the Boolean mapping between input binarized mixture signals and their target Ideal Binary Masks (IBM). Experiments on the single-channel speech denoising tasks show that the efficient BNN-based source separation system works well with an acceptable performance loss compared to a comprehensive real-valued network, while consuming a minimal amount of resources.
我们提出了位神经网络(BNN)作为资源受限环境下单通道源分离任务的有效硬件友好解决方案。在提出的BNN系统中,我们用位算术取代了深度神经网络(DNN)前馈过程中的所有实值运算(例如,双极二进制之间的XNOR运算代替乘法)。由于完全按位运行时操作,BNN系统可以作为高效实时处理至关重要的替代解决方案,例如嵌入式系统中的实时语音增强。此外,我们还提出了一种二值化方案,将输入信号转换为位串,使BNN参数学习输入二值化混合信号与其目标理想二进制掩码(IBM)之间的布尔映射。对单通道语音去噪任务的实验表明,与综合实值网络相比,基于bnn的高效源分离系统性能良好,性能损失可接受,同时消耗的资源最少。
{"title":"Bitwise Neural Networks for Efficient Single-Channel Source Separation","authors":"Minje Kim, P. Smaragdis","doi":"10.1109/ICASSP.2018.8461824","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8461824","url":null,"abstract":"We present Bitwise Neural Networks (BNN) as an efficient hardware-friendly solution to single-channel source separation tasks in resource-constrained environments. In the proposed BNN system, we replace all the real-valued operations during the feedforward process of a Deep Neural Network (DNN) with bitwise arithmetic (e.g. the XNOR operation between bipolar binaries in place of multiplications). Thanks to the fully bitwise run-time operations, the BNN system can serve as an alternative solution where efficient real-time processing is critical, for example real-time speech enhancement in embedded systems. Furthermore, we also propose a binarization scheme to convert the input signals into bit strings so that the BNN parameters learn the Boolean mapping between input binarized mixture signals and their target Ideal Binary Masks (IBM). Experiments on the single-channel speech denoising tasks show that the efficient BNN-based source separation system works well with an acceptable performance loss compared to a comprehensive real-valued network, while consuming a minimal amount of resources.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"30 1","pages":"701-705"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78677812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Particle Filtering and Inference for Limit Order Books in High Frequency Finance 高频金融中限价订单的粒子滤波与推理
Pinzhang Wang, Lin Li, S. Godsill
This paper investigates the on-line analysis of high-frequency financial order book data using Bayesian modelling techniques. Order book data involves evolving queues of orders at different prices, and here we propose that the order book shape is proportional to a gamma or inverse-gamma density function. Inference for these models is implemented on-line using particle filters and evaluated on a high-frequency EURUSD foreign exchange limit order book. The two possible order book shapes are tested using particle filter marginal likelihood estimates and in addition, heat maps are constructed based on the inference results to reveal the imbalance of order distributions between the two sides of an order book, thereby offering valuable insights into the movements of future prices.
本文研究了利用贝叶斯建模技术对高频金融订单数据进行在线分析。订单簿数据涉及不同价格的不断变化的订单队列,这里我们提出订单簿形状与gamma或逆gamma密度函数成正比。这些模型的推理使用粒子滤波器在线实现,并在高频欧元美元外汇限价订单簿上进行评估。使用粒子滤波边际似然估计对两种可能的订单形状进行测试,此外,基于推理结果构建热图,以揭示订单簿两侧订单分布的不平衡,从而为未来价格的走势提供有价值的见解。
{"title":"Particle Filtering and Inference for Limit Order Books in High Frequency Finance","authors":"Pinzhang Wang, Lin Li, S. Godsill","doi":"10.1109/ICASSP.2018.8462072","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8462072","url":null,"abstract":"This paper investigates the on-line analysis of high-frequency financial order book data using Bayesian modelling techniques. Order book data involves evolving queues of orders at different prices, and here we propose that the order book shape is proportional to a gamma or inverse-gamma density function. Inference for these models is implemented on-line using particle filters and evaluated on a high-frequency EURUSD foreign exchange limit order book. The two possible order book shapes are tested using particle filter marginal likelihood estimates and in addition, heat maps are constructed based on the inference results to reveal the imbalance of order distributions between the two sides of an order book, thereby offering valuable insights into the movements of future prices.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"55 1","pages":"4264-4268"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75292007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Identifying Susceptible Agents in Time Varying Opinion Dynamics Through Compressive Measurements 通过压缩测量识别时变意见动态中的敏感因素
Hoi-To Wai, A. Ozdaglar, A. Scaglione
We provide a compressive-measurement based method to detect susceptible agents who may receive misinformation through their contact with ‘stubborn agents’ whose goal is to influence the opinions of agents in the network. We consider a DeGroot-type opinion dynamics model where regular agents revise their opinions by linearly combining their neighbors' opinions, but stubborn agents, while influencing others, do not change their opinions. Our proposed method hinges on estimating the temporal difference vector of network-wide opinions, computed at time instances when the stubborn agents interact. We show that this temporal difference vector has approximately the same support as the locations of the susceptible agents. Moreover, both the interaction instances and the temporal difference vector can be estimated from a small number of aggregated opinions. The performance of our method is studied both analytically and empirically. We show that the detection error decreases when the social network is better connected, or when the stubborn agents are ‘less talkative’.
我们提供了一种基于压缩测量的方法来检测易受影响的代理,这些代理可能通过与“顽固代理”的接触接收到错误信息,而“顽固代理”的目标是影响网络中代理的意见。我们考虑了一个degroot类型的意见动态模型,其中常规代理通过线性结合邻居的意见来修改他们的意见,而顽固代理在影响他人的同时不改变他们的意见。我们提出的方法依赖于估计网络范围内意见的时间差向量,在顽固代理交互的时间实例中计算。我们表明,这个时间差异向量与易感因子的位置具有大致相同的支持度。此外,交互实例和时间差向量都可以从少量的汇总意见中估计出来。本文对该方法的性能进行了分析和实证研究。我们表明,当社交网络连接得更好时,或者当顽固的代理“不那么健谈”时,检测误差会减少。
{"title":"Identifying Susceptible Agents in Time Varying Opinion Dynamics Through Compressive Measurements","authors":"Hoi-To Wai, A. Ozdaglar, A. Scaglione","doi":"10.1109/ICASSP.2018.8462377","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8462377","url":null,"abstract":"We provide a compressive-measurement based method to detect susceptible agents who may receive misinformation through their contact with ‘stubborn agents’ whose goal is to influence the opinions of agents in the network. We consider a DeGroot-type opinion dynamics model where regular agents revise their opinions by linearly combining their neighbors' opinions, but stubborn agents, while influencing others, do not change their opinions. Our proposed method hinges on estimating the temporal difference vector of network-wide opinions, computed at time instances when the stubborn agents interact. We show that this temporal difference vector has approximately the same support as the locations of the susceptible agents. Moreover, both the interaction instances and the temporal difference vector can be estimated from a small number of aggregated opinions. The performance of our method is studied both analytically and empirically. We show that the detection error decreases when the social network is better connected, or when the stubborn agents are ‘less talkative’.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"36 1","pages":"4114-4118"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77438276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Blind Estimation of the Speech Transmission Index for Speech Quality Prediction 用于语音质量预测的语音传输指标的盲估计
Prem Seetharaman, G. Mysore, P. Smaragdis, Bryan Pardo
The speech transmission index (STI) of a listening position within a given room indicates the quality and intelligibility of speech uttered in that room. The measure is very reliable for predicting speech intelligibility in many room conditions but requires an STI measurement of the impulse response for the room. We present a method for blindly estimating the STI without measuring or modeling the impulse response of the room using deep convolutional neural networks. Our model is trained entirely using simulated room impulse responses combined with clean speech examples from the DAPS dataset [1] and works directly on PCM audio. Our experiments show that our method predicts true STI with a high degree of accuracy – an average error of under 4%. It can also distinguish between different STI conditions to a level of granularity that is comparable to humans.
语音传输指数(STI)在一个给定的房间内的听者位置表示在该房间发出的语音的质量和可理解性。在许多房间条件下,该测量方法对于预测语音可理解性非常可靠,但需要对房间的脉冲响应进行STI测量。我们提出了一种使用深度卷积神经网络在不测量或模拟房间脉冲响应的情况下盲目估计STI的方法。我们的模型完全使用模拟的房间脉冲响应结合DAPS数据集[1]的干净语音示例进行训练,并直接在PCM音频上工作。我们的实验表明,我们的方法预测真实STI的准确度很高——平均误差低于4%。它还可以将不同的STI条件区分到与人类相当的粒度水平。
{"title":"Blind Estimation of the Speech Transmission Index for Speech Quality Prediction","authors":"Prem Seetharaman, G. Mysore, P. Smaragdis, Bryan Pardo","doi":"10.1109/ICASSP.2018.8461827","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8461827","url":null,"abstract":"The speech transmission index (STI) of a listening position within a given room indicates the quality and intelligibility of speech uttered in that room. The measure is very reliable for predicting speech intelligibility in many room conditions but requires an STI measurement of the impulse response for the room. We present a method for blindly estimating the STI without measuring or modeling the impulse response of the room using deep convolutional neural networks. Our model is trained entirely using simulated room impulse responses combined with clean speech examples from the DAPS dataset [1] and works directly on PCM audio. Our experiments show that our method predicts true STI with a high degree of accuracy – an average error of under 4%. It can also distinguish between different STI conditions to a level of granularity that is comparable to humans.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"32 1","pages":"591-595"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77922312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
On the Sample Complexity of Graphical Model Selection from Non-Stationary Samples 基于非平稳样本的图形模型选择的样本复杂度
Nguyen Tran, A. Jung
We characterize the sample size required for accurate graphical model selection from non-stationary samples. The observed samples are modeled as a zero-mean Gaussian random process whose samples are uncorrelated but have different covariance matrices. This includes the case where observations form stationary or underspread processes. We derive a sufficient condition on the required sample size by analyzing a simple sparse neighborhood regression method.
我们描述了从非平稳样本中精确选择图形模型所需的样本量。观察到的样本被建模为零均值高斯随机过程,其样本不相关,但具有不同的协方差矩阵。这包括观测形成平稳过程或欠扩散过程的情况。通过分析一种简单的稀疏邻域回归方法,得到了所需样本量的充分条件。
{"title":"On the Sample Complexity of Graphical Model Selection from Non-Stationary Samples","authors":"Nguyen Tran, A. Jung","doi":"10.1109/ICASSP.2018.8462689","DOIUrl":"https://doi.org/10.1109/ICASSP.2018.8462689","url":null,"abstract":"We characterize the sample size required for accurate graphical model selection from non-stationary samples. The observed samples are modeled as a zero-mean Gaussian random process whose samples are uncorrelated but have different covariance matrices. This includes the case where observations form stationary or underspread processes. We derive a sufficient condition on the required sample size by analyzing a simple sparse neighborhood regression method.","PeriodicalId":6638,"journal":{"name":"2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"479 1","pages":"6314-6317"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74599205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1