ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

英文中文

Sample Space-time Covariance Matrix Estimation 样本时空协方差矩阵估计

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2019-05-16 DOI: 10.1109/ICASSP.2019.8683339

Connor Delaosa, J. Pestana, N. Goddard, S. Somasundaram, Stephan Weiss

Estimation errors are incurred when calculating the sample space-time covariance matrix. We formulate the variance of this estimator when operating on a finite sample set, compare it to known results, and demonstrate its precision in simulations. The variance of the estimation links directly to previously explored perturbation of the analytic eigenvalues and eigenspaces of a parahermitian cross-spectral density matrix when estimated from finite data.

在计算样本空时协方差矩阵时会产生估计误差。我们在有限样本集上计算了该估计量的方差，将其与已知结果进行了比较，并在模拟中证明了其精度。当从有限数据估计时，估计的方差直接与先前探索的parparhertian交叉谱密度矩阵的解析特征值和特征空间的扰动联系在一起。

引用次数: 13

Iterative Approximation of Analytic Eigenvalues of a Parahermitian Matrix EVD parehermite矩阵EVD解析特征值的迭代逼近

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2019-05-16 DOI: 10.1109/ICASSP.2019.8682407

Stephan Weiss, I. Proudler, Fraser K. Coutts, J. Pestana

We present an algorithm that extracts analytic eigenvalues from a parahermitian matrix. Operating in the discrete Fourier transform domain, an inner iteration re-establishes the lost association between bins via a maximum likelihood sequence detection driven by a smoothness criterion. An outer iteration continues until a desired accuracy for the approximation of the extracted eigenvalues has been achieved. The approach is compared to existing algorithms.

提出了一种从parparhertian矩阵中提取解析特征值的算法。在离散傅里叶变换域中操作，内部迭代通过平滑准则驱动的最大似然序列检测重新建立箱之间丢失的关联。外部迭代继续进行，直到所提取的特征值的近似值达到所需的精度。将该方法与现有算法进行了比较。

引用次数: 15

Aggregation Graph Neural Networks 聚合图神经网络

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2019-05-15 DOI: 10.1109/ICASSP.2019.8682975

Fernando Gama, A. Marques, Alejandro Ribeiro, G. Leus

Graph neural networks (GNNs) regularize classical neural networks by exploiting the underlying irregular structure supporting graph data, extending its application to broader data domains. The aggregation GNN presented here is a novel GNN that exploits the fact that the data collected at a single node by means of successive local exchanges with neighbors exhibits a regular structure. Thus, regular convolution and regular pooling yield an appropriately regularized GNN. To address some scalability issues that arise when collecting all the information at a single node, we propose a multi-node aggregation GNN that constructs regional features that are later aggregated into more global features and so on. We show superior performance in a source localization problem on synthetic graphs and on the authorship attribution problem.

图神经网络(gnn)通过利用支持图数据的底层不规则结构对经典神经网络进行正则化，将其应用扩展到更广泛的数据领域。本文提出的聚合GNN是一种新颖的GNN，它利用了在单个节点上通过与相邻节点的连续本地交换收集的数据呈现规则结构的事实。因此，规则卷积和规则池化产生一个适当正则化的GNN。为了解决在单个节点上收集所有信息时出现的一些可扩展性问题，我们提出了一个多节点聚合GNN，该GNN构建区域特征，然后聚合成更多的全局特征，等等。我们在合成图的源定位问题和作者归属问题上表现出优异的性能。

引用次数: 8

Securing Smartphone Handwritten Pin Codes with Recurrent Neural Networks 用循环神经网络保护智能手机手写Pin码

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2019-05-15 DOI: 10.1109/ICASSP.2019.8683871

Gaël Le Lan, Vincent Frey

This paper investigates the use of recurrent neural networks to secure PIN code based authentication on smartphones, in a scenario where the user is invited to draw digits on the touchscreen. From the sequence of successive positions of the users finger on the touchscreen, a bidirectional recurrent neural network computes a discriminative embedding in terms of writer traits, carrying the contextual information of the written digit. This allows to reject impostors who would have knowledge of the PIN code. The neural network is trained to recognize both users and digits of a training dataset. Evaluations are run on two datasets of 43 and 33 users, respectively, absent from the training dataset. Results show that when enrolling the users on 4 examples of each digit, the Equal Error Rate reaches 4.9% for a 4-digit PIN code. Including digit value prediction during training is key to achieve good performances.

本文研究了在用户被邀请在触摸屏上画数字的情况下，使用循环神经网络来保护智能手机上基于PIN码的身份验证。从用户手指在触摸屏上的连续位置序列中，一个双向循环神经网络根据书写者的特征计算出一个判别嵌入，并携带书写数字的上下文信息。这允许拒绝知道PIN码的冒名顶替者。神经网络被训练来识别训练数据集的用户和数字。评估分别在训练数据集中缺失的43和33个用户的两个数据集上运行。结果表明，当对每个数字的4个示例进行注册时，4位PIN码的平均错误率达到4.9%。在训练过程中包含数字值预测是获得良好表现的关键。

引用次数: 6

A History-based Stopping Criterion in Recursive Bayesian State Estimation 递归贝叶斯状态估计中基于历史的停止准则

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2019-05-15 DOI: 10.1109/ICASSP.2019.8683726

Y. Marghi, Aziz Koçanaoğulları, M. Akçakaya, Deniz Erdoğmuş

In dynamic state-space models, the state can be estimated through recursive computation of the posterior distribution of the state given all measurements. In scenarios where active sensing/querying is possible, a hard decision is made when the state posterior achieves a pre-set confidence threshold. This mandate to meet a hard threshold may sometimes unnecessarily require more queries. In application domains where sensing/querying cost is of concern, some potential accuracy may be sacrificed for greater gains in sensing cost. In this paper, we (a) propose a criterion based on a linear combination of state posterior and its changes, (b) show that for discrete-valued state estimation scenarios the proposed objective is more likely to sort correct and incorrect estimates appropriately compared to just looking at the posterior, and finally (c) demonstrate that the method can lead to significant human intent estimation speed increase without significant loss of accuracy in a brain-computer interface application.

在动态状态空间模型中，可以通过递归计算给定所有测量值的状态后验分布来估计状态。在可能进行主动感知/查询的场景中，当状态后验达到预设的置信度阈值时，就会做出艰难的决策。这种满足硬阈值的要求有时可能不必要地需要更多查询。在关注传感/查询成本的应用领域中，为了获得更大的传感成本收益，可能会牺牲一些潜在的准确性。在本文中，我们(a)提出了一个基于状态后验及其变化的线性组合的准则，(b)表明，对于离散值状态估计场景，与只看后验相比，所提出的目标更有可能适当地对正确和不正确的估计进行分类，最后(c)证明该方法可以在脑机接口应用中显著提高人类意图估计速度，而不会显著降低准确性。

引用次数: 3

Audio Feature Generation for Missing Modality Problem in Video Action Recognition 视频动作识别中模态缺失问题的音频特征生成

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2019-05-14 DOI: 10.1109/ICASSP.2019.8682513

Hu-Cheng Lee, Chih-Yu Lin, P. Hsu, Winston H. Hsu

Despite the recent success of multi-modal action recognition in videos, in reality, we usually confront the situation that some data are not available beforehand, especially for multi-modal data. For example, while vision and audio data are required to address the multi-modal action recognition, audio tracks in videos are easily lost due to the broken files or the limitation of devices. To cope with this sound-missing problem, we present an approach to simulating deep audio feature from merely spatial-temporal vision data. We demonstrate that adding the simulating sound feature can significantly assist the multi-modal action recognition task. Evaluating our method on the Moments in Time (MIT) Dataset , we show that our proposed method performs favorably against the two-stream architecture, enabling a richer understanding of multi-modal action recognition in video.

尽管近年来视频中的多模态动作识别取得了成功，但在现实生活中，我们经常会遇到一些事先没有数据的情况，特别是对于多模态数据。例如，虽然需要视觉和音频数据来处理多模态动作识别，但视频中的音轨很容易由于文件损坏或设备的限制而丢失。为了解决这种声音缺失问题，我们提出了一种仅从时空视觉数据模拟深度音频特征的方法。我们证明了添加模拟声音特征可以显著地辅助多模态动作识别任务。在时间矩(MIT)数据集上评估我们的方法，我们表明我们提出的方法在两流架构下表现良好，能够更丰富地理解视频中的多模态动作识别。

引用次数: 6

Data-selective LMS-Newton and LMS-Quasi-Newton Algorithms 数据选择性LMS-Newton和lms -准牛顿算法

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2019-05-14 DOI: 10.1109/ICASSP.2019.8683076

C. Tsinos, P. Diniz

The huge volume of data that are available today requires data-selective processing approaches that avoid the costs in computational complexity via appropriately treating the non-innovative data. In this paper, extensions of the well-known adaptive filtering LMS-Newton and LMS-Quasi-Newton Algorithms are developed that enable data selection while also addressing the censorship of outliers that emerge due to high measurement errors. The proposed solutions allow the prescription of how often the acquired data are expected to be incorporated into the learning process based on some a priori information regarding the environment. Simulation results on both synthetic and real-world data verify the effectiveness of the proposed algorithms that may achieve significant reductions in computational costs without sacrificing estimation accuracy due to the selection of the data.

当今的海量数据需要数据选择性处理方法，通过适当处理非创新数据来避免计算复杂性的成本。在本文中，开发了著名的自适应滤波LMS-Newton和lms -准牛顿算法的扩展，使数据选择成为可能，同时也解决了由于高测量误差而出现的异常值的审查问题。建议的解决方案允许根据有关环境的一些先验信息，规定预期将获得的数据纳入学习过程的频率。在合成数据和实际数据上的仿真结果验证了所提出算法的有效性，该算法可以在不牺牲由于数据选择而导致的估计精度的情况下显著降低计算成本。

引用次数: 7

Inductive Conformal Predictor for Sparse Coding Classifiers: Applications to Image Classification 稀疏编码分类器的归纳共形预测器:在图像分类中的应用

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2019-05-14 DOI: 10.1109/ICASSP.2019.8682740

Sergio Matiz, K. Barner

Conformal prediction uses the degree of strangeness (nonconformity) of new data instances to determine the confidence values of new predictions. We propose an inductive conformal predictor for sparse coding classifiers, referred to as ICP-SCC. Our contribution is twofold: first, we present two nonconformity measures that produce reliable confidence values; second, we propose a batch mode active learning algorithm within the conformal prediction framework to improve classification performance by selecting training instances based on two criteria, informativeness and diversity. Experiments conducted on face and object recognition databases demonstrate that ICP-SCC improves the classification accuracy of state-of-the-art dictionary learning algorithms while producing reliable confidence values.

适形预测使用新数据实例的陌生程度(不一致性)来确定新预测的置信度值。我们提出了稀疏编码分类器的归纳共形预测器，称为ICP-SCC。我们的贡献是双重的:首先，我们提出了两个产生可靠置信度值的不符合度量;其次，在保形预测框架内提出了一种批处理模式主动学习算法，通过基于信息量和多样性两个标准选择训练实例来提高分类性能。在人脸和物体识别数据库上进行的实验表明，ICP-SCC提高了最先进的字典学习算法的分类精度，同时产生了可靠的置信度值。

引用次数: 0

Estimation of Gaze Region Using Two Dimensional Probabilistic Maps Constructed Using Convolutional Neural Networks 基于卷积神经网络构建的二维概率映射的注视区域估计

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2019-05-13 DOI: 10.1109/ICASSP.2019.8683794

S. Jha, C. Busso

Predicting the gaze of a user can have important applications in human computer interactions (HCI). They find applications in areas such as social interaction, driver distraction, human robot interaction and education. Appearance based models for gaze estimation have significantly improved due to recent advances in convolutional neural network (CNN). This paper proposes a method to predict the gaze of a user with deep models purely based on CNNs. A key novelty of the proposed model is that it produces a probabilistic map describing the gaze distribution (as opposed to predicting a single gaze direction). This approach is achieved by converting the regression problem into a classification problem, predicting the probability at the output instead of a single direction. The framework relies in a sequence of downsampling followed by upsampling to obtain the probabilistic gaze map. We observe that our proposed approach works better than a regression model in terms of prediction accuracy. The average mean squared error between the predicted gaze and the true gaze is observed to be 6.89◦ in a model trained and tested on the MSP-Gaze database, without any calibration or adaptation to the target user.

预测用户的注视在人机交互(HCI)中具有重要的应用。它们在社交互动、司机分心、人机互动和教育等领域得到了应用。由于卷积神经网络(CNN)的最新进展，基于外观的凝视估计模型得到了显着改进。本文提出了一种纯粹基于cnn的深度模型预测用户凝视的方法。该模型的一个关键新颖之处在于，它产生了一个描述凝视分布的概率图(而不是预测单一的凝视方向)。这种方法是通过将回归问题转换为分类问题来实现的，预测输出而不是单一方向的概率。该框架依赖于一系列的下采样和上采样来获得概率凝视图。我们观察到，我们提出的方法在预测精度方面比回归模型更好。在MSP-Gaze数据库上训练和测试的模型中，预测凝视和真实凝视之间的平均均方误差为6.89◦，没有任何校准或适应目标用户。

{"title":"Estimation of Gaze Region Using Two Dimensional Probabilistic Maps Constructed Using Convolutional Neural Networks","authors":"S. Jha, C. Busso","doi":"10.1109/ICASSP.2019.8683794","DOIUrl":"https://doi.org/10.1109/ICASSP.2019.8683794","url":null,"abstract":"Predicting the gaze of a user can have important applications in human computer interactions (HCI). They find applications in areas such as social interaction, driver distraction, human robot interaction and education. Appearance based models for gaze estimation have significantly improved due to recent advances in convolutional neural network (CNN). This paper proposes a method to predict the gaze of a user with deep models purely based on CNNs. A key novelty of the proposed model is that it produces a probabilistic map describing the gaze distribution (as opposed to predicting a single gaze direction). This approach is achieved by converting the regression problem into a classification problem, predicting the probability at the output instead of a single direction. The framework relies in a sequence of downsampling followed by upsampling to obtain the probabilistic gaze map. We observe that our proposed approach works better than a regression model in terms of prediction accuracy. The average mean squared error between the predicted gaze and the true gaze is observed to be 6.89◦ in a model trained and tested on the MSP-Gaze database, without any calibration or adaptation to the target user.","PeriodicalId":13203,"journal":{"name":"ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"92 1","pages":"3792-3796"},"PeriodicalIF":0.0,"publicationDate":"2019-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83892115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

In-Car Driver Authentication Using Wireless Sensing 使用无线传感的车内驾驶员身份验证

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2019-05-13 DOI: 10.1109/ICASSP.2019.8683522

Sai Deepika Regani, Qinyi Xu, Beibei Wang, Min Wu, K. J. R. Liu

Automobiles have become an essential part of everyday lives. In this work, we attempt to make them smarter by introducing the idea of in-car driver authentication using wireless sensing. Our aim is to develop a model which can recognize drivers automatically. Firstly, we address the problem of "changing in-car environments", where the existing wireless sensing based human identification system fails. To this end, we build the first in-car driver radio biometric dataset to understand the effect of changing environments on human radio biometrics. This dataset consists of radio biometrics of five people collected over a period of two months. We leverage this dataset-to create machine learning (ML) models that make the proposed system adaptive to new in-car environments. We obtained a maximum accuracy of 99.3% in classifying two drivers and 90.66% accuracy in validating a single driver.

汽车已经成为人们日常生活中必不可少的一部分。在这项工作中，我们试图通过引入使用无线传感的车内驾驶员身份验证的想法使它们更智能。我们的目标是开发一个能够自动识别驾驶员的模型。首先，我们解决了“车内环境变化”的问题，这是现有的基于无线传感的人体识别系统无法解决的问题。为此，我们建立了第一个车载驾驶员无线电生物识别数据集，以了解环境变化对人体无线电生物识别的影响。该数据集包括在两个月内收集的五个人的无线电生物识别信息。我们利用这个数据集来创建机器学习(ML)模型，使所提出的系统适应新的车内环境。我们在对两个驱动程序进行分类时获得了99.3%的最大准确率，在验证单个驱动程序时获得了90.66%的准确率。

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀