2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

英文中文

Image compression with Stochastic Winner-Take-All Auto-Encoder 随机赢家通吃自动编码器的图像压缩

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-14 DOI: 10.1109/ICASSP.2017.7952409

Thierry Dumas, A. Roumy, C. Guillemot

This paper addresses the problem of image compression using sparse representations. We propose a variant of autoencoder called Stochastic Winner-Take-All Auto-Encoder (SWTA AE). “Winner-Take-All” means that image patches compete with one another when computing their sparse representation and “Stochastic” indicates that a stochastic hyperparameter rules this competition during training. Unlike auto-encoders, SWTA AE performs variable rate image compression for images of any size after a single training, which is fundamental for compression. For comparison, we also propose a variant of Orthogonal Matching Pursuit (OMP) called Winner-Take-All Orthogonal Matching Pursuit (WTA OMP). In terms of rate-distortion trade-off, SWTA AE outperforms auto-encoders but it is worse than WTA OMP. Besides, SWTA AE can compete with JPEG in terms of rate-distortion.

本文解决了使用稀疏表示的图像压缩问题。我们提出了一种自编码器的变体，称为随机赢家通吃自编码器(SWTA AE)。“赢者通吃”意味着图像patch在计算其稀疏表示时相互竞争，“随机”表示在训练期间随机超参数控制这种竞争。与自编码器不同，SWTA AE在单次训练后对任何大小的图像执行可变速率图像压缩，这是压缩的基础。为了比较，我们还提出了正交匹配追求(OMP)的一种变体，称为赢者通吃的正交匹配追求(WTA OMP)。在速率失真权衡方面，SWTA AE优于自编码器，但不如WTA OMP。此外，SWTA AE在率失真方面可以与JPEG竞争。

引用次数: 25

Quantisation effects in PDMM: A first study for synchronous distributed averaging PDMM中的量化效应:同步分布平均的首次研究

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-14 DOI: 10.1109/ICASSP.2017.7952955

Daan H. M. Schellekens, Thomas W. Sherson, R. Heusdens

Large-scale networks of computing units, often characterised by the absence of central control, have become commonplace in many applications. To facilitate data processing in these large-scale networks, distributed signal processing is required. The iterative behaviour of distributed processing algorithms combined with energy, computational power, and bandwidth limitations imposed by such networks, place tight constraints on the transmission capacities of the individual nodes. In this paper we investigate the effects of subtractive dithered uniform quantisation in PDMM for the synchronous distributed averaging problem. This is done by deriving expressions for the mean squared error (MSE) that include quantisation noise. Also, the required data rate for quantised PDMM is considered. It was found that for practical applications quantisation in PDMM can be applied with a fixed-rate quantiser, such that significant data rate reduction can be achieved, without compromising the rate of convergence.

计算单元的大规模网络通常以缺乏中央控制为特征，在许多应用中已经变得司空见惯。为了便于在这些大规模网络中进行数据处理，需要进行分布式信号处理。分布式处理算法的迭代行为与此类网络施加的能量、计算能力和带宽限制相结合，对单个节点的传输能力施加了严格的约束。本文研究了PDMM中减法抖动均匀量化对同步分布平均问题的影响。这是通过推导包含量化噪声的均方误差(MSE)表达式来实现的。此外，还考虑了量化PDMM所需的数据速率。研究发现，在实际应用中，PDMM中的量化可以使用固定速率量化器，这样可以在不影响收敛速度的情况下实现显著的数据速率降低。

引用次数: 7

Blind image deconvolution using Student's-t prior with overlapping group sparsity 基于重叠群稀疏度的Student's-t先验盲图像反卷积

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-13 DOI: 10.1109/ICASSP.2017.7952470

Insu Jeon, Deokyoung Kang, S. Yoo

In this paper, we solve blind image deconvolution problem that is to remove blurs form a signal degraded image without any knowledge of the blur kernel. Since the problem is ill-posed, an image prior plays a significant role in accurate blind deconvolution. Traditional image prior assumes coefficients in filtered domains are sparse. However, it is assumed here that there exist additional structures over the sparse coefficients. Accordingly, we propose new problem formulation for the blind image deconvolution, which utilize the structural information by coupling Student's-t image prior with overlapping group sparsity. The proposed method resulted in an effective blind deconvolution algorithm that outperforms other state-of-the-art algorithms.

在本文中，我们解决了盲图像反卷积问题，即在不知道模糊核的情况下，从信号退化的图像中去除模糊。由于问题是病态的，图像先验在精确的盲反卷积中起着重要的作用。传统的图像先验假设滤波域的系数是稀疏的。然而，这里假设在稀疏系数上存在额外的结构。在此基础上，我们提出了一种新的盲图像反卷积问题公式，该公式通过耦合Student’s-t图像先验和重叠群稀疏性来利用图像的结构信息。提出的方法产生了一种有效的盲反卷积算法，优于其他最先进的算法。

引用次数: 1

Personalized video emotion tagging through a topic model 通过主题模型进行个性化视频情感标注

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-13 DOI: 10.1109/ICASSP.2017.7952680

Shan Wu, Shangfei Wang, Zhen Gao

The inherent dependencies among video content, personal characteristics, and perceptual emotion are crucial for personalized video emotion tagging, but have not been thoroughly exploited. To address this, we propose a novel topic model to capture such inherent dependencies. We assume that there are several potential human factors, or “topics,” that affect the personal characteristics and the personalized emotion responses to videos. During training, the proposed topic model exploits the latent space to model the relationships among personal characteristics, video content and video tagging behaviors. After learning, the proposed model can generate meaningful latent topics, which help personalized video emotion tagging. Efficient learning and inference algorithms of the model are proposed. Experimental results on the CP-QAE-I database demonstrate the effectiveness of the proposed approach in modeling complex relationships among video content, personal characteristics, and perceptual emotion, as well as its good performance in personalized video emotion.

视频内容、个人特征和感知情感之间的内在依赖关系是个性化视频情感标注的关键，但尚未得到充分利用。为了解决这个问题，我们提出了一个新的主题模型来捕获这种固有的依赖关系。我们假设有几个潜在的人为因素，或“主题”，会影响个人特征和对视频的个性化情感反应。在训练过程中，本文提出的话题模型利用潜在空间对个人特征、视频内容和视频标注行为之间的关系进行建模。经过学习，该模型可以生成有意义的潜在主题，有助于个性化视频情感标注。提出了高效的模型学习和推理算法。在CP-QAE-I数据库上的实验结果表明，该方法在视频内容、个人特征和感知情绪之间的复杂关系建模方面是有效的，并且在个性化视频情绪方面表现良好。

引用次数: 2

Acoustic imaging of sparse Sources with Orthogonal Matching Pursuit and clustering of basis vectors 基于正交匹配追踪和基向量聚类的稀疏源声成像

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-13 DOI: 10.1109/ICASSP.2017.7953314

Trond F. Bergh, Ines Hafizovic, S. Holm

We have devised a greedy method for finding solutions to the sparse Deconvolution Approach for the Mapping of Acoustic Sources inverse problem using a variant of Orthogonal Matching Pursuit. The algorithm has two stages, wherein the first stage consists of selecting a subset of the basis vectors iteratively via a regularized inverse of the point spread function, and the second stage consists of constructing point source solutions using this basis subset and its coefficients via hierarchical agglomerative clustering. We have evaluated the algorithm on both synthetic and real data, and show that the overall accuracy in terms of direction of arrival and reconstructed source power is better than four other state of the art methods.

针对声源映射反问题的稀疏反褶积方法，我们设计了一种贪心的求解方法，该方法采用正交匹配追踪的一种变体。该算法分为两个阶段，第一阶段是通过点扩散函数的正则化逆迭代选择基向量子集，第二阶段是通过分层聚类利用该基子集及其系数构造点源解。我们在合成数据和实际数据上对该算法进行了评估，并表明在到达方向和重建源功率方面的总体精度优于其他四种最先进的方法。

引用次数: 1

Super-resolution delay-Doppler estimation for sub-Nyquist radar via atomic norm minimization 基于原子范数最小化的亚奈奎斯特雷达超分辨率延迟多普勒估计

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-13 DOI: 10.1109/ICASSP.2017.7952973

Feng Xi, Shengyao Chen, Zhong Liu

This paper studies the estimation of the delay and Doppler parameters of the sub-Nyquist radars. By formulating the delay-Doppler estimation as the low-rank matrix recovery, we propose an atomic norm minimization-based estimation approach. With the recovered low-rank matrix, we determine and pair the delay and Doppler parameters of the radar targets. Numerical simulations demonstrate the superior performance of the proposed approach, as compared to the state-of-the-art approaches.

本文研究了亚奈奎斯特雷达的时延和多普勒参数的估计。通过将延迟多普勒估计表述为低秩矩阵恢复，我们提出了一种基于原子范数最小化的估计方法。利用恢复的低秩矩阵，确定并配对雷达目标的时延和多普勒参数。数值模拟结果表明，与现有的方法相比，该方法具有优越的性能。

引用次数: 5

Emotion recognition through integrating EEG and peripheral signals 结合脑电和外周信号的情绪识别

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-13 DOI: 10.1109/ICASSP.2017.7952681

Yangyang Shu, Shangfei Wang

The inherent dependencies among multiple physiological signals are crucial for multimodal emotion recognition, but have not been thoroughly exploited yet. This paper propose to use restricted Boltzmann machine (RBM) to model such dependencies.Specifically, the visible nodes of RBM represent EEG and peripheral physiological signals, and thus the connections between visible nodes and hidden nodes capture the intrinsic relations among multiple physiological signals. The RBM generates new representation from multiple physiological signals. Then, a support vector machine is adopted to recognize users' emotion states from the generated features. Furthermore, we extend the proposed fusion method for incomplete datas, since physiological signals are often corrupted due to artifacts. Specifically, we pre-train the RBM using all the complete data, then we update missing values and RBM parameters to minimize free energy of visible vectors using both complete and incomplete data. Experiments on two benchmark databases demonstrate the effectiveness of the proposed methods.

多种生理信号之间的内在依赖关系对多模态情绪识别至关重要，但目前尚未得到充分利用。本文提出用受限玻尔兹曼机(RBM)来建模这种依赖关系。具体来说，RBM的可见节点代表脑电图和外周生理信号，可见节点和隐藏节点之间的联系捕捉了多个生理信号之间的内在联系。RBM从多种生理信号中生成新的表征。然后，采用支持向量机从生成的特征中识别用户的情绪状态。此外，我们将所提出的融合方法扩展到不完整数据，因为生理信号经常因伪影而损坏。具体而言，我们使用所有完整数据对RBM进行预训练，然后使用完整和不完整数据更新缺失值和RBM参数以最小化可见向量的自由能。在两个基准数据库上的实验验证了所提方法的有效性。

引用次数: 31

Active speech control using wave-domain processing with a linear wall of dipole secondary sources 利用偶极子二次源的线性壁进行波域处理的主动语音控制

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-12 DOI: 10.1109/ICASSP.2017.7952197

Jacob Donley, C. Ritz, W. Kleijn

In this paper, we investigate the effects of compensating for wave-domain filtering delay in an active speech control system. An active control system utilising wave-domain processed basis functions is evaluated for a linear array of dipole secondary sources. The target control soundfield is matched in a least squares sense using orthogonal wavefields to a predicted future target soundfield. Filtering is implemented using a block-based short-time signal processing approach which induces an inherent delay. We present an autoregressive method for predictively compensating for the filter delay. An approach to block-length choice that maximises the soundfield control is proposed for a trade-off between soundfield reproduction accuracy and prediction accuracy. Results show that block-length choice has a significant effect on the active suppression of speech.

在本文中，我们研究了补偿波域滤波延迟在主动语音控制系统中的效果。利用波域处理基函数对偶极子二次源线性阵列进行主动控制。目标控制声场用正交波场与预测的未来目标声场进行最小二乘匹配。滤波采用基于块的短时信号处理方法实现，该方法会引起固有延迟。我们提出了一种预测补偿滤波器延迟的自回归方法。为了在声场再现精度和预测精度之间进行权衡，提出了一种最大化声场控制的块长度选择方法。结果表明，块长度选择对语音主动抑制有显著影响。

引用次数: 2

Atomic norm minimization for modal analysis with random spatial compression 随机空间压缩模态分析的原子范数最小化

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-12 DOI: 10.1109/ICASSP.2017.7952757

S. Li, Dehui Yang, M. Wakin

Identifying characteristic vibrational modes and frequencies is of great importance for monitoring the health of structures such as buildings and bridges. In this work, we address the problem of estimating the modal parameters of a structure from small amounts of vibrational data collected from wireless sensors distributed on the structure. We consider a randomized spatial compression scheme for minimizing the amount of data that is collected and transmitted by the sensors. Using the recent technique of atomic norm minimization, we show that under certain conditions exact recovery of the mode shapes and frequencies is possible. In addition, in a simulation based on synthetic data, our method outperforms a singular value decomposition (SVD) based method for modal analysis that uses the uncompressed data set.

识别振动特征模态和频率对于监测建筑物和桥梁等结构的健康状况具有重要意义。在这项工作中，我们解决了从分布在结构上的无线传感器收集的少量振动数据中估计结构模态参数的问题。我们考虑了一种随机空间压缩方案，以最小化传感器收集和传输的数据量。利用最近的原子范数最小化技术，我们证明了在一定条件下，模态振型和频率的精确恢复是可能的。此外，在基于合成数据的仿真中，我们的方法优于使用未压缩数据集的基于奇异值分解(SVD)的模态分析方法。

引用次数: 2

Orthogonal precoding for sidelobe suppression in DFT-based systems using block reflectors 基于dft的分块反射器旁瓣抑制的正交预编码

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-12 DOI: 10.1109/ICASSP.2017.7952849

I. Clarkson

Sidelobe suppression has always been an important part of crafting communications signals to keep interference with users of adjacent spectrum to a minimum. Systems based on the discrete Fourier transform, such as orthogonal frequency-division multiplexing (OFDM) and single-carrier frequency-division multiple access (SC-FDMA) are especially prone to out-of-band power leakage. Although many techniques have been proposed to suppress sidelobes in DFT-based systems, a satisfactory balance between computational complexity and out-of-band power leakage has remained elusive.

旁瓣抑制一直是制作通信信号的重要组成部分，以保持对相邻频谱用户的干扰降到最低。基于离散傅立叶变换的系统，如正交频分复用(OFDM)和单载波频分多址(SC-FDMA)，特别容易出现带外功率泄漏。尽管在基于dft的系统中已经提出了许多抑制副瓣的技术，但在计算复杂度和带外功率泄漏之间取得令人满意的平衡仍然是难以实现的。

引用次数: 12

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀