2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

英文中文

Steganography with two JPEGs of the same scene 对同一场景的两张jpeg进行隐写

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-11 DOI: 10.1109/ICASSP.2017.7952530

Tomáš Denemark, J. Fridrich

It is widely recognized that incorporating side-information at the sender can significantly improve steganographic security in practice. Currently, most side-informed schemes for digital images utilize a high quality “precover” image that is subsequently processed and then jointly quantized and embedded with a secret. In this paper, we investigate an alternative form of side-information in the form of two JPEG images of the same scene. The second JPEG image is used to determine the preferred polarity of embedding changes and to modulate their costs. Tests on real imagery show a very significant improvement in empirical security with respect to steganography utilizing a single JPEG image.

人们普遍认为，在发送方加入侧信息可以显著提高隐写术的安全性。目前，大多数侧面通知的数字图像方案利用高质量的“预覆盖”图像，随后进行处理，然后联合量化并嵌入一个秘密。在本文中，我们以同一场景的两张JPEG图像的形式研究了另一种形式的侧信息。第二个JPEG图像用于确定嵌入更改的首选极性并调节其代价。对真实图像的测试表明，相对于使用单个JPEG图像的隐写，经验安全性得到了非常显著的改进。

引用次数: 12

Stochastic Truncated Wirtinger Flow Algorithm for phase retrieval using boolean coded apertures 基于布尔编码孔径的随机截断Wirtinger流相位检索算法

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-11 DOI: 10.1109/ICASSP.2017.7953318

Samuel Pinilla, Camilo Noriega, H. Arguello

X-ray crystallography is an experimental technique to estimate the 3D atomic positions of the elements present in a crystal. This technique constructs the 3D structure from the phase of diffracted and patterned X-rays (DPX). Multiple intensity DPX measurements are acquired to solve the phase retrieval problem. The feasibility of implementing this technique depends on solving the phase retrieval problem using expensive multiple valued patterns and the Truncated Wirtinger Flow Algorithm. This paper presents a Stochastic Truncated Wirtinger Flow Algorithm (STWF) which solves the phase retrieval problem based on DPX measurements low-cost boolean block-unblock coded apertures. Several simulations are realized to demonstrate the convergence of the STWF algorithm and the optimal parameters of the boolean coded apertures. The results indicate that given the DPX measurements, the quality of reconstructed phase images using STWF attained up 24:63dB of PSNR.

x射线晶体学是一种估计晶体中元素三维原子位置的实验技术。该技术从衍射和图案x射线(DPX)的相位构建3D结构。为了解决相位恢复问题，需要获取多个强度DPX测量值。实现该技术的可行性取决于使用昂贵的多值模式和截断Wirtinger流算法来解决相位检索问题。提出了一种基于DPX测量的低成本布尔块-无块编码孔径的随机截断wwinger流算法(STWF)，解决了相位检索问题。通过仿真验证了STWF算法的收敛性和布尔编码孔径的最优参数。结果表明，在给定DPX测量值的情况下，使用STWF重建的相位图像的PSNR高达24:63dB。

引用次数: 10

Feature mapping for speaker diarization in noisy conditions 噪声条件下扬声器拨号化的特征映射

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-11 DOI: 10.1109/ICASSP.2017.7953197

Weixin Zhu, Wu Guo, Guoping Hu

Speaker diarization in noisy conditions is addressed in this paper. The regression-based DNN is first adopted to map the noisy acoustic features to the clean features, and then consensus clustering of the original and mapped features is used to fuse the diarization results. The experiments are conducted on the IFLY-DIAR-II database, which is a Chinese talk show database with various noise types, such as music, applause and laughter. Compared to the baseline system using PLP features, a 21.26% relative DER improvement can be achieved using the proposed algorithm.

本文研究了噪声条件下的扬声器偏振化问题。首先采用基于回归的深度神经网络将噪声特征映射到干净特征上，然后对原始特征和映射的特征进行一致聚类，融合特征化结果。实验是在IFLY-DIAR-II数据库上进行的，IFLY-DIAR-II数据库是一个中文脱口秀数据库，具有各种噪音类型，如音乐，掌声和笑声。与使用PLP特征的基线系统相比，使用该算法可以实现21.26%的相对DER改进。

引用次数: 4

From focal stacks to tensor display: A method for light field visualization without multi-view images 从焦点堆叠到张量显示:一种无多视点图像的光场可视化方法

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-11 DOI: 10.1109/ICASSP.2017.7952508

Yuto Kobayashi, Keita Takahashi, T. Fujii

A new type of light field display called a tensor display was investigated. Although this display consists of only a few light attenuating layers located in front of a backlight, many views can be emitted in different directions simultaneously without sacrificing the resolution of each view. The transmittance pattern of each layer is calculated from a light field, namely, a set of dense multi-view images (typically dozens) that are to be observed from different directions. However, preparing such images is often cumbersome for real objects. We propose a method that does not require multi-view images as the input; instead, a focal stack composed of only a few differently focused images is directly transformed into the layer patterns. Our method greatly reduces the data acquisition cost while also maintaining the quality of the output light field. We validated the method with experiments using synthetic light field datasets and a focal stack acquired by an ordinary camera.

研究了一种新型的光场显示——张量显示。虽然这种显示器仅由位于背光前面的几个光衰减层组成，但可以同时向不同方向发射许多视图，而不会牺牲每个视图的分辨率。每层的透光率模式是从一个光场计算出来的，即从不同方向观察到的一组密集的多视图图像(通常是几十个)。然而，为真实物体准备这样的图像通常很麻烦。我们提出了一种不需要多视图图像作为输入的方法;相反，仅由几个不同聚焦的图像组成的焦点堆栈直接转换为层模式。该方法在保证输出光场质量的同时，大大降低了数据采集成本。利用合成光场数据集和普通相机采集的焦叠进行了实验验证。

引用次数: 10

Jointly optimized transform domain temporal prediction and sub-pixel interpolation 联合优化变换域时间预测和亚像素插值

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-11 DOI: 10.1109/ICASSP.2017.7952365

Shunyao Li, Tejaswi Nanjundaswamy, K. Rose

Conventional pixel-domain block matching temporal (inter) prediction is suboptimal, since it ignores the underlying spatial correlation. Hence in our recent research we proposed transform domain temporal prediction (TDTP), wherein spatially decorrelated transform coefficients are individually predicted. Later we proposed extended block TDTP (EB-TDTP), which fully exploits spatial correlation around reference block boundaries. However, the transform domain temporal correlation exploited by (EB-)TDTP interferes with the frequency response of sub-pixel interpolation filters. Thus, in this paper, we propose to replace the standard sub-pixel interpolation with filters which are jointly designed with EB-TDTP based on statistics of the data, for either separable or non-separable interpolation structures. We also employ a two-loop asymptotic closed-loop (ACL) approach for statistically stable off-line design. Experiments show that our framework can achieve up to 1dB gain in PSNR over HEVC.

传统的像素域块匹配时间(间)预测是次优的，因为它忽略了潜在的空间相关性。因此，在我们最近的研究中，我们提出了变换域时间预测(TDTP)，其中空间去相关变换系数单独预测。随后，我们提出了扩展块TDTP (EB-TDTP)，它充分利用了参考块边界周围的空间相关性。然而，(EB-)TDTP利用的变换域时间相关性会干扰亚像素插值滤波器的频率响应。因此，对于可分或不可分的插值结构，本文提出用与EB-TDTP共同设计的基于数据统计的滤波器代替标准的亚像素插值。我们还采用了一种双环渐近闭环(ACL)方法来进行统计稳定的离线设计。实验表明，该框架可以在HEVC的PSNR下获得高达1dB的增益。

引用次数: 0

Spatio-temporal binary video inpainting via threshold dynamics 基于阈值动态的时空二值视频图像绘制

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-10 DOI: 10.1109/ICASSP.2017.7952471

Maria Oliver, Roberto P. Palomares, C. Ballester, G. Haro

We propose a new variational method for the completion of moving shapes through binary video inpainting that works by smoothly recovering the objects into an inpainting hole. We solve it by a simple dynamic shape analysis algorithm based on threshold dynamics. The model takes into account the optical flow and motion occlusions. The resulting inpainting algorithm diffuses the available information along the space and the visible trajectories of the pixels in time. We show its performance with examples from the Sintel dataset, which contains complex object motion and occlusions.

我们提出了一种新的变分方法来完成运动形状通过二进制视频补漆，工作原理是平滑地恢复到补漆孔的对象。采用基于阈值动力学的简单动态形状分析算法求解。该模型考虑了光流和运动遮挡。所得的图像绘制算法在时间上沿空间和像素的可见轨迹扩散可用信息。我们用sinintel数据集的例子来展示它的性能，其中包含复杂的物体运动和遮挡。

引用次数: 1

Estimation in autoregressive processes with partial observations 部分观测自回归过程的估计

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-10 DOI: 10.1109/ICASSP.2017.7952950

Milind Rao, T. Javidi, Yonina C. Eldar, A. Goldsmith

We consider the problem of estimating the covariance matrix and the transition matrix of vector autoregressive (VAR) processes from partial measurements. This model encompasses settings where there are limitations in the data acquisition of the underlying measurement systems so that data is lost or corrupted by noise. An estimator for the covariance matrix of the observations is first presented. More refined estimators, factoring in structural constraints on the covariance matrix such as sparsity, bandedness, sparsity of the inverse and low-rankness are then introduced that are particularly useful in the high-dimensional regime. These estimates are then used to perform system identification by estimating the state transition matrix with or without further structural assumptions. Non-asymptotic guarantees are presented for all estimators.

研究了向量自回归(VAR)过程的协方差矩阵和转移矩阵的估计问题。该模型包含了在底层测量系统的数据采集中存在限制的设置，以便数据丢失或被噪声损坏。首先给出了观测值协方差矩阵的估计量。更精细的估计，考虑到协方差矩阵的结构约束，如稀疏性、带性、逆稀疏性和低秩性，然后引入在高维状态下特别有用的估计。然后，这些估计被用于通过估计状态转移矩阵来执行系统识别，无论是否有进一步的结构假设。给出了所有估计量的非渐近保证。

引用次数: 10

Identifying FMRI dynamic connectivity states using affinity propagation clustering method: Application to schizophrenia 使用亲和传播聚类方法识别FMRI动态连接状态:在精神分裂症中的应用

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-10 DOI: 10.1109/ICASSP.2017.7952287

M. Salman, Yuhui Du, V. Calhoun

Numerous studies have shown that brain functional connectivity patterns can be time-varying over periods of tens of seconds. It is important to capture inherent non-stationary connectivity states for a better understanding of the influence of disease on brain connectivity. K-means has been widely used to extract the connectivity states from dynamic functional connectivity. However, K-means is dependent on initialization and can be exponentially slow in converging due to extensive noise in dynamic functional connectivity. In this work, we propose to use an affinity propagation clustering method to estimate the connectivity states. By applying K-means and the new method separately, we analyzed dynamic functional connectivity of 82 healthy controls and 82 schizophrenia patients, and then explored group differences between schizophrenia patients and healthy controls in the identified connectivity states. Both methods revealed that group differences mainly lay in visual, sensorimotor and frontal cortices. However, the new approach found more meaningful group differences than K-means. Our finding supports that our method is promising in exploring biomarkers of mental disorders.

大量研究表明，大脑功能连接模式可以在几十秒的时间内随时间变化。捕获固有的非平稳连接状态对于更好地理解疾病对大脑连接的影响是很重要的。K-means被广泛用于从动态功能连接中提取连接状态。然而，K-means依赖于初始化，并且由于动态函数连通性中广泛的噪声，其收敛速度可能呈指数级缓慢。在这项工作中，我们提出使用亲和传播聚类方法来估计连接状态。分别应用K-means和新方法分析了82例健康对照和82例精神分裂症患者的动态功能连接状态，并探讨了精神分裂症患者和健康对照在识别的连接状态上的组间差异。两种方法均显示组间差异主要存在于视觉、感觉运动和额叶皮层。然而，新方法发现了比K-means更有意义的群体差异。我们的发现支持了我们的方法在探索精神障碍的生物标志物方面是有希望的。

{"title":"Identifying FMRI dynamic connectivity states using affinity propagation clustering method: Application to schizophrenia","authors":"M. Salman, Yuhui Du, V. Calhoun","doi":"10.1109/ICASSP.2017.7952287","DOIUrl":"https://doi.org/10.1109/ICASSP.2017.7952287","url":null,"abstract":"Numerous studies have shown that brain functional connectivity patterns can be time-varying over periods of tens of seconds. It is important to capture inherent non-stationary connectivity states for a better understanding of the influence of disease on brain connectivity. K-means has been widely used to extract the connectivity states from dynamic functional connectivity. However, K-means is dependent on initialization and can be exponentially slow in converging due to extensive noise in dynamic functional connectivity. In this work, we propose to use an affinity propagation clustering method to estimate the connectivity states. By applying K-means and the new method separately, we analyzed dynamic functional connectivity of 82 healthy controls and 82 schizophrenia patients, and then explored group differences between schizophrenia patients and healthy controls in the identified connectivity states. Both methods revealed that group differences mainly lay in visual, sensorimotor and frontal cortices. However, the new approach found more meaningful group differences than K-means. Our finding supports that our method is promising in exploring biomarkers of mental disorders.","PeriodicalId":118243,"journal":{"name":"2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125620963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

D2L: Decentralized dictionary learning over dynamic networks D2L:动态网络上的去中心化字典学习

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-09 DOI: 10.1109/ICASSP.2017.7952924

Amir Daneshmand, Ying Sun, G. Scutari, F. Facchinei

The paper studies a general class of distributed dictionary learning (DL) problems where the learning task is distributed over a multi-agent network with (possibly) time-varying (non-symmetric) connectivity. This setting is relevant, for instance, in scenarios where massive amounts of data are not collocated but collected/stored in different spatial locations. We develop a unified distributed algorithmic framework for this class of non-convex problems and establish its asymptotic convergence. The new method hinges on Successive Convex Approximation (SCA) techniques while leveraging a novel broadcast protocol to disseminate information and distribute the computation over the network, which neither requires the double-stochasticity of the consensus matrices nor the knowledge of the graph sequence to implement. To the best of our knowledge, this is the first distributed scheme with provable convergence for DL (and more generally bi-convex) problems, over (time-varying) digraphs.

本文研究了一类一般的分布式字典学习(DL)问题，其中学习任务分布在具有(可能)时变(非对称)连接的多智能体网络上。例如，在大量数据不是并置而是收集/存储在不同空间位置的场景中，此设置是相关的。我们为这类非凸问题建立了一个统一的分布式算法框架，并建立了它的渐近收敛性。新方法依赖于连续凸逼近(SCA)技术，同时利用一种新的广播协议在网络上传播信息和分配计算，既不需要共识矩阵的双随机性，也不需要图序列的知识来实现。据我们所知，这是第一个在(时变)有向图上具有可证明的DL(以及更普遍的双凸)问题收敛性的分布式方案。

引用次数: 5

A distributed constrained-form support vector machine 分布式约束形式支持向量机

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2017-03-09 DOI: 10.1109/ICASSP.2017.7952956

François D. Côté, I. Psaromiligkos, W. Gross

Despite the importance of distributed learning, few fully distributed support vector machines exist. In this paper, not only do we provide a fully distributed nonlinear SVM; we propose the first distributed constrained-form SVM. In the fully distributed context, a dataset is distributed among networked agents that cannot divulge their data, let alone centralize the data, and can only communicate with their neighbors in the network. Our strategy is based on two algorithms: the Douglas-Rachford algorithm and the projection-gradient method. We validate our approach by demonstrating through simulations that it can train a classifier that agrees closely with the centralized solution.

尽管分布式学习很重要，但是完全分布式的支持向量机还很少。在本文中，我们不仅提供了一个全分布的非线性支持向量机;我们提出了第一个分布式约束形式支持向量机。在完全分布式上下文中，数据集分布在网络代理之间，这些代理不能泄露它们的数据，更不用说集中数据，并且只能与网络中的邻居通信。我们的策略基于两种算法:Douglas-Rachford算法和投影梯度方法。我们通过模拟来验证我们的方法，它可以训练一个与集中式解决方案非常一致的分类器。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀