Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific最新文献

英文中文

Redefining self-similarity in natural images for denoising using graph signal gradient 重新定义自然图像的自相似度，利用图信号梯度去噪

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041627

Jiahao Pang, Gene Cheung, Wei Hu, O. Au

Image denoising is the most basic inverse imaging problem. As an under-determined problem, appropriate definition of image priors to regularize the problem is crucial. Among recent proposed priors for image denoising are: i) graph Laplacian regularizer where a given pixel patch is assumed to be smooth in the graph-signal domain; and ii) self-similarity prior where image patches are assumed to recur throughout a natural image in non-local spatial regions. In our first contribution, we demonstrate that the graph Laplacian regularizer converges to a continuous time functional counterpart, and careful selection of its features can lead to a discriminant signal prior. In our second contribution, we redefine patch self-similarity in terms of patch gradients and argue that the new definition results in a more accurate estimate of the graph Laplacian matrix, and thus better image denoising performance. Experiments show that our designed algorithm based on graph Laplacian regularizer and gradient-based self-similarity can outperform non-local means (NLM) denoising by up to 1.4 dB in PSNR.

图像去噪是最基本的逆成像问题。作为一个欠定问题，适当的图像先验定义对问题进行正则化至关重要。最近提出的图像去噪的先验算法有:i)图拉普拉斯正则化，其中假设给定的像素块在图信号域中是光滑的;ii)自相似性先验，即假设图像斑块在非局部空间区域中在整个自然图像中反复出现。在我们的第一个贡献中，我们证明了图拉普拉斯正则化器收敛于连续时间函数对立物，并且仔细选择其特征可以导致判别信号先验。在我们的第二篇论文中，我们根据斑块梯度重新定义了斑块自相似性，并认为新的定义可以更准确地估计图拉普拉斯矩阵，从而获得更好的图像去噪性能。实验表明，基于图拉普拉斯正则化和基于梯度的自相似度的算法比非局部均值(NLM)去噪的PSNR提高了1.4 dB。

{"title":"Redefining self-similarity in natural images for denoising using graph signal gradient","authors":"Jiahao Pang, Gene Cheung, Wei Hu, O. Au","doi":"10.1109/APSIPA.2014.7041627","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041627","url":null,"abstract":"Image denoising is the most basic inverse imaging problem. As an under-determined problem, appropriate definition of image priors to regularize the problem is crucial. Among recent proposed priors for image denoising are: i) graph Laplacian regularizer where a given pixel patch is assumed to be smooth in the graph-signal domain; and ii) self-similarity prior where image patches are assumed to recur throughout a natural image in non-local spatial regions. In our first contribution, we demonstrate that the graph Laplacian regularizer converges to a continuous time functional counterpart, and careful selection of its features can lead to a discriminant signal prior. In our second contribution, we redefine patch self-similarity in terms of patch gradients and argue that the new definition results in a more accurate estimate of the graph Laplacian matrix, and thus better image denoising performance. Experiments show that our designed algorithm based on graph Laplacian regularizer and gradient-based self-similarity can outperform non-local means (NLM) denoising by up to 1.4 dB in PSNR.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"35 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114117807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 29

A clustering analysis of Chinese consonants based on functional load 基于功能负荷的汉语辅音聚类分析

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041637

Bin Wu, Jinsong Zhang, Yanlu Xie

This paper attempts to provide some insights about the relationship between the differentiability and the classification importance of consonants in Chinese speech communication. The two characteristics can be modelled by the perceptual distance and the functional load respectively. We have a clustering analysis of Chinese consonants based on functional load (FL) relied on mutual information (MI) between the text and its phoneme transcription. Then we compare our clustering result with that based on the perceptual distance by articulation tests. By experimenting on the Chinese newspaper corpus with millions of sentences, we find most phonemes at the same place of articulation with different manners tend to have large FLs pairwise. It is consistent with the result that those phonemes tend to have long perceptual distance pairwise.

本文试图对汉语语音交际中辅音的可区分性与分类重要性之间的关系提出一些见解。这两个特征可以分别用感知距离和功能负荷来表示。本文对汉语辅音进行了基于功能负荷的聚类分析，该聚类分析依赖于文本与音素转录之间的互信息。然后通过发音测试将我们的聚类结果与基于感知距离的聚类结果进行比较。通过对中文报纸语料库数百万句的实验，我们发现，在发音相同的位置，不同发音方式的大多数音素往往具有较大的成对外音。这与结果一致，即这些音素倾向于具有较长的成对感知距离。

引用次数: 7

Recursive neural network paraphrase identification for example-based dialog retrieval 基于实例的对话检索递归神经网络释义识别

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041777

Lasguido Nio, S. Sakti, Graham Neubig, T. Toda, Satoshi Nakamura

An example-based dialog model often require a lot of data collections to achieve a good performance. However, when it comes on handling an out of vocabulary (OOV) database queries, this approach resulting in weakness and inadequate handling of interactions between words in the sentence. In this work, we try to overcome this problem by utilizing recursive neural network paraphrase identification to improve the robustness of example-based dialog response retrieval. We model our dialog-pair database and user input query with distributed word representations, and employ recursive autoencoders and dynamic pooling to determine whether two sentences with arbitrary length have the same meaning. The distributed representations have the potential to improve handling of OOV cases, and the recursive structure can reduce confusion in example matching.

基于示例的对话框模型通常需要大量的数据收集才能获得良好的性能。然而，在处理词汇量不足(OOV)数据库查询时，这种方法会导致对句子中单词之间交互的处理不足。在这项工作中，我们试图通过使用递归神经网络释义识别来克服这个问题，以提高基于示例的对话响应检索的鲁棒性。我们使用分布式单词表示对对话对数据库和用户输入查询进行建模，并使用递归自动编码器和动态池来确定任意长度的两个句子是否具有相同的含义。分布式表示有可能改善对OOV情况的处理，递归结构可以减少示例匹配中的混淆。

引用次数: 0

Improved cross-layer cooperative MAC protocol for wireless ad hoc networks 改进的无线自组织网络跨层协作MAC协议

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041716

Quang-Trung Hoang, X. Tran

This paper considers the design of a cross-layer medium access control (MAC) protocol for wireless ad hoc cooperative networks. Specifically, we redesign the message exchange process of the MAC protocol previously proposed by Shan et al. By using a HRP signal with shorter length the proposed protocol can reduce the protocol overhead and thus improve the transmission reliability. We also propose to use only one HRP signal to resolve the collision among the helpers with the same cooperative rate. The proposed protocol achieves higher path throughput and lower end-to-end packet latency compared with that by Shan et al. and the traditional IEEE 802.11 MAC protocol.

研究了无线自组织协作网络的跨层介质访问控制(MAC)协议的设计。具体来说，我们重新设计了Shan等人之前提出的MAC协议的消息交换过程。该协议通过使用短长度的HRP信号，减少了协议开销，提高了传输的可靠性。我们还建议使用一个HRP信号来解决具有相同合作速率的helper之间的冲突。与Shan等人的协议和传统的IEEE 802.11 MAC协议相比，该协议实现了更高的路径吞吐量和更低的端到端数据包延迟。

引用次数: 9

Learning visual co-occurrence with auto-encoder for image super-resolution 学习视觉共现与自编码器的图像超分辨率

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041671

Yudong Liang, Jinjun Wang, Shizhou Zhang, Yihong Gong

This paper proposes a novel neural network learning the essential mapping function between the low resolution and high resolution image for Image superresolution problem. In our approach, patch recurrence property of small patches in natural image are utilized as a prior to train the network. An autoencoder neutral network is designed to reconstruct the high resolution patches. The constraint that the output of the coding part should be similar as the corresponding high resolution patches is imposed to ameliorate the illness nature of the superresolution problem. In fact, the degeneration mapping from the high resolution image to the low resolution image is also integrated in the network. Both visual improvements and objective assessments are demonstrated on true images.

针对图像超分辨率问题，提出了一种学习低分辨率和高分辨率图像基本映射函数的神经网络。在我们的方法中，利用自然图像中小块的块递归特性作为训练网络的先验。设计了一个自编码器神经网络来重建高分辨率的图像。为了改善超分辨率问题的病态性，对编码部分的输出施加了与相应的高分辨率补丁相似的约束。实际上，从高分辨率图像到低分辨率图像的退化映射也集成在网络中。在真实图像上演示了视觉改进和客观评估。

引用次数: 5

Compressed sensing based channel estimation for uplink OFDMA systems 基于压缩感知的OFDMA上行系统信道估计

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041569

K. Hayashi, Masanori Sakai, Takuya Kamenosono, Megumi Kaneko

The paper considers a time domain channel estimation approach for uplink OFDMA (Orthogonal Frequency Division Multiple Access) systems. Although frequency domain channel estimation schemes are widely used for those systems, we propose time domain channel estimation schemes by taking advantage of the sparsity of channel impulse response with compressed sensing. Numerical simulations show the merit of the proposed schemes, which demonstrates the validity of the time domain channel estimation approach for OFDMA systems.

本文研究了一种用于上行OFDMA(正交频分多址)系统的时域信道估计方法。虽然频域信道估计方案被广泛用于这些系统，但我们提出了利用信道脉冲响应的稀疏性和压缩感知的时域信道估计方案。数值仿真结果表明了所提方案的优点，验证了时域信道估计方法在OFDMA系统中的有效性。

引用次数: 3

Denoising autoencoder and environment adaptation for distant-talking speech recognition with asynchronous speech recording 异步语音录制远程语音识别的去噪自编码器和环境自适应

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041548

Longbiao Wang, Bo Ren, Yuma Ueda, A. Kai, Shunta Teraoka, T. Fukushima

In this paper, we propose a robust distant-talking speech recognition system with asynchronous speech recording. This is implemented by combining denoising autoencoder-based cepstral-domain dereverberation, automatic asynchronous speech (microphone or mobile terminal) selection and environment adaptation. Although applications using mobile terminals have attracted increasing attention, there are few studies that focus on distant-talking speech recognition with asynchronous mobile terminals. For the system proposed in this paper, after applying a denoising autoencoder in the cepstral domain of speech to suppress reverberation and performing Large Vocabulary Continuous Speech Recognition (LVCSR), we adopted automatic asynchronous mobile terminal selection and environment adaptation using speech segments from optimal mobile terminals. The proposed method was evaluated using a reverberant WSJCAMO corpus, which was emitted by a loudspeaker and recorded in a meeting room with multiple speakers by far-field multiple mobile terminals. By integrating a cepstral-domain denoising autoencoder and automatic mobile terminal selection with environment adaptation, the average Word Error Rate (WER) was reduced from 51.8% of the baseline system to 28.8%, i.e., the relative error reduction rate was 44.4% when using multi-condition acoustic models.

本文提出了一种鲁棒的异步语音录制远程语音识别系统。这是通过结合去噪自编码器的倒频域去噪、自动异步语音(麦克风或移动终端)选择和环境适应来实现的。尽管基于移动终端的语音识别应用越来越受到人们的关注，但针对基于异步移动终端的远程语音识别的研究却很少。本文提出的系统在语音的倒谱域采用去噪自编码器抑制混响，并进行大词汇量连续语音识别(LVCSR)后，利用最优移动终端的语音片段进行自动异步移动终端选择和环境自适应。采用WSJCAMO混响语料库对该方法进行了评价，该语料库由扬声器发射，并由远场多移动终端在有多个扬声器的会议室中录制。通过集成倒谱域去噪自编码器和具有环境自适应功能的移动终端自动选择，将平均单词错误率(WER)从基线系统的51.8%降低到28.8%，即多条件声学模型的相对错误率为44.4%。

{"title":"Denoising autoencoder and environment adaptation for distant-talking speech recognition with asynchronous speech recording","authors":"Longbiao Wang, Bo Ren, Yuma Ueda, A. Kai, Shunta Teraoka, T. Fukushima","doi":"10.1109/APSIPA.2014.7041548","DOIUrl":"https://doi.org/10.1109/APSIPA.2014.7041548","url":null,"abstract":"In this paper, we propose a robust distant-talking speech recognition system with asynchronous speech recording. This is implemented by combining denoising autoencoder-based cepstral-domain dereverberation, automatic asynchronous speech (microphone or mobile terminal) selection and environment adaptation. Although applications using mobile terminals have attracted increasing attention, there are few studies that focus on distant-talking speech recognition with asynchronous mobile terminals. For the system proposed in this paper, after applying a denoising autoencoder in the cepstral domain of speech to suppress reverberation and performing Large Vocabulary Continuous Speech Recognition (LVCSR), we adopted automatic asynchronous mobile terminal selection and environment adaptation using speech segments from optimal mobile terminals. The proposed method was evaluated using a reverberant WSJCAMO corpus, which was emitted by a loudspeaker and recorded in a meeting room with multiple speakers by far-field multiple mobile terminals. By integrating a cepstral-domain denoising autoencoder and automatic mobile terminal selection with environment adaptation, the average Word Error Rate (WER) was reduced from 51.8% of the baseline system to 28.8%, i.e., the relative error reduction rate was 44.4% when using multi-condition acoustic models.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128533131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Analysis of customer communication by employee in restaurant and lead time estimation 分析餐厅员工与客户的沟通情况，预估交货期

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041701

Masanori Takehara, Hiroya Nojiri, S. Tamura, S. Hayamizu, T. Kurata

Human behavior sensing and their analysis are great role to improve service quality and education of employees. This paper shows novel frameworks of detection of customer communication and lead time estimation(LTE) by using multi-sensored data, sound data and accounting data in the restaurant. They are useful for management about work environments and problems for employees. Lead time from order to delivery shows the quality of the service for customers. We found sound data of an employee's speech is useful for these techniques by speech ratio smoothing and POS sound detection.

人的行为感知及其分析对提高服务质量和员工教育具有重要作用。本文展示了通过使用餐厅的多传感器数据、声音数据和会计数据来检测客户沟通和交货时间估计(LTE)的新框架。它们对管理工作环境和员工问题很有用。从订货到交货的交货期显示了为客户提供的服务质量。通过语音比例平滑和POS语音检测，我们发现员工的语音数据对这些技术是有用的。

引用次数: 1

Constrained design of FIR filters with sparse coefficients 稀疏系数FIR滤波器的约束设计

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041561

Ryo Matsuoka, T. Baba, M. Okuda

We present an algorithm for the constrained design of FIR filters with sparse coefficients. In general, the filter design approach aims to minimize a filter order and maximize the filter performance. Although the FIR filter coefficients designed by the least squares method is optimal in the least squares sense, it is not necessarily optimal among the set of filters with the same number of multipliers, that is, less mean squared error can be achieved by a filter that has the same number of multipliers, but has longer impulse response with some zero-valued entries. Our method minimizes the number of nonzero entries in the impulse response together with the least squares error of its frequency response. In addition, we incorporate some constraints to the design and realize better performance than conventional constrained least squares design.

提出了一种具有稀疏系数的FIR滤波器的约束设计算法。一般来说，滤波器设计方法的目标是最小化滤波器阶数和最大化滤波器性能。虽然用最小二乘法设计的FIR滤波器系数在最小二乘意义上是最优的，但在具有相同乘数的滤波器集合中并不一定是最优的，即具有相同乘数的滤波器可以获得较小的均方误差，但具有一些零值项的滤波器具有较长的脉冲响应。我们的方法将脉冲响应中的非零项的数量及其频率响应的最小二乘误差最小化。此外，我们在设计中加入了一些约束，实现了比传统约束最小二乘设计更好的性能。

引用次数: 3

Multiple sound source tracking and identification via degenerate unmixing estimation technique and cardinality balanced multi-target multi-bernoulli filter (DUET-CBMeMBer) with track management 利用退化解混估计技术和带轨道管理的基数平衡多目标多伯努利滤波器(DUET-CBMeMBer)对多声源进行跟踪和识别

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

Pub Date : 2014-12-01 DOI: 10.1109/APSIPA.2014.7041731

Nicholas Chong, Shanhung Wong, S. Nordholm, I. Murray

In Source Separation research, "cocktail party problem" is a challenging problem that research into source separation aims to solve. Many attempts have been made to solve this complex problem. A logical approach would be to break down this complex problem into several smaller problems which are solved in different stages - each considering various aspects. In this paper, we are providing a robust solution to a part of the problem by localizing and tracking multiple moving speech sources in a room environment. Here we study the separation problem for unknown number of moving sources. The DUET-CBMeMBer method we outline is capable of estimating the number of sound sources as well as tracking and labelling them. This paper proposes a track management technique that identifies sound sources based on their trajectory as an extension to the DUET-CBMeMBer technique.

在源分离研究中，“鸡尾酒会问题”是源分离研究要解决的一个具有挑战性的问题。为了解决这个复杂的问题，人们作了许多尝试。一个合乎逻辑的方法是把这个复杂的问题分解成几个较小的问题，在不同的阶段解决——每个阶段考虑不同的方面。在本文中，我们通过定位和跟踪房间环境中的多个移动语音源，为部分问题提供了一个鲁棒的解决方案。本文研究了未知数量运动源的分离问题。我们概述的DUET-CBMeMBer方法能够估计声源的数量以及跟踪和标记它们。本文提出了一种基于声源轨迹识别声源的轨道管理技术，作为DUET-CBMeMBer技术的扩展。

引用次数: 5

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀