2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)最新文献

英文中文

Mining user interests from social media by fusing textual and visual features 通过融合文本和视觉特征从社交媒体中挖掘用户兴趣

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Pub Date : 2016-12-01 DOI: 10.1109/APSIPA.2016.7820713

Fang-Yu Chao, Jia Xu, Chia-Wen Lin

In this paper, we propose a framework that fuses textual and visual features of user generated social media data to mine the distribution of user interests. The proposed framework consists of three steps: feature extraction, model training, and user interest mining. We choose boards from popular users on Pinterest to collect training and test data. For each pin we extract the term-document matrices as textual features, bag of visual words as low-level visual features, and attributes as mid-level visual features. Representative features are then selected for training topic models using discriminative latent Dirichlet allocation (DLDA). In performance evaluation, pins collected from popular users are used to evaluate the classification accuracy and pins collected from other common users are used to evaluate the recommendation performance. Our experimental results show the efficacy of the proposed method.

在本文中，我们提出了一个融合用户生成的社交媒体数据的文本和视觉特征来挖掘用户兴趣分布的框架。该框架包括三个步骤:特征提取、模型训练和用户兴趣挖掘。我们从Pinterest上的热门用户中选择板来收集训练和测试数据。对于每个引脚，我们提取术语-文档矩阵作为文本特征，提取视觉词包作为低级视觉特征，提取属性作为中级视觉特征。然后使用判别潜狄利克雷分配(DLDA)为训练主题模型选择代表性特征。在性能评估中，从热门用户收集的pin用于评估分类准确性，从其他普通用户收集的pin用于评估推荐性能。实验结果表明了该方法的有效性。

引用次数: 0

Time-of-flight image enhancement for depth map generation 用于深度图生成的飞行时间图像增强

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Pub Date : 2016-12-01 DOI: 10.1109/APSIPA.2016.7820774

Yunseok Song, Yo-Sung Ho

Time-of-Flight (ToF) cameras are easily accessible in this era. They capture real distances of objects in a controlled environment. Yet, the ToF image may include disconnected boundaries between objects. In addition, certain objects are not capable of reflecting the infrared ray such as black hair. Such problems are caused by the physics of ToF. This paper proposes a method to compensate such errors by replacing them with reasonable distance data. The proposed method employs object boundary filtering, outlier elimination and iterative min/max averaging. After acquiring the enhanced ToF image, this can be applied to depth map generation by using the ToF camera with other color cameras. The experiment results show improved ToF images which lead to more accurate depth maps.

在这个时代，飞行时间(ToF)相机很容易获得。它们在受控环境中捕捉物体的真实距离。然而，ToF图像可能包括物体之间不连接的边界。此外，某些物体不能反射红外线，比如黑色的头发。这些问题是由ToF的物理特性引起的。本文提出了一种用合理的距离数据来补偿这些误差的方法。该方法采用目标边界滤波、离群值消除和最小/最大迭代平均。在获得增强的ToF图像后，可以将ToF相机与其他彩色相机一起使用，用于生成深度图。实验结果表明，改进后的ToF图像可以获得更精确的深度图。

引用次数: 1

Voice-pathology analysis based on AR-HMM 基于AR-HMM的语音病理分析

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Pub Date : 2016-12-01 DOI: 10.1109/APSIPA.2016.7820679

A. Sasou

Voice-pathology detection from a subject's voice is a promising technology for pre-diagnosis of larynx diseases. Glottal source estimation in particular plays a very important role in voice-pathology analysis. For more accurate estimation of the spectral envelope and glottal source of the pathology voice, we propose a method that can automatically generate the topology of the glottal source Hidden Markov Model (HMM), as well as estimate the Auto-Regressive (AR)-HMM parameter by combining AR-HMM parameter estimation and the Minimum Description Length-based Successive State Splitting (MDL-SSS) algorithm. The AR-HMM adopts a single Gaussian distribution for the output Probability Distribution Function (PDF) of each state in the glottal source HMM. In this paper, we propose a novel voice-pathology detection method based on the AR-HMM with automatic topology generation, which utilizes the output PDF variances normalized with regard to the maximum variance as clues for voice-pathology detection. We experimentally demonstrate that for normal voices, other normalized variances are distributed around a lower range than the maximum variance. This is because the PDF of the state just following vocal fold closure tends to have a maximum variance far greater than other variances. For pathology voices, the maximum variance and other variances are more closely distributed than for normal voices, possibly due to air leaking through the vocal folds. The experiment results confirmed the feasibility and fundamental validity of the proposed method.

从被试者的声音中进行语音病理检测是一种很有前途的喉疾病预诊断技术。声门源估计在语音病理分析中起着非常重要的作用。为了更准确地估计病理语音的频谱包络和声门源，我们提出了一种自动生成声门源隐马尔可夫模型(HMM)拓扑的方法，并将AR-HMM参数估计与基于最小描述长度的连续状态分裂(MDL-SSS)算法相结合，估计自回归(AR)-HMM参数。AR-HMM对声门源HMM中每个状态的输出概率分布函数(PDF)采用单高斯分布。本文提出了一种基于AR-HMM的语音病理检测方法，该方法利用输出的PDF方差按方差最大值归一化作为语音病理检测的线索。我们通过实验证明，对于正常的声音，其他归一化方差分布在比最大方差更低的范围内。这是因为声带闭合后状态的PDF的最大方差远远大于其他方差。对于病理声音，最大方差和其他方差的分布比正常声音更紧密，可能是由于空气通过声带泄漏。实验结果证实了该方法的可行性和基本有效性。

{"title":"Voice-pathology analysis based on AR-HMM","authors":"A. Sasou","doi":"10.1109/APSIPA.2016.7820679","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820679","url":null,"abstract":"Voice-pathology detection from a subject's voice is a promising technology for pre-diagnosis of larynx diseases. Glottal source estimation in particular plays a very important role in voice-pathology analysis. For more accurate estimation of the spectral envelope and glottal source of the pathology voice, we propose a method that can automatically generate the topology of the glottal source Hidden Markov Model (HMM), as well as estimate the Auto-Regressive (AR)-HMM parameter by combining AR-HMM parameter estimation and the Minimum Description Length-based Successive State Splitting (MDL-SSS) algorithm. The AR-HMM adopts a single Gaussian distribution for the output Probability Distribution Function (PDF) of each state in the glottal source HMM. In this paper, we propose a novel voice-pathology detection method based on the AR-HMM with automatic topology generation, which utilizes the output PDF variances normalized with regard to the maximum variance as clues for voice-pathology detection. We experimentally demonstrate that for normal voices, other normalized variances are distributed around a lower range than the maximum variance. This is because the PDF of the state just following vocal fold closure tends to have a maximum variance far greater than other variances. For pathology voices, the maximum variance and other variances are more closely distributed than for normal voices, possibly due to air leaking through the vocal folds. The experiment results confirmed the feasibility and fundamental validity of the proposed method.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"150 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113960604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

The covariance selection quality for graphs with junction trees through AUC bounds 通过AUC边界的连接树图的协方差选择质量

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Pub Date : 2016-12-01 DOI: 10.1109/APSIPA.2016.7820837

N. T. Khajavi, A. Kuh

We conduct a study of graphical models and discuss the quality of model selection approximation by formulating the problem as a detection problem and examine the area under the curve (AUC). We are specifically looking at the model selection problem for jointly Gaussian random vectors. For Gaussian distributions, this problem simplifies to the covariance selection problem which is widely discussed in literature by Dempster [1]. In this paper, we discuss graphical models such as the pth order Markov chain and the pth order star network interpretation which also have junction tree graphical representations and give the definition for the correlation approximation matrix (CAM) which contains all information about the model selection problem. We compute the model covariance matrix as well as the KL divergence between the original distribution and the approximated model distribution. We conduct some simulations which show that the quality of the selected model increases as the model order, p, increases.

我们对图形模型进行了研究，并通过将问题表述为检测问题并检查曲线下面积(AUC)来讨论模型选择近似的质量。我们特别关注联合高斯随机向量的模型选择问题。对于高斯分布，该问题简化为协方差选择问题，Dempster[1]在文献中进行了广泛的讨论。本文讨论了具有连接树图形表示的p阶马尔可夫链和p阶星形网络解释等图形模型，并给出了包含模型选择问题所有信息的相关逼近矩阵的定义。我们计算了模型协方差矩阵以及原始分布和近似模型分布之间的KL散度。仿真结果表明，所选模型的质量随着模型阶数p的增加而提高。

引用次数: 3

Saliency aware fast intra coding algorithm for HEVC HEVC显著性感知快速内编码算法

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Pub Date : 2016-12-01 DOI: 10.1109/APSIPA.2016.7820706

Liyuan Xiong, Wei Zhou, Xin Zhou, Guanwen Zhang, Ai Qing

In this paper, a saliency aware fast intra coding algorithm for HEVC is proposed consists of perceptual intra coding and fast intra prediction mode decision algorithm. Firstly, based on the visual saliency detection, an adaptive CU splitting method is proposed to reduce intra encoding complexity. Furthermore, quantization parameter is adaptively adjusted at the CU level according to the relative importance of each CU and distortion is efficiently controlled. Secondly, a fast intra prediction mode decision algorithm with step halving rough mode decision method and early modes pruning algorithm is presented to selectively check the potential modes and effectively reduce the complexity of computation. Experimental results show that 45.39% encoding time can be reduced by the proposed saliency aware fast intra coding algorithm. Furthermore, our proposed algorithm can achieves 2.18% bit rate reduction on average with negligible perceptual quality loss.

本文提出了一种基于显著性感知的HEVC快速帧内编码算法，该算法由感知帧内编码和帧内预测模式快速决策算法组成。首先，在视觉显著性检测的基础上，提出了一种自适应图像分割方法，以降低图像内编码复杂度;此外，该方法还能根据各单元的相对重要性自适应调整量化参数，有效地控制失真。其次，提出了一种采用步长减半粗模式决策方法和早期模式剪枝算法的快速内预测模式决策算法，选择性地检查潜在模式，有效地降低了计算复杂度;实验结果表明，基于显著性感知的快速帧内编码算法可将编码时间缩短45.39%。此外，我们提出的算法可以实现2.18%的平均比特率降低，而感知质量损失可以忽略不计。

引用次数: 3

Codestream level secure identification for JPEG 2000 images under various compression ratios 不同压缩比下JPEG 2000图像的码流级安全识别

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Pub Date : 2016-12-01 DOI: 10.1109/APSIPA.2016.7820726

Kenta Iida, H. Kiya

A secure identification scheme for JPEG 2000 code-streams is proposed in this paper. The aim is to securely identify JPEG 2000 images generated from the same original image, without decoding images. Features used for the identification are extracted from header parts in a JPEG 2000 codestream. The proposed scheme does not provide any false negative matches under various compression ratios, while most of image hashing-based schemes do not guarantee this performance. Existing identification schemes that do not provide any false negative matches can not be securely carried out. Due to such a situation, we propose an identification system based on a fuzzy commitment scheme, which is a well-known secure protocol for biometric template protection. Moreover, an error correction technique with 1-bit parity is considered to achieve the system. The experiment results show the proposed scheme is effective in terms of true positive matches, while keeping the security high.

提出了一种针对jpeg2000码流的安全识别方案。目的是在不解码图像的情况下，安全地识别由相同原始图像生成的JPEG 2000图像。用于标识的特征是从JPEG 2000码流的标头部分提取出来的。该方案在各种压缩比下都不提供任何假负匹配，而大多数基于图像哈希的方案都不能保证这种性能。现有的不提供任何假阴性匹配的识别方案无法安全地进行。针对这种情况，我们提出了一种基于模糊承诺方案的识别系统，这是一种众所周知的生物特征模板保护安全协议。此外，还考虑了采用1位奇偶校验的纠错技术来实现该系统。实验结果表明，该方案在保证高安全性的同时，在真正匹配方面是有效的。

引用次数: 3

Region based stereo matching method with gradient and distance information 基于梯度和距离信息的区域立体匹配方法

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Pub Date : 2016-12-01 DOI: 10.1109/APSIPA.2016.7820897

Yong-Jun Chang, Yo-Sung Ho

Stereo matching methods estimate depth information of captured images. One way to estimate accurate depth values is to use the distance information. This method enhances the disparity map by preserving the edge region. In order to preserve the depth discontinuity near the edge region, it uses the distance information as a new weighting value for the matching cost function. However, this method has a high complexity problem. To overcome this problem, we propose region based stereo matching method with gradient and distance information. Since the distance transform calculates the pixel distance from the edge region, we can classify whether the pixel is near the edge region or not. In other words, some regions near the edge have small distance transformed values. For this reason, our method divides regions depending on the value of distance transformed pixel. After that, different cost functions are applied to each region for improving the computation efficiency.

立体匹配方法估计捕获图像的深度信息。估计准确深度值的一种方法是使用距离信息。该方法通过保留边缘区域来增强视差图。为了保持边缘区域附近的深度不连续，它使用距离信息作为匹配代价函数的新的加权值。然而，这种方法有一个高复杂度的问题。为了克服这一问题，我们提出了一种基于梯度和距离信息的区域立体匹配方法。由于距离变换计算的是像素到边缘区域的距离，所以我们可以对像素是否靠近边缘区域进行分类。换句话说，边缘附近的一些区域具有较小的距离变换值。因此，我们的方法根据距离变换像素的值来划分区域。然后，在每个区域应用不同的代价函数，以提高计算效率。

引用次数: 0

Direction-of-arrival estimation via Khatri-Rao subspace using compressed sensing 基于压缩感知的Khatri-Rao子空间到达方向估计

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Pub Date : 2016-12-01 DOI: 10.1109/APSIPA.2016.7820807

Hirotaka Mukumoto, K. Hayashi, Megumi Kaneko

Direction-of-arrival (DOA) estimation via Khatri-Rao (KR) subspace with multiple signal classification (MUSIC) algorithm can cope with a higher number of incoming waves than that of sensors, while it requires the signals to be quasi-stationary and needs a larger number of “frames” than that of incoming waves. On the other hand, a hybrid approach of compressed sensing and MUSIC algorithm can estimate DOAs with snapshots less than the number of sources for noiseless observation, although the number of incoming waves must be less than that of sensors. Exploiting the fact that the frame in MUSIC via KR subspace corresponds to the snapshot in conventional MUSIC without observation noise, we propose a DOA estimation scheme using KR product array processing and compressed sensing, which can cope with a greater number of incoming waves than both that of sensors and that of frames. The validity of the proposed method is shown via numerical experiments.

基于Khatri-Rao (KR)子空间的多信号分类(MUSIC)算法的到达方向(DOA)估计可以处理比传感器更多的传入波，但它要求信号是准平稳的，并且需要比传入波更多的“帧数”。另一方面，压缩感知和MUSIC算法的混合方法可以在快照少于无噪声观测源数量的情况下估计doa，尽管输入波的数量必须少于传感器的数量。利用KR子空间MUSIC中的帧与传统MUSIC中的无观测噪声的快照相对应的事实，提出了一种基于KR积阵列处理和压缩感知的DOA估计方案，该方案比传感器和帧都能处理更多的传入波。数值实验验证了该方法的有效性。

引用次数: 1

Shape-adaptive image compression using lossy shape coding, SA-prediction, and SA-deblocking 使用有损形状编码、sa预测和sa块的形状自适应图像压缩

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Pub Date : 2016-12-01 DOI: 10.1109/APSIPA.2016.7820772

Li-Ang Chen, Jian-Jiun Ding, Yih-Cherng Lee

As the annoying blocking or ghost artifacts tend to appear in the conventional compression approaches either in the JPEG or JPEG2000 standards at low bitrate, the concept of the object-oriented image compression is proposed. This kind of methods is able to retain the image structural boundaries and therefore has relatively good visual qualities even in high compression ratios. In this paper, we propose a shape-adaptive image compression scheme employing an efficient lossy contour compression algorithm to encode the region information, which is usually the main overhead data in such systems. In addition, the prediction and deblocking techniques commonly used in novel compression approaches are also applied with the proposed shape-adaptive versions. Simulation results suggest that the proposed compression system is able to provide compressed images with much better visual qualities and more reasonable degradation forms compared to other prevailing methods.

针对JPEG或JPEG2000标准中传统压缩方法在低比特率下容易出现烦人的块或鬼影现象，提出了面向对象图像压缩的概念。这种方法能够保留图像的结构边界，因此即使在高压缩比下也具有较好的视觉质量。在本文中，我们提出了一种形状自适应图像压缩方案，采用一种有效的有损轮廓压缩算法来编码区域信息，这通常是这类系统中主要的开销数据。此外，在新的压缩方法中常用的预测和去块技术也被应用于所提出的形状自适应版本。仿真结果表明，与其他流行的压缩方法相比，所提出的压缩系统能够提供更好的视觉质量和更合理的退化形式的压缩图像。

引用次数: 2

Classification of footstep attributes using a vibration sensor 用振动传感器对脚步属性进行分类

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Pub Date : 2016-12-01 DOI: 10.1109/APSIPA.2016.7820756

F. Asano, Miyuki Fukushima

The evacuation of children and the elderly from disaster areas is sometimes difficult. This study aims to use a vibration sensor to estimate situations involving people who remain in a devastated building. This paper proposes a method to estimate the attributes of the people, such as their age or sex, based on the vibration data produced by their footsteps. The vibration data obtained through sensors are analyzed by a linear prediction method to extract the features, which are then classified using a support vector machine to estimate the attributes. The experimental results show that an accuracy of 80–95% was achieved for the classification of the sex and the type of shoes.

从灾区疏散儿童和老人有时是很困难的。这项研究的目的是使用振动传感器来估计在被摧毁的建筑物中仍然存在的人的情况。本文提出了一种基于人的脚步产生的振动数据来估计人的年龄或性别等属性的方法。对传感器获得的振动数据采用线性预测方法进行分析，提取特征，然后使用支持向量机进行分类，估计属性。实验结果表明，该方法对性别和鞋型的分类准确率达到80-95%。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀