首页 > 最新文献

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)最新文献

英文 中文
Highlighting root notes in chord recognition using cepstral features and multi-task learning 利用倒谱特征和多任务学习突出和弦识别中的根音
Mu Yang, Li Su, Yi-Hsuan Yang
A musical chord is usually described by its root note and the chord type. While a substantial amount of work has been done in the field of music information retrieval (MIR) to automate chord recognition, the role of root notes in this task has seldom received specific attention. In this paper, we present a new approach and empirical studies demonstrating improved accuracy in chord recognition by properly highlighting the information of the root notes. In the signal level, we propose to combine spectral features with features derived from the cepstrum to improve the identification of low pitches, which usually correspond to the root notes. In the model level, we propose a multi-task learning framework based on the neural nets to jointly consider chord recognition and root note recognition in training. We found that the improved accuracy can be attributed to better information about the sub-harmonics of the notes, and the emphasis of root notes in recognizing chords.
一个音乐和弦通常由它的根音和和弦类型来描述。虽然在音乐信息检索(MIR)领域已经做了大量的工作来自动识别和弦,但在这项任务中,词根音符的作用很少受到特别的关注。在本文中,我们提出了一种新的方法和实证研究表明,通过适当地突出根音符的信息,可以提高和弦识别的准确性。在信号层面,我们提出将频谱特征与倒谱衍生的特征相结合,以提高对低音的识别能力,因为低音通常与根音相对应。在模型层面,我们提出了一种基于神经网络的多任务学习框架,在训练中共同考虑和弦识别和根音识别。我们发现,准确度的提高可以归因于更好地了解音符的次谐波,以及识别和弦时根音的强调。
{"title":"Highlighting root notes in chord recognition using cepstral features and multi-task learning","authors":"Mu Yang, Li Su, Yi-Hsuan Yang","doi":"10.1109/APSIPA.2016.7820865","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820865","url":null,"abstract":"A musical chord is usually described by its root note and the chord type. While a substantial amount of work has been done in the field of music information retrieval (MIR) to automate chord recognition, the role of root notes in this task has seldom received specific attention. In this paper, we present a new approach and empirical studies demonstrating improved accuracy in chord recognition by properly highlighting the information of the root notes. In the signal level, we propose to combine spectral features with features derived from the cepstrum to improve the identification of low pitches, which usually correspond to the root notes. In the model level, we propose a multi-task learning framework based on the neural nets to jointly consider chord recognition and root note recognition in training. We found that the improved accuracy can be attributed to better information about the sub-harmonics of the notes, and the emphasis of root notes in recognizing chords.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"95 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126461814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Mesh-based image retargeting with spectral graph filtering 基于谱图滤波的网格图像重定位
Yuichi Tanaka, S. Yagyu, Akie Sakiyama, Masaki Onuki
We propose a calculation method of deformed image pixel positions for mesh-based image retargeting. Image retargeting is a sophisticated image resizing method which yields resized images with acceptable quality even if we resize the image into different aspect ratio from the original one. It often employs a mesh-based approach, where pixels are nodes of a graph and relationships between pixels are represented as its edges. In this paper, we reformulate a pixel position deformation of image retargeting as a spectral graph filtering with a graph signal processing-based approach. We validate our method through some image retargeting examples with an appropriately designed filter kernels in the graph spectral domain.
提出了一种基于网格的图像重定位中变形图像像素位置的计算方法。图像重定向是一种复杂的图像大小调整方法,即使我们将图像大小调整为与原始图像不同的宽高比,也能产生质量可接受的图像。它通常采用基于网格的方法,其中像素是图的节点,像素之间的关系表示为其边缘。本文采用基于图信号处理的方法,将图像重定位的像素位置变形重构为光谱图滤波。我们通过一些图像重定向实例验证了我们的方法,并在图谱域中设计了适当的滤波器核。
{"title":"Mesh-based image retargeting with spectral graph filtering","authors":"Yuichi Tanaka, S. Yagyu, Akie Sakiyama, Masaki Onuki","doi":"10.1109/APSIPA.2016.7820728","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820728","url":null,"abstract":"We propose a calculation method of deformed image pixel positions for mesh-based image retargeting. Image retargeting is a sophisticated image resizing method which yields resized images with acceptable quality even if we resize the image into different aspect ratio from the original one. It often employs a mesh-based approach, where pixels are nodes of a graph and relationships between pixels are represented as its edges. In this paper, we reformulate a pixel position deformation of image retargeting as a spectral graph filtering with a graph signal processing-based approach. We validate our method through some image retargeting examples with an appropriately designed filter kernels in the graph spectral domain.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127271640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discrete feature transform for low-complexity single-image super-resolution 低复杂度单图像超分辨率的离散特征变换
Jonghee Kim, Changick Kim
Dictionary-based super-resolution is actively studied with successful achievements. However, previous dictionary-based super-resolution methods exploit optimization or nearest neighbor search which has high complexity. In this paper, we propose a low-complexity super-resolution method called the discrete feature transform which performs feature extraction and nearest neighbor search at once. As a result, the proposed method achieves the lowest complexity among dictionary-based super-resolution methods with a comparable performance.
基于字典的超分辨技术得到了积极的研究,并取得了成功的成果。然而,以往基于字典的超分辨方法采用优化或最近邻搜索,其复杂度较高。本文提出了一种低复杂度的超分辨率方法——离散特征变换,它可以同时进行特征提取和最近邻搜索。结果表明,在性能相当的情况下,该方法是基于字典的超分辨率方法中复杂度最低的方法。
{"title":"Discrete feature transform for low-complexity single-image super-resolution","authors":"Jonghee Kim, Changick Kim","doi":"10.1109/APSIPA.2016.7820852","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820852","url":null,"abstract":"Dictionary-based super-resolution is actively studied with successful achievements. However, previous dictionary-based super-resolution methods exploit optimization or nearest neighbor search which has high complexity. In this paper, we propose a low-complexity super-resolution method called the discrete feature transform which performs feature extraction and nearest neighbor search at once. As a result, the proposed method achieves the lowest complexity among dictionary-based super-resolution methods with a comparable performance.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127469234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Frame rate up conversion for multiview video 帧率上升转换为多视图视频
Yoonmo Yang, Dohoon Lee, Byung Tae Oh
In this paper, we propose a new frame rate up conversion method for multiview video. The proposed method uses the depth map and neighboring view information for the improvement of motion estimation and compensation accuracy. In details, it decomposes a block into multiple layers with depth map. Then it estimates the occluded regions in the lower layer using their neighboring view information, which consequently leads more accurate motion estimation and compensation. The experimental results show that the proposed method highly improves the quality of the interpolated frames compared to the conventional methods.
本文提出了一种新的多视点视频帧率提升转换方法。该方法利用深度图和相邻视图信息,提高了运动估计和补偿精度。具体来说,它通过深度图将一个块分解成多个层。然后利用被遮挡区域的相邻视图信息对被遮挡区域进行估计,从而得到更精确的运动估计和补偿。实验结果表明,与传统插值方法相比,该方法大大提高了插值帧的质量。
{"title":"Frame rate up conversion for multiview video","authors":"Yoonmo Yang, Dohoon Lee, Byung Tae Oh","doi":"10.1109/APSIPA.2016.7820714","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820714","url":null,"abstract":"In this paper, we propose a new frame rate up conversion method for multiview video. The proposed method uses the depth map and neighboring view information for the improvement of motion estimation and compensation accuracy. In details, it decomposes a block into multiple layers with depth map. Then it estimates the occluded regions in the lower layer using their neighboring view information, which consequently leads more accurate motion estimation and compensation. The experimental results show that the proposed method highly improves the quality of the interpolated frames compared to the conventional methods.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"05 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127367435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Locality sensitive discriminant analysis for speaker verification 说话人验证的局部敏感判别分析
Danwei Cai, Weicheng Cai, Zhidong Ni, Ming Li
In this paper, we apply Locality Sensitive Discriminant Analysis (LSDA) to speaker verification system for intersession variability compensation. As opposed to LDA which fails to discover the local geometrical structure of the data manifold, LSDA finds a projection which maximizes the margin between i-vectors from different speakers at each local area. Since the number of samples varies in a wide range in each class, we improve LSDA by using adaptive k nearest neighbors in each class and modifying the corresponding within- and between-class weight matrix. In that way, each class has equal importance in LSDA's objective function. Experiments were carried out on the NIST 2010 speaker recognition evaluation (SRE) extended condition 5 female task, results show that our proposed adaptive k nearest neighbors based LSDA method significantly improves the conventional i-vector/PLDA baseline by 18% relative cost reduction and 28% relative equal error rate reduction.
本文将局域敏感判别分析(LSDA)应用于说话人验证系统的会话间可变性补偿。与LDA无法发现数据流形的局部几何结构相反,LSDA找到了一个投影,该投影使每个局部区域来自不同说话人的i向量之间的余量最大化。由于每个类的样本数量变化范围很大,我们通过在每个类中使用自适应k近邻并修改相应的类内和类间权重矩阵来改进LSDA。这样,每个类在LSDA的目标函数中具有同等的重要性。在NIST 2010 speaker recognition evaluation (SRE)扩展条件5女性任务上进行了实验,结果表明,我们提出的基于k近邻的自适应LSDA方法比传统的i-vector/PLDA基线显著降低了18%的相对成本和28%的相对平均错误率。
{"title":"Locality sensitive discriminant analysis for speaker verification","authors":"Danwei Cai, Weicheng Cai, Zhidong Ni, Ming Li","doi":"10.1109/APSIPA.2016.7820799","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820799","url":null,"abstract":"In this paper, we apply Locality Sensitive Discriminant Analysis (LSDA) to speaker verification system for intersession variability compensation. As opposed to LDA which fails to discover the local geometrical structure of the data manifold, LSDA finds a projection which maximizes the margin between i-vectors from different speakers at each local area. Since the number of samples varies in a wide range in each class, we improve LSDA by using adaptive k nearest neighbors in each class and modifying the corresponding within- and between-class weight matrix. In that way, each class has equal importance in LSDA's objective function. Experiments were carried out on the NIST 2010 speaker recognition evaluation (SRE) extended condition 5 female task, results show that our proposed adaptive k nearest neighbors based LSDA method significantly improves the conventional i-vector/PLDA baseline by 18% relative cost reduction and 28% relative equal error rate reduction.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114078379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Efficient deep neural networks for speech synthesis using bottleneck features 基于瓶颈特征的高效深度神经网络语音合成
Young-Sun Joo, Won-Suk Jun, Hong-Goo Kang
This paper proposes a cascading deep neural network (DNN) structure for speech synthesis system that consists of text-to-bottleneck (TTB) and bottleneck-to-speech (BTS) models. Unlike conventional single structure that requires a large database to find complicated mapping rules between linguistic and acoustic features, the proposed structure is very effective even if the available training database is inadequate. The bottle-neck feature utilized in the proposed approach represents the characteristics of linguistic features and its average acoustic features of several speakers. Therefore, it is more efficient to learn a mapping rule between bottleneck and acoustic features than to learn directly a mapping rule between linguistic and acoustic features. Experimental results show that the learning capability of the proposed structure is much higher than that of the conventional structures. Objective and subjective listening test results also verify the superiority of the proposed structure.
本文提出了一种用于语音合成系统的级联深度神经网络(DNN)结构,该结构由文本到瓶颈(TTB)和瓶颈到语音(BTS)模型组成。传统的单一结构需要庞大的数据库才能找到语言和声学特征之间复杂的映射规则,与之不同的是,即使可用的训练数据库不足,该结构也非常有效。该方法中使用的瓶颈特征代表了几个说话人的语言特征及其平均声学特征的特征。因此,学习瓶颈和声学特征之间的映射规则比直接学习语言和声学特征之间的映射规则更有效。实验结果表明,该结构的学习能力大大高于传统结构。客观和主观听力测试结果也验证了所提出结构的优越性。
{"title":"Efficient deep neural networks for speech synthesis using bottleneck features","authors":"Young-Sun Joo, Won-Suk Jun, Hong-Goo Kang","doi":"10.1109/APSIPA.2016.7820721","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820721","url":null,"abstract":"This paper proposes a cascading deep neural network (DNN) structure for speech synthesis system that consists of text-to-bottleneck (TTB) and bottleneck-to-speech (BTS) models. Unlike conventional single structure that requires a large database to find complicated mapping rules between linguistic and acoustic features, the proposed structure is very effective even if the available training database is inadequate. The bottle-neck feature utilized in the proposed approach represents the characteristics of linguistic features and its average acoustic features of several speakers. Therefore, it is more efficient to learn a mapping rule between bottleneck and acoustic features than to learn directly a mapping rule between linguistic and acoustic features. Experimental results show that the learning capability of the proposed structure is much higher than that of the conventional structures. Objective and subjective listening test results also verify the superiority of the proposed structure.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122412625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An improved simulated annealing genetic algorithm of EEG feature selection in sleep stage 睡眠阶段脑电信号特征选择的改进模拟退火遗传算法
Y. Ji, Xiangeng Bu, Jinwei Sun, Zhiyong Liu
In order to establish a more reliable and robust EEG model in sleep stages, the reasonable choice of modeling parameters is necessary. The function of this step is to select a subset of d features from a set of D features based on some optimization criterion, and provide the most optimal input features of classification. In the present study, an improved simulated annealing genetic algorithm (ISAGA) was proposed. 25 feature parameters were extracted from the sleep EEG in MIT-BIH polysomnography database. The feature selection results demonstrated that ISAGA can get a higher classification accuracy with fewer feature number than the correlation coefficient algorithm (CCA), genetic algorithm (GA), adaptive genetic algorithm (AGA) and simulated annealing genetic algorithm (SAGA). Compared to using all the features in sleep staging, the classification accuracy of ISAGA with optimal features is about 92.00%, which improved about 4.83%.
为了建立更可靠、鲁棒的睡眠阶段脑电模型,需要合理选择建模参数。这一步的功能是根据一定的优化准则从d个特征集中选择d个特征的子集,并提供最优的分类输入特征。提出了一种改进的模拟退火遗传算法(ISAGA)。从MIT-BIH多导睡眠图数据库中提取25个特征参数。特征选择结果表明,与相关系数算法(CCA)、遗传算法(GA)、自适应遗传算法(AGA)和模拟退火遗传算法(SAGA)相比,ISAGA能够以较少的特征个数获得更高的分类精度。与使用所有睡眠分期特征相比,使用最优特征的ISAGA分类准确率约为92.00%,提高了约4.83%。
{"title":"An improved simulated annealing genetic algorithm of EEG feature selection in sleep stage","authors":"Y. Ji, Xiangeng Bu, Jinwei Sun, Zhiyong Liu","doi":"10.1109/APSIPA.2016.7820683","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820683","url":null,"abstract":"In order to establish a more reliable and robust EEG model in sleep stages, the reasonable choice of modeling parameters is necessary. The function of this step is to select a subset of d features from a set of D features based on some optimization criterion, and provide the most optimal input features of classification. In the present study, an improved simulated annealing genetic algorithm (ISAGA) was proposed. 25 feature parameters were extracted from the sleep EEG in MIT-BIH polysomnography database. The feature selection results demonstrated that ISAGA can get a higher classification accuracy with fewer feature number than the correlation coefficient algorithm (CCA), genetic algorithm (GA), adaptive genetic algorithm (AGA) and simulated annealing genetic algorithm (SAGA). Compared to using all the features in sleep staging, the classification accuracy of ISAGA with optimal features is about 92.00%, which improved about 4.83%.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121724641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Color-distribution similarity by information theoretic divergence for color images 基于信息理论散度的彩色图像颜色分布相似度研究
Mizuki Murayama, Daisuke Oguro, H. Kikuchi, H. Huttunen, Yo-Sung Ho, Jaeho Shin
The divergence similarity between two color images is presented based on the Jensen-Shannon divergence to measure the color-distribution similarity. Subjective assessment experiments were developed to obtain mean opinion scores (MOS) of test images. It was found that the divergence similarity and MOS values showed statistically significant correlations.
基于Jensen-Shannon散度,提出了两幅彩色图像之间的散度相似度来衡量颜色分布的相似度。进行主观评价实验,得到测试图像的平均意见分数(MOS)。结果表明,差异相似度与MOS值具有显著的统计学相关性。
{"title":"Color-distribution similarity by information theoretic divergence for color images","authors":"Mizuki Murayama, Daisuke Oguro, H. Kikuchi, H. Huttunen, Yo-Sung Ho, Jaeho Shin","doi":"10.1109/APSIPA.2016.7820681","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820681","url":null,"abstract":"The divergence similarity between two color images is presented based on the Jensen-Shannon divergence to measure the color-distribution similarity. Subjective assessment experiments were developed to obtain mean opinion scores (MOS) of test images. It was found that the divergence similarity and MOS values showed statistically significant correlations.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122625817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Stereo matching using census transform of adaptive window sizes with gradient images 基于自适应窗口大小普查变换的梯度图像立体匹配
Jaeryun Ko, Yo-Sung Ho
The census transform in computing the matching cost of stereo matching is simple and robust under luminance variations in stereo image pairs; however, different disparity maps are generated depending on the shape and size of the census transform window. In this paper, we propose a stereo matching method with variable sizes of census transform windows based on the gradients of stereo images. Our experiment shows higher accuracy of disparity values in the area of depth discontinuities.
在立体图像对亮度变化的情况下,普查变换计算立体匹配的匹配代价简单、鲁棒;然而,根据人口普查变换窗口的形状和大小,会生成不同的视差图。本文提出了一种基于立体图像梯度的人口普查变换窗口大小可变的立体匹配方法。实验表明,深度不连续区域的视差值精度较高。
{"title":"Stereo matching using census transform of adaptive window sizes with gradient images","authors":"Jaeryun Ko, Yo-Sung Ho","doi":"10.1109/APSIPA.2016.7820827","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820827","url":null,"abstract":"The census transform in computing the matching cost of stereo matching is simple and robust under luminance variations in stereo image pairs; however, different disparity maps are generated depending on the shape and size of the census transform window. In this paper, we propose a stereo matching method with variable sizes of census transform windows based on the gradients of stereo images. Our experiment shows higher accuracy of disparity values in the area of depth discontinuities.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126815594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Adaptive subspace-constrained diagonal loading 自适应子空间约束对角加载
Yueh-Ting Tsai, B. Su, Yu Tsao, Syu-Siang Wang
Recently, a subspace-constrained diagonal loading (SSC-DL) method has been proposed for robust beamforming against the mismatched direction of arrival (DoA) issue. Although SSC-DL has outstanding output SINR performance, it is not clear how to choose the DL factor and subspace dimension in practice. The aim of the present study is to further investigate conditions on optimal parameters for SSC-DL and algorithms to determine them in realistic test conditions. First, we proposed to use the Capon power spectrum density to determine the desired signal power, which is then used to compute the optimal DL factor for SSC-DL. Next, a novel adaptive SSC-DL approach (adaptive-SSC-DL) is proposed, which can dynamically optimize the sub-space dimension based on the test conditions. Simulation results show that adaptive-SSC-DL provides higher output SINR than several existing methods and achieves comparable performance comparing to SSC-DL with ideal parameter setup.
近年来,针对到达方向不匹配问题,提出了一种子空间约束对角加载(SSC-DL)的鲁棒波束形成方法。虽然SSC-DL具有出色的输出SINR性能,但在实际中如何选择DL因子和子空间维数并不是很明确。本研究的目的是进一步研究SSC-DL最优参数的条件和在实际测试条件下确定它们的算法。首先,我们提出使用Capon功率谱密度来确定所需的信号功率,然后使用该功率谱密度来计算SSC-DL的最优DL因子。其次,提出了一种新的自适应SSC-DL方法(adaptive-SSC-DL),该方法可以根据测试条件动态优化子空间维度。仿真结果表明,自适应SSC-DL方法的输出信噪比高于几种现有方法,并且在理想参数设置下与SSC-DL方法性能相当。
{"title":"Adaptive subspace-constrained diagonal loading","authors":"Yueh-Ting Tsai, B. Su, Yu Tsao, Syu-Siang Wang","doi":"10.1109/APSIPA.2016.7820704","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820704","url":null,"abstract":"Recently, a subspace-constrained diagonal loading (SSC-DL) method has been proposed for robust beamforming against the mismatched direction of arrival (DoA) issue. Although SSC-DL has outstanding output SINR performance, it is not clear how to choose the DL factor and subspace dimension in practice. The aim of the present study is to further investigate conditions on optimal parameters for SSC-DL and algorithms to determine them in realistic test conditions. First, we proposed to use the Capon power spectrum density to determine the desired signal power, which is then used to compute the optimal DL factor for SSC-DL. Next, a novel adaptive SSC-DL approach (adaptive-SSC-DL) is proposed, which can dynamically optimize the sub-space dimension based on the test conditions. Simulation results show that adaptive-SSC-DL provides higher output SINR than several existing methods and achieves comparable performance comparing to SSC-DL with ideal parameter setup.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129176065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1