首页 > 最新文献

2013 2nd IAPR Asian Conference on Pattern Recognition最新文献

英文 中文
Planar Segmentation from Point Clouds via Graph Laplacian Regularized K-Planes 基于图拉普拉斯正则k平面的点云平面分割
Pub Date : 2013-11-05 DOI: 10.1109/ACPR.2013.15
Wei Sui, Lingfeng Wang, Huai-Yu Wu, Chunhong Pan
Extracting planar surfaces from 3D point clouds is an important and challenging step for generating building models as the obtained data are always noisy, missing and unorganised. In this paper, we present a novel graph Laplacian regularized K-planes method for segmenting piece-wise planar surfaces of urban building point clouds. The core ideas behind our model are from two aspects: 1) a linear projection model is utilized to fit planar surfaces globally, 2) a graph Laplacian regularization is applied to preserve smoothness of each plane locally. The two terms are combined as an objective function, which is minimized via an iterative updating algorithm. Comparative experiments on both synthetic and real data sets are performed. The results demonstrate the effectiveness and efficiency of our method.
从三维点云中提取平面表面是生成建筑模型的一个重要且具有挑战性的步骤,因为所获得的数据总是有噪声、缺失和无组织的。本文提出了一种新的图拉普拉斯正则k平面分割方法,用于城市建筑点云的逐块平面分割。该模型的核心思想来自两个方面:1)利用线性投影模型对平面进行全局拟合;2)利用图拉普拉斯正则化来保持各平面的局部光滑性。将这两项组合为一个目标函数,并通过迭代更新算法将其最小化。在合成数据集和真实数据集上进行了对比实验。结果表明了该方法的有效性和高效性。
{"title":"Planar Segmentation from Point Clouds via Graph Laplacian Regularized K-Planes","authors":"Wei Sui, Lingfeng Wang, Huai-Yu Wu, Chunhong Pan","doi":"10.1109/ACPR.2013.15","DOIUrl":"https://doi.org/10.1109/ACPR.2013.15","url":null,"abstract":"Extracting planar surfaces from 3D point clouds is an important and challenging step for generating building models as the obtained data are always noisy, missing and unorganised. In this paper, we present a novel graph Laplacian regularized K-planes method for segmenting piece-wise planar surfaces of urban building point clouds. The core ideas behind our model are from two aspects: 1) a linear projection model is utilized to fit planar surfaces globally, 2) a graph Laplacian regularization is applied to preserve smoothness of each plane locally. The two terms are combined as an objective function, which is minimized via an iterative updating algorithm. Comparative experiments on both synthetic and real data sets are performed. The results demonstrate the effectiveness and efficiency of our method.","PeriodicalId":365633,"journal":{"name":"2013 2nd IAPR Asian Conference on Pattern Recognition","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114665793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Consistent Segmentation Based Color Correction for Coarsely Registered Images 基于一致分割的粗配准图像颜色校正
Pub Date : 2013-11-05 DOI: 10.1109/ACPR.2013.72
Haoxing Wang, Longquan Dai, Xiaopeng Zhang
Local color correction methods transfer colors between corresponding regions. However, inconsistent segmentation between the source image and the target image tends to degrade the correction result. In this paper, we propose a local color correction technique for coarsely registered images. In the segmentation step, it enforces the consistent segmentation on both source and target images to alleviate the inaccurate registration problem. In the color transfer step, it uses the region confidences and the bilateral-filter-like color influence maps to improve the color correction result. The experiment shows the proposed method achieves improved color correction results compared with the global methods and the recent local color correction methods.
局部色彩校正方法在相应区域之间传递色彩。然而,源图像和目标图像分割不一致,容易降低校正结果。本文提出了一种用于粗配准图像的局部色彩校正技术。在分割步骤中,对源图像和目标图像进行一致性分割,以缓解配准不准确的问题。在颜色转移步骤中,使用区域置信度和类似于双边滤波器的颜色影响图来改善颜色校正结果。实验表明,与全局方法和当前的局部色彩校正方法相比,该方法取得了更好的色彩校正效果。
{"title":"Consistent Segmentation Based Color Correction for Coarsely Registered Images","authors":"Haoxing Wang, Longquan Dai, Xiaopeng Zhang","doi":"10.1109/ACPR.2013.72","DOIUrl":"https://doi.org/10.1109/ACPR.2013.72","url":null,"abstract":"Local color correction methods transfer colors between corresponding regions. However, inconsistent segmentation between the source image and the target image tends to degrade the correction result. In this paper, we propose a local color correction technique for coarsely registered images. In the segmentation step, it enforces the consistent segmentation on both source and target images to alleviate the inaccurate registration problem. In the color transfer step, it uses the region confidences and the bilateral-filter-like color influence maps to improve the color correction result. The experiment shows the proposed method achieves improved color correction results compared with the global methods and the recent local color correction methods.","PeriodicalId":365633,"journal":{"name":"2013 2nd IAPR Asian Conference on Pattern Recognition","volume":"22 3 Suppl 7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115350663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Vehicle Detection in Satellite Images by Parallel Deep Convolutional Neural Networks 基于并行深度卷积神经网络的卫星图像车辆检测
Pub Date : 2013-11-05 DOI: 10.1109/ACPR.2013.33
Xueyun Chen, Shiming Xiang, Cheng-Lin Liu, Chunhong Pan
Deep convolutional Neural Networks (DNN) is the state-of-the-art machine learning method. It has been used in many recognition tasks including handwritten digits, Chinese words and traffic signs, etc. However, training and test DNN are time-consuming tasks. In practical vehicle detection application, both speed and accuracy are required. So increasing the speeds of DNN while keeping its high accuracy has significant meaning for many recognition and detection applications. We introduce parallel branches into the DNN. The maps of the layers of DNN are divided into several parallel branches, each branch has the same number of maps. There are not direct connections between different branches. Our parallel DNN (PNN) keeps the same structure and dimensions of the DNN, reducing the total number of connections between maps. The more number of branches we divide, the more swift the speed of the PNN is, the conventional DNN becomes a special form of PNN which has only one branch. Experiments on large vehicle database showed that the detection accuracy of PNN dropped slightly with the speed increasing. Even the fastest PNN (10 times faster than DNN), whose branch has only two maps, fully outperformed the traditional methods based on features (such as HOG, LBP). In fact, PNN provides a good solution way for compromising the speed and accuracy requirements in many applications.
深度卷积神经网络(DNN)是最先进的机器学习方法。它已被用于许多识别任务,包括手写数字、中文单词和交通标志等。然而,训练和测试深度神经网络是一项耗时的任务。在实际的车辆检测应用中,对速度和精度都有要求。因此,提高深度神经网络的速度,同时保持其高精度,对许多识别和检测应用具有重要意义。我们在DNN中引入平行分支。DNN各层的地图被分成几个平行的分支,每个分支有相同数量的地图。不同的分支之间没有直接的联系。我们的并行深度神经网络(PNN)保持了DNN的相同结构和维度,减少了映射之间的连接总数。分割的分支数越多,PNN的速度越快,传统深度神经网络成为只有一个分支的PNN的一种特殊形式。在大型车辆数据库上的实验表明,随着速度的增加,PNN的检测精度略有下降。即使是最快的PNN(比DNN快10倍),其分支只有两个地图,也完全优于基于特征的传统方法(如HOG, LBP)。事实上,PNN为许多应用中对速度和精度要求的妥协提供了一种很好的解决方案。
{"title":"Vehicle Detection in Satellite Images by Parallel Deep Convolutional Neural Networks","authors":"Xueyun Chen, Shiming Xiang, Cheng-Lin Liu, Chunhong Pan","doi":"10.1109/ACPR.2013.33","DOIUrl":"https://doi.org/10.1109/ACPR.2013.33","url":null,"abstract":"Deep convolutional Neural Networks (DNN) is the state-of-the-art machine learning method. It has been used in many recognition tasks including handwritten digits, Chinese words and traffic signs, etc. However, training and test DNN are time-consuming tasks. In practical vehicle detection application, both speed and accuracy are required. So increasing the speeds of DNN while keeping its high accuracy has significant meaning for many recognition and detection applications. We introduce parallel branches into the DNN. The maps of the layers of DNN are divided into several parallel branches, each branch has the same number of maps. There are not direct connections between different branches. Our parallel DNN (PNN) keeps the same structure and dimensions of the DNN, reducing the total number of connections between maps. The more number of branches we divide, the more swift the speed of the PNN is, the conventional DNN becomes a special form of PNN which has only one branch. Experiments on large vehicle database showed that the detection accuracy of PNN dropped slightly with the speed increasing. Even the fastest PNN (10 times faster than DNN), whose branch has only two maps, fully outperformed the traditional methods based on features (such as HOG, LBP). In fact, PNN provides a good solution way for compromising the speed and accuracy requirements in many applications.","PeriodicalId":365633,"journal":{"name":"2013 2nd IAPR Asian Conference on Pattern Recognition","volume":"314 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122751171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 70
Real-Time Binary Descriptor Based Background Modeling 基于实时二进制描述符的背景建模
Pub Date : 2013-11-05 DOI: 10.1109/ACPR.2013.125
Wan-Chen Liu, Shu-Zhe Lin, Min-Hsiang Yang, Chun-Rong Huang
In this paper, we propose a new binary descriptor based background modeling approach which is robust to lighting changes and dynamic backgrounds in the environment. Instead of using traditional parametric models, our background models are constructed by background instances using binary descriptors computed from observed backgrounds. As shown in the experiments, our method can achieve better foreground detection results and fewer false alarms compared to the state-of-the-art methods.
本文提出了一种新的基于二元描述符的背景建模方法,该方法对光照变化和环境中的动态背景具有鲁棒性。我们的背景模型不是使用传统的参数模型,而是使用从观测背景中计算出的二进制描述符来构建背景实例。实验表明,与现有的方法相比,我们的方法可以获得更好的前景检测结果,并且可以减少误报。
{"title":"Real-Time Binary Descriptor Based Background Modeling","authors":"Wan-Chen Liu, Shu-Zhe Lin, Min-Hsiang Yang, Chun-Rong Huang","doi":"10.1109/ACPR.2013.125","DOIUrl":"https://doi.org/10.1109/ACPR.2013.125","url":null,"abstract":"In this paper, we propose a new binary descriptor based background modeling approach which is robust to lighting changes and dynamic backgrounds in the environment. Instead of using traditional parametric models, our background models are constructed by background instances using binary descriptors computed from observed backgrounds. As shown in the experiments, our method can achieve better foreground detection results and fewer false alarms compared to the state-of-the-art methods.","PeriodicalId":365633,"journal":{"name":"2013 2nd IAPR Asian Conference on Pattern Recognition","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122795885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Learning Fingerprint Orientation Fields Using Continuous Restricted Boltzmann Machines 使用连续受限玻尔兹曼机学习指纹方向场
Pub Date : 2013-11-05 DOI: 10.1109/ACPR.2013.37
M. Sahasrabudhe, A. Namboodiri
We aim to learn local orientation field patterns in fingerprints and correct distorted field patterns in noisy fingerprint images. This is formulated as a learning problem and achieved using two continuous restricted Boltzmann machines. The learnt orientation fields are then used in conjunction with traditional Gabor based algorithms for fingerprint enhancement. Orientation fields extracted by gradient-based methods are local, and do not consider neighboring orientations. If some amount of noise is present in a fingerprint, then these methods perform poorly when enhancing the image, affecting fingerprint matching. This paper presents a method to correct the resulting noisy regions over patches of the fingerprint by training two continuous restricted Boltzmann machines. The continuous RBMs are trained with clean fingerprint images and applied to overlapping patches of the input fingerprint. Experimental results show that one can successfully restore patches of noisy fingerprint images.
我们的目标是学习指纹的局部方向场模式,并校正噪声指纹图像中的畸变场模式。这被表述为一个学习问题,并使用两个连续受限玻尔兹曼机来实现。然后将学习到的方向场与传统的基于Gabor的指纹增强算法结合使用。基于梯度的方法提取的方向场是局部的,不考虑相邻的方向。如果指纹中存在一定数量的噪声,那么这些方法在增强图像时表现不佳,影响指纹匹配。本文提出了一种通过训练两个连续受限玻尔兹曼机来校正指纹图像斑块上产生的噪声区域的方法。用干净的指纹图像训练连续rbm,并将其应用于输入指纹的重叠块上。实验结果表明,该方法可以成功地恢复带有噪声的指纹图像。
{"title":"Learning Fingerprint Orientation Fields Using Continuous Restricted Boltzmann Machines","authors":"M. Sahasrabudhe, A. Namboodiri","doi":"10.1109/ACPR.2013.37","DOIUrl":"https://doi.org/10.1109/ACPR.2013.37","url":null,"abstract":"We aim to learn local orientation field patterns in fingerprints and correct distorted field patterns in noisy fingerprint images. This is formulated as a learning problem and achieved using two continuous restricted Boltzmann machines. The learnt orientation fields are then used in conjunction with traditional Gabor based algorithms for fingerprint enhancement. Orientation fields extracted by gradient-based methods are local, and do not consider neighboring orientations. If some amount of noise is present in a fingerprint, then these methods perform poorly when enhancing the image, affecting fingerprint matching. This paper presents a method to correct the resulting noisy regions over patches of the fingerprint by training two continuous restricted Boltzmann machines. The continuous RBMs are trained with clean fingerprint images and applied to overlapping patches of the input fingerprint. Experimental results show that one can successfully restore patches of noisy fingerprint images.","PeriodicalId":365633,"journal":{"name":"2013 2nd IAPR Asian Conference on Pattern Recognition","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126147391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Automatic Elements Extraction of Chinese Web News Using Prior Information of Content and Structure 基于内容和结构先验信息的中文网络新闻元素自动提取
Pub Date : 2013-11-05 DOI: 10.1109/ACPR.2013.52
Chengru Song, Shifeng Weng, Changshui Zhang
We propose a set of efficient processes for extracting all four elements of Chinese news web pages, namely news title, release date, news source and the main text. Our approach is based on a deep analysis of content and structure features of current Chinese news. We take content indicators as the key to recover tree structure of the main text. Additionally, we come up with the concept of Length-Distance Ratio to help improve performance. Our method rarely depends on selection of samples and has strong generalization ability regardless of training process, distinguishing itself from most existing methods. We have tested our approach on 1721 labeled Chinese news pages from 429 web sites. Results show that an 87% accuracy was achieved for news source extraction, and over 95% accuracy for other three elements.
我们提出了一套高效的中文新闻网页四要素提取流程,即新闻标题、发布日期、新闻来源和正文。我们的方法是基于对当前中国新闻内容和结构特征的深入分析。我们将内容指标作为恢复正文树状结构的关键。此外,我们提出了长距离比的概念,以帮助提高性能。我们的方法很少依赖于样本的选择,无论训练过程如何,都具有较强的泛化能力,区别于现有的大多数方法。我们在429个网站的1721个有标签的中文新闻页面上测试了我们的方法。结果表明,新闻源提取的准确率达到87%,其他三个元素的准确率超过95%。
{"title":"Automatic Elements Extraction of Chinese Web News Using Prior Information of Content and Structure","authors":"Chengru Song, Shifeng Weng, Changshui Zhang","doi":"10.1109/ACPR.2013.52","DOIUrl":"https://doi.org/10.1109/ACPR.2013.52","url":null,"abstract":"We propose a set of efficient processes for extracting all four elements of Chinese news web pages, namely news title, release date, news source and the main text. Our approach is based on a deep analysis of content and structure features of current Chinese news. We take content indicators as the key to recover tree structure of the main text. Additionally, we come up with the concept of Length-Distance Ratio to help improve performance. Our method rarely depends on selection of samples and has strong generalization ability regardless of training process, distinguishing itself from most existing methods. We have tested our approach on 1721 labeled Chinese news pages from 429 web sites. Results show that an 87% accuracy was achieved for news source extraction, and over 95% accuracy for other three elements.","PeriodicalId":365633,"journal":{"name":"2013 2nd IAPR Asian Conference on Pattern Recognition","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129354926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Saliency Detection Using Color Spatial Variance Weighted Graph Model 基于颜色空间方差加权图模型的显著性检测
Pub Date : 2013-11-05 DOI: 10.1109/ACPR.2013.93
Xiaoyun Yan, Yuehuang Wang, Mengmeng Song, Man Jiang
Saliency detection as a recently active research field of computer vision, has a wide range of applications, such as pattern recognition, image retrieval, adaptive compression, target detection, etc. In this paper, we propose a saliency detection method based on color spatial variance weighted graph model, which is designed rely on a background prior. First, the original image is partitioned into small patches, then we use mean-shift clustering algorithm on this patches to get sorts of clustering centers that represents the main colors of whole image. In modeling stage, all patches and the clustering centers are denoted as nodes on a specific graph model. The saliency of each patch is defined as weighted sum of weights on shortest paths from the patch to all clustering centers, each shortest path is weighted according to color spatial variance. Our saliency detection method is computational efficient and outperformed the state of art methods by higher precision and better recall rates, when we took evaluation on the popular MSRA1000 database.
显著性检测是近年来计算机视觉研究的一个活跃领域,在模式识别、图像检索、自适应压缩、目标检测等方面有着广泛的应用。本文提出了一种基于颜色空间方差加权图模型的显著性检测方法,该方法的设计依赖于背景先验。首先将原始图像分割成小块,然后在小块上使用mean-shift聚类算法得到代表整幅图像主要颜色的各种聚类中心。在建模阶段,将所有的patch和聚类中心表示为特定图模型上的节点。每个patch的显著性定义为从patch到所有聚类中心的最短路径上的权值的加权和,每个最短路径根据颜色空间方差进行加权。当我们在流行的MSRA1000数据库上进行评估时,我们的显著性检测方法计算效率高,并且以更高的精度和更好的召回率优于当前的方法。
{"title":"Saliency Detection Using Color Spatial Variance Weighted Graph Model","authors":"Xiaoyun Yan, Yuehuang Wang, Mengmeng Song, Man Jiang","doi":"10.1109/ACPR.2013.93","DOIUrl":"https://doi.org/10.1109/ACPR.2013.93","url":null,"abstract":"Saliency detection as a recently active research field of computer vision, has a wide range of applications, such as pattern recognition, image retrieval, adaptive compression, target detection, etc. In this paper, we propose a saliency detection method based on color spatial variance weighted graph model, which is designed rely on a background prior. First, the original image is partitioned into small patches, then we use mean-shift clustering algorithm on this patches to get sorts of clustering centers that represents the main colors of whole image. In modeling stage, all patches and the clustering centers are denoted as nodes on a specific graph model. The saliency of each patch is defined as weighted sum of weights on shortest paths from the patch to all clustering centers, each shortest path is weighted according to color spatial variance. Our saliency detection method is computational efficient and outperformed the state of art methods by higher precision and better recall rates, when we took evaluation on the popular MSRA1000 database.","PeriodicalId":365633,"journal":{"name":"2013 2nd IAPR Asian Conference on Pattern Recognition","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129293559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New Banknote Number Recognition Algorithm Based on Support Vector Machine 基于支持向量机的纸币号码识别新算法
Pub Date : 2013-11-05 DOI: 10.1109/ACPR.2013.115
S. Gai, Guowei Yang, S. Zhang, M. Wan
Detecting the banknote serial number is an important task in business transaction. In this paper, we propose a new banknote number recognition method. The preprocessing of each banknote image is used to locate position of the banknote number image. Each number image is divided into non-overlapping partitions and the average gray value of each partition is used as feature vector for recognition. The optimal kernel function is obtained by the semi-definite programming (SDP). The experimental results show that the proposed method outperforms MASK, BP, HMM, Single SVM classifiers.
纸币序列号的检测是商业交易中的一项重要工作。本文提出了一种新的钞票号码识别方法。对每张钞票图像进行预处理,定位钞票号码图像的位置。将数字图像分成互不重叠的分区,并将各分区的平均灰度值作为特征向量进行识别。利用半确定规划方法得到了最优核函数。实验结果表明,该方法优于MASK、BP、HMM和单个SVM分类器。
{"title":"New Banknote Number Recognition Algorithm Based on Support Vector Machine","authors":"S. Gai, Guowei Yang, S. Zhang, M. Wan","doi":"10.1109/ACPR.2013.115","DOIUrl":"https://doi.org/10.1109/ACPR.2013.115","url":null,"abstract":"Detecting the banknote serial number is an important task in business transaction. In this paper, we propose a new banknote number recognition method. The preprocessing of each banknote image is used to locate position of the banknote number image. Each number image is divided into non-overlapping partitions and the average gray value of each partition is used as feature vector for recognition. The optimal kernel function is obtained by the semi-definite programming (SDP). The experimental results show that the proposed method outperforms MASK, BP, HMM, Single SVM classifiers.","PeriodicalId":365633,"journal":{"name":"2013 2nd IAPR Asian Conference on Pattern Recognition","volume":"170 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131465893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A Multi-resolution Action Recognition Algorithm Using Wavelet Domain Features 使用小波域特征的多分辨率动作识别算法
Pub Date : 2013-11-05 DOI: 10.1109/ACPR.2013.143
H. Imtiaz, U. Mahbub, G. Schaefer, Md Atiqur Rahman Ahad
This paper proposes a novel approach for human action recognition using multi-resolution feature extraction based on the two-dimensional discrete wavelet transform (2D-DWT). Action representations can be considered as image templates, which can be useful for understanding various actions or gestures as well as for recognition and analysis. An action recognition scheme is developed based on extracting features from the frames of a video sequence. The proposed feature selection algorithm offers the advantage of very low feature dimensionality and therefore lower computational burden. It is shown that the use of wavelet-domain features enhances the distinguish ability of different actions, resulting in a very high within-class compactness and between-class separability of the extracted features, while certain undesirable phenomena, such as camera movement and change in camera distance from the subject, are less severe in the frequency domain. Principal component analysis is performed to further reduce the dimensionality of the feature space. Extensive experimentations on a standard benchmark database confirm that the proposed approach offers not only computational savings but also a very recognition accuracy.
本文提出了一种基于二维离散小波变换(2D-DWT)的多分辨率特征提取人类动作识别新方法。动作表征可视为图像模板,可用于理解各种动作或手势以及识别和分析。基于从视频序列的帧中提取特征,开发了一种动作识别方案。所提出的特征选择算法具有特征维度极低的优势,因此计算负担较轻。研究表明,小波域特征的使用增强了对不同动作的区分能力,从而使提取的特征具有很高的类内紧凑性和类间可分性,而某些不良现象,如摄像机移动和摄像机与被摄体距离的变化,在频域中则不那么严重。主成分分析可进一步降低特征空间的维度。在标准基准数据库上进行的大量实验证实,所提出的方法不仅节省了计算量,而且识别准确率也非常高。
{"title":"A Multi-resolution Action Recognition Algorithm Using Wavelet Domain Features","authors":"H. Imtiaz, U. Mahbub, G. Schaefer, Md Atiqur Rahman Ahad","doi":"10.1109/ACPR.2013.143","DOIUrl":"https://doi.org/10.1109/ACPR.2013.143","url":null,"abstract":"This paper proposes a novel approach for human action recognition using multi-resolution feature extraction based on the two-dimensional discrete wavelet transform (2D-DWT). Action representations can be considered as image templates, which can be useful for understanding various actions or gestures as well as for recognition and analysis. An action recognition scheme is developed based on extracting features from the frames of a video sequence. The proposed feature selection algorithm offers the advantage of very low feature dimensionality and therefore lower computational burden. It is shown that the use of wavelet-domain features enhances the distinguish ability of different actions, resulting in a very high within-class compactness and between-class separability of the extracted features, while certain undesirable phenomena, such as camera movement and change in camera distance from the subject, are less severe in the frequency domain. Principal component analysis is performed to further reduce the dimensionality of the feature space. Extensive experimentations on a standard benchmark database confirm that the proposed approach offers not only computational savings but also a very recognition accuracy.","PeriodicalId":365633,"journal":{"name":"2013 2nd IAPR Asian Conference on Pattern Recognition","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126963557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Correlation-Based Facade Parsing Using Shape Grammar 使用形状语法的基于关联的Facade解析
Pub Date : 2013-11-05 DOI: 10.1109/ACPR.2013.81
Runze Zhang, Ruiling Deng, Xin He, Gang Zeng, Rui Gan, H. Zha
With strong inference of hierarchical and repetitive structures, semantic information has been widely used in dealing with urban scenes. In this paper, we present a super-pixel-based facade parsing framework which combines the top-down shape grammar splitting with bottom-up information aggregation: machine learning forecasts prior classes, super-pixels improve compactness, and boundary estimation divides the splitting into two procedures - raw and fine, providing a reasonable initial guess for the latter to achieve better random walk optimization results. We also put forward the correlation judging between floors for the purpose of compromising freedom degree reduction with style variety and flexibility, which is also introduced as alignment constraint term to extend the probability energy. Experiments show that our method converges fast and achieves the state-of-the-art results for different styles. Further study on understanding and reconstruction is in progress of exploiting these results.
语义信息具有很强的层次性和重复性,在城市场景处理中得到了广泛的应用。在本文中,我们提出了一种基于超像素的外观解析框架,将自顶向下的形状语法分割与自底向上的信息聚合相结合:机器学习预测先验类,超像素提高紧凑性,边界估计将分割分为原始和精细两个过程,为后者提供合理的初始猜测,以获得更好的随机行走优化结果。提出了楼层间的相关性判断,以折衷降低自由度与风格的多样性和灵活性,并将其作为对齐约束项引入,以扩展概率能量。实验表明,该方法收敛速度快,对不同风格的图像都能得到较好的结果。利用这些结果,正在进行进一步的理解和重建研究。
{"title":"Correlation-Based Facade Parsing Using Shape Grammar","authors":"Runze Zhang, Ruiling Deng, Xin He, Gang Zeng, Rui Gan, H. Zha","doi":"10.1109/ACPR.2013.81","DOIUrl":"https://doi.org/10.1109/ACPR.2013.81","url":null,"abstract":"With strong inference of hierarchical and repetitive structures, semantic information has been widely used in dealing with urban scenes. In this paper, we present a super-pixel-based facade parsing framework which combines the top-down shape grammar splitting with bottom-up information aggregation: machine learning forecasts prior classes, super-pixels improve compactness, and boundary estimation divides the splitting into two procedures - raw and fine, providing a reasonable initial guess for the latter to achieve better random walk optimization results. We also put forward the correlation judging between floors for the purpose of compromising freedom degree reduction with style variety and flexibility, which is also introduced as alignment constraint term to extend the probability energy. Experiments show that our method converges fast and achieves the state-of-the-art results for different styles. Further study on understanding and reconstruction is in progress of exploiting these results.","PeriodicalId":365633,"journal":{"name":"2013 2nd IAPR Asian Conference on Pattern Recognition","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116170001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2013 2nd IAPR Asian Conference on Pattern Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1