首页 > 最新文献

2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)最新文献

英文 中文
A combined two-stage local-spatial interest point matching algorithm 一种结合两阶段局部-空间兴趣点匹配算法
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779959
A. Dehghani, Alistair Sutherland
A local-spatial interest point matching algorithm for articulated human upper body tracking application is proposed in this paper. The first stage finds confidently matched pairs of interest points from the reference and target interest point lists through a local-feature-descriptors-based matching method. Applying two cross-checking and displacement-checking steps reduces the number of mismatched pairs and results confidently matched pairs. Using these confidently matched pairs, the second stage recovers more matched interest point pairs from the remaining unmatched through the graph matching by a cyclic string matching algorithm. The proposed approach benefits from the speed of local matching algorithms as well as the accuracy and robustness of spatial matching methods. In addition, it compensates for the reference list leakage problem. Experimental results show that the combined two-stage interest matching method efficiently improves the matching process for articulated human upper body tracking.
提出了一种局部空间兴趣点匹配算法,用于关节式人体上半身跟踪。第一阶段通过基于局部特征描述符的匹配方法,从参考点和目标兴趣点列表中找到自信匹配的兴趣点对。应用两个交叉检查和位移检查步骤减少了不匹配对的数量,并得到了自信的匹配对。第二阶段利用这些自信匹配的兴趣点对,通过循环字符串匹配算法从剩余的不匹配图中恢复更多匹配的兴趣点对。该方法既具有局部匹配算法的快速性,又具有空间匹配方法的准确性和鲁棒性。此外,它还补偿了参考表泄漏问题。实验结果表明,结合两阶段兴趣匹配方法有效地改善了关节人体上半身跟踪的匹配过程。
{"title":"A combined two-stage local-spatial interest point matching algorithm","authors":"A. Dehghani, Alistair Sutherland","doi":"10.1109/IRANIANMVIP.2013.6779959","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779959","url":null,"abstract":"A local-spatial interest point matching algorithm for articulated human upper body tracking application is proposed in this paper. The first stage finds confidently matched pairs of interest points from the reference and target interest point lists through a local-feature-descriptors-based matching method. Applying two cross-checking and displacement-checking steps reduces the number of mismatched pairs and results confidently matched pairs. Using these confidently matched pairs, the second stage recovers more matched interest point pairs from the remaining unmatched through the graph matching by a cyclic string matching algorithm. The proposed approach benefits from the speed of local matching algorithms as well as the accuracy and robustness of spatial matching methods. In addition, it compensates for the reference list leakage problem. Experimental results show that the combined two-stage interest matching method efficiently improves the matching process for articulated human upper body tracking.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115124168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The capacity of arithmetic compression based text steganography method 基于容量算法压缩的文本隐写方法
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779946
R. Saniei, K. Faez
Application of the lossless compression method to hide texts is considered as a novel trend in research projects. Evaluation of the proposed methods in the field of steganography reflects a variety of approaches to create covert communication via text files. The extensiveness of steganographic issues and the presence of a huge variety of approaches make it difficult to precisely compare and evaluate these methods. Therefore, in this article a new steganography method that uses a statistical compression technique called `arithmetic coding', will be presented. In addition, the comparison of this method capacity with other methods will be explained. The arithmetic coding technique that has very high compression rates, shall guarantee even a higher growth capacity and higher security compared to its similar techniques. Meanwhile, the secret messages were not revealed through rewriting or syntax/semantic checking and compared with similar methods, increased the capacity by up to 68.9%, and compared with other methods; this method improved the capacity of fifteen times.
应用无损压缩方法隐藏文本被认为是研究项目中的一个新趋势。对隐写术领域中提出的方法的评估反映了通过文本文件创建隐蔽通信的各种方法。隐写问题的广泛性和各种方法的存在使得很难精确地比较和评估这些方法。因此,在这篇文章中,一个新的隐写方法,使用统计压缩技术称为“算术编码”,将提出。此外,还将说明该方法的容量与其他方法的比较。算术编码技术具有很高的压缩率,与同类技术相比,它需要保证更高的增长能力和更高的安全性。同时,与同类方法相比,未通过重写或语法/语义检查对秘密信息进行披露,容量提高了68.9%,与其他方法相比;这种方法使容量提高了15倍。
{"title":"The capacity of arithmetic compression based text steganography method","authors":"R. Saniei, K. Faez","doi":"10.1109/IRANIANMVIP.2013.6779946","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779946","url":null,"abstract":"Application of the lossless compression method to hide texts is considered as a novel trend in research projects. Evaluation of the proposed methods in the field of steganography reflects a variety of approaches to create covert communication via text files. The extensiveness of steganographic issues and the presence of a huge variety of approaches make it difficult to precisely compare and evaluate these methods. Therefore, in this article a new steganography method that uses a statistical compression technique called `arithmetic coding', will be presented. In addition, the comparison of this method capacity with other methods will be explained. The arithmetic coding technique that has very high compression rates, shall guarantee even a higher growth capacity and higher security compared to its similar techniques. Meanwhile, the secret messages were not revealed through rewriting or syntax/semantic checking and compared with similar methods, increased the capacity by up to 68.9%, and compared with other methods; this method improved the capacity of fifteen times.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115500073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Single camera vehicles speed measurement 单摄像头车辆测速
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779976
A. Dehghani, A. Pourmohammad
Image and video processing techniques are one of the commonly used methods for traffics monitoring. This paper investigates the image processing techniques based vehicles speed measurement issue using only a fixed single camera. Therefore, a geometrical calculation based method is proposed. Based on this method, first a moving vehicle is detected in a video background and then the vehicle speed is estimated based on some geometrical calculations. A comparison is made between this method and two other same case vehicles speed measurement methods for evaluation. The simulations results on 160×112 pixels real recorded video images shows the average vehicles speed error less than %10.
图像和视频处理技术是交通监控常用的方法之一。本文研究了基于单台固定摄像机的车辆测速图像处理技术。因此,提出了一种基于几何计算的方法。该方法首先在视频背景中检测到移动的车辆,然后根据几何计算估计出车辆的速度。并将该方法与其他两种相同情况下的车速测量方法进行了比较评价。在160×112像素实录视频图像上的仿真结果表明,平均车速误差小于%10。
{"title":"Single camera vehicles speed measurement","authors":"A. Dehghani, A. Pourmohammad","doi":"10.1109/IRANIANMVIP.2013.6779976","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779976","url":null,"abstract":"Image and video processing techniques are one of the commonly used methods for traffics monitoring. This paper investigates the image processing techniques based vehicles speed measurement issue using only a fixed single camera. Therefore, a geometrical calculation based method is proposed. Based on this method, first a moving vehicle is detected in a video background and then the vehicle speed is estimated based on some geometrical calculations. A comparison is made between this method and two other same case vehicles speed measurement methods for evaluation. The simulations results on 160×112 pixels real recorded video images shows the average vehicles speed error less than %10.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125200408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A textural approach for recognizing architectural distortion in mammograms 一种识别乳房x光片结构扭曲的纹理方法
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779965
Elham Mohammadi, E. Fatemizadeh, H. Sheikhzadeh, Sahar Khoubani
Breast cancer is considered as the most important cause of death among women. Architectural distortions are very important signs of breast cancer and early detection of them is a rewarding work. In this paper we propose a method to recognize architectural distortion from normal parenchyma. In our proposed method, appropriate features are extracted by the analysis of oriented textures with the application of orientation component of recent the state-of-the-art local texture descriptor called Monogenic Binary Coding (MBC). In addition, we transform Region of Interests (ROIs) to polar coordinates in order to highlight some specific patterns in mammograms. Various classifiers are used over a set of mammograms from Digital Database for Screening Mammography (DDSM). The results show that proposed method is very encouraging. The best performance achieved is 91.25% in terms of the average accuracy using the Nearest Neighbor classifier.
乳腺癌被认为是妇女死亡的最重要原因。建筑变形是乳腺癌的重要标志,早期发现是一项有益的工作。本文提出了一种从正常实质中识别建筑变形的方法。该方法利用当前最先进的局部纹理描述符单基因二进制编码(MBC)的方向分量,对纹理进行定向分析,提取出合适的特征。此外,我们将兴趣区域(roi)转换为极坐标,以突出乳房x光片中的一些特定模式。不同的分类器被用于一组来自乳腺摄影筛查数字数据库(DDSM)的乳房x线照片。结果表明,所提出的方法是非常令人鼓舞的。使用最近邻分类器获得的最佳性能是91.25%的平均准确率。
{"title":"A textural approach for recognizing architectural distortion in mammograms","authors":"Elham Mohammadi, E. Fatemizadeh, H. Sheikhzadeh, Sahar Khoubani","doi":"10.1109/IRANIANMVIP.2013.6779965","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779965","url":null,"abstract":"Breast cancer is considered as the most important cause of death among women. Architectural distortions are very important signs of breast cancer and early detection of them is a rewarding work. In this paper we propose a method to recognize architectural distortion from normal parenchyma. In our proposed method, appropriate features are extracted by the analysis of oriented textures with the application of orientation component of recent the state-of-the-art local texture descriptor called Monogenic Binary Coding (MBC). In addition, we transform Region of Interests (ROIs) to polar coordinates in order to highlight some specific patterns in mammograms. Various classifiers are used over a set of mammograms from Digital Database for Screening Mammography (DDSM). The results show that proposed method is very encouraging. The best performance achieved is 91.25% in terms of the average accuracy using the Nearest Neighbor classifier.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130551089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Dynamic temporal error concealment for video data in error-prone environments 易出错环境下视频数据的动态时间错误隐藏
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779947
S. M. Marvasti-Zadeh, Hossein Ghanei-Yakhdan, S. Kasaei
Error concealment is a useful method for improving the damaged video quality in the decoder side. In this paper, a dynamic method with low computational complexity is presented to improve the visual quality of videos when up to 50% of the frames are damaged. In the proposed method, temporal replacement and the improved outer boundary matching algorithm are used for dynamical error concealment in inter-frames of videos. With the use of motion vectors (MVs) which are close to the damaged macroblock (MB) the method can determine whether the motion in specific areas is either regular, irregular, or zero. Then, based on this knowledge, different methods are performed. It adaptively selects a set of candidate MVs and external boundaries for comparison purposes. Furthermore, to increase the accuracy, depending on the correctness of adjacent MVs, a specific weight is given to the boundaries of adjacent MBs. Experimental results show that the proposed method enhances both objective and subjective quality of damaged frames without any considerable increase in complexity. The average PSNR in some frames of test sequences has increased about 1.01 dB more than the outer boundary matching algorithm.
错误隐藏是改善解码器端受损视频质量的有效方法。本文提出了一种低计算复杂度的动态方法,用于在高达50%帧损坏的情况下提高视频的视觉质量。该方法采用时间替换和改进的外边界匹配算法实现视频帧间的动态错误隐藏。该方法利用与受损宏块(MB)接近的运动向量(mv)来确定特定区域的运动是规则的、不规则的还是零。然后,基于这些知识,执行不同的方法。它自适应地选择一组候选mv和外部边界进行比较。此外,为了提高精度,根据相邻mv的正确性,对相邻mb的边界赋予特定的权重。实验结果表明,该方法在不增加图像复杂度的前提下,提高了图像的主观和客观质量。测试序列部分帧的平均信噪比比外边界匹配算法提高了1.01 dB左右。
{"title":"Dynamic temporal error concealment for video data in error-prone environments","authors":"S. M. Marvasti-Zadeh, Hossein Ghanei-Yakhdan, S. Kasaei","doi":"10.1109/IRANIANMVIP.2013.6779947","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779947","url":null,"abstract":"Error concealment is a useful method for improving the damaged video quality in the decoder side. In this paper, a dynamic method with low computational complexity is presented to improve the visual quality of videos when up to 50% of the frames are damaged. In the proposed method, temporal replacement and the improved outer boundary matching algorithm are used for dynamical error concealment in inter-frames of videos. With the use of motion vectors (MVs) which are close to the damaged macroblock (MB) the method can determine whether the motion in specific areas is either regular, irregular, or zero. Then, based on this knowledge, different methods are performed. It adaptively selects a set of candidate MVs and external boundaries for comparison purposes. Furthermore, to increase the accuracy, depending on the correctness of adjacent MVs, a specific weight is given to the boundaries of adjacent MBs. Experimental results show that the proposed method enhances both objective and subjective quality of damaged frames without any considerable increase in complexity. The average PSNR in some frames of test sequences has increased about 1.01 dB more than the outer boundary matching algorithm.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125717100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Pedestrian detection using principal components analysis of gradient distribution 行人检测采用主成分梯度分布分析
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779950
S. Mehralian, M. Palhang
In this paper we proposed a new method for pedestrian detection in images and videos. Our method uses a sliding window to search through images. In order to extract the features, each window is divided into overlapping cells and features are extracted from them. The feature that we extracted to describe each window is based on analysis of gradient distribution of each cell. After gradient distribution of a cell computed, the PCA is applied on it and using a mathematical expression that gauges the attitude of edges we got the feature of the cell. Putting the features of the cells next to each other forms the feature vector of the window. Then, the extracted features are classified using Support Vector Machine (SVM). Finally, the learned SVM model tested on the INRIA pedestrian dataset. The proposed method was compared with Histograms of Oriented Gradient (HOG) approach and the results show that our method has comparable detection accuracy as well as having more robustness when facing with noise.
本文提出了一种新的图像和视频行人检测方法。我们的方法使用滑动窗口来搜索图像。为了提取特征,将每个窗口划分为重叠的单元,并从中提取特征。我们提取的特征描述每个窗口是基于分析每个细胞的梯度分布。在计算出细胞的梯度分布后,将主成分分析应用到细胞上,利用测量边缘姿态的数学表达式得到细胞的特征。将单元格的特征彼此相邻,形成窗口的特征向量。然后,使用支持向量机(SVM)对提取的特征进行分类。最后,将学习到的SVM模型在INRIA行人数据集上进行测试。将该方法与直方图定向梯度(HOG)方法进行了比较,结果表明,该方法具有相当的检测精度,并且在面对噪声时具有更强的鲁棒性。
{"title":"Pedestrian detection using principal components analysis of gradient distribution","authors":"S. Mehralian, M. Palhang","doi":"10.1109/IRANIANMVIP.2013.6779950","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779950","url":null,"abstract":"In this paper we proposed a new method for pedestrian detection in images and videos. Our method uses a sliding window to search through images. In order to extract the features, each window is divided into overlapping cells and features are extracted from them. The feature that we extracted to describe each window is based on analysis of gradient distribution of each cell. After gradient distribution of a cell computed, the PCA is applied on it and using a mathematical expression that gauges the attitude of edges we got the feature of the cell. Putting the features of the cells next to each other forms the feature vector of the window. Then, the extracted features are classified using Support Vector Machine (SVM). Finally, the learned SVM model tested on the INRIA pedestrian dataset. The proposed method was compared with Histograms of Oriented Gradient (HOG) approach and the results show that our method has comparable detection accuracy as well as having more robustness when facing with noise.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"149 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121401501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Iterative back projection based image resolution enhancement 基于迭代反投影的图像分辨率增强
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779986
P. Rasti, H. Demirel, G. Anbarjafari
In this paper, we propose a new super resolution technique based on the interpolation followed by registering them using iterative back projection (IBP). Firstly the low resolution image is interpolated and then decimate it to four lower low resolution images. The four low resolution images are interpolated and registered by using IBP in order to generate a sharper high resolution image. The proposed method has been tested on Lena, Elaine, Pepper, and Baboon. The quantitative peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM) results as well as the visual results show the superiority of the proposed technique over the conventional and alternative image super resolution techniques. For Lena's image, the PSNR is 6.21 dB higher than the bicubic interpolation.
本文提出了一种基于插值和迭代反投影(IBP)配准的超分辨率技术。首先对低分辨率图像进行插值,然后将其抽取为4张低分辨率图像。利用IBP对四幅低分辨率图像进行插值配准,得到更清晰的高分辨率图像。提出的方法已经在莉娜,伊莱恩,佩珀和狒狒身上进行了测试。定量峰值信噪比(PSNR)和结构相似指数(SSIM)结果以及视觉结果表明,该技术优于传统和替代图像超分辨率技术。对于Lena的图像,PSNR比双三次插值高6.21 dB。
{"title":"Iterative back projection based image resolution enhancement","authors":"P. Rasti, H. Demirel, G. Anbarjafari","doi":"10.1109/IRANIANMVIP.2013.6779986","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779986","url":null,"abstract":"In this paper, we propose a new super resolution technique based on the interpolation followed by registering them using iterative back projection (IBP). Firstly the low resolution image is interpolated and then decimate it to four lower low resolution images. The four low resolution images are interpolated and registered by using IBP in order to generate a sharper high resolution image. The proposed method has been tested on Lena, Elaine, Pepper, and Baboon. The quantitative peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM) results as well as the visual results show the superiority of the proposed technique over the conventional and alternative image super resolution techniques. For Lena's image, the PSNR is 6.21 dB higher than the bicubic interpolation.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129310987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Fuzzy local binary patterns: A comparison between Min-Max and Dot-Sum operators in the application of facial expression recognition 模糊局部二值模式:最小-最大和点和算子在面部表情识别中的应用比较
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6780002
M. Mohammadi, E. Fatemizadeh
The Local Binary Patterns (LBP) feature extraction method is a theoretically and computationally simple and efficient methodology for texture analysis. The LBP operator is used in many applications such as facial expression recognition and face recognition. The original LBP is based on hard thresholding the neighborhood of each pixel, which makes texture representation sensitive to noise. In addition, LBP cannot distinguish between a strong and a weak pattern. In order to enhance the LBP approach, Fuzzy Local Binary Patterns (FLBP) is proposed. In FLBP, any neighborhood does not represented only by one code, but, it is represented by all existing codes with different degrees. In FLBP, any fuzzy Intersection and Union operators may be used. In this study, the following operators are applied and their results are compared together: Dot-Sum, Min-Max and normalized Min-Max. Based on the extensive experiments, the fuzzy Min-Max operators are more useful and can improve the accuracy in the application of Facial Expression Recognition (FER) about 4% (i.e., form 82.98% to 86.88%).
局部二值模式(LBP)特征提取方法是一种理论上和计算上简单有效的纹理分析方法。LBP算子被广泛应用于面部表情识别和人脸识别等领域。原始的LBP基于每个像素邻域的硬阈值,使得纹理表示对噪声敏感。此外,LBP不能区分强和弱模式。为了改进LBP方法,提出了模糊局部二值模式(FLBP)。在FLBP中,任何邻域都不是只用一个码来表示,而是由现有的所有不同程度的码来表示。在FLBP中,可以使用任何模糊交算子和模糊联合算子。在本研究中,使用了以下算子,并将它们的结果进行了比较:Dot-Sum、Min-Max和归一化Min-Max。经过大量的实验,模糊最小-最大算子在面部表情识别(FER)的应用中更有用,可以将准确率提高约4%(即从82.98%提高到86.88%)。
{"title":"Fuzzy local binary patterns: A comparison between Min-Max and Dot-Sum operators in the application of facial expression recognition","authors":"M. Mohammadi, E. Fatemizadeh","doi":"10.1109/IRANIANMVIP.2013.6780002","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6780002","url":null,"abstract":"The Local Binary Patterns (LBP) feature extraction method is a theoretically and computationally simple and efficient methodology for texture analysis. The LBP operator is used in many applications such as facial expression recognition and face recognition. The original LBP is based on hard thresholding the neighborhood of each pixel, which makes texture representation sensitive to noise. In addition, LBP cannot distinguish between a strong and a weak pattern. In order to enhance the LBP approach, Fuzzy Local Binary Patterns (FLBP) is proposed. In FLBP, any neighborhood does not represented only by one code, but, it is represented by all existing codes with different degrees. In FLBP, any fuzzy Intersection and Union operators may be used. In this study, the following operators are applied and their results are compared together: Dot-Sum, Min-Max and normalized Min-Max. Based on the extensive experiments, the fuzzy Min-Max operators are more useful and can improve the accuracy in the application of Facial Expression Recognition (FER) about 4% (i.e., form 82.98% to 86.88%).","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129253472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
On road vehicle make and model recognition via sparse feature coding 基于稀疏特征编码的道路车辆型号识别
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6780025
A. Nazemi, M. Shafiee, Z. Azimifar
Automatic vehicle Make and Model Recognition (MMR) system offers a competent way to vehicle classification and recognition systems. This paper proposes a real time while robust vehicle make and model recognition system extracting the vehicle sub-image from the background and studies some sparse feature coding methods such as Orthogonal Matching Pursuit (OMP), some variation of Sparse Coding (SC) methods and compares them to choose the best one. Our method employs the sparse feature coding methods on dense Scale-Invariant Feature Transform (SIFT) features and Support Vector Machine (SVM) for classification. The proposed system is examined by an Iranian on road vehicles dataset, which its samples are in different point of views, various weather conditions and illuminations.
自动车型识别(MMR)系统为车辆分类识别提供了一种有效的方法。本文提出了一种从背景中提取车辆子图像的实时鲁棒的车型识别系统,研究了稀疏特征编码方法,如正交匹配追踪(OMP)方法、稀疏编码方法的几种变体(SC)方法,并对其进行了比较选择。该方法采用稀疏特征编码方法对密集尺度不变特征变换(SIFT)特征和支持向量机(SVM)进行分类。该系统由伊朗的道路车辆数据集进行了测试,其样本处于不同的角度、不同的天气条件和照明下。
{"title":"On road vehicle make and model recognition via sparse feature coding","authors":"A. Nazemi, M. Shafiee, Z. Azimifar","doi":"10.1109/IRANIANMVIP.2013.6780025","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6780025","url":null,"abstract":"Automatic vehicle Make and Model Recognition (MMR) system offers a competent way to vehicle classification and recognition systems. This paper proposes a real time while robust vehicle make and model recognition system extracting the vehicle sub-image from the background and studies some sparse feature coding methods such as Orthogonal Matching Pursuit (OMP), some variation of Sparse Coding (SC) methods and compares them to choose the best one. Our method employs the sparse feature coding methods on dense Scale-Invariant Feature Transform (SIFT) features and Support Vector Machine (SVM) for classification. The proposed system is examined by an Iranian on road vehicles dataset, which its samples are in different point of views, various weather conditions and illuminations.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115297495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Human detection and tracking using new features combination in particle filter framework 在粒子滤波框架下利用新特征组合进行人体检测与跟踪
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6780009
S. Rahimi, A. Aghagolzadeh, Hadi Seyedarabi
Human tracking is an interesting topic in computer vision domain. In this paper, a human detection and tracking algorithm based on new features combination in one camera system is proposed. In detection part, first, mixture of Gaussian background subtraction method is used to find moving regions, then histogram of oriented gradient (HOG) feature of these regions are extracted. At the end, SVM classifier is used to distinguish human from non-human according to their HOG features. In tracking part, first, color, cellular local binary pattern (Cell-LBP) and HOG features of humans are extracted, then their next positions are estimated using particle filter framework. Color, Cell-LBP and HOG features are used to model humans. Color is an effective feature in dealing with object deformation and partial occlusion but has some restriction in cases where background or objects have same color. Cell-LBP is an improved texture descriptor that is robust against partial occlusion, this feature compensates color's restriction. HOG is a shape descriptor that can separate humans from background and is robust against illumination changes. Combination of these three features improves tracking result despite challenges like partial occlusion, object's deformation and illumination changes. Experimental results show advantage of the proposed algorithm.
人体跟踪是计算机视觉领域一个有趣的研究课题。本文提出了一种基于新特征组合的单摄像机人体检测与跟踪算法。在检测部分,首先采用混合高斯背景法寻找运动区域,然后提取这些区域的定向梯度直方图(HOG)特征;最后,利用SVM分类器根据HOG特征对人与非人进行区分。在跟踪部分,首先提取人体的颜色、细胞局部二值模式(Cell-LBP)和HOG特征,然后利用粒子滤波框架估计其下一个位置;颜色、Cell-LBP和HOG特征用于人体建模。颜色是处理物体变形和局部遮挡的有效特征,但在背景或物体颜色相同的情况下有一定的限制。Cell-LBP是一种改进的纹理描述符,它对部分遮挡具有鲁棒性,补偿了颜色的限制。HOG是一种形状描述符,可以将人与背景分开,并且对光照变化具有鲁棒性。尽管存在部分遮挡、物体变形和光照变化等挑战,但这三个特征的结合改善了跟踪结果。实验结果表明了该算法的优越性。
{"title":"Human detection and tracking using new features combination in particle filter framework","authors":"S. Rahimi, A. Aghagolzadeh, Hadi Seyedarabi","doi":"10.1109/IRANIANMVIP.2013.6780009","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6780009","url":null,"abstract":"Human tracking is an interesting topic in computer vision domain. In this paper, a human detection and tracking algorithm based on new features combination in one camera system is proposed. In detection part, first, mixture of Gaussian background subtraction method is used to find moving regions, then histogram of oriented gradient (HOG) feature of these regions are extracted. At the end, SVM classifier is used to distinguish human from non-human according to their HOG features. In tracking part, first, color, cellular local binary pattern (Cell-LBP) and HOG features of humans are extracted, then their next positions are estimated using particle filter framework. Color, Cell-LBP and HOG features are used to model humans. Color is an effective feature in dealing with object deformation and partial occlusion but has some restriction in cases where background or objects have same color. Cell-LBP is an improved texture descriptor that is robust against partial occlusion, this feature compensates color's restriction. HOG is a shape descriptor that can separate humans from background and is robust against illumination changes. Combination of these three features improves tracking result despite challenges like partial occlusion, object's deformation and illumination changes. Experimental results show advantage of the proposed algorithm.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116212947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1