2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)最新文献

英文中文

Image gray-level enhancement using Black Hole algorithm 利用黑洞算法增强图像灰度

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Pub Date : 2015-03-11 DOI: 10.1109/PRIA.2015.7161633

Saber Yaghoobi, Saeed Hemayat, H. Mojallali

Image enhancement methods are known among the most important image processing techniques. Here, image enhancement is considered as an optimization problem and a new heuristic optimization algorithm namely the Black Hole is used to solve it. Image enhancement is a nonlinear optimization problem with its particular constraints and the enhancement process will be done by intensifying each pixel's content. In this paper, BH is employed to find the image's optimum parameters of the transfer function in order to get the best results. BH is used here for its simplicity, ease of implementation, and also its invincibility against the parameter tuning issues. Performance of the proposed enhancement algorithm is tested against some of the well-known enhancement techniques viz. GA, PSO, HE and CS, and the obtained results indicate the robustness and also the outperformance of the proposed algorithm among its other counterparts. Enhancement in opaque images consisting of immense dominant gray values can be listed as one of the proposed method's superiority to that of the other available in literature, which will turn the input image into an enhanced image, featuring embossed textures.

图像增强方法是最重要的图像处理技术之一。本文将图像增强视为一个优化问题，并采用一种新的启发式优化算法——黑洞算法来求解。图像增强是一个具有特定约束条件的非线性优化问题，增强过程将通过增强每个像素的内容来完成。为了得到最佳的结果，本文采用BH来寻找图像传递函数的最优参数。这里使用BH是因为它简单，易于实现，并且它对参数调整问题的不可战胜性。本文提出的增强算法的性能与一些著名的增强技术(如GA、PSO、HE和CS)进行了测试，得到的结果表明该算法具有鲁棒性，并且在其他同类算法中表现出优异的性能。由巨大的主导灰度值组成的不透明图像的增强可以被列为本文提出的方法优于文献中其他方法的优点之一，它将把输入图像变成具有浮雕纹理的增强图像。

{"title":"Image gray-level enhancement using Black Hole algorithm","authors":"Saber Yaghoobi, Saeed Hemayat, H. Mojallali","doi":"10.1109/PRIA.2015.7161633","DOIUrl":"https://doi.org/10.1109/PRIA.2015.7161633","url":null,"abstract":"Image enhancement methods are known among the most important image processing techniques. Here, image enhancement is considered as an optimization problem and a new heuristic optimization algorithm namely the Black Hole is used to solve it. Image enhancement is a nonlinear optimization problem with its particular constraints and the enhancement process will be done by intensifying each pixel's content. In this paper, BH is employed to find the image's optimum parameters of the transfer function in order to get the best results. BH is used here for its simplicity, ease of implementation, and also its invincibility against the parameter tuning issues. Performance of the proposed enhancement algorithm is tested against some of the well-known enhancement techniques viz. GA, PSO, HE and CS, and the obtained results indicate the robustness and also the outperformance of the proposed algorithm among its other counterparts. Enhancement in opaque images consisting of immense dominant gray values can be listed as one of the proposed method's superiority to that of the other available in literature, which will turn the input image into an enhanced image, featuring embossed textures.","PeriodicalId":163817,"journal":{"name":"2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127353647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

Recognition of Farsi handwriting strokes using profile HMM 利用侧面HMM识别波斯语书写笔划

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Pub Date : 2015-03-11 DOI: 10.1109/PRIA.2015.7161646

Ali Katanforoush, Z. Rezvani

This paper aims to stroke recognition, where the strokes are connected forms of cursive handwritten scripts, and in particular, we concern on recognition of Farsi handwriting strokes. In Farsi and some other writing systems, connected letters have special shapes that are often unrecognizable from their separated shapes. Despite that quite efficient algorithms have been developed for recognition of handwritten digits and disjoint letters, adapting these algorithms to stroke recognition is so arduous that development of a holistic approach is preferable. In this paper, we develop a method for Farsi handwriting recognition based on profile-HMM and study aspects of modeling the spatiotemporal features of handwriting strokes. The modular architecture of profile-HMMs provides a flexible framework for stroke modeling. Stroke shrinking and elongation are naturally modeled by the recurrent states and the silent states of profile-HMMs and make the model insensitive to writing speed and subtle slides. Our experimental results show that the profile-HMM is quite robust with respect to downsampling of the curve points, also is robust with respect to various settings in the training procedure. Our method correctly recognizes the main stroke of 90.8%, 98.5%, and 99.2% of handwriting samples, respectively in the top first, top five, and top ten hits.

笔画是草书笔迹的一种连接形式，本文主要研究的是波斯语笔迹笔画的识别问题。在波斯语和其他一些书写系统中，连在一起的字母有特殊的形状，通常无法从它们分开的形状中识别出来。尽管已经开发了相当有效的算法来识别手写数字和不连贯的字母，但将这些算法应用于笔画识别是如此艰巨，因此开发一种整体方法是可取的。本文提出了一种基于轮廓hmm的波斯语手写识别方法，并对手写笔画的时空特征建模进行了研究。轮廓hmm的模块化结构为笔画建模提供了一个灵活的框架。笔画的收缩和伸长自然地由轮廓hmm的循环状态和沉默状态建模，使模型对书写速度和细微滑动不敏感。我们的实验结果表明，profile-HMM对于曲线点的下采样具有相当的鲁棒性，并且对于训练过程中的各种设置也具有鲁棒性。我们的方法正确识别了90.8%、98.5%和99.2%的笔迹样本的主笔划，分别是前一、前五和前十。

{"title":"Recognition of Farsi handwriting strokes using profile HMM","authors":"Ali Katanforoush, Z. Rezvani","doi":"10.1109/PRIA.2015.7161646","DOIUrl":"https://doi.org/10.1109/PRIA.2015.7161646","url":null,"abstract":"This paper aims to stroke recognition, where the strokes are connected forms of cursive handwritten scripts, and in particular, we concern on recognition of Farsi handwriting strokes. In Farsi and some other writing systems, connected letters have special shapes that are often unrecognizable from their separated shapes. Despite that quite efficient algorithms have been developed for recognition of handwritten digits and disjoint letters, adapting these algorithms to stroke recognition is so arduous that development of a holistic approach is preferable. In this paper, we develop a method for Farsi handwriting recognition based on profile-HMM and study aspects of modeling the spatiotemporal features of handwriting strokes. The modular architecture of profile-HMMs provides a flexible framework for stroke modeling. Stroke shrinking and elongation are naturally modeled by the recurrent states and the silent states of profile-HMMs and make the model insensitive to writing speed and subtle slides. Our experimental results show that the profile-HMM is quite robust with respect to downsampling of the curve points, also is robust with respect to various settings in the training procedure. Our method correctly recognizes the main stroke of 90.8%, 98.5%, and 99.2% of handwriting samples, respectively in the top first, top five, and top ten hits.","PeriodicalId":163817,"journal":{"name":"2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116395581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The effect of the distance from the webcam in heart rate estimation from face video images 距离摄像头对人脸视频图像心率估计的影响

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Pub Date : 2015-03-11 DOI: 10.1109/PRIA.2015.7161622

Atefeh Shagholi, M. Charmi, H. Rakhshan

Human facial video captured with a webcam can be processed to extract the heart rate. The objective of this paper is to show that the distance of the person from the webcam is an important parameter in the accuracy of the estimated heart rate. In fact, with increasing this distance, facial region is limited and the resolution of the video image is diminished which eventually leads to decrease in the accuracy of the estimated heart rate. We have carried out experiments on a data set of 12 subjects. The results of experiment have been compared with the heart rates recorded by a fingertip pulse oximeter in a statistical analysis framework. Our comparison reveals that the root mean square error of the difference of the recorded and the estimated heart rates has been increased from 8.91 for one-half meter away from the webcam to 16.77 for three meters away from the webcam.

通过网络摄像头拍摄的人脸视频可以提取心率。本文的目的是表明人与网络摄像头的距离是估计心率准确性的一个重要参数。实际上，随着这个距离的增加，人脸区域受到限制，视频图像的分辨率降低，最终导致估计心率的准确性下降。我们对12个受试者的数据集进行了实验。在统计分析框架中，将实验结果与指尖脉搏血氧仪记录的心率进行了比较。我们的比较表明，记录的心率和估计的心率之差的均方根误差已经从离摄像头半米的8.91增加到离摄像头三米的16.77。

引用次数: 7

Fast and adaptive license plate recognition algorithm for Persian plates 波斯车牌快速自适应车牌识别算法

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Pub Date : 2015-03-11 DOI: 10.1109/PRIA.2015.7161638

Sina Moayed Baharlou, Saeed Hemayat, A. Saberkari, Saber Yaghoobi

A new Persian license plate recognition algorithm is presented. These operations are highly susceptible to error, especially where the image consists of large amount of either vehicle's linked components or the other existing objects. Although the proposed character recognition procedure is highly optimized for Persian plates, the localization parts can be employed for all types of vehicles. Minimum rectangle bounding box is replaced the common bounding box methods, compensating normal bounding box's inherent flaws. License plate possibility ratio (LPPR) is a robust method proposed here to localize the plate. New method of finding plate's location out of so many rectangles, considering “Sensitive to angle” criterions for characters has also been presented. It should be noted that the process is regardless of the plate's location. Different approach on thresholding namely: “Dynamic Thresholding” is used to overcome the probable drawbacks caused by inappropriate lighting. From OCR point of view, a graph, consisting of two specifications will be formed and a set of rules will be defined to capture the character's label. An automated harassment section is added as the denoising filter, in order to omit the grinning ramifications. Presenting the best percent accuracy (95.33%) among relevant well-known algorithms in localization procedure with 25ms run time of the program, and also the outstanding results with over 97% of percent accuracy in character recognition of Persian plates with 30ms run time of the program on Linux and also average of 90ms on Android, can be listed as strong proofs of algorithm's efficiency.

提出了一种新的波斯语车牌识别算法。这些操作非常容易出错，特别是当图像由大量车辆的链接组件或其他现有对象组成时。虽然所提出的字符识别程序对波斯车牌进行了高度优化，但定位部分可以用于所有类型的车辆。用最小矩形包围盒代替了常用的包围盒方法，弥补了常规包围盒的固有缺陷。车牌可能性比(LPPR)是一种鲁棒的车牌定位方法。本文还提出了一种考虑字符“角度敏感”准则的从大量矩形中寻找板位置的新方法。应该注意的是，这个过程是与印版的位置无关的。不同的阈值处理方法，即“动态阈值处理”，用于克服不适当的照明可能造成的缺陷。从OCR的角度来看，将形成一个由两个规范组成的图形，并定义一组规则来捕获字符的标签。一个自动骚扰部分被添加为去噪过滤器，以省略咧嘴笑的后果。在相关的知名算法中，在25ms的程序运行时间下，在定位过程中准确率最高(95.33%)，在Linux上，程序运行时间为30ms，在Android上平均运行时间为90ms，在波斯语字符识别中准确率超过97%，这些都是算法效率的有力证明。

{"title":"Fast and adaptive license plate recognition algorithm for Persian plates","authors":"Sina Moayed Baharlou, Saeed Hemayat, A. Saberkari, Saber Yaghoobi","doi":"10.1109/PRIA.2015.7161638","DOIUrl":"https://doi.org/10.1109/PRIA.2015.7161638","url":null,"abstract":"A new Persian license plate recognition algorithm is presented. These operations are highly susceptible to error, especially where the image consists of large amount of either vehicle's linked components or the other existing objects. Although the proposed character recognition procedure is highly optimized for Persian plates, the localization parts can be employed for all types of vehicles. Minimum rectangle bounding box is replaced the common bounding box methods, compensating normal bounding box's inherent flaws. License plate possibility ratio (LPPR) is a robust method proposed here to localize the plate. New method of finding plate's location out of so many rectangles, considering “Sensitive to angle” criterions for characters has also been presented. It should be noted that the process is regardless of the plate's location. Different approach on thresholding namely: “Dynamic Thresholding” is used to overcome the probable drawbacks caused by inappropriate lighting. From OCR point of view, a graph, consisting of two specifications will be formed and a set of rules will be defined to capture the character's label. An automated harassment section is added as the denoising filter, in order to omit the grinning ramifications. Presenting the best percent accuracy (95.33%) among relevant well-known algorithms in localization procedure with 25ms run time of the program, and also the outstanding results with over 97% of percent accuracy in character recognition of Persian plates with 30ms run time of the program on Linux and also average of 90ms on Android, can be listed as strong proofs of algorithm's efficiency.","PeriodicalId":163817,"journal":{"name":"2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125742096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

A binary-segmentation algorithm based on shearlet transform and eigenvectors 基于shearlet变换和特征向量的二值分割算法

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Pub Date : 2015-03-11 DOI: 10.1109/PRIA.2015.7161618

Ladan Sharafyan Cigaroudy, N. Aghazadeh

In this paper, we illustrate an iterative algorithm for extraction of object with tubular structure specially vessel extraction. For this aim, we segment image to reach binary image in which the pixels of purpose object is found. In our segmentation method, we use Gaussian scale-space technique to compute discrete gradient of image for pre-segmenting. Also, in order to denoise, we use tight frame of shearlet transform. This algorithm has an iterative part based on iterative part of TFA [2], but we use eigenvectors of Hessian matrix of image for improving this part. Theoretical properties of this method are presented. The experimental results show that in our algorithm distinguishing homogeneous vessels is done efficiently.

本文给出了一种用于管状结构物体提取的迭代算法，特别是容器的提取。为此，我们对图像进行分割，得到找到目标物体像素的二值图像。在我们的分割方法中，我们使用高斯尺度空间技术计算图像的离散梯度进行预分割。同时，为了对图像进行降噪，我们采用了紧框架剪切波变换。该算法在TFA迭代部分的基础上有一个迭代部分[2]，但我们使用图像Hessian矩阵的特征向量对这部分进行改进。给出了该方法的理论性质。实验结果表明，该算法能够有效地识别均匀血管。

引用次数: 4

Separation of multiplicative image components by Bayesian Independent Component Analysis 用贝叶斯独立分量分析分离相乘图像分量

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Pub Date : 2015-03-11 DOI: 10.1109/PRIA.2015.7161648

Arash Mehrjou, Babak Nadjar Araabi, Reshad Hosseini

The ability to decompose superimposed images to their basic components has a fundamental importance in machine vision applications. Segmentation Algorithms consider an image composed of several regions each with a particular gray level, texture or color and try to extract those regions which are not covering each other. However, in this paper, we propose a method for decomposing an image to its superimposed components. Taking prior assumptions into account requires Bayesian framework which is well adapted to this application. Also, a profound mathematical theory called Variational Method is used here which makes us capable of calculating intractable integrals and marginal posteriors. In this paper, situations where superimposed images are to be recovered are discussed and a thorough framework is suggested which is basically founded on the ground of Blind Source Separation (BSS) and Independent Component Analysis (ICA). The main idea of this paper is exerted on some synthetic images to verify its applicability.

将叠加图像分解为其基本组件的能力在机器视觉应用中具有重要的基础意义。分割算法考虑由几个区域组成的图像，每个区域具有特定的灰度，纹理或颜色，并试图提取那些不相互覆盖的区域。然而，在本文中，我们提出了一种将图像分解为其叠加分量的方法。考虑到先前的假设需要贝叶斯框架，它很好地适应了这种应用。此外，这里还使用了一种叫做变分法的深奥数学理论，它使我们能够计算难以处理的积分和边际后验。本文讨论了叠加图像恢复的情况，并提出了一个基本建立在盲源分离(BSS)和独立分量分析(ICA)基础上的完整框架。将本文的主要思想应用于一些合成图像，验证了其适用性。

引用次数: 0

Investigation of error propagation and measurement error for 2D block method in Electrical Impedance Tomography 电阻抗层析二维块法误差传播及测量误差研究

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Pub Date : 2015-03-11 DOI: 10.1109/PRIA.2015.7161642

Saeed Zaravi, R. Amirfattahi, B. Vahdat

2D block method (2D BM) is a new approach to solve inverse problem as one of the most challenging case in Electrical Impedance Tomography (EIT). In this method, a tissue is modeled by some blocks to construct a medical image of a body limb. Recently, a non-iterative linear inverse solution is introduced to solve the inverse problem in the 2D BM. But effect of measurement error has not been considered for non-iterative linear inverse solution yet. In this paper, an appropriate method is proposed to investigate the error propagation in 2D BM. The effect of measurement error is considered as well through different examples. Results show that the BM is very sensitive to the measurement error and fault propagation depends mainly on the type of tissue conductivity distribution. It also can be show that the error increases exponentially in each calculation step.

二维块法(2D BM)是电阻抗层析成像(EIT)中最具挑战性的问题之一，是一种求解逆问题的新方法。在该方法中，用一些块对组织进行建模，以构建身体肢体的医学图像。最近，引入了一种非迭代线性逆解来求解二维BM的逆问题。但对于非迭代线性逆解，还没有考虑测量误差的影响。本文提出了一种合适的方法来研究二维BM中的误差传播。通过不同的算例，考虑了测量误差的影响。结果表明，BM对测量误差非常敏感，故障传播主要取决于组织电导率分布的类型。结果表明，误差在每个计算步骤中呈指数增长。

引用次数: 1

Retinal vessel segmentation using system fuzzy and DBSCAN algorithm 采用系统模糊和DBSCAN算法分割视网膜血管

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Pub Date : 2015-03-11 DOI: 10.1109/PRIA.2015.7161643

Negar Riazifar, Ehsan Saghapour

Retinal vessel segmentation used for the early diagnosis of retinal diseases such as hypertension, diabetes and glaucoma. There exist several methods for segmenting blood vessels from retinal images. The aim of this paper is to analyze the retinal vessel segmentation based on the clustering algorithm DBSCAN relying on a density-based notion of clusters which is designed to discover clusters of arbitrary shape. DBSCAN requires only one input parameter and a value for this parameter is suggested to the user. The performance of algorithm is compared and analyzed using a number of measures which include sensitivity and specificity. The specificity and sensitivity of this method is 5.36 and 3.82 respectively.

视网膜血管分割用于高血压、糖尿病、青光眼等视网膜疾病的早期诊断。目前存在几种从视网膜图像中分割血管的方法。本文的目的是分析基于聚类算法DBSCAN的视网膜血管分割，该算法基于基于密度的聚类概念，旨在发现任意形状的聚类。DBSCAN只需要一个输入参数，并建议用户使用该参数的值。用灵敏度和特异性等指标对算法的性能进行了比较和分析。该方法特异性为5.36，敏感性为3.82。

引用次数: 3

CGSR features: Toward RGB-D image matching using color gradient description of geometrically stable regions CGSR特征:利用几何稳定区域的颜色梯度描述对RGB-D图像进行匹配

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Pub Date : 2015-03-11 DOI: 10.1109/PRIA.2015.7161627

A. Rahimi, A. Harati

Image local feature extraction and description is one of the basic problems in computer vision and robotics. However it has still many challenges. On the other hand, in recent years, after the appearance of novel sensors like Kinect camera, RGB-D images are easily available. So it is necessary to extend feature extraction and description methods to be applicable on RGB-D images. In this paper we propose a new approach to feature extraction and description for RGB-D images: Color Gradient Description of Geometrically Stable Regions. The proposed method, first finds smooth regions with uniform changes in surface normal vectors. The process in this stage is inspired from MSER algorithm. Each region then is normalized to a fixed size circle and is rotated toward its dominant orientation to make description affine, scale, and rotation invariant. Finally, color gradients log-polar histogram of normalized regions is used for description. Experimental results show that CGSR features have good performance in illumination and viewpoint changes and outperform state of the art techniques such as SURF and BRAND in matching precision and robustness.

图像局部特征提取与描述是计算机视觉和机器人技术的基本问题之一。然而，它仍然面临许多挑战。另一方面，近年来，随着Kinect相机等新型传感器的出现，RGB-D图像很容易获得。因此，有必要对特征提取和描述方法进行扩展，使其适用于RGB-D图像。本文提出了一种新的RGB-D图像特征提取和描述方法:几何稳定区域的颜色梯度描述。该方法首先寻找表面法向量变化均匀的光滑区域。这一阶段的过程受到了MSER算法的启发。然后将每个区域归一化为固定大小的圆，并向其主导方向旋转，使描述仿射，缩放和旋转不变。最后，采用归一化区域的颜色梯度对数极直方图进行描述。实验结果表明，CGSR特征在光照和视点变化方面具有良好的性能，在匹配精度和鲁棒性方面优于SURF和BRAND等技术。

{"title":"CGSR features: Toward RGB-D image matching using color gradient description of geometrically stable regions","authors":"A. Rahimi, A. Harati","doi":"10.1109/PRIA.2015.7161627","DOIUrl":"https://doi.org/10.1109/PRIA.2015.7161627","url":null,"abstract":"Image local feature extraction and description is one of the basic problems in computer vision and robotics. However it has still many challenges. On the other hand, in recent years, after the appearance of novel sensors like Kinect camera, RGB-D images are easily available. So it is necessary to extend feature extraction and description methods to be applicable on RGB-D images. In this paper we propose a new approach to feature extraction and description for RGB-D images: Color Gradient Description of Geometrically Stable Regions. The proposed method, first finds smooth regions with uniform changes in surface normal vectors. The process in this stage is inspired from MSER algorithm. Each region then is normalized to a fixed size circle and is rotated toward its dominant orientation to make description affine, scale, and rotation invariant. Finally, color gradients log-polar histogram of normalized regions is used for description. Experimental results show that CGSR features have good performance in illumination and viewpoint changes and outperform state of the art techniques such as SURF and BRAND in matching precision and robustness.","PeriodicalId":163817,"journal":{"name":"2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115137993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Facial expression recognition using high order directional derivative local binary patterns 基于高阶方向导数局部二值模式的面部表情识别

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

Pub Date : 2015-03-11 DOI: 10.1109/PRIA.2015.7161619

S. M. Tabatabaei, Abdollah Chalechale, Shekoofeh Moghimi

The most expressive manner which human can reveal his emotional states is facial expression. Automatic facial expression recognition is an emerging field of study having extensive applications among which the human-computer interaction (HCI) has received lots of attentions in recent years. The features extracted from facial images, in order to recognize facial expressions, play an essential role in effectiveness of the facial image descriptors. Local binary pattern (LBP) texture descriptors have been known as simple, yet efficient descriptors which are noticeably used for extracting facial patterns from images. Recently, a generalized form of local binary pattern has been introduced which can offer a more precise image description than simple LBP descriptors. Consequently, it would be expected that taking the advantage of using these new LBP texture descriptors will produce more promising results in comparison with use of simple local binary pattern descriptors. In this paper, a novel method has been proposed for image feature extraction using these new image texture descriptors (generalized LBP); then, the obtained results have been compared to the results produced when applying simple LBP descriptors. Furthermore, K-NN and SVM have been used as classifiers in the proposed approach. Finally, a comparison between the proposed method and the existing local binary pattern algorithms for facial expression recognition concludes the superiority of the proposed algorithm over its existing counterparts.

人类最能表达自己情绪的方式是面部表情。面部表情自动识别是一门应用广泛的新兴研究领域，其中人机交互(HCI)是近年来备受关注的研究领域。从人脸图像中提取的特征，对人脸图像描述符的有效性起着至关重要的作用。局部二值模式(Local binary pattern, LBP)纹理描述符是一种简单而高效的描述符，被广泛用于人脸图像的纹理提取。近年来，引入了一种广义的局部二值模式，它能比简单的LBP描述符提供更精确的图像描述。因此，与使用简单的局部二元模式描述符相比，利用这些新的LBP纹理描述符的优势将产生更有希望的结果。本文提出了一种利用这些新的图像纹理描述符提取图像特征的新方法(广义LBP);然后，将得到的结果与应用简单LBP描述符时产生的结果进行比较。此外，在该方法中使用K-NN和SVM作为分类器。最后，将所提方法与现有的局部二值模式人脸识别算法进行比较，得出所提算法优于现有算法的结论。

{"title":"Facial expression recognition using high order directional derivative local binary patterns","authors":"S. M. Tabatabaei, Abdollah Chalechale, Shekoofeh Moghimi","doi":"10.1109/PRIA.2015.7161619","DOIUrl":"https://doi.org/10.1109/PRIA.2015.7161619","url":null,"abstract":"The most expressive manner which human can reveal his emotional states is facial expression. Automatic facial expression recognition is an emerging field of study having extensive applications among which the human-computer interaction (HCI) has received lots of attentions in recent years. The features extracted from facial images, in order to recognize facial expressions, play an essential role in effectiveness of the facial image descriptors. Local binary pattern (LBP) texture descriptors have been known as simple, yet efficient descriptors which are noticeably used for extracting facial patterns from images. Recently, a generalized form of local binary pattern has been introduced which can offer a more precise image description than simple LBP descriptors. Consequently, it would be expected that taking the advantage of using these new LBP texture descriptors will produce more promising results in comparison with use of simple local binary pattern descriptors. In this paper, a novel method has been proposed for image feature extraction using these new image texture descriptors (generalized LBP); then, the obtained results have been compared to the results produced when applying simple LBP descriptors. Furthermore, K-NN and SVM have been used as classifiers in the proposed approach. Finally, a comparison between the proposed method and the existing local binary pattern algorithms for facial expression recognition concludes the superiority of the proposed algorithm over its existing counterparts.","PeriodicalId":163817,"journal":{"name":"2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115286126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2015 2nd International Conference on Pattern Recognition and Image Analysis (IPRIA)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀