首页 > 最新文献

2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)最新文献

英文 中文
Detecting earthquake damage levels using adaptive boosting 利用自适应增强检测地震破坏程度
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779989
Mona Peyk Herfeh, A. Shahbahrami, Farshad Parhizkar Miandehi
When an earthquake happens, the image-based techniques are influential tools for detection and classification of damaged buildings. Obtaining precise and exhaustive information about the condition and state of damaged buildings after an earthquake is basis of disaster management. Today's using satellite imageries such Quickbird is becoming more significant data for disaster management. In this paper, a method for detecting and classifying of damaged buildings using satellite imageries and digital map is proposed. In this method after extracting buildings position from digital map, they are located in the pre-event and post-event images of Bam earthquake. After generating features, genetic algorithm applied for obtaining optimal features. For classification, Adaptive boosting is used and compared with neural networks. Experimental results show that total accuracy of adaptive boosting for detecting and classifying of collapsed buildings is about 84 percent.
当地震发生时,基于图像的技术是检测和分类受损建筑物的重要工具。在地震发生后,获得有关受损建筑物状况和状态的准确而详尽的信息是灾害管理的基础。今天使用卫星图像,如Quickbird,正在成为灾害管理的更重要的数据。本文提出了一种利用卫星图像和数字地图对受损建筑进行检测和分类的方法。该方法从数字地图中提取建筑物位置后,将建筑物定位在巴姆地震的震前和震后图像中。在生成特征后,应用遗传算法获得最优特征。在分类方面,采用自适应增强方法,并与神经网络进行了比较。实验结果表明,自适应增强对建筑物倒塌检测和分类的总准确率约为84%。
{"title":"Detecting earthquake damage levels using adaptive boosting","authors":"Mona Peyk Herfeh, A. Shahbahrami, Farshad Parhizkar Miandehi","doi":"10.1109/IRANIANMVIP.2013.6779989","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779989","url":null,"abstract":"When an earthquake happens, the image-based techniques are influential tools for detection and classification of damaged buildings. Obtaining precise and exhaustive information about the condition and state of damaged buildings after an earthquake is basis of disaster management. Today's using satellite imageries such Quickbird is becoming more significant data for disaster management. In this paper, a method for detecting and classifying of damaged buildings using satellite imageries and digital map is proposed. In this method after extracting buildings position from digital map, they are located in the pre-event and post-event images of Bam earthquake. After generating features, genetic algorithm applied for obtaining optimal features. For classification, Adaptive boosting is used and compared with neural networks. Experimental results show that total accuracy of adaptive boosting for detecting and classifying of collapsed buildings is about 84 percent.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131158679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A new approach to apply texture features in minerals identification in petrographic thin sections using ANNs 利用人工神经网络将纹理特征应用于岩相薄片矿物识别的新方法
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779990
H. Izadi, J. Sadri, Nosrat-Agha Mehran
Identification of minerals in petrographic thin sections using intelligent methods is very complex and challenging task which, mineralogists and computer scientists are faced with it. Textural features have very important role to identify minerals, and undoubtedly without using these features, recognition minerals in thin sections yield to many miss classification results. Thin sections have been studied applying plane-polarized and cross-polarized lights. In this paper, in order to extract textural features of minerals in thin section, co-occurrence matrix is used, and six features as Entropy, Homogeneity, Energy, Correlation and Maximum Probability are extracted from each image. Then, ANNs are used for identifying in complex situation and experimental results have shown that using textural features in mineral identification, significant improve classification result in petrographic thin sections.
利用智能方法识别岩石薄片中的矿物是矿物学家和计算机科学家所面临的一项非常复杂和具有挑战性的任务。纹理特征在矿物识别中具有非常重要的作用,如果不使用这些特征,在薄片中识别矿物无疑会导致许多分类结果的缺失。用平面偏振光和交叉偏振光对薄片进行了研究。为了提取薄片矿物的纹理特征,本文采用共生矩阵,从每张图像中提取熵、均匀性、能量、相关性和最大概率6个特征。然后,将人工神经网络用于复杂情况下的识别,实验结果表明,利用纹理特征进行矿物识别,可以显著改善岩相薄片的分类效果。
{"title":"A new approach to apply texture features in minerals identification in petrographic thin sections using ANNs","authors":"H. Izadi, J. Sadri, Nosrat-Agha Mehran","doi":"10.1109/IRANIANMVIP.2013.6779990","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779990","url":null,"abstract":"Identification of minerals in petrographic thin sections using intelligent methods is very complex and challenging task which, mineralogists and computer scientists are faced with it. Textural features have very important role to identify minerals, and undoubtedly without using these features, recognition minerals in thin sections yield to many miss classification results. Thin sections have been studied applying plane-polarized and cross-polarized lights. In this paper, in order to extract textural features of minerals in thin section, co-occurrence matrix is used, and six features as Entropy, Homogeneity, Energy, Correlation and Maximum Probability are extracted from each image. Then, ANNs are used for identifying in complex situation and experimental results have shown that using textural features in mineral identification, significant improve classification result in petrographic thin sections.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121485961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Offline handwritten Farsi cursive text recognition using hidden Markov models 离线手写波斯语草书文本识别使用隐马尔可夫模型
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779953
Z. Imani, A. Ahmadyfard, A. Zohrevand, Mohamad Alipour
In this paper we address the problem of recognizing Farsi handwritten words. We extract two types of features from vertical stripes on word images: chain-code of word boundary and distribution of foreground density across the image word. The extracted feature vectors are coded using self organizing vector quantization. The result codes are used for training the model of each word in the database. Each word is modeled using discrete hidden Markov models (HMM). In order to evaluate the performance of the proposed system we conducted an experiment using new prepared database FARSA. We tested the proposed method using 198 word classes in this database. The result of experiment in compare with the existing methods is very promising.
本文主要研究波斯语手写文字的识别问题。我们从单词图像上的垂直条纹中提取两种特征:单词边界链码和前景密度在图像单词上的分布。提取的特征向量采用自组织矢量量化编码。结果代码用于训练数据库中每个单词的模型。每个单词使用离散隐马尔可夫模型(HMM)建模。为了评估所提出的系统的性能,我们使用新准备的数据库FARSA进行了实验。我们使用该数据库中的198个词类对提出的方法进行了测试。实验结果与现有方法进行了比较。
{"title":"Offline handwritten Farsi cursive text recognition using hidden Markov models","authors":"Z. Imani, A. Ahmadyfard, A. Zohrevand, Mohamad Alipour","doi":"10.1109/IRANIANMVIP.2013.6779953","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779953","url":null,"abstract":"In this paper we address the problem of recognizing Farsi handwritten words. We extract two types of features from vertical stripes on word images: chain-code of word boundary and distribution of foreground density across the image word. The extracted feature vectors are coded using self organizing vector quantization. The result codes are used for training the model of each word in the database. Each word is modeled using discrete hidden Markov models (HMM). In order to evaluate the performance of the proposed system we conducted an experiment using new prepared database FARSA. We tested the proposed method using 198 word classes in this database. The result of experiment in compare with the existing methods is very promising.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123550749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Image encryption using genetic algorithm 使用遗传算法进行图像加密
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6780026
Roza Afarin, S. Mozaffari
This paper presents a new method for image encryption using Genetic algorithm (GA). First, rows and columns of the input image are dislocated randomly. Then, the obtained image is divided into four equal sized sub-images. After selecting one of these sub-images accidentally, two pixels are chosen from it as GA initial population. Cross-over and mutation operations are applied on the binary values of the selected pixels. Then the image is reconstructed in the reverse manner. If entropy of the result image increases, the current sub-image is utilized for the next step. Otherwise, another sub-images is chosen randomly and the same process is applied. Randomness of the encrypted image is measured by entropy, correlation coefficients and histogram analysis. Experimental results show that the proposed method can be used effectively for image encryption.
提出了一种基于遗传算法的图像加密新方法。首先,输入图像的行和列随机错位。然后,将得到的图像分成四个大小相等的子图像。在随机选取其中一个子图像后,从中选取两个像素作为遗传算法的初始种群。交叉和突变操作应用于所选像素的二进制值。然后以相反的方式重构图像。如果结果图像的熵增加,则利用当前子图像进行下一步。否则,随机选择另一个子图像,并应用相同的过程。通过熵、相关系数和直方图分析来衡量加密图像的随机性。实验结果表明,该方法可以有效地用于图像加密。
{"title":"Image encryption using genetic algorithm","authors":"Roza Afarin, S. Mozaffari","doi":"10.1109/IRANIANMVIP.2013.6780026","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6780026","url":null,"abstract":"This paper presents a new method for image encryption using Genetic algorithm (GA). First, rows and columns of the input image are dislocated randomly. Then, the obtained image is divided into four equal sized sub-images. After selecting one of these sub-images accidentally, two pixels are chosen from it as GA initial population. Cross-over and mutation operations are applied on the binary values of the selected pixels. Then the image is reconstructed in the reverse manner. If entropy of the result image increases, the current sub-image is utilized for the next step. Otherwise, another sub-images is chosen randomly and the same process is applied. Randomness of the encrypted image is measured by entropy, correlation coefficients and histogram analysis. Experimental results show that the proposed method can be used effectively for image encryption.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126070285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
An automated vessel segmentation algorithm in retinal images using 2D Gabor wavelet 基于二维Gabor小波的视网膜血管自动分割算法
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779967
Pouya Nazari, H. Pourghassem
This paper proposes a novel method to extract blood vessels in retinal images. We also present a new effective preprocessing to reduce the effect of non-uniformly illumination using red and green channels of these images. The vessels finally have been extracted using 2D Gabor filter bank followed by thresholding on grayscale and thresholding based on structural properties of labeled vessel candidates, to extract large and thin vessels. The proposed algorithm is evaluated on DRIVE database, which is publically available. The results show that presented algorithm achieved accuracy rate of 94.81% along with True Positive Fraction (TPF) of 71.12% and False Positive Fraction (FPF) of 2.84%.
提出了一种提取视网膜图像中血管的新方法。我们还提出了一种新的有效的预处理方法,利用这些图像的红绿通道来减少非均匀光照的影响。最后利用二维Gabor滤波器组对血管进行提取,然后对标记的候选血管进行灰度阈值和基于结构属性的阈值提取,分别提取大血管和细血管。在公开的DRIVE数据库上对该算法进行了评估。结果表明,该算法准确率为94.81%,真阳性分数(TPF)为71.12%,假阳性分数(FPF)为2.84%。
{"title":"An automated vessel segmentation algorithm in retinal images using 2D Gabor wavelet","authors":"Pouya Nazari, H. Pourghassem","doi":"10.1109/IRANIANMVIP.2013.6779967","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779967","url":null,"abstract":"This paper proposes a novel method to extract blood vessels in retinal images. We also present a new effective preprocessing to reduce the effect of non-uniformly illumination using red and green channels of these images. The vessels finally have been extracted using 2D Gabor filter bank followed by thresholding on grayscale and thresholding based on structural properties of labeled vessel candidates, to extract large and thin vessels. The proposed algorithm is evaluated on DRIVE database, which is publically available. The results show that presented algorithm achieved accuracy rate of 94.81% along with True Positive Fraction (TPF) of 71.12% and False Positive Fraction (FPF) of 2.84%.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129243509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Accelerating GPU implementation of contourlet transform 加速GPU实现contourlet变换
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6780005
Majid Mohrekesh, Shekoofeh Azizi, S. Samavi
The widespread usage of the contourlet-transform (CT) and today's real-time needs demand faster execution of CT. Solutions are available, but due to lack of portability or computational intensity, they are disadvantageous in real-time applications. In this paper we take advantage of modern GPUs for the acceleration purpose. GPU is well-suited to address data-parallel computation applications such as CT. The convolution part of CT, which is the most computational intensive step, is reshaped for parallel processing. Then the whole transform is transported into GPU to avoid multiple time consuming migrations between the host and device. Experimental results show that with existing GPUs, CT execution achieves more than 19x speedup as compared to its non-parallel CPU-based method. It takes approximately 40ms to compute the transform of a 512×512 image, which should be sufficient for real-time applications.
轮廓变换(contourlet-transform, CT)的广泛应用和当今的实时性需求要求CT的执行速度更快。解决方案是可用的,但由于缺乏可移植性或计算强度,它们在实时应用中是不利的。在本文中,我们利用现代gpu的加速目的。GPU非常适合处理数据并行计算应用,如CT。CT的卷积部分是计算量最大的步骤,它被重构为并行处理。然后将整个变换传输到GPU中,避免了主机和设备之间多次耗时的迁移。实验结果表明,在现有的gpu上,CT的执行速度比基于非并行cpu的方法提高了19倍以上。计算512×512图像的变换大约需要40毫秒,这对于实时应用程序来说应该足够了。
{"title":"Accelerating GPU implementation of contourlet transform","authors":"Majid Mohrekesh, Shekoofeh Azizi, S. Samavi","doi":"10.1109/IRANIANMVIP.2013.6780005","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6780005","url":null,"abstract":"The widespread usage of the contourlet-transform (CT) and today's real-time needs demand faster execution of CT. Solutions are available, but due to lack of portability or computational intensity, they are disadvantageous in real-time applications. In this paper we take advantage of modern GPUs for the acceleration purpose. GPU is well-suited to address data-parallel computation applications such as CT. The convolution part of CT, which is the most computational intensive step, is reshaped for parallel processing. Then the whole transform is transported into GPU to avoid multiple time consuming migrations between the host and device. Experimental results show that with existing GPUs, CT execution achieves more than 19x speedup as compared to its non-parallel CPU-based method. It takes approximately 40ms to compute the transform of a 512×512 image, which should be sufficient for real-time applications.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"18 789 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129351078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spectral library pruning based on classification techniques 基于分类技术的谱库剪枝
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779966
H. Fayyazi, H. Dehghani, M. Hosseini
Spectral unmixing is an active research area in remote sensing. The direct use of the spectral libraries in spectral unmixing is increased by increasing the availability of the libraries. In this way, the spectral unmixing problem is converted into a sparse regression problem that is time-consuming. This is due to the existence of irrelevant spectra in the library. So these spectra should be removed in some way. In this paper, a machine learning approach for spectral library pruning is introduced. At first, the spectral library is clustered based on a simple and efficient new feature space. Then the training data needed to learn a classifier are extracted by adding different noise levels to the clustered spectra. The label of the training data is determined based on the results of spectral library clustering. After learning the classifier, each pixel of the image is classified using it. For pruning the library, the spectra with the labels that none of the image pixels belong to, are removed. We use three classifiers, decision tree, neural networks and k-nearest neighbor to determine the effect of applying different classifiers. The results compared here show that the proposed method works well in noisy images.
光谱分解是遥感领域的一个活跃研究领域。通过增加谱库的可用性,增加了谱库在光谱解混中的直接使用。这样,将光谱解混问题转化为一个耗时的稀疏回归问题。这是由于库中存在不相关的光谱。所以这些光谱应该以某种方式去除。本文介绍了一种用于谱库剪枝的机器学习方法。首先,基于简单高效的新特征空间对光谱库进行聚类。然后通过在聚类光谱中加入不同的噪声水平来提取学习分类器所需的训练数据。根据谱库聚类结果确定训练数据的标签。学习分类器后,使用它对图像的每个像素进行分类。为了对库进行修剪,将带有不属于图像像素的标签的光谱去除。我们使用决策树、神经网络和k近邻三种分类器来确定应用不同分类器的效果。实验结果表明,该方法能较好地处理噪声图像。
{"title":"Spectral library pruning based on classification techniques","authors":"H. Fayyazi, H. Dehghani, M. Hosseini","doi":"10.1109/IRANIANMVIP.2013.6779966","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779966","url":null,"abstract":"Spectral unmixing is an active research area in remote sensing. The direct use of the spectral libraries in spectral unmixing is increased by increasing the availability of the libraries. In this way, the spectral unmixing problem is converted into a sparse regression problem that is time-consuming. This is due to the existence of irrelevant spectra in the library. So these spectra should be removed in some way. In this paper, a machine learning approach for spectral library pruning is introduced. At first, the spectral library is clustered based on a simple and efficient new feature space. Then the training data needed to learn a classifier are extracted by adding different noise levels to the clustered spectra. The label of the training data is determined based on the results of spectral library clustering. After learning the classifier, each pixel of the image is classified using it. For pruning the library, the spectra with the labels that none of the image pixels belong to, are removed. We use three classifiers, decision tree, neural networks and k-nearest neighbor to determine the effect of applying different classifiers. The results compared here show that the proposed method works well in noisy images.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131384404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Real-time dynamic hand gesture recognition using hidden Markov models 基于隐马尔可夫模型的实时动态手势识别
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779977
M. M. Gharasuie, Hadi Seyedarabi
The goal of interaction between human and computer is to find a way to treat it like human-human interaction. Gestures play an important role in human's daily life in order to transfer data and human emotions. The gestures are results of part of body movement in which hand movement is the most widely used one that is known as dynamic hand gesture. So it is very important to follow and recognize hand motion to provide multi-purpose use. In this paper, we propose a system that recognizes hand gestures from continuous hand motion for English numbers from 0 to 9 in real-time, based on Hidden Markov Models (HMMs). There are two kinds of gestures, key gestures and link gestures. The link gestures are used to separate the key gestures from other hand motion trajectories (gesture path) that are called spotting. This type of spotting is a heuristic-based method that identifies start and end points of the key gestures. Then gesture path between these two points are given to HMMs for classification. Experimental results show that the proposed system can successfully recognize the key gestures with recognition rate of 93.84%and work in complex situations very well.
人与计算机交互的目标是找到一种方法来对待它,就像人与人之间的交互一样。手势在人类的日常生活中扮演着重要的角色,以传递数据和人类的情感。手势是身体部分运动的结果,其中手的运动是最广泛使用的一种,被称为动态手势。因此,跟踪和识别手部动作以提供多用途是非常重要的。在本文中,我们提出了一个基于隐马尔可夫模型(hmm)的系统,该系统可以实时识别从0到9的英文数字的连续手势。有两种手势,键手势和链接手势。链接手势用于将关键手势与其他称为定位的手部运动轨迹(手势路径)分开。这种类型的定位是一种基于启发式的方法,用于识别关键手势的起点和终点。然后将这两点之间的手势路径交给hmm进行分类。实验结果表明,该系统能够成功识别关键手势,识别率为93.84%,在复杂的情况下也能很好地工作。
{"title":"Real-time dynamic hand gesture recognition using hidden Markov models","authors":"M. M. Gharasuie, Hadi Seyedarabi","doi":"10.1109/IRANIANMVIP.2013.6779977","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779977","url":null,"abstract":"The goal of interaction between human and computer is to find a way to treat it like human-human interaction. Gestures play an important role in human's daily life in order to transfer data and human emotions. The gestures are results of part of body movement in which hand movement is the most widely used one that is known as dynamic hand gesture. So it is very important to follow and recognize hand motion to provide multi-purpose use. In this paper, we propose a system that recognizes hand gestures from continuous hand motion for English numbers from 0 to 9 in real-time, based on Hidden Markov Models (HMMs). There are two kinds of gestures, key gestures and link gestures. The link gestures are used to separate the key gestures from other hand motion trajectories (gesture path) that are called spotting. This type of spotting is a heuristic-based method that identifies start and end points of the key gestures. Then gesture path between these two points are given to HMMs for classification. Experimental results show that the proposed system can successfully recognize the key gestures with recognition rate of 93.84%and work in complex situations very well.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"142 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116350177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Improving the performance of skin segmentation in quasi-skin regions via multiple classifier system 利用多分类器系统改进准皮肤区域的皮肤分割性能
Pub Date : 2013-09-01 DOI: 10.1109/IranianMVIP.2013.6780006
Mohamad Fatahi, Mohsen Nadjafi, S.V. Al-Din Makki
This paper presents a skin segmentation method based on multiple classifier system strategy in order to improve the performance of classification especially in quasi-skin regions. Quasi-skin regions in digital images are non-skin patches which have characteristics like the human skin and are known as a basic origin of misclassification error in skin segmentation. To cope with this problem, we have designed an algorithmic architecture by combining four prominent classifiers to construct a synergy to conceal their weaknesses and amplify their strengths. Participant classifiers in our approach include cellular learning automaton, likelihood, Gaussian and Support Vector Machines in which decision making performs via a conditional voting step. The accuracy and specificity were employed to evaluate the performance. Experiments on a collected test-set database including 142 challenging images demonstrate that the proposed skin detector is able to improve the accuracy and specificity up to 1.92% and 0.83%, respectively, than the best of individual classifier.
为了提高准皮肤区域的分类性能,提出了一种基于多分类器系统策略的皮肤分割方法。数字图像中的准皮肤区域是具有人体皮肤特征的非皮肤斑块,是皮肤分割中误分类误差的基本来源。为了解决这个问题,我们设计了一个算法架构,将四个突出的分类器组合在一起,构建一个协同效应,以掩盖它们的弱点,放大它们的优势。我们方法中的参与者分类器包括细胞学习自动机、似然、高斯和支持向量机,其中决策通过条件投票步骤执行。采用准确性和特异性来评价其性能。在包含142张挑战性图像的测试集数据库上进行的实验表明,与最佳的单个分类器相比,该皮肤检测器的准确率和特异性分别提高了1.92%和0.83%。
{"title":"Improving the performance of skin segmentation in quasi-skin regions via multiple classifier system","authors":"Mohamad Fatahi, Mohsen Nadjafi, S.V. Al-Din Makki","doi":"10.1109/IranianMVIP.2013.6780006","DOIUrl":"https://doi.org/10.1109/IranianMVIP.2013.6780006","url":null,"abstract":"This paper presents a skin segmentation method based on multiple classifier system strategy in order to improve the performance of classification especially in quasi-skin regions. Quasi-skin regions in digital images are non-skin patches which have characteristics like the human skin and are known as a basic origin of misclassification error in skin segmentation. To cope with this problem, we have designed an algorithmic architecture by combining four prominent classifiers to construct a synergy to conceal their weaknesses and amplify their strengths. Participant classifiers in our approach include cellular learning automaton, likelihood, Gaussian and Support Vector Machines in which decision making performs via a conditional voting step. The accuracy and specificity were employed to evaluate the performance. Experiments on a collected test-set database including 142 challenging images demonstrate that the proposed skin detector is able to improve the accuracy and specificity up to 1.92% and 0.83%, respectively, than the best of individual classifier.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125437033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Automatic brain hemorrhage segmentation and classification in CT scan images 脑出血CT扫描图像的自动分割与分类
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6780031
Bahareh Shahangian, H. Pourghassem
Brain hemorrhage detection and classification is a major help to physicians to rescue patients in an early stage. In this paper, we have tried to introduce an automatic detection and classification method to improve and accelerate the process of physicians' decision-making. To achieve this purpose, at first we have used a simple and effective segmentation method to detect and separate the hemorrhage regions from other parts of the brain, and then we have extracted a number of features from each detected hemorrhage region. We selected some of convenient features by using a Genetic Algorithm (GA)-based feature selection algorithm. Eventually, we have classified the different types of hemorrhages. Our algorithm is evaluated on a perfect set of CT-scan images and the segmentation accuracy for three major types of hemorrhages (EDH, ICH and SDH) obtained 96.22%, 95.14% and 90.04%, respectively. In the classification step, multilayer neural network could be more successful than the KNN classifier because of its higher accuracy (93.3%). Finally, we achieved the accuracy rate of more than 90% for the detection and classification of brain hemorrhages.
脑出血的检测和分类是医生早期抢救患者的重要帮助。在本文中,我们试图引入一种自动检测和分类方法,以改善和加快医生的决策过程。为了达到这一目的,我们首先使用一种简单有效的分割方法来检测和分离出血区域与大脑的其他部分,然后我们从每个检测到的出血区域中提取一些特征。采用基于遗传算法(GA)的特征选择算法选择方便的特征。最后,我们区分了不同类型的出血。我们的算法在一组完美的ct扫描图像上进行了评估,对EDH、ICH和SDH三种主要出血类型的分割准确率分别为96.22%、95.14%和90.04%。在分类步骤中,多层神经网络比KNN分类器具有更高的准确率(93.3%)。最终,我们对脑出血的检测和分类准确率达到90%以上。
{"title":"Automatic brain hemorrhage segmentation and classification in CT scan images","authors":"Bahareh Shahangian, H. Pourghassem","doi":"10.1109/IRANIANMVIP.2013.6780031","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6780031","url":null,"abstract":"Brain hemorrhage detection and classification is a major help to physicians to rescue patients in an early stage. In this paper, we have tried to introduce an automatic detection and classification method to improve and accelerate the process of physicians' decision-making. To achieve this purpose, at first we have used a simple and effective segmentation method to detect and separate the hemorrhage regions from other parts of the brain, and then we have extracted a number of features from each detected hemorrhage region. We selected some of convenient features by using a Genetic Algorithm (GA)-based feature selection algorithm. Eventually, we have classified the different types of hemorrhages. Our algorithm is evaluated on a perfect set of CT-scan images and the segmentation accuracy for three major types of hemorrhages (EDH, ICH and SDH) obtained 96.22%, 95.14% and 90.04%, respectively. In the classification step, multilayer neural network could be more successful than the KNN classifier because of its higher accuracy (93.3%). Finally, we achieved the accuracy rate of more than 90% for the detection and classification of brain hemorrhages.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116894446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
期刊
2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1