首页 > 最新文献

2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)最新文献

英文 中文
Comparison of classification and dimensionality reduction methods used in fMRI decoding fMRI解码中分类与降维方法的比较
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779973
N. Alamdari, E. Fatemizadeh
In the last few years there has been growing interest in the use of functional Magnetic Resonance Imaging (fMRI) for brain mapping. To decode brain patterns in fMRI data, we need reliable and accurate classifiers. Towards this goal, we compared performance of eleven popular pattern recognition methods. Before performing pattern recognition, applying the dimensionality reduction methods can improve the classification performance; therefore, seven methods in region of interest (RDI) have been compared to answer the following question: which dimensionality reduction procedure performs best? In both tasks, in addition to measuring prediction accuracy, we estimated standard deviation of accuracies to realize more reliable methods. According to all results, we suggest using support vector machines with linear kernel (C-SVM and v-SVM), or random forest classifier on low dimensional subsets, which is prepared by Active or maxDis feature selection method to classify brain activity patterns more efficiently.
在过去的几年里,人们对使用功能性磁共振成像(fMRI)来绘制大脑图谱越来越感兴趣。为了解码fMRI数据中的脑模式,我们需要可靠和准确的分类器。为了实现这个目标,我们比较了11种流行的模式识别方法的性能。在进行模式识别之前,应用降维方法可以提高分类性能;因此,对感兴趣区域(RDI)的七种方法进行了比较,以回答以下问题:哪种降维方法性能最好?在这两项任务中,除了测量预测精度外,我们还估计了精度的标准差,以实现更可靠的方法。根据所有结果,我们建议使用线性核支持向量机(C-SVM和v-SVM),或在Active或maxDis特征选择方法制备的低维子集上使用随机森林分类器对大脑活动模式进行更有效的分类。
{"title":"Comparison of classification and dimensionality reduction methods used in fMRI decoding","authors":"N. Alamdari, E. Fatemizadeh","doi":"10.1109/IRANIANMVIP.2013.6779973","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779973","url":null,"abstract":"In the last few years there has been growing interest in the use of functional Magnetic Resonance Imaging (fMRI) for brain mapping. To decode brain patterns in fMRI data, we need reliable and accurate classifiers. Towards this goal, we compared performance of eleven popular pattern recognition methods. Before performing pattern recognition, applying the dimensionality reduction methods can improve the classification performance; therefore, seven methods in region of interest (RDI) have been compared to answer the following question: which dimensionality reduction procedure performs best? In both tasks, in addition to measuring prediction accuracy, we estimated standard deviation of accuracies to realize more reliable methods. According to all results, we suggest using support vector machines with linear kernel (C-SVM and v-SVM), or random forest classifier on low dimensional subsets, which is prepared by Active or maxDis feature selection method to classify brain activity patterns more efficiently.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133269325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
SM3D studio: A 3D model constructor SM3D工作室:一个3D模型构造器
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779941
V. Soleimani, F. H. Vincheh, Ehsan Zare
In this paper we describe designing and implementation of a powerful, fast and compact simple 3D modeler (SM3D). In addition to saving cost and time (due to high processing speed), 3D objects can be created with minimum system resources by using this application. Easy learning and using are other strengths of this application. Modularity using classification and applying Dynamic-Link Library files are noted aspects that are regarded in writing the source code and this causes separation of main part and user interface, so the application can be easily expanded in the future. Ability to create primary objects and also applying advanced transformations and modifiers have been considered. Moreover, ability to select points of an object and move them is another prominent feature. Working with the camera, its settings and creating desired viewpoints are other professional features. Also, saving and loading object's information from a file and export objects to other popular types of files are included in this application.
本文介绍了一个功能强大、速度快、结构紧凑的简易三维建模器(SM3D)的设计与实现。除了节省成本和时间(由于高处理速度)之外,使用此应用程序还可以用最少的系统资源创建3D对象。易于学习和使用是这个应用程序的另一个优点。使用分类的模块化和应用动态链接库文件是编写源代码时需要注意的两个方面,这使得主要部分和用户界面分离,便于将来的应用扩展。能够创建主要对象,并应用高级转换和修改器已被考虑。此外,能够选择一个对象的点并移动它们是另一个突出的特征。与相机一起工作,它的设置和创建所需的视点是其他专业功能。此外,该应用程序还包括从文件中保存和加载对象的信息以及将对象导出到其他流行类型的文件。
{"title":"SM3D studio: A 3D model constructor","authors":"V. Soleimani, F. H. Vincheh, Ehsan Zare","doi":"10.1109/IRANIANMVIP.2013.6779941","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779941","url":null,"abstract":"In this paper we describe designing and implementation of a powerful, fast and compact simple 3D modeler (SM3D). In addition to saving cost and time (due to high processing speed), 3D objects can be created with minimum system resources by using this application. Easy learning and using are other strengths of this application. Modularity using classification and applying Dynamic-Link Library files are noted aspects that are regarded in writing the source code and this causes separation of main part and user interface, so the application can be easily expanded in the future. Ability to create primary objects and also applying advanced transformations and modifiers have been considered. Moreover, ability to select points of an object and move them is another prominent feature. Working with the camera, its settings and creating desired viewpoints are other professional features. Also, saving and loading object's information from a file and export objects to other popular types of files are included in this application.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130184930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Brain MRI segmentation using the mixture of FCM and RBF neural network 基于FCM和RBF神经网络的脑MRI分割
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6780023
M. Rostami, R. Ghaderi, M. Ezoji, J. Ghasemi
One of the most commonly used methods for Magnetic Resonance Imaging (MRI) segmentation is Fuzzy C-Means (FCM). This method in comparison with other methods preserves more information of the images. Because of using the intensity of pixels as a key feature for clustering, Standard FCM is sensitive to noise. In this study in addition to intensity, mean of neighbourhood of pixels and largest singular value of neighbourhood of pixels are used as features. Also a method for segmenting MRI images is presented which uses both FCM and Radial Basis Function (RBF) neural network and partly decreases the limitation of standard FCM.
磁共振成像(MRI)分割中最常用的方法之一是模糊c均值(FCM)。与其他方法相比,该方法保留了更多的图像信息。由于使用像素的强度作为聚类的关键特征,标准FCM对噪声很敏感。在本研究中,除强度外,还使用像素的邻域均值和像素的邻域最大奇异值作为特征。提出了一种结合FCM和RBF神经网络的MRI图像分割方法,在一定程度上降低了标准FCM的局限性。
{"title":"Brain MRI segmentation using the mixture of FCM and RBF neural network","authors":"M. Rostami, R. Ghaderi, M. Ezoji, J. Ghasemi","doi":"10.1109/IRANIANMVIP.2013.6780023","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6780023","url":null,"abstract":"One of the most commonly used methods for Magnetic Resonance Imaging (MRI) segmentation is Fuzzy C-Means (FCM). This method in comparison with other methods preserves more information of the images. Because of using the intensity of pixels as a key feature for clustering, Standard FCM is sensitive to noise. In this study in addition to intensity, mean of neighbourhood of pixels and largest singular value of neighbourhood of pixels are used as features. Also a method for segmenting MRI images is presented which uses both FCM and Radial Basis Function (RBF) neural network and partly decreases the limitation of standard FCM.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115079502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Scene matching NCC value improvement based on contrast matching 基于对比度匹配的场景匹配NCC值改进
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779998
A. Pourmohammad, S. Poursajadi, S. Karimifar
Geometrical and radiometrical corrections are important for scene matching applications. We suppose the applications that there are no geometrical errors based on using 3D-Inertial sensors for geometrical corrections. In these cases, Normalized Cross-Correlation (NCC) is commonly used method for scene matching. The problem of matching a pattern image (mask) to an image in these cases needs to correction of radiometrical errors as illumination (contrast) variations. In this paper we show that correlation between a mask and a histogram matched image instead of using that raw version, improves the correlation value. First we match histogram function of the image to histogram function of the mask in order to have two closed contrast images, and then correlate those together using NCC and root mean square error (RMSE) methods. Simulation results confirm that according to using NCC and RMSE simultaneously, not only this method is a fast and real time method, but also according to matching histogram function of the received image to histogram function of the mask, it improves the correlation value. Also we show that using the edge detected version of the mask and histogram matched image, lead us to have the best results.
几何校正和辐射校正对于场景匹配应用非常重要。我们假设在没有几何误差的情况下,使用三维惯性传感器进行几何校正。在这种情况下,归一化互相关(NCC)是常用的场景匹配方法。在这些情况下,将图案图像(掩模)与图像匹配的问题需要校正照明(对比度)变化带来的辐射误差。在本文中,我们证明了掩码与直方图匹配的图像之间的相关性,而不是使用原始版本,提高了相关值。首先,我们将图像的直方图函数与掩模的直方图函数匹配,以获得两个封闭的对比度图像,然后使用NCC和均方根误差(RMSE)方法将它们关联在一起。仿真结果表明,该方法同时使用NCC和RMSE,不仅是一种快速、实时的方法,而且根据接收图像的直方图函数与掩模的直方图函数的匹配,提高了相关值。同时我们也证明了使用边缘检测版的蒙版和直方图进行图像匹配,使我们得到了最好的效果。
{"title":"Scene matching NCC value improvement based on contrast matching","authors":"A. Pourmohammad, S. Poursajadi, S. Karimifar","doi":"10.1109/IRANIANMVIP.2013.6779998","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779998","url":null,"abstract":"Geometrical and radiometrical corrections are important for scene matching applications. We suppose the applications that there are no geometrical errors based on using 3D-Inertial sensors for geometrical corrections. In these cases, Normalized Cross-Correlation (NCC) is commonly used method for scene matching. The problem of matching a pattern image (mask) to an image in these cases needs to correction of radiometrical errors as illumination (contrast) variations. In this paper we show that correlation between a mask and a histogram matched image instead of using that raw version, improves the correlation value. First we match histogram function of the image to histogram function of the mask in order to have two closed contrast images, and then correlate those together using NCC and root mean square error (RMSE) methods. Simulation results confirm that according to using NCC and RMSE simultaneously, not only this method is a fast and real time method, but also according to matching histogram function of the received image to histogram function of the mask, it improves the correlation value. Also we show that using the edge detected version of the mask and histogram matched image, lead us to have the best results.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132219500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Extracting local reliable text regions to segment complex handwritten textlines 提取局部可靠的文本区域,分割复杂的手写文本行
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779952
M. Ziaratban, F. Bagheri
Textline segmentation is an important preprocess before trying to recognize words. Handwritten texts include complex lines such as connected/overlapped, multi skewed, and curved textlines. In the proposed approach, to overcome these problems, local reliable text regions are locally extracted for each block of a handwritten text. Text image is first filtered by a set of directional 2D filters and filtered images are divided to a number of overlapping blocks. The filtered block with the highest contrast is selected to be used for text region detection. Experiments show that our proposed method accurately segments complex handwritten textlines.
文本线分割是识别单词前的重要预处理。手写文本包括复杂的线条,如连接/重叠,多重倾斜和弯曲的文本线。在该方法中,为了克服这些问题,对手写文本的每个块局部提取局部可靠文本区域。文本图像首先通过一组定向二维滤波器进行过滤,过滤后的图像被划分为多个重叠的块。选择对比度最高的过滤块用于文本区域检测。实验表明,该方法能准确分割复杂手写文本行。
{"title":"Extracting local reliable text regions to segment complex handwritten textlines","authors":"M. Ziaratban, F. Bagheri","doi":"10.1109/IRANIANMVIP.2013.6779952","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779952","url":null,"abstract":"Textline segmentation is an important preprocess before trying to recognize words. Handwritten texts include complex lines such as connected/overlapped, multi skewed, and curved textlines. In the proposed approach, to overcome these problems, local reliable text regions are locally extracted for each block of a handwritten text. Text image is first filtered by a set of directional 2D filters and filtered images are divided to a number of overlapping blocks. The filtered block with the highest contrast is selected to be used for text region detection. Experiments show that our proposed method accurately segments complex handwritten textlines.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134537540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Applying specific region frequency and texture features on content-based image retrieval 特定区域频率和纹理特征在基于内容的图像检索中的应用
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779997
Amin Abdullahzadeh, F. Mohanna
In this paper, a specific region called affine noisy invariant region is extracted from a query and database images to help accurate retrieval on different attacks. Then, only a 64×1 codebook based feature vector is obtained from this specific region applying vector quantization and codebook generation based on the Linde-Buzo-Gray algorithm, which reduces retrieval feature comparison calculations. Also a number of texture and frequency domain based features are computed and established for the region. Finally combination of these two groups of feature vectors improves the retrieval system efficiency. Besides, in order to optimize weighting combination coefficients of the feature vectors, the particle swarm optimization algorithm is applied. The experimental results show a real-time content-based image retrieval system with higher accuracy and acceptable retrieval time.
本文从查询和数据库图像中提取一个特定的区域,称为仿射噪声不变量区域,以帮助在不同攻击下准确检索。然后,通过矢量量化和基于Linde-Buzo-Gray算法的码本生成,在该特定区域只得到一个基于64×1码本的特征向量,减少了检索特征比较的计算。此外,还计算并建立了一些基于纹理和频域的区域特征。最后,两组特征向量的结合提高了检索系统的效率。此外,为了优化特征向量的加权组合系数,采用了粒子群优化算法。实验结果表明,基于内容的实时图像检索系统具有较高的检索精度和可接受的检索时间。
{"title":"Applying specific region frequency and texture features on content-based image retrieval","authors":"Amin Abdullahzadeh, F. Mohanna","doi":"10.1109/IRANIANMVIP.2013.6779997","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779997","url":null,"abstract":"In this paper, a specific region called affine noisy invariant region is extracted from a query and database images to help accurate retrieval on different attacks. Then, only a 64×1 codebook based feature vector is obtained from this specific region applying vector quantization and codebook generation based on the Linde-Buzo-Gray algorithm, which reduces retrieval feature comparison calculations. Also a number of texture and frequency domain based features are computed and established for the region. Finally combination of these two groups of feature vectors improves the retrieval system efficiency. Besides, in order to optimize weighting combination coefficients of the feature vectors, the particle swarm optimization algorithm is applied. The experimental results show a real-time content-based image retrieval system with higher accuracy and acceptable retrieval time.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133569767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
High angular resolution diffusion image registration 高角分辨率扩散图像配准
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779985
M. Afzali, E. Fatemizadeh, H. Soltanian-Zadeh
Diffusion Tensor Imaging (DTI) is a common method for the investigation of brain white matter. In this method, it is assumed that diffusion of water molecules is Gaussian and so, it fails in fiber crossings where this assumption does not hold. High Angular Resolution Diffusion Imaging (HARDI) allows more accurate investigation of microstructures of the brain white matter; it can present fiber crossing in each voxel. HARDI contains complex orientation information of the fibers. Therefore, registration of these images is more complicated than the scalar images. In this paper, we propose a HARDI registration algorithm based on the feature vectors that are extracted from the Orientation Distribution Functions (ODFs) in each voxel. Hammer similarity measure is used to match the feature vectors and thin-plate spline (TPS) based registration is used for spatial registration of the skeleton and its neighbors. A re-orientation strategy is utilized to re-orient the ODFs after spatial registration. Finally, we evaluate our method based on the differences in principal diffusion direction and we will show that utilizing the skeleton as landmark in the registration results in accurate alignment of HARDI data.
弥散张量成像(DTI)是研究脑白质的常用方法。在这种方法中,假设水分子的扩散是高斯的,因此,在光纤交叉处,这种假设不成立,它就失败了。高角分辨率扩散成像(HARDI)可以更准确地研究脑白质的微观结构;它可以在每个体素中呈现纤维交叉。HARDI包含了复杂的纤维取向信息。因此,这些图像的配准比标量图像更复杂。在本文中,我们提出了一种基于从每个体素的方向分布函数(odf)中提取的特征向量的HARDI配准算法。采用锤击相似度测度对特征向量进行匹配,采用薄板样条配准方法对骨架及其相邻区域进行空间配准。在空间配准后,采用重定向策略对odf进行重定向。最后,我们基于主扩散方向的差异评估了我们的方法,我们将证明在配准中使用骨架作为地标可以精确对准HARDI数据。
{"title":"High angular resolution diffusion image registration","authors":"M. Afzali, E. Fatemizadeh, H. Soltanian-Zadeh","doi":"10.1109/IRANIANMVIP.2013.6779985","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779985","url":null,"abstract":"Diffusion Tensor Imaging (DTI) is a common method for the investigation of brain white matter. In this method, it is assumed that diffusion of water molecules is Gaussian and so, it fails in fiber crossings where this assumption does not hold. High Angular Resolution Diffusion Imaging (HARDI) allows more accurate investigation of microstructures of the brain white matter; it can present fiber crossing in each voxel. HARDI contains complex orientation information of the fibers. Therefore, registration of these images is more complicated than the scalar images. In this paper, we propose a HARDI registration algorithm based on the feature vectors that are extracted from the Orientation Distribution Functions (ODFs) in each voxel. Hammer similarity measure is used to match the feature vectors and thin-plate spline (TPS) based registration is used for spatial registration of the skeleton and its neighbors. A re-orientation strategy is utilized to re-orient the ODFs after spatial registration. Finally, we evaluate our method based on the differences in principal diffusion direction and we will show that utilizing the skeleton as landmark in the registration results in accurate alignment of HARDI data.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"319 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116424731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and VLSI implementation of new hardware architectures for image filtering 图像滤波新硬件架构的设计与VLSI实现
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779960
Mohsen Azizabadi, A. Behrad
Nowadays, hardware implementation of image and video processing algorithms is highly attractive. Needing to real-time processing makes hardware implementation of these algorithms inevitable. In most of image and video processing algorithms, pre-processing filters are the first and most important stage of the algorithm. In this paper, we propose new hardware architectures for the implementation of image filters including Gaussian, median and weighted median filters. The proposed architectures aim to optimize the filter implementation for speed and gate usage. The proposed architectures are implemented and synthesized in ASIC with 65 nm technology and different specification of the implementation such as maximum clock frequency and IC area are reported.
目前,图像和视频处理算法的硬件实现非常有吸引力。由于需要实时处理,这些算法的硬件实现是不可避免的。在大多数图像和视频处理算法中,预处理滤波器是算法的第一个也是最重要的阶段。在本文中,我们提出了新的硬件架构来实现图像滤波器,包括高斯滤波器,中值滤波器和加权中值滤波器。所提出的架构旨在优化滤波器实现的速度和栅极使用。采用65nm技术在ASIC上实现和合成了所提出的架构,并报道了不同的实现规格,如最大时钟频率和集成电路面积。
{"title":"Design and VLSI implementation of new hardware architectures for image filtering","authors":"Mohsen Azizabadi, A. Behrad","doi":"10.1109/IRANIANMVIP.2013.6779960","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779960","url":null,"abstract":"Nowadays, hardware implementation of image and video processing algorithms is highly attractive. Needing to real-time processing makes hardware implementation of these algorithms inevitable. In most of image and video processing algorithms, pre-processing filters are the first and most important stage of the algorithm. In this paper, we propose new hardware architectures for the implementation of image filters including Gaussian, median and weighted median filters. The proposed architectures aim to optimize the filter implementation for speed and gate usage. The proposed architectures are implemented and synthesized in ASIC with 65 nm technology and different specification of the implementation such as maximum clock frequency and IC area are reported.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121746773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Online failure detection and correction for CAMShift tracking algorithm CAMShift跟踪算法的在线故障检测与校正
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6779974
Ebrahim Emami, M. Fathy, Ehsan Kozegar
Tracking failure is an inevitable problem in any object tracking algorithm. Online evaluation of a tracking algorithm to detect and correct failures is therefore an important task in any object tracking system. In this paper we propose an early tracking failure detection procedure for the Continuously Adaptive Mean-Shift(CAMShift) tracking algorithm. We also propose an algorithm to modify the tracker in order to correct the detected failures. CAMShift is a light-weight tracking algorithm first developed based on mean-shift to track human face as a component in a perceptual user interface, but it easily fails in tracking targets in more complex situations like surveillance applications. With our proposed failure detection and correction algorithm, CAMShift shows promising results in the test video sequences.
跟踪失败是任何目标跟踪算法中不可避免的问题。因此,在线评估跟踪算法以检测和纠正故障是任何目标跟踪系统中的一项重要任务。本文提出了一种连续自适应均值移位(CAMShift)跟踪算法的早期跟踪故障检测方法。我们还提出了一种修改跟踪器的算法,以纠正检测到的故障。CAMShift是一种轻量级的跟踪算法,最初是基于mean-shift来跟踪人脸,作为感知用户界面的一个组成部分,但在监视应用等更复杂的情况下,它很容易失败。利用我们提出的故障检测和校正算法,CAMShift在测试视频序列中显示出令人满意的结果。
{"title":"Online failure detection and correction for CAMShift tracking algorithm","authors":"Ebrahim Emami, M. Fathy, Ehsan Kozegar","doi":"10.1109/IRANIANMVIP.2013.6779974","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6779974","url":null,"abstract":"Tracking failure is an inevitable problem in any object tracking algorithm. Online evaluation of a tracking algorithm to detect and correct failures is therefore an important task in any object tracking system. In this paper we propose an early tracking failure detection procedure for the Continuously Adaptive Mean-Shift(CAMShift) tracking algorithm. We also propose an algorithm to modify the tracker in order to correct the detected failures. CAMShift is a light-weight tracking algorithm first developed based on mean-shift to track human face as a component in a perceptual user interface, but it easily fails in tracking targets in more complex situations like surveillance applications. With our proposed failure detection and correction algorithm, CAMShift shows promising results in the test video sequences.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128856359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Robust head pose estimation using contourletSD transform and GLCM 基于contourletSD变换和GLCM的鲁棒头姿估计
Pub Date : 2013-09-01 DOI: 10.1109/IRANIANMVIP.2013.6780014
Gelareh Meydanipour, K. Faez
Head pose estimation is an important preprocessing step in many computer vision and pattern recognition systems such as face recognition. Compared to face detection and recognition which have been wildly used in computer vision systems, head pose estimation has fewer proposed systems and generic solutions. In this paper we propose a novel approach for robust human head pose estimation using contourletSD transform. At first we apply contourletSD transform on images, then we create feature vector by computing gray-level co-occurrence matrix (GLCM) from each contourlet sub-band. Linear discriminant analysis (LDA) is used for dimensionality reduction of feature vector. Finally, we classify obtained feature vectors using Support Vector Machine (SVM), K-nearest Neighbor (KNN) and hierarchical decision tree (HDT) classifiers, separately. Experimental results on FERET database demonstrate robustness of the proposed method than previous methods in human head pose estimation.
在人脸识别等计算机视觉和模式识别系统中,头部姿态估计是一个重要的预处理步骤。与计算机视觉系统中广泛应用的人脸检测和识别相比,头部姿态估计的系统和通用解决方案较少。本文提出了一种基于contourletSD变换的鲁棒头部姿态估计方法。首先对图像进行contourlet sd变换,然后通过计算每个contourlet子带的灰度共生矩阵(GLCM)来生成特征向量。采用线性判别分析(LDA)对特征向量进行降维。最后,我们分别使用支持向量机(SVM)、k近邻(KNN)和层次决策树(HDT)分类器对得到的特征向量进行分类。在FERET数据库上的实验结果表明,该方法在人体头部姿态估计方面具有较好的鲁棒性。
{"title":"Robust head pose estimation using contourletSD transform and GLCM","authors":"Gelareh Meydanipour, K. Faez","doi":"10.1109/IRANIANMVIP.2013.6780014","DOIUrl":"https://doi.org/10.1109/IRANIANMVIP.2013.6780014","url":null,"abstract":"Head pose estimation is an important preprocessing step in many computer vision and pattern recognition systems such as face recognition. Compared to face detection and recognition which have been wildly used in computer vision systems, head pose estimation has fewer proposed systems and generic solutions. In this paper we propose a novel approach for robust human head pose estimation using contourletSD transform. At first we apply contourletSD transform on images, then we create feature vector by computing gray-level co-occurrence matrix (GLCM) from each contourlet sub-band. Linear discriminant analysis (LDA) is used for dimensionality reduction of feature vector. Finally, we classify obtained feature vectors using Support Vector Machine (SVM), K-nearest Neighbor (KNN) and hierarchical decision tree (HDT) classifiers, separately. Experimental results on FERET database demonstrate robustness of the proposed method than previous methods in human head pose estimation.","PeriodicalId":297204,"journal":{"name":"2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124520840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1