首页 > 最新文献

2006 IEEE 14th Signal Processing and Communications Applications最新文献

英文 中文
Fusion of MFCC and MPEG-7 Attributes for Speaker Verification 基于MFCC和MPEG-7属性的说话人验证
Pub Date : 2006-04-17 DOI: 10.1109/SIU.2006.1659766
H. Altınçay, C. Ergun, T. Çiloglu
Following the anticipation that some of the MPEG-7 audio descriptors hold glottal information differing than those MFCCs hold, possible contribution of MPEG-7 descriptors to speaker verification has been investigated. Both feature level and score level fusion of MFCCs and MPEG-7 descriptors have been studied. Results indicate improvements up to 18 % compared to those obtained by using MFCCs alone; a justification of the anticipation and a novel indication to the community
由于预期某些MPEG-7音频描述符所包含的声门信息与mfc所包含的声门信息不同,因此研究了MPEG-7描述符对说话人验证的可能贡献。研究了mfccc和MPEG-7描述符的特征级融合和分数级融合。结果表明,与单独使用mfc相比,改善幅度高达18%;这是对预期的证明,也是对社区的一种新的指示
{"title":"Fusion of MFCC and MPEG-7 Attributes for Speaker Verification","authors":"H. Altınçay, C. Ergun, T. Çiloglu","doi":"10.1109/SIU.2006.1659766","DOIUrl":"https://doi.org/10.1109/SIU.2006.1659766","url":null,"abstract":"Following the anticipation that some of the MPEG-7 audio descriptors hold glottal information differing than those MFCCs hold, possible contribution of MPEG-7 descriptors to speaker verification has been investigated. Both feature level and score level fusion of MFCCs and MPEG-7 descriptors have been studied. Results indicate improvements up to 18 % compared to those obtained by using MFCCs alone; a justification of the anticipation and a novel indication to the community","PeriodicalId":415037,"journal":{"name":"2006 IEEE 14th Signal Processing and Communications Applications","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116567374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Interpretation of Uroflow Graphs with Artificial Neural Networks 用人工神经网络解释尿流图
Pub Date : 2006-04-17 DOI: 10.1109/SIU.2006.1659698
S. Altunay, Z. Telatar, O. Eroğul, E. Aydur
Uroflowmetry is a measuring method, which provides numerical and graphical information about patient's lower urinary tract dynamics by measuring and plotting the rate of change of voided urine volume. The main purpose of the study is to evaluate uroflowmetric data using artificial neural networks (ANN) and provide a pre-diagnostic result for urology specialists. The ANN is trained using back-propagation method and the inputs of ANN are the results of a special feature extraction algorithm, which is designed with the suggestions of urology specialists. System's success is monitored with a set of data, which was already diagnosed by specialists. The outputs of ANN are classified into three groups, namely, "healthy", "possible pathologic" and "pathologic"
尿流法是一种测量方法,通过测量和绘制空尿量变化率,提供患者下尿路动力学的数值和图形信息。该研究的主要目的是利用人工神经网络(ANN)评估尿流量数据,并为泌尿科专家提供预诊断结果。神经网络采用反向传播方法进行训练,神经网络的输入是一种特殊的特征提取算法的结果,该算法是根据泌尿外科专家的建议设计的。系统的成功是通过一组数据来监测的,这些数据已经被专家诊断出来。人工神经网络的输出分为“健康”、“可能病理”和“病理”三组。
{"title":"Interpretation of Uroflow Graphs with Artificial Neural Networks","authors":"S. Altunay, Z. Telatar, O. Eroğul, E. Aydur","doi":"10.1109/SIU.2006.1659698","DOIUrl":"https://doi.org/10.1109/SIU.2006.1659698","url":null,"abstract":"Uroflowmetry is a measuring method, which provides numerical and graphical information about patient's lower urinary tract dynamics by measuring and plotting the rate of change of voided urine volume. The main purpose of the study is to evaluate uroflowmetric data using artificial neural networks (ANN) and provide a pre-diagnostic result for urology specialists. The ANN is trained using back-propagation method and the inputs of ANN are the results of a special feature extraction algorithm, which is designed with the suggestions of urology specialists. System's success is monitored with a set of data, which was already diagnosed by specialists. The outputs of ANN are classified into three groups, namely, \"healthy\", \"possible pathologic\" and \"pathologic\"","PeriodicalId":415037,"journal":{"name":"2006 IEEE 14th Signal Processing and Communications Applications","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122191491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Tracking of Speech Formant Frequencies 语音峰频率跟踪
Pub Date : 2006-04-17 DOI: 10.1109/SIU.2006.1659794
I. Ozbek, M. Demirekler
This study is about tracking of formant frequencies using Kalman filtering. Assuming that the formant frequencies are changing in time slowly, it is possible to model their behaviors as outputs of a dynamic system and then track them using Kalman filtering approach. Tracking system considers also the orders and possible intervals of each formant frequency
本研究是关于使用卡尔曼滤波跟踪形成峰频率。假设形成峰频率随时间缓慢变化,可以将其行为建模为动态系统的输出,然后使用卡尔曼滤波方法对其进行跟踪。跟踪系统还考虑了各波峰频率的阶数和可能的间隔
{"title":"Tracking of Speech Formant Frequencies","authors":"I. Ozbek, M. Demirekler","doi":"10.1109/SIU.2006.1659794","DOIUrl":"https://doi.org/10.1109/SIU.2006.1659794","url":null,"abstract":"This study is about tracking of formant frequencies using Kalman filtering. Assuming that the formant frequencies are changing in time slowly, it is possible to model their behaviors as outputs of a dynamic system and then track them using Kalman filtering approach. Tracking system considers also the orders and possible intervals of each formant frequency","PeriodicalId":415037,"journal":{"name":"2006 IEEE 14th Signal Processing and Communications Applications","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122315891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A New Method For Signal Interpolation 一种新的信号插值方法
Pub Date : 2006-04-17 DOI: 10.1109/SIU.2006.1659817
T. E. Tuncer
Interpolation is an important problem of signal processing. Even though there are several methods for interpolation, it is still an open problem. In this paper, we propose a new method for interpolation with certain advantages compared to the previous methods. The proposed method is based on the least squares error optimum design of the interpolating filter. Interpolating filter is chosen as the Kaiser filter since it can be configured in a variety of shapes by the appropriate choice of cut-off and shape parameters. Proposed method can be seen as the generalization of the spline interpolator by employing a Kaiser filter. It is shown that the proposed interpolator performs much better than any of its competitors when the signal is at least approximately bandlimited
插值是信号处理中的一个重要问题。尽管有几种插值方法,但它仍然是一个开放的问题。本文提出了一种新的插值方法,与以往的插值方法相比具有一定的优势。提出了基于最小二乘误差的插值滤波器优化设计方法。选择插值滤波器作为凯撒滤波器,因为它可以通过适当选择截止和形状参数配置为各种形状。所提出的方法可以看作是采用凯撒滤波器的样条插值器的推广。结果表明,当信号至少近似带宽受限时,所提出的插值器的性能比任何竞争者都要好得多
{"title":"A New Method For Signal Interpolation","authors":"T. E. Tuncer","doi":"10.1109/SIU.2006.1659817","DOIUrl":"https://doi.org/10.1109/SIU.2006.1659817","url":null,"abstract":"Interpolation is an important problem of signal processing. Even though there are several methods for interpolation, it is still an open problem. In this paper, we propose a new method for interpolation with certain advantages compared to the previous methods. The proposed method is based on the least squares error optimum design of the interpolating filter. Interpolating filter is chosen as the Kaiser filter since it can be configured in a variety of shapes by the appropriate choice of cut-off and shape parameters. Proposed method can be seen as the generalization of the spline interpolator by employing a Kaiser filter. It is shown that the proposed interpolator performs much better than any of its competitors when the signal is at least approximately bandlimited","PeriodicalId":415037,"journal":{"name":"2006 IEEE 14th Signal Processing and Communications Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129880959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Object Detection Using Haar Feature Selection Optimization 基于Haar特征选择优化的目标检测
Pub Date : 2006-04-17 DOI: 10.1109/SIU.2006.1659787
C. Demirkir, B. Sankur
Object detection in still images is one of the common problems which is needed to be solved in a robust and reliable manner. Main focus on this work is the designing of classifiers based on Haar like simple features to obtain a good and efficient detection performance. This problem corresponds to the so called feature selection problem which is common in the pattern classifier systems. Classifiers used to detect objects are based on the simple Haar like features and these features are selected using systematic and general evolutionary based algorithm. The objective is to build a set of classifiers which respond stronger to the features present in object patterns than to non-object patterns, thereby improving the class discrimination between these two classes. This approach combines the classifier design with feature selection by using a genetic algorithm (GA). In the feature selection part of the algorithm a GA algorithm which the Haar features are encoded using their parameters in a single chromosome and optimized using genetic operators. During optimization the features which show similar characteristics in the parameter space are selected using a cluster based partitioning algorithm and thereby redundancy in the features is eliminated and a more compact Haar feature set can be obtained. Performances of the resulting chromosomes are measured using a fitness measure which is based on the separation of the two classes samples over a validation set. The resulting object detection structure is tested for near frontal face images in the cluttered background images
静止图像中的目标检测是一个常见的问题,需要以鲁棒可靠的方式解决。本工作的重点是设计基于Haar类简单特征的分类器,以获得良好高效的检测性能。这个问题对应于模式分类器系统中常见的特征选择问题。用于检测目标的分类器基于简单的Haar类特征,并使用系统的和通用的基于进化的算法选择这些特征。目标是构建一组分类器,这些分类器对对象模式中存在的特征的响应强于对非对象模式的响应,从而改进这两个类之间的类区分。该方法利用遗传算法将分类器设计与特征选择相结合。在算法的特征选择部分,采用遗传算子对Haar特征在单个染色体上的参数进行编码并进行优化的GA算法。在优化过程中,采用基于聚类的划分算法选择在参数空间中表现出相似特征的特征,从而消除特征中的冗余,得到更紧凑的Haar特征集。所得到的染色体的性能使用基于验证集上两类样本分离的适应度度量来测量。在杂乱的背景图像中对所得到的近正面人脸图像的目标检测结构进行了测试
{"title":"Object Detection Using Haar Feature Selection Optimization","authors":"C. Demirkir, B. Sankur","doi":"10.1109/SIU.2006.1659787","DOIUrl":"https://doi.org/10.1109/SIU.2006.1659787","url":null,"abstract":"Object detection in still images is one of the common problems which is needed to be solved in a robust and reliable manner. Main focus on this work is the designing of classifiers based on Haar like simple features to obtain a good and efficient detection performance. This problem corresponds to the so called feature selection problem which is common in the pattern classifier systems. Classifiers used to detect objects are based on the simple Haar like features and these features are selected using systematic and general evolutionary based algorithm. The objective is to build a set of classifiers which respond stronger to the features present in object patterns than to non-object patterns, thereby improving the class discrimination between these two classes. This approach combines the classifier design with feature selection by using a genetic algorithm (GA). In the feature selection part of the algorithm a GA algorithm which the Haar features are encoded using their parameters in a single chromosome and optimized using genetic operators. During optimization the features which show similar characteristics in the parameter space are selected using a cluster based partitioning algorithm and thereby redundancy in the features is eliminated and a more compact Haar feature set can be obtained. Performances of the resulting chromosomes are measured using a fitness measure which is based on the separation of the two classes samples over a validation set. The resulting object detection structure is tested for near frontal face images in the cluttered background images","PeriodicalId":415037,"journal":{"name":"2006 IEEE 14th Signal Processing and Communications Applications","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129978537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Spatial Frequency Components of to the Event Related Brain Potentials (ERP) 事件相关脑电位(ERP)的空间频率分量
Pub Date : 2006-04-17 DOI: 10.1109/SIU.2006.1659832
A. Bayram, Erol Yildirim, T. Demiralp, A. Ademoglu
The highest temporal resolution, which is crucial for temporal localization of intracerebral activities, is achieved by ERP, but spatial resolution of scalp topography is low. To overcome the limitation of scalp topography, several current-density estimation techniques were developed whose goal is to find the locations of the three-dimensional (3D) intracerebral activities by solving an inverse problem (such as LORETA). However, scalp topologies constituted by multiple sources which makes the inverse problem complicated. The overall objective of this work is to isolate spatial frequency components of scalp topography by 2-D wavelet transform and to interpret spatial frequency formation via corresponding current-density estimations. Moreover, by achieving less complex scalp maps, obstacle of the inverse problem due to the multiple sources might be lessen. At the first step, main topologies of ERP recordings were investigated by hierarchical clustering algorithm. Secondly, different spatial frequencies of these main topologies were separated by 2-D wavelet transform. Finally, main topological maps and topographic maps of different spatial frequencies derived from them were used to find corresponding cortical activities by LORETA. Assessment of our spatial analysis results was made according to the current density estimation results
ERP可以获得最高的时间分辨率,这对脑内活动的时间定位至关重要,但头皮地形的空间分辨率较低。为了克服头皮地形的限制,开发了几种电流密度估计技术,其目标是通过求解逆问题(如LORETA)来找到三维(3D)脑内活动的位置。然而,头皮拓扑结构由多个源构成,使得逆问题变得复杂。这项工作的总体目标是通过二维小波变换分离头皮地形的空间频率成分,并通过相应的电流密度估计来解释空间频率形成。此外,通过实现更简单的头皮图,可以减少由于多源而导致的反问题障碍。首先,采用层次聚类算法对ERP记录的主要拓扑结构进行研究。其次,利用二维小波变换对各主要拓扑的不同空间频率进行分离;最后,利用主地形图和不同空间频率的地形图,利用LORETA软件寻找相应的皮层活动。根据目前的密度估算结果对我们的空间分析结果进行评估
{"title":"Spatial Frequency Components of to the Event Related Brain Potentials (ERP)","authors":"A. Bayram, Erol Yildirim, T. Demiralp, A. Ademoglu","doi":"10.1109/SIU.2006.1659832","DOIUrl":"https://doi.org/10.1109/SIU.2006.1659832","url":null,"abstract":"The highest temporal resolution, which is crucial for temporal localization of intracerebral activities, is achieved by ERP, but spatial resolution of scalp topography is low. To overcome the limitation of scalp topography, several current-density estimation techniques were developed whose goal is to find the locations of the three-dimensional (3D) intracerebral activities by solving an inverse problem (such as LORETA). However, scalp topologies constituted by multiple sources which makes the inverse problem complicated. The overall objective of this work is to isolate spatial frequency components of scalp topography by 2-D wavelet transform and to interpret spatial frequency formation via corresponding current-density estimations. Moreover, by achieving less complex scalp maps, obstacle of the inverse problem due to the multiple sources might be lessen. At the first step, main topologies of ERP recordings were investigated by hierarchical clustering algorithm. Secondly, different spatial frequencies of these main topologies were separated by 2-D wavelet transform. Finally, main topological maps and topographic maps of different spatial frequencies derived from them were used to find corresponding cortical activities by LORETA. Assessment of our spatial analysis results was made according to the current density estimation results","PeriodicalId":415037,"journal":{"name":"2006 IEEE 14th Signal Processing and Communications Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131016708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Effect of DC Stimuli on Neuronal Dynamics 直流刺激对神经元动力学的影响
Pub Date : 2006-04-17 DOI: 10.1109/SIU.2006.1659750
N. H. Ekmekci, M. Ozer
Voltage-gated ion channels are of great importance in the generation and propagation of electrical signals in the excitable membranes. In this study, we introduce the stochastic version of the Hodgkin-Huxley formalism and investigate the effect of channel noise on neuronal dynamic behaviors based on a model with the Gaussian noise. We show that the channel noise may result in a spiking activity even in the absence of any stimulus for small membrane patches, and that the spontaneous firing dynamics follows more regular patterns when the membrane patch becomes smaller. It is also shown that the stochastic model converges to the deterministic model for very large membrane patches and regularity pattern of fired spikes exhibit resonance behaviour depending on the stimuli strength and the membrane patch area
电压门控离子通道在可激膜电信号的产生和传播中起着重要的作用。在这项研究中,我们引入了霍奇金-赫胥黎形式的随机版本,并基于高斯噪声模型研究了信道噪声对神经元动态行为的影响。我们表明,即使在没有任何刺激的情况下,通道噪声也可能导致小膜斑块的尖峰活动,并且当膜斑块变小时,自发放电动力学遵循更规则的模式。研究还表明,随机模型在很大的膜斑块下收敛于确定性模型,发射峰的规则模式表现出共振行为,这取决于刺激强度和膜斑块面积
{"title":"Effect of DC Stimuli on Neuronal Dynamics","authors":"N. H. Ekmekci, M. Ozer","doi":"10.1109/SIU.2006.1659750","DOIUrl":"https://doi.org/10.1109/SIU.2006.1659750","url":null,"abstract":"Voltage-gated ion channels are of great importance in the generation and propagation of electrical signals in the excitable membranes. In this study, we introduce the stochastic version of the Hodgkin-Huxley formalism and investigate the effect of channel noise on neuronal dynamic behaviors based on a model with the Gaussian noise. We show that the channel noise may result in a spiking activity even in the absence of any stimulus for small membrane patches, and that the spontaneous firing dynamics follows more regular patterns when the membrane patch becomes smaller. It is also shown that the stochastic model converges to the deterministic model for very large membrane patches and regularity pattern of fired spikes exhibit resonance behaviour depending on the stimuli strength and the membrane patch area","PeriodicalId":415037,"journal":{"name":"2006 IEEE 14th Signal Processing and Communications Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133665407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic Relevance Determination for the Estimation of Relevant Features for Object Recognition 目标识别中相关特征估计的自动关联确定
Pub Date : 2006-04-17 DOI: 10.1109/SIU.2006.1659843
ilkay Ulusoy, C. Bishop
Object recognition from 2D images is a highly interesting problem. The final goal is to have a system which can recognize thousands of different categories like human beings do. However, hand labelling the 2D training images in order to segment the foreground (object) from the background is a very tedious job. Because of this reason, in recent years, intelligent systems which can learn object categories from unlabelled image sets have been introduced. In this case, an image is labelled by the objects which are present in the image but the objects are not segmented in the image. The main problem in this case is that the object and the background are used altogether in such unsupervised systems and segmentation must be performed by the system itself. Automatic Relevance Determination (ARD ) is a method which will be investigated in this study in order to segment foreground and background in an unsupervised object category learning system.
从二维图像中识别物体是一个非常有趣的问题。最终目标是建立一个像人类一样可以识别数千种不同类别的系统。然而,为了从背景中分割前景(对象),手工标记2D训练图像是一项非常繁琐的工作。由于这个原因,近年来引入了能够从未标记的图像集中学习对象类别的智能系统。在这种情况下,图像被图像中存在的对象标记,但这些对象在图像中没有被分割。这种情况下的主要问题是,在这种无监督系统中,对象和背景是一起使用的,分割必须由系统自己执行。本文将研究一种用于无监督对象分类学习系统中前景和背景分割的自动关联确定方法。
{"title":"Automatic Relevance Determination for the Estimation of Relevant Features for Object Recognition","authors":"ilkay Ulusoy, C. Bishop","doi":"10.1109/SIU.2006.1659843","DOIUrl":"https://doi.org/10.1109/SIU.2006.1659843","url":null,"abstract":"Object recognition from 2D images is a highly interesting problem. The final goal is to have a system which can recognize thousands of different categories like human beings do. However, hand labelling the 2D training images in order to segment the foreground (object) from the background is a very tedious job. Because of this reason, in recent years, intelligent systems which can learn object categories from unlabelled image sets have been introduced. In this case, an image is labelled by the objects which are present in the image but the objects are not segmented in the image. The main problem in this case is that the object and the background are used altogether in such unsupervised systems and segmentation must be performed by the system itself. Automatic Relevance Determination (ARD ) is a method which will be investigated in this study in order to segment foreground and background in an unsupervised object category learning system.","PeriodicalId":415037,"journal":{"name":"2006 IEEE 14th Signal Processing and Communications Applications","volume":"23 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132152922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A Fast Algorithm for Vision-Based Hand Gesture Recognition for Robot Control 一种基于视觉的机器人手势识别快速算法
Pub Date : 2006-04-17 DOI: 10.1109/SIU.2006.1659822
Asanterabi Malima, E. Ozgur, M. Çetin
We propose a fast algorithm for automatically recognizing a limited set of gestures from hand images for a robot control application. Hand gesture recognition is a challenging problem in its general form. We consider a fixed set of manual commands and a reasonably structured environment, and develop a simple, yet effective, procedure for gesture recognition. Our approach contains steps for segmenting the hand region, locating the fingers, and finally classifying the gesture. The algorithm is invariant to translation, rotation, and scale of the hand. We demonstrate the effectiveness of the technique on real imagery
我们提出了一种快速的算法,用于机器人控制应用程序从手部图像中自动识别有限的手势。一般来说,手势识别是一个具有挑战性的问题。我们考虑了一组固定的手动命令和一个合理的结构化环境,并开发了一个简单而有效的手势识别程序。我们的方法包含了手部区域分割,手指定位,最后对手势进行分类的步骤。该算法对手的平移、旋转和缩放是不变的。我们证明了该技术在真实图像上的有效性
{"title":"A Fast Algorithm for Vision-Based Hand Gesture Recognition for Robot Control","authors":"Asanterabi Malima, E. Ozgur, M. Çetin","doi":"10.1109/SIU.2006.1659822","DOIUrl":"https://doi.org/10.1109/SIU.2006.1659822","url":null,"abstract":"We propose a fast algorithm for automatically recognizing a limited set of gestures from hand images for a robot control application. Hand gesture recognition is a challenging problem in its general form. We consider a fixed set of manual commands and a reasonably structured environment, and develop a simple, yet effective, procedure for gesture recognition. Our approach contains steps for segmenting the hand region, locating the fingers, and finally classifying the gesture. The algorithm is invariant to translation, rotation, and scale of the hand. We demonstrate the effectiveness of the technique on real imagery","PeriodicalId":415037,"journal":{"name":"2006 IEEE 14th Signal Processing and Communications Applications","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132571068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 274
Fast Mode Decision for Intra Prediction in H.264 H.264帧内预测的快速模式决策
Pub Date : 2006-04-17 DOI: 10.1109/SIU.2006.1659793
O. Alay, G. Akar
In H.264 video coding standard, in order to increase the coding efficiency, several new techniques such as spatial intra prediction, integer transformation have been introduced. However these also increase the computational complexity drastically. In this paper, we propose a new fast intra prediction algorithm to be used instead of the full search algorithm to choose the best mode for spatial intra prediction. Experimental results show that, our algorithm achieves 52% computation reduction on the average while maintaining similar PSNR and an average bitrate increase of 1.5%
在H.264视频编码标准中,为了提高编码效率,引入了空间内预测、整数变换等新技术。然而,这也极大地增加了计算复杂度。在本文中,我们提出了一种新的快速图像内预测算法来代替全搜索算法来选择空间内预测的最佳模式。实验结果表明,在保持相似的PSNR和1.5%的平均比特率增长的情况下,我们的算法平均减少了52%的计算量
{"title":"Fast Mode Decision for Intra Prediction in H.264","authors":"O. Alay, G. Akar","doi":"10.1109/SIU.2006.1659793","DOIUrl":"https://doi.org/10.1109/SIU.2006.1659793","url":null,"abstract":"In H.264 video coding standard, in order to increase the coding efficiency, several new techniques such as spatial intra prediction, integer transformation have been introduced. However these also increase the computational complexity drastically. In this paper, we propose a new fast intra prediction algorithm to be used instead of the full search algorithm to choose the best mode for spatial intra prediction. Experimental results show that, our algorithm achieves 52% computation reduction on the average while maintaining similar PSNR and an average bitrate increase of 1.5%","PeriodicalId":415037,"journal":{"name":"2006 IEEE 14th Signal Processing and Communications Applications","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132708851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2006 IEEE 14th Signal Processing and Communications Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1