首页 > 最新文献

2013 IEEE International Conference on Signal and Image Processing Applications最新文献

英文 中文
Wavelet based Cepstral Coefficients for neural network speech recognition 基于小波倒谱系数的神经网络语音识别
Pub Date : 1900-01-01 DOI: 10.1109/ICSIPA.2013.6708048
T. Adam, M. Salam, T. Gunawan
Traditional cepstral analysis methods are often used as part of feature extraction process in speech recognition. However the cepstral analysis method uses the Discrete Fourier Transform (DFT) in one of its computation process. The DFT uses fixed frame resolution to analyze frames of signal thus it will result in an analysis that would not accurately analyze localized events. This paper investigates the use of the Discrete Wavelet Transform (DWT) for calculating the cepstrum coefficients. Two wavelet types with different decomposition level are experimented to yield the cepstrum which is called the Wavelet Cepstral Coefficient (WCC). To test the WCC speech recognizing task of recognizing 26 English alphabets were conducted. Under same number of feature dimension the WCC outperformed the MFCC with about 20% in terms of recognition rate under both speaker dependent and speaker independent task.
在语音识别中,传统的倒谱分析方法常被用作特征提取的一部分。而倒谱分析方法在其计算过程中使用了离散傅立叶变换(DFT)。DFT使用固定帧分辨率来分析信号的帧,因此它将导致不能准确分析局部事件的分析。本文研究了用离散小波变换(DWT)计算倒谱系数的方法。实验了两种不同分解水平的小波类型,得到倒谱,称为小波倒谱系数。为了测试WCC语音识别,进行了识别26个英文字母的任务。在相同的特征维数下,WCC在依赖说话人任务和独立说话人任务下的识别率都比MFCC高出约20%。
{"title":"Wavelet based Cepstral Coefficients for neural network speech recognition","authors":"T. Adam, M. Salam, T. Gunawan","doi":"10.1109/ICSIPA.2013.6708048","DOIUrl":"https://doi.org/10.1109/ICSIPA.2013.6708048","url":null,"abstract":"Traditional cepstral analysis methods are often used as part of feature extraction process in speech recognition. However the cepstral analysis method uses the Discrete Fourier Transform (DFT) in one of its computation process. The DFT uses fixed frame resolution to analyze frames of signal thus it will result in an analysis that would not accurately analyze localized events. This paper investigates the use of the Discrete Wavelet Transform (DWT) for calculating the cepstrum coefficients. Two wavelet types with different decomposition level are experimented to yield the cepstrum which is called the Wavelet Cepstral Coefficient (WCC). To test the WCC speech recognizing task of recognizing 26 English alphabets were conducted. Under same number of feature dimension the WCC outperformed the MFCC with about 20% in terms of recognition rate under both speaker dependent and speaker independent task.","PeriodicalId":440373,"journal":{"name":"2013 IEEE International Conference on Signal and Image Processing Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115168266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Noise reduction in iris recognition using multiple thresholding 基于多重阈值的虹膜识别降噪方法
Pub Date : 1900-01-01 DOI: 10.1109/ICSIPA.2013.6707992
A. B. Dehkordi, S. Abu-Bakar
Iris recognition is known to be as one of the most accurate biometric modalities. In iris image processing, the unwanted textures in the iris region such as those belong to pupil, eyelashes, eyelids, shadows and light reflections are defined as noises. These unwanted noises have strong gray values which cause wrong threshold value selection and thus, result in reducing the performance of the iris recognition system. In this paper, we proposed a multiple thresholding method for detection of eyelids, eyelash textures and light reflections and pupil pixels. The threshold values related to these noises are selected based on the information obtained from the histogram of the normalized iris image. The proposed method was applied to the CASIA V.3 iris image database, version three, from the institute of automation, Chinese academy of science and has 99.62% recognition rate with 0.04 false rejection rate (FRR).
虹膜识别被认为是最准确的生物识别方式之一。在虹膜图像处理中,虹膜区域中不需要的纹理,如瞳孔、睫毛、眼睑、阴影和光反射等被定义为噪声。这些噪声具有很强的灰度值,会导致阈值选择错误,从而降低虹膜识别系统的性能。本文提出了一种眼睑、睫毛纹理、光反射和瞳孔像素的多重阈值检测方法。根据归一化虹膜图像的直方图信息选择与这些噪声相关的阈值。将该方法应用于中国科学院自动化研究所CASIA V.3虹膜图像数据库第三版,识别率为99.62%,误拒率为0.04。
{"title":"Noise reduction in iris recognition using multiple thresholding","authors":"A. B. Dehkordi, S. Abu-Bakar","doi":"10.1109/ICSIPA.2013.6707992","DOIUrl":"https://doi.org/10.1109/ICSIPA.2013.6707992","url":null,"abstract":"Iris recognition is known to be as one of the most accurate biometric modalities. In iris image processing, the unwanted textures in the iris region such as those belong to pupil, eyelashes, eyelids, shadows and light reflections are defined as noises. These unwanted noises have strong gray values which cause wrong threshold value selection and thus, result in reducing the performance of the iris recognition system. In this paper, we proposed a multiple thresholding method for detection of eyelids, eyelash textures and light reflections and pupil pixels. The threshold values related to these noises are selected based on the information obtained from the histogram of the normalized iris image. The proposed method was applied to the CASIA V.3 iris image database, version three, from the institute of automation, Chinese academy of science and has 99.62% recognition rate with 0.04 false rejection rate (FRR).","PeriodicalId":440373,"journal":{"name":"2013 IEEE International Conference on Signal and Image Processing Applications","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128679802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Gait recognition using Local Ternary Pattern (LTP) 基于局部三元模式的步态识别
Pub Date : 1900-01-01 DOI: 10.1109/ICSIPA.2013.6707997
K. B. Low, U. U. Sheikh
Local Ternary Pattern (LTP) is usually applied for texture classification problems. In this work, we propose LTP for human gait characterization for the purpose of human identification. Our proposed method is based on the Gait Energy Image (GEI) whereby edge information over a complete gait cycle is extracted. However, GEI does not contain enough human body structure information for human recognition purpose. Therefore, LTP is used to extract texture information from all pixels in the human gait region which preserves more discriminative features of the subject. Gait cycle estimation is computed by using the aspect ratio of the subject's bounding box. After that, LTP features are averaged over a full gait cycle and a 2D joint histogram of the LTP is computed. At the end, K nearest-neighbor (k-NN) is used to obtain the final recognition results. The proposed method achieved higher accuracy compared to other methods when tested on the CMU MoBo human gait database. The proposed LTP method is easy to implement and also has the advantage of significantly lower computation time.
局部三元模式(LTP)通常用于纹理分类问题。在这项工作中,我们提出LTP用于人类步态表征,目的是为了识别人类。我们提出的方法是基于步态能量图像(GEI),在一个完整的步态周期提取边缘信息。然而,GEI并没有包含足够的人体结构信息来达到人体识别的目的。因此,使用LTP从人体步态区域的所有像素中提取纹理信息,从而保留了受试者更多的判别特征。步态周期估计是利用被试边界框的纵横比来计算的。然后,在整个步态周期内对LTP特征进行平均,并计算LTP的二维关节直方图。最后利用K近邻算法(K - nn)得到最终的识别结果。在CMU MoBo人体步态数据库上进行了测试,取得了比其他方法更高的准确率。所提出的LTP方法易于实现,并且具有显著降低计算时间的优点。
{"title":"Gait recognition using Local Ternary Pattern (LTP)","authors":"K. B. Low, U. U. Sheikh","doi":"10.1109/ICSIPA.2013.6707997","DOIUrl":"https://doi.org/10.1109/ICSIPA.2013.6707997","url":null,"abstract":"Local Ternary Pattern (LTP) is usually applied for texture classification problems. In this work, we propose LTP for human gait characterization for the purpose of human identification. Our proposed method is based on the Gait Energy Image (GEI) whereby edge information over a complete gait cycle is extracted. However, GEI does not contain enough human body structure information for human recognition purpose. Therefore, LTP is used to extract texture information from all pixels in the human gait region which preserves more discriminative features of the subject. Gait cycle estimation is computed by using the aspect ratio of the subject's bounding box. After that, LTP features are averaged over a full gait cycle and a 2D joint histogram of the LTP is computed. At the end, K nearest-neighbor (k-NN) is used to obtain the final recognition results. The proposed method achieved higher accuracy compared to other methods when tested on the CMU MoBo human gait database. The proposed LTP method is easy to implement and also has the advantage of significantly lower computation time.","PeriodicalId":440373,"journal":{"name":"2013 IEEE International Conference on Signal and Image Processing Applications","volume":"48 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133457018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Performance of LMS, NLMS and LMF algorithms in tracking time-varying UWB channels LMS、NLMS和LMF算法在时变UWB信道跟踪中的性能
Pub Date : 1900-01-01 DOI: 10.1109/ICSIPA.2013.6708024
S. Nunoo, U. Chude-Okonkwo, R. Ngah
There has been extensive discussions on the performance of the LMS algorithm in nonstationary environment. In some instances its performance in such environments has been compared with that of other adaptive algorithms. However, most existing works treated in this respect focus on narrowband channels. In this paper, we investigate the performance of the LMS, NLMS and LMF algorithms in tracking time-varying ultra wideband (UWB) channels. Channel measurements were conducted for an indoor environment and the resultant channel impulse response is used in the analysis. The results show that the LMF and NLMS algorithms outperform the LMS algorithm in time-varying UWB channels with the NLMS providing the best performance.
关于LMS算法在非平稳环境下的性能已经有了广泛的讨论。在某些情况下,将其在这种环境下的性能与其他自适应算法进行了比较。然而,在这方面的大多数现有工作都集中在窄带信道上。在本文中,我们研究了LMS、NLMS和LMF算法在跟踪时变超宽带(UWB)信道中的性能。通道测量是在室内环境下进行的,所得到的通道脉冲响应用于分析。结果表明,LMF算法和NLMS算法在时变UWB信道中的性能优于LMS算法,其中NLMS算法的性能最好。
{"title":"Performance of LMS, NLMS and LMF algorithms in tracking time-varying UWB channels","authors":"S. Nunoo, U. Chude-Okonkwo, R. Ngah","doi":"10.1109/ICSIPA.2013.6708024","DOIUrl":"https://doi.org/10.1109/ICSIPA.2013.6708024","url":null,"abstract":"There has been extensive discussions on the performance of the LMS algorithm in nonstationary environment. In some instances its performance in such environments has been compared with that of other adaptive algorithms. However, most existing works treated in this respect focus on narrowband channels. In this paper, we investigate the performance of the LMS, NLMS and LMF algorithms in tracking time-varying ultra wideband (UWB) channels. Channel measurements were conducted for an indoor environment and the resultant channel impulse response is used in the analysis. The results show that the LMF and NLMS algorithms outperform the LMS algorithm in time-varying UWB channels with the NLMS providing the best performance.","PeriodicalId":440373,"journal":{"name":"2013 IEEE International Conference on Signal and Image Processing Applications","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122043759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Voronoi cell geometry based dynamic Fractional Frequency Reuse for OFDMA cellular networks 基于Voronoi单元几何的OFDMA蜂窝网络动态分数频率复用
Pub Date : 1900-01-01 DOI: 10.1109/ICSIPA.2013.6708046
R. Ullah, N. Fisal, Hashim Safdar, W. Maqbool, Z. Khalid, A. Khan
Interference Management (IM) is one of the major challenges of next generation wireless communication. Fractional Frequency Reuse (FFR) has been acknowledged as an efficient IM technique, which offers significant capacity enhancement and improve cell edge coverage with low complexity. In literature, FFR has been analyzed mostly with cellular networks described by Hexagon Grid Model, which is neither tractable nor scalable to the dense deployment of next generation wireless networks. Moreover, the perfect geometry based grid model tends to overestimate the system performance and not able to reflect the reality. In this paper, we use the stochastic geometry approach, FFR is analyzed with cellular network modeled by homogeneous Poisson Point Process (PPP). A dynamic frequency allocation scheme is proposed which take into account the randomness of the cell coverage area describe by Voronoi tessellation. It is shown that the proposed scheme outperforms the traditional fixed frequency allocation schemes in terms of per user capacity and capacity density.
干扰管理(IM)是下一代无线通信面临的主要挑战之一。分数频率复用(FFR)是一种高效的即时通信技术,具有显著的容量增强和低复杂度的小区边缘覆盖能力。在文献中,对FFR的分析主要是用六边形网格模型描述的蜂窝网络,这种模型既不易于处理,也无法扩展到下一代无线网络的密集部署。此外,基于完美几何的网格模型容易高估系统性能,不能反映实际情况。本文采用随机几何方法,用齐次泊松点过程(PPP)建模的蜂窝网络对FFR进行了分析。提出了一种考虑Voronoi细分描述的小区覆盖区域随机性的动态频率分配方案。结果表明,该方案在单位用户容量和容量密度方面都优于传统的固定频率分配方案。
{"title":"Voronoi cell geometry based dynamic Fractional Frequency Reuse for OFDMA cellular networks","authors":"R. Ullah, N. Fisal, Hashim Safdar, W. Maqbool, Z. Khalid, A. Khan","doi":"10.1109/ICSIPA.2013.6708046","DOIUrl":"https://doi.org/10.1109/ICSIPA.2013.6708046","url":null,"abstract":"Interference Management (IM) is one of the major challenges of next generation wireless communication. Fractional Frequency Reuse (FFR) has been acknowledged as an efficient IM technique, which offers significant capacity enhancement and improve cell edge coverage with low complexity. In literature, FFR has been analyzed mostly with cellular networks described by Hexagon Grid Model, which is neither tractable nor scalable to the dense deployment of next generation wireless networks. Moreover, the perfect geometry based grid model tends to overestimate the system performance and not able to reflect the reality. In this paper, we use the stochastic geometry approach, FFR is analyzed with cellular network modeled by homogeneous Poisson Point Process (PPP). A dynamic frequency allocation scheme is proposed which take into account the randomness of the cell coverage area describe by Voronoi tessellation. It is shown that the proposed scheme outperforms the traditional fixed frequency allocation schemes in terms of per user capacity and capacity density.","PeriodicalId":440373,"journal":{"name":"2013 IEEE International Conference on Signal and Image Processing Applications","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117171171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Characterization and parameterization of a class of multivariable non-summable stochastic processes with bounded stochastic trends 一类具有有界随机趋势的多变量不可和随机过程的表征和参数化
Pub Date : 1900-01-01 DOI: 10.1109/ICSIPA.2013.6708026
U. Chude-Okonkwo, S. Nunoo, R. Ngah, Chollette C. Chude-Olisah, T. A. Rahman, Anthony A. Okafor
In some applications, multivariable stochastic processes that are composed of sequentially arranged independent weakly-stationary processes, may arise. Such multivariable process can be categorized as a class of non-summable processes with very complex probability density function. In this paper, we present the formal definition of such non-summable process, and provide a method of parameterizing and defining the statistical trend associated with the process. The illustration of a typical example of a multivariable non-summable process and how a bounded statistical trend can be obtained for the process is presented. The typical example is obtained from the simulation of a time-varying wideband wireless channel.
在某些应用中,可能会出现由顺序排列的独立弱平稳过程组成的多变量随机过程。这种多变量过程可以归类为一类具有非常复杂概率密度函数的不可和过程。本文给出了这种不可和过程的形式化定义,并给出了一种参数化和定义与此过程相关的统计趋势的方法。给出了一个多变量不可和过程的典型例子,以及如何得到该过程的有界统计趋势。通过对时变宽带无线信道的仿真,得到了一个典型的实例。
{"title":"Characterization and parameterization of a class of multivariable non-summable stochastic processes with bounded stochastic trends","authors":"U. Chude-Okonkwo, S. Nunoo, R. Ngah, Chollette C. Chude-Olisah, T. A. Rahman, Anthony A. Okafor","doi":"10.1109/ICSIPA.2013.6708026","DOIUrl":"https://doi.org/10.1109/ICSIPA.2013.6708026","url":null,"abstract":"In some applications, multivariable stochastic processes that are composed of sequentially arranged independent weakly-stationary processes, may arise. Such multivariable process can be categorized as a class of non-summable processes with very complex probability density function. In this paper, we present the formal definition of such non-summable process, and provide a method of parameterizing and defining the statistical trend associated with the process. The illustration of a typical example of a multivariable non-summable process and how a bounded statistical trend can be obtained for the process is presented. The typical example is obtained from the simulation of a time-varying wideband wireless channel.","PeriodicalId":440373,"journal":{"name":"2013 IEEE International Conference on Signal and Image Processing Applications","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132526332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2013 IEEE International Conference on Signal and Image Processing Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1