Expert System for Speaker Identification Using Lip Features with PCA

Anuj Mehra, Mahender Kumawat, R. Ranjan, B. Pandey, Sushil Ranjan, A. Shukla, R. Tiwari
{"title":"Expert System for Speaker Identification Using Lip Features with PCA","authors":"Anuj Mehra, Mahender Kumawat, R. Ranjan, B. Pandey, Sushil Ranjan, A. Shukla, R. Tiwari","doi":"10.1109/IWISA.2010.5473241","DOIUrl":null,"url":null,"abstract":"Biometric authentication techniques such as lips, face, and eyes are more reliable and efficient than conventional authentication techniques such as password authentication, token, cards, personal identification number, etc. In this research paper, the emphasis has been laid on the speaker identification based on lip features. In this study, we have presented a detailed comparative analysis for speaker identification by using lip features, Principal Component Analysis (PCA), and neural network classifiers. PCA has been used for feature extraction from the six geometric lip features which are height of the outer corners of the mouth, width of the outer corners of the mouth, height of the inner corners of the mouth, width of the inner corners of the mouth, height of the upper lip, and height of the lower lip. These features are then used for training of the network by using different neural network classifiers such as Back Propagation (BP), Radial Basis Function (RBF) and Learning Vector Quantization (LVQ). These approaches are incorporated on \"TULIPS1 database, (Movellan, 1995)\" which is a small audiovisual database of 12 subjects saying the first 4 digits in English. After the detailed analysis and evaluation a maximum of 91.07% accuracy in speaker recognition is obtained using PCA and RBF. Speaker identification has a wide range of applications such as Audio Processing, Medical data, Finance, Array processing, etc.","PeriodicalId":298764,"journal":{"name":"2010 2nd International Workshop on Intelligent Systems and Applications","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 2nd International Workshop on Intelligent Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWISA.2010.5473241","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Biometric authentication techniques such as lips, face, and eyes are more reliable and efficient than conventional authentication techniques such as password authentication, token, cards, personal identification number, etc. In this research paper, the emphasis has been laid on the speaker identification based on lip features. In this study, we have presented a detailed comparative analysis for speaker identification by using lip features, Principal Component Analysis (PCA), and neural network classifiers. PCA has been used for feature extraction from the six geometric lip features which are height of the outer corners of the mouth, width of the outer corners of the mouth, height of the inner corners of the mouth, width of the inner corners of the mouth, height of the upper lip, and height of the lower lip. These features are then used for training of the network by using different neural network classifiers such as Back Propagation (BP), Radial Basis Function (RBF) and Learning Vector Quantization (LVQ). These approaches are incorporated on "TULIPS1 database, (Movellan, 1995)" which is a small audiovisual database of 12 subjects saying the first 4 digits in English. After the detailed analysis and evaluation a maximum of 91.07% accuracy in speaker recognition is obtained using PCA and RBF. Speaker identification has a wide range of applications such as Audio Processing, Medical data, Finance, Array processing, etc.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于PCA的说话人唇特征识别专家系统
唇、脸、眼睛等生物识别技术比密码、令牌、卡片、个人识别号码等传统认证技术更可靠、更高效。本文的研究重点是基于唇形特征的说话人识别。在这项研究中,我们提出了一个详细的比较分析使用唇特征,主成分分析(PCA)和神经网络分类器说话人识别。利用主成分分析法对嘴角外角高度、嘴角外角宽度、嘴角内角高度、嘴角内角宽度、上唇高度和下唇高度这6个唇形几何特征进行特征提取。然后使用不同的神经网络分类器,如反向传播(BP)、径向基函数(RBF)和学习向量量化(LVQ),将这些特征用于网络的训练。这些方法被纳入“TULIPS1数据库,(Movellan, 1995)”,这是一个小型视听数据库,由12个受试者说英语的前4位数字。经过详细的分析和评价,采用主成分分析和RBF方法识别说话人的准确率最高可达91.07%。说话人识别在音频处理、医疗数据、金融、阵列处理等领域有着广泛的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
How to Display the Data from Database by ListView on Android An Improved Genetic Algorithm and Its Blending Application with Neural Network A Study for Important Criteria of Feature Selection in Text Categorization A Hierarchical Classification Model Based on Granular Computing A Study of Improving Apriori Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1