{"title":"Speaker identification using feature vector reduction of row mean of different transforms","authors":"H. B. Kekre, V. Kulkarni","doi":"10.1109/ICCICT.2012.6398100","DOIUrl":null,"url":null,"abstract":"In this paper a novel approach to text dependent speaker identification based on feature vector reduction technique of the row mean is proposed. Five different Orthogonal Transform Techniques: Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT), Discrete Sine Transform (DST), Discrete Hartley Transform (DHT) and Walsh Hadamard Transform (WHT) are applied on the framed speech signal. Feature extraction in the testing and matching phases has been done by using feature vector reduction technique applied on the row mean vector of the magnitude of the transformed speech signal. Two similarity measures Euclidean distance and Manhattan distance are used for feature matching. The results indicate that the accuracy using both the similarity measures remains steady up to certain reduction in feature vector permitting to reduce feature vector size. This algorithm is tested using two databases: a locally created database and CSLU Database. It is observed that, DFT allows maximum percentage of feature vector reduction. It out performs other Transforms with a big margin.","PeriodicalId":319467,"journal":{"name":"2012 International Conference on Communication, Information & Computing Technology (ICCICT)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on Communication, Information & Computing Technology (ICCICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCICT.2012.6398100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper a novel approach to text dependent speaker identification based on feature vector reduction technique of the row mean is proposed. Five different Orthogonal Transform Techniques: Discrete Fourier Transform (DFT), Discrete Cosine Transform (DCT), Discrete Sine Transform (DST), Discrete Hartley Transform (DHT) and Walsh Hadamard Transform (WHT) are applied on the framed speech signal. Feature extraction in the testing and matching phases has been done by using feature vector reduction technique applied on the row mean vector of the magnitude of the transformed speech signal. Two similarity measures Euclidean distance and Manhattan distance are used for feature matching. The results indicate that the accuracy using both the similarity measures remains steady up to certain reduction in feature vector permitting to reduce feature vector size. This algorithm is tested using two databases: a locally created database and CSLU Database. It is observed that, DFT allows maximum percentage of feature vector reduction. It out performs other Transforms with a big margin.