A. Mahmood, M. Alsulaiman, G. Muhammad, S. Selouani
{"title":"MDLF-Mavg: A new speech feature with a voice print","authors":"A. Mahmood, M. Alsulaiman, G. Muhammad, S. Selouani","doi":"10.1109/IEEEGCC.2013.6705847","DOIUrl":null,"url":null,"abstract":"A new feature extraction method is presented in this paper. It is modification to our previous work where we have extracted Multidimentional local features (MDLF). We name the new feature as Multi-Directional Local Feature with moving average(MDLF-Mavg). MDLF-Mavg is based on three-point linear regression and three point moving average. Linear regression is applied on horizontal (time axis) that captures phoneme onset and offset and vertical (frequency axis) which capture formant contours whereas modified moving average is applied on (45 degree) time-frequency axis and (135 degree) time-frequency axis which capture the voiceprint of speaker. The MDLF-Mavg performance is compared to other speech features in a speaker recognition system. MDLF-Mavg has shown better performance than the other features. In the case of the female only part of the database it achieved 100% recognition rate. We will show that MDLF-Mavg produce what can be looked at as a voice print for each speaker when vocalizing a certain text.","PeriodicalId":316751,"journal":{"name":"2013 7th IEEE GCC Conference and Exhibition (GCC)","volume":"149 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 7th IEEE GCC Conference and Exhibition (GCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEEEGCC.2013.6705847","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
A new feature extraction method is presented in this paper. It is modification to our previous work where we have extracted Multidimentional local features (MDLF). We name the new feature as Multi-Directional Local Feature with moving average(MDLF-Mavg). MDLF-Mavg is based on three-point linear regression and three point moving average. Linear regression is applied on horizontal (time axis) that captures phoneme onset and offset and vertical (frequency axis) which capture formant contours whereas modified moving average is applied on (45 degree) time-frequency axis and (135 degree) time-frequency axis which capture the voiceprint of speaker. The MDLF-Mavg performance is compared to other speech features in a speaker recognition system. MDLF-Mavg has shown better performance than the other features. In the case of the female only part of the database it achieved 100% recognition rate. We will show that MDLF-Mavg produce what can be looked at as a voice print for each speaker when vocalizing a certain text.