{"title":"Application of multiple orthogonal window spectrum estimation in speaker recognition","authors":"Bai Jing, Zhang Yiran, Yin Cong","doi":"10.1145/2757384.2757394","DOIUrl":null,"url":null,"abstract":"For speaker recognition systems, short-time spectrum of speech signal is obtained by using windowed discrete Fourier transform (DFT) in feature extraction. Although windowed DFT can reduces spectral leakage, variance of the spectrum estimation remains high, which reduces the stability of spectrum estimation. Multiple orthogonal window spectrum estimation(referred Multitapering) method, which can not only reduces spectral leakage but also reduces the variance of the spectrum estimation, has more stable performance of spectrum estimate, is utilized in this paper. After how number of windows affects performance of spectrum estimation is studied, the performance of speaker recognition system is also tested in noisy environment. The results show that multiple orthogonal spectrum estimation method has more stable performance and better noise robustness than Hamming windowed DFT.","PeriodicalId":330286,"journal":{"name":"Proceedings of the 2015 Workshop on Mobile Big Data","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2015 Workshop on Mobile Big Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2757384.2757394","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
For speaker recognition systems, short-time spectrum of speech signal is obtained by using windowed discrete Fourier transform (DFT) in feature extraction. Although windowed DFT can reduces spectral leakage, variance of the spectrum estimation remains high, which reduces the stability of spectrum estimation. Multiple orthogonal window spectrum estimation(referred Multitapering) method, which can not only reduces spectral leakage but also reduces the variance of the spectrum estimation, has more stable performance of spectrum estimate, is utilized in this paper. After how number of windows affects performance of spectrum estimation is studied, the performance of speaker recognition system is also tested in noisy environment. The results show that multiple orthogonal spectrum estimation method has more stable performance and better noise robustness than Hamming windowed DFT.