{"title":"多重正交窗谱估计在说话人识别中的应用","authors":"Bai Jing, Zhang Yiran, Yin Cong","doi":"10.1145/2757384.2757394","DOIUrl":null,"url":null,"abstract":"For speaker recognition systems, short-time spectrum of speech signal is obtained by using windowed discrete Fourier transform (DFT) in feature extraction. Although windowed DFT can reduces spectral leakage, variance of the spectrum estimation remains high, which reduces the stability of spectrum estimation. Multiple orthogonal window spectrum estimation(referred Multitapering) method, which can not only reduces spectral leakage but also reduces the variance of the spectrum estimation, has more stable performance of spectrum estimate, is utilized in this paper. After how number of windows affects performance of spectrum estimation is studied, the performance of speaker recognition system is also tested in noisy environment. The results show that multiple orthogonal spectrum estimation method has more stable performance and better noise robustness than Hamming windowed DFT.","PeriodicalId":330286,"journal":{"name":"Proceedings of the 2015 Workshop on Mobile Big Data","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of multiple orthogonal window spectrum estimation in speaker recognition\",\"authors\":\"Bai Jing, Zhang Yiran, Yin Cong\",\"doi\":\"10.1145/2757384.2757394\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For speaker recognition systems, short-time spectrum of speech signal is obtained by using windowed discrete Fourier transform (DFT) in feature extraction. Although windowed DFT can reduces spectral leakage, variance of the spectrum estimation remains high, which reduces the stability of spectrum estimation. Multiple orthogonal window spectrum estimation(referred Multitapering) method, which can not only reduces spectral leakage but also reduces the variance of the spectrum estimation, has more stable performance of spectrum estimate, is utilized in this paper. After how number of windows affects performance of spectrum estimation is studied, the performance of speaker recognition system is also tested in noisy environment. The results show that multiple orthogonal spectrum estimation method has more stable performance and better noise robustness than Hamming windowed DFT.\",\"PeriodicalId\":330286,\"journal\":{\"name\":\"Proceedings of the 2015 Workshop on Mobile Big Data\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2015 Workshop on Mobile Big Data\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2757384.2757394\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2015 Workshop on Mobile Big Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2757384.2757394","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Application of multiple orthogonal window spectrum estimation in speaker recognition
For speaker recognition systems, short-time spectrum of speech signal is obtained by using windowed discrete Fourier transform (DFT) in feature extraction. Although windowed DFT can reduces spectral leakage, variance of the spectrum estimation remains high, which reduces the stability of spectrum estimation. Multiple orthogonal window spectrum estimation(referred Multitapering) method, which can not only reduces spectral leakage but also reduces the variance of the spectrum estimation, has more stable performance of spectrum estimate, is utilized in this paper. After how number of windows affects performance of spectrum estimation is studied, the performance of speaker recognition system is also tested in noisy environment. The results show that multiple orthogonal spectrum estimation method has more stable performance and better noise robustness than Hamming windowed DFT.