{"title":"基于EMD和改进谱减法的语音端点检测","authors":"Jin Wu, Gege Chong, Wenting Pang, Lei Wang","doi":"10.1109/icnlp58431.2023.00029","DOIUrl":null,"url":null,"abstract":"Aiming at the problem that the correct rate of speech endpoint detection is low in the environment with low signal-to-noise ratio, a speech endpoint detection algorithm based on Empirical Mode Decomposition (EMD) and improved spectral subtraction is proposed, considering some noise reduction before endpoint detection. After EMD decomposition and reconstruction, the algorithm uses the improved spectral subtraction of multi-window spectral estimation to reduce noise, which improves the signal-to-noise ratio of speech signal, and then detects the endpoint by using the Teager energy and Zero-Crossing Rate(ZCR). The effectiveness and feasibility of the method presented in this paper are verified by the simulation experiment. The speech signals selected in the experiment were recorded in a quiet environment. Compared with the speech endpoint detection algorithm based on empirical modal decomposition and improved two-threshold method, the proposed algorithm has significantly improved the accuracy and accuracy of endpoint detection.","PeriodicalId":53637,"journal":{"name":"Icon","volume":"5 1","pages":"126-130"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speech Endpoint Detection Based on EMD and Improved Spectral Subtraction\",\"authors\":\"Jin Wu, Gege Chong, Wenting Pang, Lei Wang\",\"doi\":\"10.1109/icnlp58431.2023.00029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problem that the correct rate of speech endpoint detection is low in the environment with low signal-to-noise ratio, a speech endpoint detection algorithm based on Empirical Mode Decomposition (EMD) and improved spectral subtraction is proposed, considering some noise reduction before endpoint detection. After EMD decomposition and reconstruction, the algorithm uses the improved spectral subtraction of multi-window spectral estimation to reduce noise, which improves the signal-to-noise ratio of speech signal, and then detects the endpoint by using the Teager energy and Zero-Crossing Rate(ZCR). The effectiveness and feasibility of the method presented in this paper are verified by the simulation experiment. The speech signals selected in the experiment were recorded in a quiet environment. Compared with the speech endpoint detection algorithm based on empirical modal decomposition and improved two-threshold method, the proposed algorithm has significantly improved the accuracy and accuracy of endpoint detection.\",\"PeriodicalId\":53637,\"journal\":{\"name\":\"Icon\",\"volume\":\"5 1\",\"pages\":\"126-130\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Icon\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icnlp58431.2023.00029\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Arts and Humanities\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Icon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icnlp58431.2023.00029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Arts and Humanities","Score":null,"Total":0}
Speech Endpoint Detection Based on EMD and Improved Spectral Subtraction
Aiming at the problem that the correct rate of speech endpoint detection is low in the environment with low signal-to-noise ratio, a speech endpoint detection algorithm based on Empirical Mode Decomposition (EMD) and improved spectral subtraction is proposed, considering some noise reduction before endpoint detection. After EMD decomposition and reconstruction, the algorithm uses the improved spectral subtraction of multi-window spectral estimation to reduce noise, which improves the signal-to-noise ratio of speech signal, and then detects the endpoint by using the Teager energy and Zero-Crossing Rate(ZCR). The effectiveness and feasibility of the method presented in this paper are verified by the simulation experiment. The speech signals selected in the experiment were recorded in a quiet environment. Compared with the speech endpoint detection algorithm based on empirical modal decomposition and improved two-threshold method, the proposed algorithm has significantly improved the accuracy and accuracy of endpoint detection.