{"title":"语音病理识别的低频带连续语音系统","authors":"Hugo Cordeiro, C. Meneses","doi":"10.23919/SPA.2018.8563393","DOIUrl":null,"url":null,"abstract":"This paper describes the impact of the signal bandwidth reduction in the identification of voice pathologies. The implemented systems evaluate the identification of 3 classes divided by healthy subjects, subjects diagnosed with physiological larynx pathologies and subjects diagnosed with neuromuscular larynx pathologies. Continuous speech signals are down-sampled to 4 kHz and the extracted spectral parameters are applied to a GMM classifier. No significant change in accuracy occurs, being possible to conclude that the low frequencies contain sufficient information to allow the classification of pathologies. A second objective is to test the effects of suppressing the voice activity detection and the increasing the analysis window length. In both cases the accuracy increases. In conclusion, a pathological voice identification system based on signals sampled at 4 kHz, without voice activity detection and with an analysis window length of 40 ms is proposed, getting 81.8% accuracy. The proposed system has also the advantage of reduces the storage memory and the processing time.","PeriodicalId":265587,"journal":{"name":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"1 4, Part 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Low band continuous speech system for voice pathologies identification\",\"authors\":\"Hugo Cordeiro, C. Meneses\",\"doi\":\"10.23919/SPA.2018.8563393\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes the impact of the signal bandwidth reduction in the identification of voice pathologies. The implemented systems evaluate the identification of 3 classes divided by healthy subjects, subjects diagnosed with physiological larynx pathologies and subjects diagnosed with neuromuscular larynx pathologies. Continuous speech signals are down-sampled to 4 kHz and the extracted spectral parameters are applied to a GMM classifier. No significant change in accuracy occurs, being possible to conclude that the low frequencies contain sufficient information to allow the classification of pathologies. A second objective is to test the effects of suppressing the voice activity detection and the increasing the analysis window length. In both cases the accuracy increases. In conclusion, a pathological voice identification system based on signals sampled at 4 kHz, without voice activity detection and with an analysis window length of 40 ms is proposed, getting 81.8% accuracy. The proposed system has also the advantage of reduces the storage memory and the processing time.\",\"PeriodicalId\":265587,\"journal\":{\"name\":\"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)\",\"volume\":\"1 4, Part 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/SPA.2018.8563393\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/SPA.2018.8563393","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Low band continuous speech system for voice pathologies identification
This paper describes the impact of the signal bandwidth reduction in the identification of voice pathologies. The implemented systems evaluate the identification of 3 classes divided by healthy subjects, subjects diagnosed with physiological larynx pathologies and subjects diagnosed with neuromuscular larynx pathologies. Continuous speech signals are down-sampled to 4 kHz and the extracted spectral parameters are applied to a GMM classifier. No significant change in accuracy occurs, being possible to conclude that the low frequencies contain sufficient information to allow the classification of pathologies. A second objective is to test the effects of suppressing the voice activity detection and the increasing the analysis window length. In both cases the accuracy increases. In conclusion, a pathological voice identification system based on signals sampled at 4 kHz, without voice activity detection and with an analysis window length of 40 ms is proposed, getting 81.8% accuracy. The proposed system has also the advantage of reduces the storage memory and the processing time.