{"title":"The Study of Voice Pathology Detection based on MFCC and SVM","authors":"Yipeng Niu, Jiaming Cao, Fei Shen, Pengling Ren","doi":"10.1145/3444884.3444890","DOIUrl":null,"url":null,"abstract":"Subjective auditory perception evaluation of voice is the most simple and direct method for judgment of the degree of voice lesions and the treatment effect. But it is closely related to the clinical experience of doctors. Recently, some voice automatic diagnosis methods based on voice feature parameters and classification algorithms have been proposed. Mel Frequency Cepstral Coefficient (MFCC) is the most commonly used feature parameter. However, it is not clear the role of MFCC dynamic features in improving diagnosis results. This study adopted the features of MFCC, MFCC + ΔMFCC, and MFCC + ΔMFCC + ΔΔMFCC respectively, combined with the Support Vector Machine (SVM) method to further determine whether adding dynamic MFCC features can improve the accuracy of pathological voice detection. The results showed that no matter whether dynamic features were added or not, the accuracy rate and specificity have not changed significantly. This means the dynamic change of the MFCC characteristic parameters is slight at least for vowel vocalization. This study may provide useful information for pathological voice diagnosis based on vowel vocalization.","PeriodicalId":142206,"journal":{"name":"Proceedings of the 2020 7th International Conference on Biomedical and Bioinformatics Engineering","volume":"325 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 7th International Conference on Biomedical and Bioinformatics Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3444884.3444890","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Subjective auditory perception evaluation of voice is the most simple and direct method for judgment of the degree of voice lesions and the treatment effect. But it is closely related to the clinical experience of doctors. Recently, some voice automatic diagnosis methods based on voice feature parameters and classification algorithms have been proposed. Mel Frequency Cepstral Coefficient (MFCC) is the most commonly used feature parameter. However, it is not clear the role of MFCC dynamic features in improving diagnosis results. This study adopted the features of MFCC, MFCC + ΔMFCC, and MFCC + ΔMFCC + ΔΔMFCC respectively, combined with the Support Vector Machine (SVM) method to further determine whether adding dynamic MFCC features can improve the accuracy of pathological voice detection. The results showed that no matter whether dynamic features were added or not, the accuracy rate and specificity have not changed significantly. This means the dynamic change of the MFCC characteristic parameters is slight at least for vowel vocalization. This study may provide useful information for pathological voice diagnosis based on vowel vocalization.