{"title":"Classification of Cold and Non-Cold Speech Using Vowel-Like Region Segments","authors":"Pankaj Warule, S. Mishra, S. Deb","doi":"10.1109/SPCOM55316.2022.9840775","DOIUrl":null,"url":null,"abstract":"This work uses vowel-like region segments of speech to classify cold and non-cold speech signals. As various articulators are affected by the common cold, speech produced during the common cold gets affected. These changes in a speech during common cold can be used to classify cold and non-cold speech. Vowel-like region (VLR) in speech includes vowels, semi-vowels, and diphthongs phonemes. Vowel-like regions are the dominant part of the speech signal. Hence, we have considered only vowel-like regions for cold and non-cold speech classification. The VLRs are identified by locating the VLR onset point (VLROP) and end point (VLREP). The Hilbert envelope and zero frequency filtering methods are used for detection of VLROPs and VLREPs. Mel frequency cepstral coefficients (MFCCs) feature are extracted from VLRs, and the performance of these features are evaluated using a deep neural network. Features extracted from VLRs give comparable results to features extracted from complete active speech (CAS) signal. Compared to the CAS technique, the number of frames that needs to be processed utilizing VLRs is significantly less.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"159 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPCOM55316.2022.9840775","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
This work uses vowel-like region segments of speech to classify cold and non-cold speech signals. As various articulators are affected by the common cold, speech produced during the common cold gets affected. These changes in a speech during common cold can be used to classify cold and non-cold speech. Vowel-like region (VLR) in speech includes vowels, semi-vowels, and diphthongs phonemes. Vowel-like regions are the dominant part of the speech signal. Hence, we have considered only vowel-like regions for cold and non-cold speech classification. The VLRs are identified by locating the VLR onset point (VLROP) and end point (VLREP). The Hilbert envelope and zero frequency filtering methods are used for detection of VLROPs and VLREPs. Mel frequency cepstral coefficients (MFCCs) feature are extracted from VLRs, and the performance of these features are evaluated using a deep neural network. Features extracted from VLRs give comparable results to features extracted from complete active speech (CAS) signal. Compared to the CAS technique, the number of frames that needs to be processed utilizing VLRs is significantly less.