N. A. Johari, M. Hariharan, A. Saidatul, S. Yaacob
{"title":"基于小波包能量和熵特征的语音多风格分类","authors":"N. A. Johari, M. Hariharan, A. Saidatul, S. Yaacob","doi":"10.1109/STUDENT.2011.6089328","DOIUrl":null,"url":null,"abstract":"Physco-physiological conditions like stress and emotional state may affect human's speech production. Different techniques have been proposed such as facial expressions, speech production variation, and physiological signals to detect the emotional/stressed states of a person. For past 2 decades, the determination of an emotional/stressed state through speech has been undergone substantial research and development. Various techniques are used in the literature to classify emotional/stressed states on the basis of speech. In this paper, a feature extraction method using two different wavelet packet filterbank structures which are based on barkscale and equivalent rectangular bandwidth (ERB) scale for identifying the emotional/stressed states of a person. In this study speech samples are taken from Speech Under Simulated and Actual Stress (SUSAS) database. Linear Discriminant analysis (LDA) based classifier is used to test usefulness of suggested features. Experimental result shows that the suggested methods can be used to identify the emotional/stressed states of a person.","PeriodicalId":247351,"journal":{"name":"2011 IEEE Conference on Sustainable Utilization and Development in Engineering and Technology (STUDENT)","volume":"42 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Multistyle classification of speech under stress using wavelet packet energy and entropy features\",\"authors\":\"N. A. Johari, M. Hariharan, A. Saidatul, S. Yaacob\",\"doi\":\"10.1109/STUDENT.2011.6089328\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Physco-physiological conditions like stress and emotional state may affect human's speech production. Different techniques have been proposed such as facial expressions, speech production variation, and physiological signals to detect the emotional/stressed states of a person. For past 2 decades, the determination of an emotional/stressed state through speech has been undergone substantial research and development. Various techniques are used in the literature to classify emotional/stressed states on the basis of speech. In this paper, a feature extraction method using two different wavelet packet filterbank structures which are based on barkscale and equivalent rectangular bandwidth (ERB) scale for identifying the emotional/stressed states of a person. In this study speech samples are taken from Speech Under Simulated and Actual Stress (SUSAS) database. Linear Discriminant analysis (LDA) based classifier is used to test usefulness of suggested features. Experimental result shows that the suggested methods can be used to identify the emotional/stressed states of a person.\",\"PeriodicalId\":247351,\"journal\":{\"name\":\"2011 IEEE Conference on Sustainable Utilization and Development in Engineering and Technology (STUDENT)\",\"volume\":\"42 6\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE Conference on Sustainable Utilization and Development in Engineering and Technology (STUDENT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/STUDENT.2011.6089328\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Conference on Sustainable Utilization and Development in Engineering and Technology (STUDENT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/STUDENT.2011.6089328","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multistyle classification of speech under stress using wavelet packet energy and entropy features
Physco-physiological conditions like stress and emotional state may affect human's speech production. Different techniques have been proposed such as facial expressions, speech production variation, and physiological signals to detect the emotional/stressed states of a person. For past 2 decades, the determination of an emotional/stressed state through speech has been undergone substantial research and development. Various techniques are used in the literature to classify emotional/stressed states on the basis of speech. In this paper, a feature extraction method using two different wavelet packet filterbank structures which are based on barkscale and equivalent rectangular bandwidth (ERB) scale for identifying the emotional/stressed states of a person. In this study speech samples are taken from Speech Under Simulated and Actual Stress (SUSAS) database. Linear Discriminant analysis (LDA) based classifier is used to test usefulness of suggested features. Experimental result shows that the suggested methods can be used to identify the emotional/stressed states of a person.