{"title":"Nonlinear prediction of speech signals using radial basis function networks","authors":"M. Birgmeier","doi":"10.5281/ZENODO.36355","DOIUrl":null,"url":null,"abstract":"In this paper, we compare the capabilities of various forms of radial basis function networks as nonlinear short-term predictors for speech signals representing sustained utterances of German vowels. We use RBF and RBF-AR1 network architectures, trained using a standard algorithm or alternatively the extended Kalman filter (EKF) algorithm, and linear least squares predictors. We also look at cascaded forms of linear/nonlinear predictors. We evaluate both prediction gain and spectral flatness measure of the residual. The results indicate: The RBF-AR structure is the most powerful, EKF training yields better results than standard training for RBF networks, and a non-cascaded RBF-AR predictor produces results superior to cascaded predictors.","PeriodicalId":282153,"journal":{"name":"1996 8th European Signal Processing Conference (EUSIPCO 1996)","volume":"26 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"1996 8th European Signal Processing Conference (EUSIPCO 1996)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.36355","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 27
Abstract
In this paper, we compare the capabilities of various forms of radial basis function networks as nonlinear short-term predictors for speech signals representing sustained utterances of German vowels. We use RBF and RBF-AR1 network architectures, trained using a standard algorithm or alternatively the extended Kalman filter (EKF) algorithm, and linear least squares predictors. We also look at cascaded forms of linear/nonlinear predictors. We evaluate both prediction gain and spectral flatness measure of the residual. The results indicate: The RBF-AR structure is the most powerful, EKF training yields better results than standard training for RBF networks, and a non-cascaded RBF-AR predictor produces results superior to cascaded predictors.