{"title":"Adaptive robust speech processing based on acoustic noise estimation and classification","authors":"F. Beritelli, S. Casale, S. Serrano","doi":"10.1109/ISSPIT.2005.1577196","DOIUrl":null,"url":null,"abstract":"The paper presents an adaptive system for speech signal processing in the presence of loud background noise. The validity of the approach is confirmed by implementing a classification system for voiced and unvoiced (V/UV) speech frames. Genetic algorithms were used to select the parameters that offer the best V/UV classification in the presence of 4 different types of background noise and with 5 different SNRs. 20 neural network-based classification systems were then implemented, chosen dynamically frame by frame according to the output of a background noise recognition system and an SNR estimation system. The system was implemented and the tests performed using the TIMIT speech corpus and its phonetic classification. The results were compared with a non-adaptive classification system and the 3 V/UV detectors adopted by three important: LPClO, ITU-T G. 723.1 and ETSI AMR. In all cases the adaptive V/UV classifier clearly outperformed the others, confirming the validity of the adaptive approach","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPIT.2005.1577196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
The paper presents an adaptive system for speech signal processing in the presence of loud background noise. The validity of the approach is confirmed by implementing a classification system for voiced and unvoiced (V/UV) speech frames. Genetic algorithms were used to select the parameters that offer the best V/UV classification in the presence of 4 different types of background noise and with 5 different SNRs. 20 neural network-based classification systems were then implemented, chosen dynamically frame by frame according to the output of a background noise recognition system and an SNR estimation system. The system was implemented and the tests performed using the TIMIT speech corpus and its phonetic classification. The results were compared with a non-adaptive classification system and the 3 V/UV detectors adopted by three important: LPClO, ITU-T G. 723.1 and ETSI AMR. In all cases the adaptive V/UV classifier clearly outperformed the others, confirming the validity of the adaptive approach
提出了一种适用于大背景噪声环境下语音信号处理的自适应系统。通过实现浊音和浊音(V/UV)语音帧的分类系统,验证了该方法的有效性。在4种不同类型的背景噪声和5种不同信噪比的情况下,使用遗传算法选择提供最佳V/UV分类的参数。然后实现了20个基于神经网络的分类系统,根据背景噪声识别系统和信噪比估计系统的输出逐帧动态选择。利用TIMIT语音语料库及其语音分类对系统进行了实现和测试。结果与非自适应分类系统和三种重要的3 V/UV检测器LPClO、ITU-T G. 723.1和ETSI AMR进行了比较。在所有情况下,自适应V/UV分类器明显优于其他分类器,证实了自适应方法的有效性