{"title":"Low cost speech detection using Haar-like filtering for sensornet","authors":"J. Nishimura, T. Kuroda","doi":"10.1109/ICOSP.2008.4697683","DOIUrl":null,"url":null,"abstract":"Haar-like filtering based speech detection is proposed as a new and very low calculation cost method for sensornet applications. The simple Haar-like filters having variable filter width and shift width are trained to learn appropriate filter parameters from the training samples to detect speech. Our method yielded speech/nonspeech classification accuracy of 96.93% for the input length of 0.1s. Compared with high performance feature extraction method MFCC (Mel-frequency cepstrum coefficient), the proposed Haar-like filtering can be approximately 85.77% efficient in terms of the amount of add and multiply calculations while capable of achieving the error rate of only 3.03% relative to MFCC.","PeriodicalId":445699,"journal":{"name":"2008 9th International Conference on Signal Processing","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 9th International Conference on Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2008.4697683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
Haar-like filtering based speech detection is proposed as a new and very low calculation cost method for sensornet applications. The simple Haar-like filters having variable filter width and shift width are trained to learn appropriate filter parameters from the training samples to detect speech. Our method yielded speech/nonspeech classification accuracy of 96.93% for the input length of 0.1s. Compared with high performance feature extraction method MFCC (Mel-frequency cepstrum coefficient), the proposed Haar-like filtering can be approximately 85.77% efficient in terms of the amount of add and multiply calculations while capable of achieving the error rate of only 3.03% relative to MFCC.