{"title":"Acoustic Scene Classification for Bone-Conducted Sound Using Transfer Learning and Feature Fusion","authors":"Sijun Bi, Liang Xu, Shenghui Zhao, Jing Wang","doi":"10.1109/ICICSP55539.2022.10050618","DOIUrl":null,"url":null,"abstract":"The air-conducted (AC) sound is usually used in the task of acoustic scene classification (ASC). Compared with the AC sound, bone-conducted (BC) sound has the unique advantage of shielding background noise. However, the amount of information contained in BC sound is far less than that in the AC sound due to its limited frequency bandwidth. In this paper, an acoustic scene classification method for BC sound is proposed with a small BC dataset. Firstly, the prosodic features are combined with the spectral features to capture more information, and feature fusion is adopted. Secondly, in order to deal with the small BC dataset, transfer learning is used with a large AC dataset. Finally, a deep learning network based on local residual learning is proposed. The experimental results show that the proposed method achieves the superior performance over the reference models.","PeriodicalId":281095,"journal":{"name":"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICSP55539.2022.10050618","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The air-conducted (AC) sound is usually used in the task of acoustic scene classification (ASC). Compared with the AC sound, bone-conducted (BC) sound has the unique advantage of shielding background noise. However, the amount of information contained in BC sound is far less than that in the AC sound due to its limited frequency bandwidth. In this paper, an acoustic scene classification method for BC sound is proposed with a small BC dataset. Firstly, the prosodic features are combined with the spectral features to capture more information, and feature fusion is adopted. Secondly, in order to deal with the small BC dataset, transfer learning is used with a large AC dataset. Finally, a deep learning network based on local residual learning is proposed. The experimental results show that the proposed method achieves the superior performance over the reference models.