{"title":"基于迁移学习和特征融合的骨传导声场景分类","authors":"Sijun Bi, Liang Xu, Shenghui Zhao, Jing Wang","doi":"10.1109/ICICSP55539.2022.10050618","DOIUrl":null,"url":null,"abstract":"The air-conducted (AC) sound is usually used in the task of acoustic scene classification (ASC). Compared with the AC sound, bone-conducted (BC) sound has the unique advantage of shielding background noise. However, the amount of information contained in BC sound is far less than that in the AC sound due to its limited frequency bandwidth. In this paper, an acoustic scene classification method for BC sound is proposed with a small BC dataset. Firstly, the prosodic features are combined with the spectral features to capture more information, and feature fusion is adopted. Secondly, in order to deal with the small BC dataset, transfer learning is used with a large AC dataset. Finally, a deep learning network based on local residual learning is proposed. The experimental results show that the proposed method achieves the superior performance over the reference models.","PeriodicalId":281095,"journal":{"name":"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Acoustic Scene Classification for Bone-Conducted Sound Using Transfer Learning and Feature Fusion\",\"authors\":\"Sijun Bi, Liang Xu, Shenghui Zhao, Jing Wang\",\"doi\":\"10.1109/ICICSP55539.2022.10050618\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The air-conducted (AC) sound is usually used in the task of acoustic scene classification (ASC). Compared with the AC sound, bone-conducted (BC) sound has the unique advantage of shielding background noise. However, the amount of information contained in BC sound is far less than that in the AC sound due to its limited frequency bandwidth. In this paper, an acoustic scene classification method for BC sound is proposed with a small BC dataset. Firstly, the prosodic features are combined with the spectral features to capture more information, and feature fusion is adopted. Secondly, in order to deal with the small BC dataset, transfer learning is used with a large AC dataset. Finally, a deep learning network based on local residual learning is proposed. The experimental results show that the proposed method achieves the superior performance over the reference models.\",\"PeriodicalId\":281095,\"journal\":{\"name\":\"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICSP55539.2022.10050618\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Information Communication and Signal Processing (ICICSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICSP55539.2022.10050618","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Acoustic Scene Classification for Bone-Conducted Sound Using Transfer Learning and Feature Fusion
The air-conducted (AC) sound is usually used in the task of acoustic scene classification (ASC). Compared with the AC sound, bone-conducted (BC) sound has the unique advantage of shielding background noise. However, the amount of information contained in BC sound is far less than that in the AC sound due to its limited frequency bandwidth. In this paper, an acoustic scene classification method for BC sound is proposed with a small BC dataset. Firstly, the prosodic features are combined with the spectral features to capture more information, and feature fusion is adopted. Secondly, in order to deal with the small BC dataset, transfer learning is used with a large AC dataset. Finally, a deep learning network based on local residual learning is proposed. The experimental results show that the proposed method achieves the superior performance over the reference models.