{"title":"Sound Event Detection using Federated Learning","authors":"M. K. Maurya, Mandeep Kumar, Manish Kumar","doi":"10.1109/UPCON56432.2022.9986444","DOIUrl":null,"url":null,"abstract":"The study of sound event detection (SED) in environmental environments has gained popularity recently. However, significant logistical and privacy concerns exist because huge amounts of (private) home or urban audio data are needed. Federated learning (FL), which effectively distributes these duties, is a viable way to use enormous amounts of data without raising privacy issues. Although FL has recently gained much attention, only a few studies have been done on FL for SED. In this paper, we attempted FL for SED to fill this gap and encourage further study. This paper demonstrated the experiments on the URBAN and MNIST datasets to better understand the impact of data heterogeneity, optimizer, client participation, and communication round. Additionally, we run baseline outcomes for deep neural network designs on the datasets in an FL context. The CNN-M model is used for training and testing purposes; two datasets, namely URBAN and MNIST audio datasets, are used.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"326-327 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UPCON56432.2022.9986444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The study of sound event detection (SED) in environmental environments has gained popularity recently. However, significant logistical and privacy concerns exist because huge amounts of (private) home or urban audio data are needed. Federated learning (FL), which effectively distributes these duties, is a viable way to use enormous amounts of data without raising privacy issues. Although FL has recently gained much attention, only a few studies have been done on FL for SED. In this paper, we attempted FL for SED to fill this gap and encourage further study. This paper demonstrated the experiments on the URBAN and MNIST datasets to better understand the impact of data heterogeneity, optimizer, client participation, and communication round. Additionally, we run baseline outcomes for deep neural network designs on the datasets in an FL context. The CNN-M model is used for training and testing purposes; two datasets, namely URBAN and MNIST audio datasets, are used.