{"title":"Robust Multipitch Estimation of Piano Sounds Using Deep Spiking Neural Networks","authors":"Hanxiao Qian, Pengjie Gu, Rui Yan, Huajin Tang","doi":"10.1109/SSCI44817.2019.9003037","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a robust multi-label classification system based on deep spiking neural networks to handle multi-pitch estimation tasks. We employ constantQ transform spectrogram as a time-frequency representation. A keypoint detection technique is used for noise suppression and the extraction of relevant information. We also propose a novel biological spiking coding method that fits the expression of musical signals. This coding method can encode time, frequency, intensity information into spatiotemporal spike trains. And the spatio-temporal credit assignment (STCA) algorithm is used to train deep spiking neural networks. We perform the multipitch evaluation on the MAPS data set, and our work compares with the state-of-the-art methods by using the F1-score metric. Experimental results show that the proposed scheme has achieved better performance than other state-of-the-art methods and reveal the system’s robustness to environmental noise.","PeriodicalId":6729,"journal":{"name":"2019 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"6 1","pages":"2335-2341"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Symposium Series on Computational Intelligence (SSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSCI44817.2019.9003037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we propose a robust multi-label classification system based on deep spiking neural networks to handle multi-pitch estimation tasks. We employ constantQ transform spectrogram as a time-frequency representation. A keypoint detection technique is used for noise suppression and the extraction of relevant information. We also propose a novel biological spiking coding method that fits the expression of musical signals. This coding method can encode time, frequency, intensity information into spatiotemporal spike trains. And the spatio-temporal credit assignment (STCA) algorithm is used to train deep spiking neural networks. We perform the multipitch evaluation on the MAPS data set, and our work compares with the state-of-the-art methods by using the F1-score metric. Experimental results show that the proposed scheme has achieved better performance than other state-of-the-art methods and reveal the system’s robustness to environmental noise.