{"title":"基于深度神经网络模型的印度古典复调乐器音频音调检测","authors":"Ashwini, A. Krishna, V. Mahesh, G. K. Karrthik","doi":"10.1504/IJFE.2020.10037779","DOIUrl":null,"url":null,"abstract":"Identification of tone from a polyphonic audio is quite a challenging task in digital audio processing. When the audio clip is a classical instrumental track the process is even more cumbersome. This paper proposes a novel approach to detect the tone of polyphonic Indian classical instrumental audio using scaled exponential linear unit (SeLu) activated Deep Neural Network (DNN) along with instrument identification which also uses SeLu activated DNN Model. This aims at utilising the same key features which help in instrument detection in real-life situations. The number of features were also reduced from 34 to 26 in comparison with the earlier work by analysing and identifying the redundant features and adding a few more important characteristic features. The proposed Instrument identification model predicts instruments with an accuracy of 84.39% for Carnatic classical and 83.59% for Hindustani classical. The SeLu activated DNN model for tone detection has attained an accuracy of 88.30%.","PeriodicalId":443235,"journal":{"name":"International Journal of Forensic Engineering","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Tone detection for Indian classical polyphonic instrumental audio using DNN model\",\"authors\":\"Ashwini, A. Krishna, V. Mahesh, G. K. Karrthik\",\"doi\":\"10.1504/IJFE.2020.10037779\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Identification of tone from a polyphonic audio is quite a challenging task in digital audio processing. When the audio clip is a classical instrumental track the process is even more cumbersome. This paper proposes a novel approach to detect the tone of polyphonic Indian classical instrumental audio using scaled exponential linear unit (SeLu) activated Deep Neural Network (DNN) along with instrument identification which also uses SeLu activated DNN Model. This aims at utilising the same key features which help in instrument detection in real-life situations. The number of features were also reduced from 34 to 26 in comparison with the earlier work by analysing and identifying the redundant features and adding a few more important characteristic features. The proposed Instrument identification model predicts instruments with an accuracy of 84.39% for Carnatic classical and 83.59% for Hindustani classical. The SeLu activated DNN model for tone detection has attained an accuracy of 88.30%.\",\"PeriodicalId\":443235,\"journal\":{\"name\":\"International Journal of Forensic Engineering\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Forensic Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJFE.2020.10037779\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Forensic Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJFE.2020.10037779","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Tone detection for Indian classical polyphonic instrumental audio using DNN model
Identification of tone from a polyphonic audio is quite a challenging task in digital audio processing. When the audio clip is a classical instrumental track the process is even more cumbersome. This paper proposes a novel approach to detect the tone of polyphonic Indian classical instrumental audio using scaled exponential linear unit (SeLu) activated Deep Neural Network (DNN) along with instrument identification which also uses SeLu activated DNN Model. This aims at utilising the same key features which help in instrument detection in real-life situations. The number of features were also reduced from 34 to 26 in comparison with the earlier work by analysing and identifying the redundant features and adding a few more important characteristic features. The proposed Instrument identification model predicts instruments with an accuracy of 84.39% for Carnatic classical and 83.59% for Hindustani classical. The SeLu activated DNN model for tone detection has attained an accuracy of 88.30%.