{"title":"Insect Sound Recognition Based on Convolutional Neural Network","authors":"X. Dong, Ning Yan, Ying Wei","doi":"10.1109/ICIVC.2018.8492871","DOIUrl":null,"url":null,"abstract":"A novel insect sound recognition system using enhanced spectrogram and convolutional neural network is proposed. Contrast-limit adaptive histogram equalization (CLAHE) is adopted to enhance R-space spectrogram. Traditionally, artificial feature extraction is an essential step of classification, introducing extra noise caused by subjectivity of individual researchers. In this paper, we construct a convolutional neural network (CNN) as classifier, extracting deep feature by machine learning. Mel-Frequency Cepstral Coefficient (MFCC) and chromatic spectrogram have been compared with enhanced R-space spectrogram as feature image. Eventually, 97.8723 % accuracy rate is achieved among 47 types of insect sound from USDA library.","PeriodicalId":173981,"journal":{"name":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","volume":"142 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIVC.2018.8492871","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
A novel insect sound recognition system using enhanced spectrogram and convolutional neural network is proposed. Contrast-limit adaptive histogram equalization (CLAHE) is adopted to enhance R-space spectrogram. Traditionally, artificial feature extraction is an essential step of classification, introducing extra noise caused by subjectivity of individual researchers. In this paper, we construct a convolutional neural network (CNN) as classifier, extracting deep feature by machine learning. Mel-Frequency Cepstral Coefficient (MFCC) and chromatic spectrogram have been compared with enhanced R-space spectrogram as feature image. Eventually, 97.8723 % accuracy rate is achieved among 47 types of insect sound from USDA library.