N. Stefanakis, Konstantinos Psaroulakis, Nikonas Simou, Christos Astaras
{"title":"An Open-Access System for Long-Range Chainsaw Sound Detection","authors":"N. Stefanakis, Konstantinos Psaroulakis, Nikonas Simou, Christos Astaras","doi":"10.23919/eusipco55093.2022.9909629","DOIUrl":null,"url":null,"abstract":"A pipeline for automatic detection of chainsaw events in audio recordings is presented as the means to detect illegal logging activity in a protected natural environment. We propose a two-step process that consists of an activity detector at the front end and a deep neural network (DNN) classifier at the back end. At the front end, we use the Summation or Residual Harmonics method in order to detect patterns with harmonic structure in the audio recording. Active audio segments are consequently fed to the classifier that decides upon the absence or presence of a chainsaw event. As acoustic feature, we propose the widely-used amplitude spectrogram, passing it through the recently proposed Per-Channel Energy Normalization (PCEN) process. Results based on real-field recordings illustrate that the proposed end-to-end system may efficiently detect low-SNR chainsaw events at a very low false detection rate.","PeriodicalId":231263,"journal":{"name":"2022 30th European Signal Processing Conference (EUSIPCO)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 30th European Signal Processing Conference (EUSIPCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/eusipco55093.2022.9909629","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A pipeline for automatic detection of chainsaw events in audio recordings is presented as the means to detect illegal logging activity in a protected natural environment. We propose a two-step process that consists of an activity detector at the front end and a deep neural network (DNN) classifier at the back end. At the front end, we use the Summation or Residual Harmonics method in order to detect patterns with harmonic structure in the audio recording. Active audio segments are consequently fed to the classifier that decides upon the absence or presence of a chainsaw event. As acoustic feature, we propose the widely-used amplitude spectrogram, passing it through the recently proposed Per-Channel Energy Normalization (PCEN) process. Results based on real-field recordings illustrate that the proposed end-to-end system may efficiently detect low-SNR chainsaw events at a very low false detection rate.