{"title":"Implementation of Audio Event Recognition for The Elderly Home Support Using Convolutional Neural Networks","authors":"A. Ramadhan, A. Wijayanto, H. Oktavianto","doi":"10.1109/IES50839.2020.9231702","DOIUrl":null,"url":null,"abstract":"This paper proposes a development of a smart home system for assisting elderly people by implementing an Audio Event Recognition (AER). By listening to the sound in the environment, the AER recognizes audio events that have been trained and then produces a useful information. There are four pretrained audio events namely door knock, can dropped, kettle sound, and rain sound. The audio in the environment is sampled for 5 seconds. Then, the sampled audio is processed into a spectrogram with a size of 128 x 76 pixels. The spectrogram serves as an input image for the Convolutional Neural Networks (CNN) to be recognized. Finally, after the spectrogram is recognized, the system produces information and transmit it to the cloud to be gathered by a smartphone. The system was implemented using Raspberry Pi 4. The experimental results show an accuracy rate of 97.5 % and 85% with a background noise of less than 40 dB and around 40 - 60 dB, respectively.","PeriodicalId":344685,"journal":{"name":"2020 International Electronics Symposium (IES)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Electronics Symposium (IES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IES50839.2020.9231702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper proposes a development of a smart home system for assisting elderly people by implementing an Audio Event Recognition (AER). By listening to the sound in the environment, the AER recognizes audio events that have been trained and then produces a useful information. There are four pretrained audio events namely door knock, can dropped, kettle sound, and rain sound. The audio in the environment is sampled for 5 seconds. Then, the sampled audio is processed into a spectrogram with a size of 128 x 76 pixels. The spectrogram serves as an input image for the Convolutional Neural Networks (CNN) to be recognized. Finally, after the spectrogram is recognized, the system produces information and transmit it to the cloud to be gathered by a smartphone. The system was implemented using Raspberry Pi 4. The experimental results show an accuracy rate of 97.5 % and 85% with a background noise of less than 40 dB and around 40 - 60 dB, respectively.
本文提出了一种智能家居系统的开发,通过实施音频事件识别(AER)来帮助老年人。通过聆听环境中的声音,AER识别经过训练的音频事件,然后产生有用的信息。有四种预先训练的音频事件,即敲门声、罐子掉落声、水壶声和雨声。环境中的音频采样5秒。然后,将采样的音频处理成大小为128 x 76像素的频谱图。频谱图作为卷积神经网络(CNN)的输入图像进行识别。最后,在光谱图被识别后,系统产生信息并将其传输到云端,由智能手机收集。该系统是在Raspberry Pi 4上实现的。实验结果表明,在背景噪声小于40 dB和40 ~ 60 dB时,该方法的准确率分别为97.5%和85%。