Development of CNN Pruning Method for Earthquake Signal Imagery Classification

2022 14th International Conference on Information Technology and Electrical Engineering (ICITEE) Pub Date : 2022-10-18 DOI:10.1109/ICITEE56407.2022.9954106

Bayu Kusuma Atmaja, I. Mustika, Risanuri Hidayat, Hajar Nimpuno Adi, Ahmad Taufiq Musaddid

{"title":"Development of CNN Pruning Method for Earthquake Signal Imagery Classification","authors":"Bayu Kusuma Atmaja, I. Mustika, Risanuri Hidayat, Hajar Nimpuno Adi, Ahmad Taufiq Musaddid","doi":"10.1109/ICITEE56407.2022.9954106","DOIUrl":null,"url":null,"abstract":"The real-time detection of earthquake occurrences in a seismic wave, drawn by seismograph, is crucial for disaster mitigation. The earlier the earthquake warning, the more lives can be saved. One approach that can monitor and detect earthquake occurrence is binary classification between earthquake and noise signals. The use of deep learning models such as CNN (Convolutional Neural Network), is considered quite accurate to perform an image classification of seismograph signals. Nevertheless, the tendency to use the large CNN model is rated to have better accuracy than smaller models. In fact, the disadvantage of utilizing large model is the inference time and the deployment of a large model to obtain real-time inference is more costly than the smaller model. This paper aims to reduce the size of a CNN model (Resnet50) by pruning the unnecessary filters and neuron on the model architecture without sacrificing the accuracy. The task of the model was to classify two classes (earthquake and noise) of spectrogram images, the dataset is STEAD (Stanford Earthquake Dataset). To prioritize which filter or neuron to be eliminated, L2-norm was calculated on each filter or neuron weights. We assumed that a filter or neuron with the lowest L2-norm had the least significant role in the model. By pruning 90% of the filter and neuron of the model and retraining the pruned model, the inference time was improved from 22. 45ms to 3. 6ms (on NVIDIA GTX 1050) per image with the accuracy of 99.405%.","PeriodicalId":246279,"journal":{"name":"2022 14th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEE56407.2022.9954106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The real-time detection of earthquake occurrences in a seismic wave, drawn by seismograph, is crucial for disaster mitigation. The earlier the earthquake warning, the more lives can be saved. One approach that can monitor and detect earthquake occurrence is binary classification between earthquake and noise signals. The use of deep learning models such as CNN (Convolutional Neural Network), is considered quite accurate to perform an image classification of seismograph signals. Nevertheless, the tendency to use the large CNN model is rated to have better accuracy than smaller models. In fact, the disadvantage of utilizing large model is the inference time and the deployment of a large model to obtain real-time inference is more costly than the smaller model. This paper aims to reduce the size of a CNN model (Resnet50) by pruning the unnecessary filters and neuron on the model architecture without sacrificing the accuracy. The task of the model was to classify two classes (earthquake and noise) of spectrogram images, the dataset is STEAD (Stanford Earthquake Dataset). To prioritize which filter or neuron to be eliminated, L2-norm was calculated on each filter or neuron weights. We assumed that a filter or neuron with the lowest L2-norm had the least significant role in the model. By pruning 90% of the filter and neuron of the model and retraining the pruned model, the inference time was improved from 22. 45ms to 3. 6ms (on NVIDIA GTX 1050) per image with the accuracy of 99.405%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

地震信号图像分类CNN剪枝方法的发展

利用地震仪绘制的地震波实时探测地震发生，对减灾至关重要。地震预警越早，就能挽救越多的生命。一种监测和检测地震发生的方法是对地震信号和噪声信号进行二值分类。使用深度学习模型，如CNN(卷积神经网络)，被认为是相当准确地执行地震仪信号的图像分类。然而，使用大型CNN模型的倾向被认为比较小的模型具有更好的准确性。实际上，利用大型模型的缺点是推理时间，并且部署大型模型以获得实时推理的成本要高于小型模型。本文旨在在不牺牲精度的情况下，通过在模型架构上修剪不必要的滤波器和神经元来减小CNN模型(Resnet50)的尺寸。该模型的任务是对两类(地震和噪声)光谱图图像进行分类，数据集为STEAD (Stanford earthquake dataset)。为了优先考虑要消除哪些滤波器或神经元，对每个滤波器或神经元的权重计算l2范数。我们假设具有最低l2范数的过滤器或神经元在模型中具有最不重要的作用。通过对模型中90%的滤波器和神经元进行剪枝，并对剪枝后的模型进行再训练，将推理时间从22小时提高到22小时。45毫秒到3。每张图像6ms(在NVIDIA GTX 1050上)，准确率为99.405%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 14th International Conference on Information Technology and Electrical Engineering (ICITEE)

自引率

0.00%

发文量