IMCMK-CNN: A lightweight convolutional neural network with Multi-scale Kernels for Image-based Malware Classification

IF 6.2 2区工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY alexandria engineering journal Pub Date : 2024-10-22 DOI:10.1016/j.aej.2024.10.055

Dandan Zhang, Yafei Song, Qian Xiang, Yang Wang

{"title":"IMCMK-CNN: A lightweight convolutional neural network with Multi-scale Kernels for Image-based Malware Classification","authors":"Dandan Zhang, Yafei Song, Qian Xiang, Yang Wang","doi":"10.1016/j.aej.2024.10.055","DOIUrl":null,"url":null,"abstract":"<div><div>Rapid and accurate identification of unknown malware and its variants is the premise and basis for the effective prevention of malicious attacks. However, with the explosive growth of malware variants, the efficiency of manual updating of the sample database is getting worse and worse. It is difficult for the traditional identification methods to effectively capture the sample feature information operated by the confusion method only based on the delayed database information. The research into the direction of malware detection is dedicated to surmounting the limitations of conventional detection methodologies, and delves deeply into the application of cutting-edge technologies such as data visualization, machine learning, and hybrid detection within the realm of malware detection. Through these investigations, our goal is to construct a detection system that is both more precise and efficient, capable of addressing the ever-evolving threats to cybersecurity. Pursuing research in this direction is not only vital for enhancing network security defenses and safeguarding user data, but it will also foster the advancement of related state-of-the-art technologies and further mitigate the economic and societal repercussions of malware attacks. In light of this issue, this paper proposes the Image-based Malware Classification with Multi-scale Kernels (IMCMK), a Convolutional Neural Network (CNN) architecture using multi-scale convolution kernels mixing action to improve malware variants detection capabilities. First, we propose the Multi-scale Kernels (MK) block combining deep large kernel convolution and standard small kernel convolution with shortcuts to improve the accuracy. Furthermore, we propose Multi-scale Kernel Fusion (MKF) to reduce the number of parameters that come with the large kernels. The improved Squeeze-and-Excitation (SE) block can obtain the correlation between different channels to further increase the model performance. Experimental results show that IMCMK outperforms the state-of-the-art methods in malware family classification accuracy, which has achieved 99.25 %.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"111 ","pages":"Pages 203-220"},"PeriodicalIF":6.2000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110016824012109","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Rapid and accurate identification of unknown malware and its variants is the premise and basis for the effective prevention of malicious attacks. However, with the explosive growth of malware variants, the efficiency of manual updating of the sample database is getting worse and worse. It is difficult for the traditional identification methods to effectively capture the sample feature information operated by the confusion method only based on the delayed database information. The research into the direction of malware detection is dedicated to surmounting the limitations of conventional detection methodologies, and delves deeply into the application of cutting-edge technologies such as data visualization, machine learning, and hybrid detection within the realm of malware detection. Through these investigations, our goal is to construct a detection system that is both more precise and efficient, capable of addressing the ever-evolving threats to cybersecurity. Pursuing research in this direction is not only vital for enhancing network security defenses and safeguarding user data, but it will also foster the advancement of related state-of-the-art technologies and further mitigate the economic and societal repercussions of malware attacks. In light of this issue, this paper proposes the Image-based Malware Classification with Multi-scale Kernels (IMCMK), a Convolutional Neural Network (CNN) architecture using multi-scale convolution kernels mixing action to improve malware variants detection capabilities. First, we propose the Multi-scale Kernels (MK) block combining deep large kernel convolution and standard small kernel convolution with shortcuts to improve the accuracy. Furthermore, we propose Multi-scale Kernel Fusion (MKF) to reduce the number of parameters that come with the large kernels. The improved Squeeze-and-Excitation (SE) block can obtain the correlation between different channels to further increase the model performance. Experimental results show that IMCMK outperforms the state-of-the-art methods in malware family classification accuracy, which has achieved 99.25 %.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

IMCMK-CNN：用于图像恶意软件分类的多尺度内核轻量级卷积神经网络

快速准确地识别未知恶意软件及其变种是有效防范恶意攻击的前提和基础。然而，随着恶意软件变种的爆炸式增长，人工更新样本数据库的效率越来越低。传统的识别方法仅基于延迟的数据库信息，难以有效捕捉混淆法操作的样本特征信息。恶意软件检测方向的研究致力于克服传统检测方法的局限性，深入探讨数据可视化、机器学习、混合检测等前沿技术在恶意软件检测领域的应用。通过这些研究，我们的目标是构建一个更精确、更高效的检测系统，以应对不断变化的网络安全威胁。朝着这个方向开展研究不仅对加强网络安全防御和保护用户数据至关重要，而且还能促进相关先进技术的发展，进一步减轻恶意软件攻击对经济和社会造成的影响。有鉴于此，本文提出了基于图像的多尺度内核恶意软件分类（Image-based Malware Classification with Multi-scale Kernels，IMCMK），这是一种使用多尺度卷积内核混合作用的卷积神经网络（CNN）架构，旨在提高恶意软件变种的检测能力。首先，我们提出了多尺度内核（MK）区块，将深度大内核卷积和标准小内核卷积与捷径相结合，以提高准确性。此外，我们还提出了多尺度内核融合（MKF），以减少大内核带来的参数数量。改进的挤压激励（SE）块可以获得不同通道之间的相关性，从而进一步提高模型性能。实验结果表明，IMCMK 的恶意软件族分类准确率超过了最先进的方法，达到了 99.25%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

alexandria engineering journal Engineering-General Engineering

CiteScore

11.20

自引率

4.40%

发文量

1015

审稿时长

43 days

期刊介绍： Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification: • Mechanical, Production, Marine and Textile Engineering • Electrical Engineering, Computer Science and Nuclear Engineering • Civil and Architecture Engineering • Chemical Engineering and Applied Sciences • Environmental Engineering