Towards Novel Malicious Packet Recognition: A Few-Shot Learning Approach

arXiv - CS - Cryptography and Security Pub Date : 2024-09-17 DOI:arxiv-2409.11254

Kyle Stein, Andrew A. Mahyari, Guillermo Francia III, Eman El-Sheikh

{"title":"Towards Novel Malicious Packet Recognition: A Few-Shot Learning Approach","authors":"Kyle Stein, Andrew A. Mahyari, Guillermo Francia III, Eman El-Sheikh","doi":"arxiv-2409.11254","DOIUrl":null,"url":null,"abstract":"As the complexity and connectivity of networks increase, the need for novel\nmalware detection approaches becomes imperative. Traditional security defenses\nare becoming less effective against the advanced tactics of today's\ncyberattacks. Deep Packet Inspection (DPI) has emerged as a key technology in\nstrengthening network security, offering detailed analysis of network traffic\nthat goes beyond simple metadata analysis. DPI examines not only the packet\nheaders but also the payload content within, offering a thorough insight into\nthe data traversing the network. This study proposes a novel approach that\nleverages a large language model (LLM) and few-shot learning to accurately\nrecognizes novel, unseen malware types with few labels samples. Our proposed\napproach uses a pretrained LLM on known malware types to extract the embeddings\nfrom packets. The embeddings are then used alongside few labeled samples of an\nunseen malware type. This technique is designed to acclimate the model to\ndifferent malware representations, further enabling it to generate robust\nembeddings for each trained and unseen classes. Following the extraction of\nembeddings from the LLM, few-shot learning is utilized to enhance performance\nwith minimal labeled data. Our evaluation, which utilized two renowned\ndatasets, focused on identifying malware types within network traffic and\nInternet of Things (IoT) environments. Our approach shows promising results\nwith an average accuracy of 86.35% and F1-Score of 86.40% on different malware\ntypes across the two datasets.","PeriodicalId":501332,"journal":{"name":"arXiv - CS - Cryptography and Security","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Cryptography and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11254","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

As the complexity and connectivity of networks increase, the need for novel malware detection approaches becomes imperative. Traditional security defenses are becoming less effective against the advanced tactics of today's cyberattacks. Deep Packet Inspection (DPI) has emerged as a key technology in strengthening network security, offering detailed analysis of network traffic that goes beyond simple metadata analysis. DPI examines not only the packet headers but also the payload content within, offering a thorough insight into the data traversing the network. This study proposes a novel approach that leverages a large language model (LLM) and few-shot learning to accurately recognizes novel, unseen malware types with few labels samples. Our proposed approach uses a pretrained LLM on known malware types to extract the embeddings from packets. The embeddings are then used alongside few labeled samples of an unseen malware type. This technique is designed to acclimate the model to different malware representations, further enabling it to generate robust embeddings for each trained and unseen classes. Following the extraction of embeddings from the LLM, few-shot learning is utilized to enhance performance with minimal labeled data. Our evaluation, which utilized two renowned datasets, focused on identifying malware types within network traffic and Internet of Things (IoT) environments. Our approach shows promising results with an average accuracy of 86.35% and F1-Score of 86.40% on different malware types across the two datasets.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

新型恶意数据包识别：少量学习方法

随着网络复杂性和连通性的增加，对新型恶意软件检测方法的需求变得势在必行。面对当今网络攻击的先进战术，传统的安全防御措施变得越来越无效。深度包检测（DPI）已成为加强网络安全的一项关键技术，它能对网络流量进行详细分析，而不仅仅是简单的元数据分析。DPI 不仅能检查包头，还能检查其中的有效载荷内容，从而对穿越网络的数据进行全面深入的分析。本研究提出了一种新方法，即利用大型语言模型（LLM）和少量学习来准确识别新型、未见过的恶意软件类型。我们提出的方法使用对已知恶意软件类型进行预训练的 LLM 从数据包中提取嵌入。然后将这些嵌入信息与未见恶意软件类型的少量标记样本一起使用。这种技术旨在使模型适应不同的恶意软件表征，进一步使其能够为每个训练有素的和未见过的类别生成稳健的嵌入。从 LLM 中提取前缀后，利用少量学习来提高使用最少标记数据的性能。我们的评估利用了两个著名的数据集，重点是识别网络流量和物联网（IoT）环境中的恶意软件类型。我们的方法在两个数据集的不同恶意软件类型上取得了很好的结果，平均准确率为 86.35%，F1-Score 为 86.40%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - CS - Cryptography and Security

自引率

0.00%

发文量