Analysis of Maximum Executable Length for Detecting Text-Based Malware

2008 The 28th International Conference on Distributed Computing Systems Pub Date : 2008-06-17 DOI:10.1109/ICDCS.2008.70

P. K. Manna, S. Ranka, Shigang Chen

{"title":"Analysis of Maximum Executable Length for Detecting Text-Based Malware","authors":"P. K. Manna, S. Ranka, Shigang Chen","doi":"10.1109/ICDCS.2008.70","DOIUrl":null,"url":null,"abstract":"The possibility of using purely text stream (keyboard-enterable) as carrier of malware is under-researched and often under estimated. A text attack can happen at multiple levels, from code-injection attacks at the top level to host-compromising text-based machine code at the lowest level. Since a large number of protocols are text-based, at times the servers based on those protocols use ASCII filters to allow text input only. However, simply applying ASCII filters to weed out the binary data is not enough from the security viewpoint since the assumption that malware are always binary is false. We show that although text is a subset of binary, binary malware detectors cannot always detect text malware. We analyze the MEL (maximum executable length)-based detection schemes, and make two contributions by this analysis. First, although the concept of MEL has been used in various detection schemes earlier, we are the first to provide its underlying mathematical foundation. We show that the threshold value can be calculated from the input character frequencies and that it can be tuned to control the detection sensitivity. Second, we demonstrate the effectiveness of a MEL-based text malware detector by exploiting the specific properties of text streams.","PeriodicalId":240205,"journal":{"name":"2008 The 28th International Conference on Distributed Computing Systems","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 The 28th International Conference on Distributed Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDCS.2008.70","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The possibility of using purely text stream (keyboard-enterable) as carrier of malware is under-researched and often under estimated. A text attack can happen at multiple levels, from code-injection attacks at the top level to host-compromising text-based machine code at the lowest level. Since a large number of protocols are text-based, at times the servers based on those protocols use ASCII filters to allow text input only. However, simply applying ASCII filters to weed out the binary data is not enough from the security viewpoint since the assumption that malware are always binary is false. We show that although text is a subset of binary, binary malware detectors cannot always detect text malware. We analyze the MEL (maximum executable length)-based detection schemes, and make two contributions by this analysis. First, although the concept of MEL has been used in various detection schemes earlier, we are the first to provide its underlying mathematical foundation. We show that the threshold value can be calculated from the input character frequencies and that it can be tuned to control the detection sensitivity. Second, we demonstrate the effectiveness of a MEL-based text malware detector by exploiting the specific properties of text streams.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

检测基于文本的恶意软件的最大可执行长度分析

使用纯文本流(键盘可输入)作为恶意软件载体的可能性研究不足，而且往往被低估。文本攻击可以发生在多个级别，从最高级别的代码注入攻击到最低级别的基于文本的危及主机的机器码。由于大量协议是基于文本的，因此基于这些协议的服务器有时使用ASCII过滤器只允许文本输入。然而，从安全的角度来看，仅仅应用ASCII过滤器来清除二进制数据是不够的，因为恶意软件总是二进制的假设是错误的。我们表明，虽然文本是二进制的子集，但二进制恶意软件检测器不能总是检测文本恶意软件。我们分析了基于MEL(最大可执行长度)的检测方案，并通过该分析做出了两个贡献。首先，尽管MEL的概念已经在早期的各种检测方案中使用，但我们是第一个提供其底层数学基础的人。我们表明，阈值可以从输入字符频率计算，并且可以调整以控制检测灵敏度。其次，我们通过利用文本流的特定属性证明了基于mel的文本恶意软件检测器的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2008 The 28th International Conference on Distributed Computing Systems

自引率

0.00%

发文量

期刊最新文献

Relative Network Positioning via CDN Redirections Compiler-Assisted Application-Level Checkpointing for MPI Programs Exploring Anti-Spam Models in Large Scale VoIP Systems Correlation-Aware Object Placement for Multi-Object Operations Probing Queries in Wireless Sensor Networks