A Practical Attack on the TLSH Similarity Digest Scheme

Proceedings of the 18th International Conference on Availability, Reliability and Security Pub Date : 2023-08-29 DOI:10.1145/3600160.3600173

Gábor Fuchs, Roland Nagy, L. Buttyán

引用次数: 0

Abstract

Similarity digest schemes are used in various applications (e.g., digital forensics, spam filtering, malware clustering, and malware detection), which require them to be resistant to attacks aiming at generating semantically similar inputs that have very different similarity digest values. In this paper, we show that TLSH, a widely used similarity digest function, is not sufficiently robust against such attacks. More specifically, we propose an automated method for modifying executable files (binaries), such that the modified binary has the exact same functionality as the original one, it also remains syntactically similar to the original one, yet, the TLSH difference score between the original and the modified binaries becomes high. We evaluate our method on a large data set containing malware binaries, and we also show that it can be used effectively to generate adversarial samples that evade detection by SIMBIoTA, a recently proposed similarity-based malware detection approach.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

对TLSH相似摘要方案的实际攻击

相似摘要方案用于各种应用程序(例如，数字取证、垃圾邮件过滤、恶意软件集群和恶意软件检测)，这要求它们能够抵抗旨在生成具有非常不同相似摘要值的语义相似输入的攻击。在本文中，我们证明了广泛使用的相似摘要函数TLSH对此类攻击的鲁棒性不够。更具体地说，我们提出了一种自动化的方法来修改可执行文件(二进制文件)，这样修改后的二进制文件具有与原始文件完全相同的功能，在语法上也与原始文件相似，但是，原始和修改后的二进制文件之间的TLSH差值变得很高。我们在包含恶意软件二进制文件的大型数据集上评估了我们的方法，并且我们还表明它可以有效地用于生成对抗性样本，从而逃避SIMBIoTA(最近提出的基于相似性的恶意软件检测方法)的检测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 18th International Conference on Availability, Reliability and Security

自引率

0.00%

发文量

期刊最新文献

Confidential Quantum Computing Enabling Qualified Anonymity for Enhanced User Privacy in the Digital Era Fingerprint forgery training: Easy to learn, hard to perform Experiences with Secure Pipelines in Highly Regulated Environments Leveraging Knowledge Graphs For Classifying Incident Situations in ICT Systems