Cryptographic Function Detection in Obfuscated Binaries via Bit-Precise Symbolic Loop Mapping

2017 IEEE Symposium on Security and Privacy (SP) Pub Date : 2017-05-22 DOI:10.1109/SP.2017.56

Dongpeng Xu, Jiang Ming, Dinghao Wu

{"title":"Cryptographic Function Detection in Obfuscated Binaries via Bit-Precise Symbolic Loop Mapping","authors":"Dongpeng Xu, Jiang Ming, Dinghao Wu","doi":"10.1109/SP.2017.56","DOIUrl":null,"url":null,"abstract":"Cryptographic functions have been commonly abused by malware developers to hide malicious behaviors, disguise destructive payloads, and bypass network-based firewalls. Now-infamous crypto-ransomware even encrypts victim's computer documents until a ransom is paid. Therefore, detecting cryptographic functions in binary code is an appealing approach to complement existing malware defense and forensics. However, pervasive control and data obfuscation schemes make cryptographic function identification a challenging work. Existing detection methods are either brittle to work on obfuscated binaries or ad hoc in that they can only identify specific cryptographic functions. In this paper, we propose a novel technique called bit-precise symbolic loop mapping to identify cryptographic functions in obfuscated binary code. Our trace-based approach captures the semantics of possible cryptographic algorithms with bit-precise symbolic execution in a loop. Then we perform guided fuzzing to efficiently match boolean formulas with known reference implementations. We have developed a prototype called CryptoHunt and evaluated it with a set of obfuscated synthetic examples, well-known cryptographic libraries, and malware. Compared with the existing tools, CryptoHunt is a general approach to detecting commonly used cryptographic functions such as TEA, AES, RC4, MD5, and RSA under different control and data obfuscation scheme combinations.","PeriodicalId":6502,"journal":{"name":"2017 IEEE Symposium on Security and Privacy (SP)","volume":"111 1","pages":"921-937"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"60","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Symposium on Security and Privacy (SP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SP.2017.56","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 60

Abstract

Cryptographic functions have been commonly abused by malware developers to hide malicious behaviors, disguise destructive payloads, and bypass network-based firewalls. Now-infamous crypto-ransomware even encrypts victim's computer documents until a ransom is paid. Therefore, detecting cryptographic functions in binary code is an appealing approach to complement existing malware defense and forensics. However, pervasive control and data obfuscation schemes make cryptographic function identification a challenging work. Existing detection methods are either brittle to work on obfuscated binaries or ad hoc in that they can only identify specific cryptographic functions. In this paper, we propose a novel technique called bit-precise symbolic loop mapping to identify cryptographic functions in obfuscated binary code. Our trace-based approach captures the semantics of possible cryptographic algorithms with bit-precise symbolic execution in a loop. Then we perform guided fuzzing to efficiently match boolean formulas with known reference implementations. We have developed a prototype called CryptoHunt and evaluated it with a set of obfuscated synthetic examples, well-known cryptographic libraries, and malware. Compared with the existing tools, CryptoHunt is a general approach to detecting commonly used cryptographic functions such as TEA, AES, RC4, MD5, and RSA under different control and data obfuscation scheme combinations.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于位精确符号循环映射的混淆二进制文件中的密码函数检测

加密功能通常被恶意软件开发人员滥用，以隐藏恶意行为、伪装破坏性有效负载和绕过基于网络的防火墙。现在臭名昭著的加密勒索软件甚至会加密受害者的电脑文件，直到支付赎金。因此，检测二进制代码中的加密功能是补充现有恶意软件防御和取证的一种有吸引力的方法。然而，普遍的控制和数据混淆方案使加密功能识别成为一项具有挑战性的工作。现有的检测方法要么难以处理混淆的二进制文件，要么只能识别特定的加密功能。在本文中，我们提出一种称为位精确符号循环映射的新技术来识别混淆二进制码中的密码函数。我们基于跟踪的方法捕获了在循环中具有位精确符号执行的可能的加密算法的语义。然后，我们执行引导模糊来有效地匹配布尔公式与已知的参考实现。我们开发了一个名为CryptoHunt的原型，并使用一组混淆的合成示例、知名的加密库和恶意软件对其进行了评估。与现有工具相比，CryptoHunt是一种通用的方法，可以在不同的控制和数据混淆方案组合下检测TEA、AES、RC4、MD5和RSA等常用密码函数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 IEEE Symposium on Security and Privacy (SP)

自引率

0.00%

发文量