Billions and billions of constraints: Whitebox fuzz testing in production

2013 35th International Conference on Software Engineering (ICSE) Pub Date : 2013-05-18 DOI:10.1109/ICSE.2013.6606558

E. Bounimova, Patrice Godefroid, D. Molnar

{"title":"Billions and billions of constraints: Whitebox fuzz testing in production","authors":"E. Bounimova, Patrice Godefroid, D. Molnar","doi":"10.1109/ICSE.2013.6606558","DOIUrl":null,"url":null,"abstract":"We report experiences with constraint-based whitebox fuzz testing in production across hundreds of large Windows applications and over 500 machine years of computation from 2007 to 2013. Whitebox fuzzing leverages symbolic execution on binary traces and constraint solving to construct new inputs to a program. These inputs execute previously uncovered paths or trigger security vulnerabilities. Whitebox fuzzing has found one-third of all file fuzzing bugs during the development of Windows 7, saving millions of dollars in potential security vulnerabilities. The technique is in use today across multiple products at Microsoft. We describe key challenges with running whitebox fuzzing in production. We give principles for addressing these challenges and describe two new systems built from these principles: SAGAN, which collects data from every fuzzing run for further analysis, and JobCenter, which controls deployment of our whitebox fuzzing infrastructure across commodity virtual machines. Since June 2010, SAGAN has logged over 3.4 billion constraints solved, millions of symbolic executions, and tens of millions of test cases generated. Our work represents the largest scale deployment of whitebox fuzzing to date, including the largest usage ever for a Satisfiability Modulo Theories (SMT) solver. We present specific data analyses that improved our production use of whitebox fuzzing. Finally we report data on the performance of constraint solving and dynamic test generation that points toward future research problems.","PeriodicalId":322423,"journal":{"name":"2013 35th International Conference on Software Engineering (ICSE)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"172","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 35th International Conference on Software Engineering (ICSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSE.2013.6606558","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 172

Abstract

We report experiences with constraint-based whitebox fuzz testing in production across hundreds of large Windows applications and over 500 machine years of computation from 2007 to 2013. Whitebox fuzzing leverages symbolic execution on binary traces and constraint solving to construct new inputs to a program. These inputs execute previously uncovered paths or trigger security vulnerabilities. Whitebox fuzzing has found one-third of all file fuzzing bugs during the development of Windows 7, saving millions of dollars in potential security vulnerabilities. The technique is in use today across multiple products at Microsoft. We describe key challenges with running whitebox fuzzing in production. We give principles for addressing these challenges and describe two new systems built from these principles: SAGAN, which collects data from every fuzzing run for further analysis, and JobCenter, which controls deployment of our whitebox fuzzing infrastructure across commodity virtual machines. Since June 2010, SAGAN has logged over 3.4 billion constraints solved, millions of symbolic executions, and tens of millions of test cases generated. Our work represents the largest scale deployment of whitebox fuzzing to date, including the largest usage ever for a Satisfiability Modulo Theories (SMT) solver. We present specific data analyses that improved our production use of whitebox fuzzing. Finally we report data on the performance of constraint solving and dynamic test generation that points toward future research problems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

数以十亿计的约束:生产中的白盒模糊测试

我们报告了在生产环境中使用基于约束的白盒模糊测试的经验，这些测试涵盖了数百个大型Windows应用程序和从2007年到2013年超过500台机器年的计算。白盒模糊测试利用二进制跟踪的符号执行和约束求解来构造程序的新输入。这些输入执行以前未发现的路径或触发安全漏洞。在Windows 7的开发过程中，白盒模糊测试已经发现了三分之一的文件模糊错误，为潜在的安全漏洞节省了数百万美元。如今，微软的多个产品都在使用这种技术。我们描述了在生产环境中运行白盒模糊测试的主要挑战。我们给出了解决这些挑战的原则，并描述了根据这些原则构建的两个新系统:SAGAN，它从每次模糊测试运行中收集数据以进行进一步分析，以及JobCenter，它控制跨商品虚拟机的白盒模糊测试基础设施的部署。自2010年6月以来，SAGAN已经记录了超过34亿个解决的约束，数百万个符号执行，以及生成的数千万个测试用例。我们的工作代表了迄今为止白盒模糊测试的最大规模部署，包括对可满足模理论(SMT)求解器的最大使用。我们提出了具体的数据分析，改进了我们对白盒模糊测试的生产使用。最后，我们报告了约束求解和动态测试生成性能的数据，指出了未来的研究问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2013 35th International Conference on Software Engineering (ICSE)

自引率

0.00%

发文量