ActGraph: prioritization of test cases based on deep neural network activation graph

IF 2 2区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Automated Software Engineering Pub Date : 2023-08-22 DOI:10.1007/s10515-023-00396-8

Jinyin Chen, Jie Ge, Haibin Zheng

{"title":"ActGraph: prioritization of test cases based on deep neural network activation graph","authors":"Jinyin Chen, Jie Ge, Haibin Zheng","doi":"10.1007/s10515-023-00396-8","DOIUrl":null,"url":null,"abstract":"<div>Widespread applications of deep neural networks (DNNs) benefit from DNN testing to guarantee their quality. In the DNN testing, numerous test cases are fed into the model to explore potential vulnerabilities, but they require expensive manual cost to check the label. Therefore, test case prioritization is proposed to solve the problem of labeling cost, e.g., surprise adequacy-based, uncertainty quantifiers-based and mutation-based prioritization methods. However, most of them suffer from limited scenarios (i.e. high confidence adversarial or false positive cases) and high time complexity. To address these challenges, we propose the concept of the activation graph from the perspective of the spatial relationship of neurons. We observe that the activation graph of cases that triggers the model’s misbehavior significantly differs from that of normal cases. Motivated by it, we design a test case prioritization method based on the activation graph, ActGraph, by extracting the high-order node feature of the activation graph for prioritization. ActGraph explains the difference between the test cases to solve the problem of scenario limitation. Without mutation operations, ActGraph is easy to implement, leading to lower time complexity. Extensive experiments on three datasets and four models demonstrate that ActGraph has the following key characteristics. (i) Effectiveness and generalizability: ActGraph shows competitive performance in all of the natural, adversarial and mixed scenarios, especially in RAUC-100 improvement (\\(\\sim \\times \\)1.40). (ii) Efficiency: ActGraph runs at less time cost (\\(\\sim \\times \\)1/50) than the state-of-the-art method. The code of ActGraph is open-sourced at https://github.com/Embed-Debuger/ActGraph.</div>","PeriodicalId":55414,"journal":{"name":"Automated Software Engineering","volume":"30 2","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2023-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10515-023-00396-8.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Automated Software Engineering","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10515-023-00396-8","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 1

Abstract

Widespread applications of deep neural networks (DNNs) benefit from DNN testing to guarantee their quality. In the DNN testing, numerous test cases are fed into the model to explore potential vulnerabilities, but they require expensive manual cost to check the label. Therefore, test case prioritization is proposed to solve the problem of labeling cost, e.g., surprise adequacy-based, uncertainty quantifiers-based and mutation-based prioritization methods. However, most of them suffer from limited scenarios (i.e. high confidence adversarial or false positive cases) and high time complexity. To address these challenges, we propose the concept of the activation graph from the perspective of the spatial relationship of neurons. We observe that the activation graph of cases that triggers the model’s misbehavior significantly differs from that of normal cases. Motivated by it, we design a test case prioritization method based on the activation graph, ActGraph, by extracting the high-order node feature of the activation graph for prioritization. ActGraph explains the difference between the test cases to solve the problem of scenario limitation. Without mutation operations, ActGraph is easy to implement, leading to lower time complexity. Extensive experiments on three datasets and four models demonstrate that ActGraph has the following key characteristics. (i) Effectiveness and generalizability: ActGraph shows competitive performance in all of the natural, adversarial and mixed scenarios, especially in RAUC-100 improvement (\(\sim \times \)1.40). (ii) Efficiency: ActGraph runs at less time cost (\(\sim \times \)1/50) than the state-of-the-art method. The code of ActGraph is open-sourced at https://github.com/Embed-Debuger/ActGraph.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

ActGraph:基于深度神经网络激活图的测试用例优先级排序

深度神经网络（DNN）的广泛应用得益于DNN测试以保证其质量。在DNN测试中，许多测试用例被输入到模型中，以探索潜在的漏洞，但它们需要昂贵的手动成本来检查标签。因此，为了解决标签成本问题，提出了测试用例优先级排序方法，如基于惊喜充分性的、基于不确定性量词的和基于变异的优先级排序方法。然而，它们中的大多数都存在有限的场景（即高置信度对抗性或假阳性案例）和高时间复杂性。为了应对这些挑战，我们从神经元的空间关系的角度提出了激活图的概念。我们观察到，触发模型不当行为的案例的激活图与正常案例的激活图显着不同。受此启发，我们设计了一种基于激活图ActGraph的测试用例优先级排序方法，通过提取激活图的高阶节点特征进行优先级排序。ActGraph解释了测试用例之间的差异，以解决场景限制的问题。在没有突变操作的情况下，ActGraph易于实现，从而降低了时间复杂性。在三个数据集和四个模型上进行的大量实验表明，ActGraph具有以下关键特性。（i）有效性和可推广性：ActGraph在所有自然、对抗性和混合场景中都显示出竞争性能，尤其是在RAUC-100改进中（\（\sim\times\）1.40）。ActGraph的代码开源于https://github.com/Embed-Debuger/ActGraph.

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Automated Software Engineering 工程技术-计算机：软件工程

CiteScore

4.80

自引率

11.80%

发文量

审稿时长

>12 weeks

期刊介绍： This journal details research, tutorial papers, survey and accounts of significant industrial experience in the foundations, techniques, tools and applications of automated software engineering technology. This includes the study of techniques for constructing, understanding, adapting, and modeling software artifacts and processes. Coverage in Automated Software Engineering examines both automatic systems and collaborative systems as well as computational models of human software engineering activities. In addition, it presents knowledge representations and artificial intelligence techniques applicable to automated software engineering, and formal techniques that support or provide theoretical foundations. The journal also includes reviews of books, software, conferences and workshops.