Graph-based explainable vulnerability prediction

IF 4.3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Information and Software Technology Pub Date : 2024-08-31 DOI:10.1016/j.infsof.2024.107566

Hong Quy Nguyen , Thong Hoang , Hoa Khanh Dam , Aditya Ghose

{"title":"Graph-based explainable vulnerability prediction","authors":"Hong Quy Nguyen , Thong Hoang , Hoa Khanh Dam , Aditya Ghose","doi":"10.1016/j.infsof.2024.107566","DOIUrl":null,"url":null,"abstract":"<div>Significant increases in cyberattacks worldwide have threatened the security of organizations, businesses, and individuals. Cyberattacks exploit vulnerabilities in software systems. Recent work has leveraged powerful and complex models, such as deep neural networks, to improve the predictive performance of vulnerability detection models. However, these models are often regarded as “black box” models, making it challenging for software practitioners to understand and interpret their predictions. This lack of explainability has resulted in a reluctance to adopt or deploy these vulnerability prediction models in industry applications. This paper proposes a novel approach, Genetic Algorithm-based Vulnerability Prediction Explainer, (herein GAVulExplainer), which generates explanations for vulnerability prediction models based on graph neural networks. GAVulExplainer leverages genetic algorithms to construct a subgraph explanation that represents the crucial factor contributing to the vulnerability. Experimental results show that our proposed approach outperforms baselines in providing concrete reasons for a vulnerability prediction.</div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"177 ","pages":"Article 107566"},"PeriodicalIF":4.3000,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S095058492400171X/pdfft?md5=51c2432186d2a7513da1bb84a4daf260&pid=1-s2.0-S095058492400171X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information and Software Technology","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095058492400171X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Significant increases in cyberattacks worldwide have threatened the security of organizations, businesses, and individuals. Cyberattacks exploit vulnerabilities in software systems. Recent work has leveraged powerful and complex models, such as deep neural networks, to improve the predictive performance of vulnerability detection models. However, these models are often regarded as “black box” models, making it challenging for software practitioners to understand and interpret their predictions. This lack of explainability has resulted in a reluctance to adopt or deploy these vulnerability prediction models in industry applications. This paper proposes a novel approach, Genetic Algorithm-based Vulnerability Prediction Explainer, (herein GAVulExplainer), which generates explanations for vulnerability prediction models based on graph neural networks. GAVulExplainer leverages genetic algorithms to construct a subgraph explanation that represents the crucial factor contributing to the vulnerability. Experimental results show that our proposed approach outperforms baselines in providing concrete reasons for a vulnerability prediction.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于图形的可解释漏洞预测

全球范围内网络攻击的显著增加威胁着组织、企业和个人的安全。网络攻击利用的是软件系统中的漏洞。最近的研究利用深度神经网络等强大而复杂的模型来提高漏洞检测模型的预测性能。然而，这些模型通常被视为 "黑箱 "模型，使软件从业人员难以理解和解释其预测结果。这种缺乏可解释性的情况导致人们不愿意在行业应用中采用或部署这些漏洞预测模型。本文提出了一种新方法--基于遗传算法的漏洞预测解释器（以下简称 GAVulExplainer），它基于图神经网络生成漏洞预测模型的解释。GAVulExplainer 利用遗传算法来构建子图解释，该子图解释代表了造成漏洞的关键因素。实验结果表明，我们提出的方法在为漏洞预测提供具体原因方面优于基线方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Information and Software Technology 工程技术-计算机：软件工程

CiteScore

9.10

自引率

7.70%

发文量

164

审稿时长

9.6 weeks

期刊介绍： Information and Software Technology is the international archival journal focusing on research and experience that contributes to the improvement of software development practices. The journal''s scope includes methods and techniques to better engineer software and manage its development. Articles submitted for review should have a clear component of software engineering or address ways to improve the engineering and management of software development. Areas covered by the journal include: • Software management, quality and metrics, • Software processes, • Software architecture, modelling, specification, design and programming • Functional and non-functional software requirements • Software testing and verification & validation • Empirical studies of all aspects of engineering and managing software development Short Communications is a new section dedicated to short papers addressing new ideas, controversial opinions, "Negative" results and much more. Read the Guide for authors for more information. The journal encourages and welcomes submissions of systematic literature studies (reviews and maps) within the scope of the journal. Information and Software Technology is the premiere outlet for systematic literature studies in software engineering.

期刊最新文献

Test automation with selenium: A survey AI-gile: Revisiting Agile principles in the era of AI SEDMR: A spreadsheet error detection approach based on metamorphic testing Exploring and characterizing cross-service defects in microservice projects SRSPSQL: A dual-stage Text-to-SQL framework with semantic rewriting and schema pruning