Software Defect Prediction via GIN with Hybrid Graphical Features

Xuanye Wang, Lu Lu, Boye Wang, Yudong Shang, Hao Yang
{"title":"Software Defect Prediction via GIN with Hybrid Graphical Features","authors":"Xuanye Wang, Lu Lu, Boye Wang, Yudong Shang, Hao Yang","doi":"10.1109/QRS-C57518.2022.00066","DOIUrl":null,"url":null,"abstract":"Software defect prediction (SDP) is one of the determining factors in software development cycle, enabling developers to estimate the quality of delivery and thus improve the user experience. Generally, efforts have opted for building the abstract syntax tree (AST). However, AST is limited in its capacity of providing adequate semantic and structural information. This paper proposes a framework, which trains hybrid features combined from AST and control flow graph (CFG) via Graph Isomorphism Network (GIN) (H-GIN): The framework parses the CFG of the source code and converts redundant blocks to a concise representation by extracting AST. In addition, the contextual relationships between each block in the CFG is carefully preserved, after which the graphical representation of the code is obtained and then fed into the GIN to conduct a defect prediction. The experiments were performed on a large python dataset called PyTraceBus with Precision, Recall, and F1-score as evaluation criteria. Our approach outperforms the previous state-of-the-art methods, which clearly demonstrates the effectiveness of the proposed method.","PeriodicalId":183728,"journal":{"name":"2022 IEEE 22nd International Conference on Software Quality, Reliability, and Security Companion (QRS-C)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 22nd International Conference on Software Quality, Reliability, and Security Companion (QRS-C)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/QRS-C57518.2022.00066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Software defect prediction (SDP) is one of the determining factors in software development cycle, enabling developers to estimate the quality of delivery and thus improve the user experience. Generally, efforts have opted for building the abstract syntax tree (AST). However, AST is limited in its capacity of providing adequate semantic and structural information. This paper proposes a framework, which trains hybrid features combined from AST and control flow graph (CFG) via Graph Isomorphism Network (GIN) (H-GIN): The framework parses the CFG of the source code and converts redundant blocks to a concise representation by extracting AST. In addition, the contextual relationships between each block in the CFG is carefully preserved, after which the graphical representation of the code is obtained and then fed into the GIN to conduct a defect prediction. The experiments were performed on a large python dataset called PyTraceBus with Precision, Recall, and F1-score as evaluation criteria. Our approach outperforms the previous state-of-the-art methods, which clearly demonstrates the effectiveness of the proposed method.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于混合图形特征的GIN软件缺陷预测
软件缺陷预测(SDP)是软件开发周期中的决定因素之一,它使开发人员能够估计交付的质量,从而改善用户体验。一般来说,人们选择构建抽象语法树(AST)。然而,AST在提供足够的语义和结构信息方面的能力有限。本文提出了一个框架,该框架通过图同构网络(GIN) (H-GIN)对AST和控制流图(CFG)组合的混合特征进行训练:该框架对源代码的CFG进行解析,通过提取AST将冗余块转换为简洁的表示,并仔细保留CFG中各块之间的上下文关系,获得代码的图形表示,然后将其输入到GIN中进行缺陷预测。实验在一个名为PyTraceBus的大型python数据集上进行,以Precision, Recall和F1-score作为评估标准。我们的方法优于以前的最先进的方法,这清楚地表明了所提出方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Software Bug Prediction based on Complex Network Considering Control Flow A Fault Localization Method Based on Similarity Weighting with Unlabeled Test Cases What Should Abeeha do? an Activity for Phishing Awareness The Real-Time General Display and Control Platform Designing Method based on Software Product Line Code Search Method Based on Multimodal Representation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1