Xuanye Wang, Lu Lu, Boye Wang, Yudong Shang, Hao Yang
{"title":"Software Defect Prediction via GIN with Hybrid Graphical Features","authors":"Xuanye Wang, Lu Lu, Boye Wang, Yudong Shang, Hao Yang","doi":"10.1109/QRS-C57518.2022.00066","DOIUrl":null,"url":null,"abstract":"Software defect prediction (SDP) is one of the determining factors in software development cycle, enabling developers to estimate the quality of delivery and thus improve the user experience. Generally, efforts have opted for building the abstract syntax tree (AST). However, AST is limited in its capacity of providing adequate semantic and structural information. This paper proposes a framework, which trains hybrid features combined from AST and control flow graph (CFG) via Graph Isomorphism Network (GIN) (H-GIN): The framework parses the CFG of the source code and converts redundant blocks to a concise representation by extracting AST. In addition, the contextual relationships between each block in the CFG is carefully preserved, after which the graphical representation of the code is obtained and then fed into the GIN to conduct a defect prediction. The experiments were performed on a large python dataset called PyTraceBus with Precision, Recall, and F1-score as evaluation criteria. Our approach outperforms the previous state-of-the-art methods, which clearly demonstrates the effectiveness of the proposed method.","PeriodicalId":183728,"journal":{"name":"2022 IEEE 22nd International Conference on Software Quality, Reliability, and Security Companion (QRS-C)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 22nd International Conference on Software Quality, Reliability, and Security Companion (QRS-C)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/QRS-C57518.2022.00066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Software defect prediction (SDP) is one of the determining factors in software development cycle, enabling developers to estimate the quality of delivery and thus improve the user experience. Generally, efforts have opted for building the abstract syntax tree (AST). However, AST is limited in its capacity of providing adequate semantic and structural information. This paper proposes a framework, which trains hybrid features combined from AST and control flow graph (CFG) via Graph Isomorphism Network (GIN) (H-GIN): The framework parses the CFG of the source code and converts redundant blocks to a concise representation by extracting AST. In addition, the contextual relationships between each block in the CFG is carefully preserved, after which the graphical representation of the code is obtained and then fed into the GIN to conduct a defect prediction. The experiments were performed on a large python dataset called PyTraceBus with Precision, Recall, and F1-score as evaluation criteria. Our approach outperforms the previous state-of-the-art methods, which clearly demonstrates the effectiveness of the proposed method.