F. A. Mohamed, Cherif R. Salama, A. Yousef, Ashraf Salem
{"title":"基于不同面向对象度量套件的缺陷类预测通用模型","authors":"F. A. Mohamed, Cherif R. Salama, A. Yousef, Ashraf Salem","doi":"10.1109/NILES50944.2020.9257892","DOIUrl":null,"url":null,"abstract":"Recently, research studies were directed to the construction of a universal defect prediction model. Such models are trained using different projects to have enough training data and be generic. One of the main challenges in the construction of a universal model is the different distributions of metrics in various projects. In this study, we aim to build a universal defect prediction model to predict software defective classes. We also aim to validate the Object-Oriented Cognitive Complexity metrics suite (CC metrics) for its association with fault-proneness. Finally, this study aims to compare the prediction performances of the CC metrics and the Chidamber and Kemerer metrics suite (CK metrics), taking into account the effect of preprocessing techniques. A neural network model is constructed using these 2 metrics suites (CK & CC metrics suites). We apply different preprocessing techniques on these metrics to overcome variations in their distributions. The results show that the CK metrics perform well whether a preprocessing is applied or not, while CC metrics’ performance is significantly affected by different preprocessing techniques. The CC metrics always outperform in the recall, while the CK metrics usually outperform in other performance metrics. Normalization preprocessing results in the highest recall values using either of the two metrics suites.","PeriodicalId":253090,"journal":{"name":"2020 2nd Novel Intelligent and Leading Emerging Sciences Conference (NILES)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Universal Model for Defective Classes Prediction Using Different Object-Oriented Metrics Suites\",\"authors\":\"F. A. Mohamed, Cherif R. Salama, A. Yousef, Ashraf Salem\",\"doi\":\"10.1109/NILES50944.2020.9257892\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, research studies were directed to the construction of a universal defect prediction model. Such models are trained using different projects to have enough training data and be generic. One of the main challenges in the construction of a universal model is the different distributions of metrics in various projects. In this study, we aim to build a universal defect prediction model to predict software defective classes. We also aim to validate the Object-Oriented Cognitive Complexity metrics suite (CC metrics) for its association with fault-proneness. Finally, this study aims to compare the prediction performances of the CC metrics and the Chidamber and Kemerer metrics suite (CK metrics), taking into account the effect of preprocessing techniques. A neural network model is constructed using these 2 metrics suites (CK & CC metrics suites). We apply different preprocessing techniques on these metrics to overcome variations in their distributions. The results show that the CK metrics perform well whether a preprocessing is applied or not, while CC metrics’ performance is significantly affected by different preprocessing techniques. The CC metrics always outperform in the recall, while the CK metrics usually outperform in other performance metrics. Normalization preprocessing results in the highest recall values using either of the two metrics suites.\",\"PeriodicalId\":253090,\"journal\":{\"name\":\"2020 2nd Novel Intelligent and Leading Emerging Sciences Conference (NILES)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 2nd Novel Intelligent and Leading Emerging Sciences Conference (NILES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NILES50944.2020.9257892\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd Novel Intelligent and Leading Emerging Sciences Conference (NILES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NILES50944.2020.9257892","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Universal Model for Defective Classes Prediction Using Different Object-Oriented Metrics Suites
Recently, research studies were directed to the construction of a universal defect prediction model. Such models are trained using different projects to have enough training data and be generic. One of the main challenges in the construction of a universal model is the different distributions of metrics in various projects. In this study, we aim to build a universal defect prediction model to predict software defective classes. We also aim to validate the Object-Oriented Cognitive Complexity metrics suite (CC metrics) for its association with fault-proneness. Finally, this study aims to compare the prediction performances of the CC metrics and the Chidamber and Kemerer metrics suite (CK metrics), taking into account the effect of preprocessing techniques. A neural network model is constructed using these 2 metrics suites (CK & CC metrics suites). We apply different preprocessing techniques on these metrics to overcome variations in their distributions. The results show that the CK metrics perform well whether a preprocessing is applied or not, while CC metrics’ performance is significantly affected by different preprocessing techniques. The CC metrics always outperform in the recall, while the CK metrics usually outperform in other performance metrics. Normalization preprocessing results in the highest recall values using either of the two metrics suites.