Optimizing and simplifying software metric models constructed using maximum likelihood methods

29th Annual International Computer Software and Applications Conference (COMPSAC'05) Pub Date : 2005-07-26 DOI:10.1109/COMPSAC.2005.116

Victor K. Y. Chan, W. E. Wong

{"title":"Optimizing and simplifying software metric models constructed using maximum likelihood methods","authors":"Victor K. Y. Chan, W. E. Wong","doi":"10.1109/COMPSAC.2005.116","DOIUrl":null,"url":null,"abstract":"A software metric model can be used to predict a target metric (e.g., the development work effort) for a future release of a software system based on the project's predictor metrics (e.g., the project team size). However, missing or incomplete data often appear in the data samples used to construct the model. So far, the least biased and thus the most recommended software metric models for dealing with the missing/incomplete data are those constructed by using the maximum likelihood methods. It is true that the inclusion of a particular predictor metric in the model construction is initially based on an intuitive or experience-based assumption that the predictor metric impacts significantly the target metric. Nevertheless, this assumption has to be verified. Previous research on metric models constructed by using the maximum likelihood methods simply took this verification for granted. This can result in probable inclusion of superfluous predictor metric(s) and/or unnecessary predictor metric complexity. In this paper, we propose a methodology to optimize and simplify such models based on the results of appropriate hypothesis tests. An experiment is also reported to demonstrate the use of our methodology in trimming redundant predictor metric(s) and/or unnecessary predictor metric complexity.","PeriodicalId":419267,"journal":{"name":"29th Annual International Computer Software and Applications Conference (COMPSAC'05)","volume":"113 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"29th Annual International Computer Software and Applications Conference (COMPSAC'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COMPSAC.2005.116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

A software metric model can be used to predict a target metric (e.g., the development work effort) for a future release of a software system based on the project's predictor metrics (e.g., the project team size). However, missing or incomplete data often appear in the data samples used to construct the model. So far, the least biased and thus the most recommended software metric models for dealing with the missing/incomplete data are those constructed by using the maximum likelihood methods. It is true that the inclusion of a particular predictor metric in the model construction is initially based on an intuitive or experience-based assumption that the predictor metric impacts significantly the target metric. Nevertheless, this assumption has to be verified. Previous research on metric models constructed by using the maximum likelihood methods simply took this verification for granted. This can result in probable inclusion of superfluous predictor metric(s) and/or unnecessary predictor metric complexity. In this paper, we propose a methodology to optimize and simplify such models based on the results of appropriate hypothesis tests. An experiment is also reported to demonstrate the use of our methodology in trimming redundant predictor metric(s) and/or unnecessary predictor metric complexity.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用最大似然方法对软件度量模型进行优化和简化

软件度量模型可用于基于项目的预测度量(例如，项目团队规模)来预测软件系统未来版本的目标度量(例如，开发工作工作量)。然而，用于构建模型的数据样本中经常出现缺失或不完整的数据。到目前为止，用于处理缺失/不完整数据的偏差最小，因此最推荐的软件度量模型是使用最大似然方法构建的模型。确实，在模型构建中包含一个特定的预测指标最初是基于一个直观的或基于经验的假设，即预测指标对目标指标有显著的影响。然而，这一假设必须得到证实。以往使用极大似然方法构建度量模型的研究简单地将这一验证视为理所当然。这可能导致可能包含多余的预测指标和/或不必要的预测指标复杂性。在本文中，我们提出了一种基于适当假设检验结果的方法来优化和简化这些模型。还报告了一个实验来证明我们的方法在修剪冗余预测指标和/或不必要的预测指标复杂性方面的使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

29th Annual International Computer Software and Applications Conference (COMPSAC'05)

自引率

0.00%

发文量