Software Effort Prediction Using Ensemble Learning Methods

Omar H. Alhazmi, Mohammed Zubair Khan
{"title":"Software Effort Prediction Using Ensemble Learning Methods","authors":"Omar H. Alhazmi, Mohammed Zubair Khan","doi":"10.4236/jsea.2020.137010","DOIUrl":null,"url":null,"abstract":"Software Cost Estimation (SCE) is an essential requirement in producing software these days. Genuine accurate estimation requires cost-and-efforts factors in delivering software by utilizing algorithmic or Ensemble Learning Methods (ELMs). Effort is estimated in terms of individual months and length. Overestimation as well as underestimation of efforts can adversely affect software development. Hence, it is the responsibility of software development managers to estimate the cost using the best possible techniques. The predominant cost for any product is the expense of figuring effort. Subsequently, effort estimation is exceptionally pivotal and there is a constant need to improve its accuracy. Fortunately, several efforts estimation models are available; however, it is difficult to determine which model is more accurate on what dataset. Hence, we use ensemble learning bagging with base learner Linear regression, SMOReg, MLP, random forest, REPTree, and M5Rule. We also implemented the feature selection algorithm to examine the effect of feature selection algorithm BestFit and Genetic Algorithm. The dataset is based on 499 projects known as China. The results show that the Mean Magnitude Relative error of Bagging M5 rule with Genetic Algorithm as Feature Selection is 10%, which makes it better than other algorithms.","PeriodicalId":62222,"journal":{"name":"软件工程与应用(英文)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"软件工程与应用(英文)","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.4236/jsea.2020.137010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Software Cost Estimation (SCE) is an essential requirement in producing software these days. Genuine accurate estimation requires cost-and-efforts factors in delivering software by utilizing algorithmic or Ensemble Learning Methods (ELMs). Effort is estimated in terms of individual months and length. Overestimation as well as underestimation of efforts can adversely affect software development. Hence, it is the responsibility of software development managers to estimate the cost using the best possible techniques. The predominant cost for any product is the expense of figuring effort. Subsequently, effort estimation is exceptionally pivotal and there is a constant need to improve its accuracy. Fortunately, several efforts estimation models are available; however, it is difficult to determine which model is more accurate on what dataset. Hence, we use ensemble learning bagging with base learner Linear regression, SMOReg, MLP, random forest, REPTree, and M5Rule. We also implemented the feature selection algorithm to examine the effect of feature selection algorithm BestFit and Genetic Algorithm. The dataset is based on 499 projects known as China. The results show that the Mean Magnitude Relative error of Bagging M5 rule with Genetic Algorithm as Feature Selection is 10%, which makes it better than other algorithms.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用集成学习方法的软件工作量预测
软件成本估算(SCE)是当今软件生产的一项重要要求。真正准确的估计需要通过使用算法或集成学习方法(ELM)来交付软件的成本和努力因素。工作量是根据个别月份和时间长短来估计的。过度估计和低估工作可能会对软件开发产生不利影响。因此,软件开发经理有责任使用尽可能好的技术来估计成本。任何产品的主要成本都是计算工作量的费用。随后,工作量估计异常关键,需要不断提高其准确性。幸运的是,有几种努力估计模型可用;然而,很难确定哪个模型在哪个数据集上更准确。因此,我们将集成学习袋与基础学习器线性回归、SMOReg、MLP、随机森林、REPTree和M5Rule一起使用。我们还实现了特征选择算法来检验特征选择算法BestFit和遗传算法的效果。该数据集基于499个中国项目。结果表明,以遗传算法作为特征选择的Bagging M5规则的平均幅度相对误差为10%,优于其他算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
815
期刊最新文献
Advanced Face Mask Detection Model Using Hybrid Dilation Convolution Based Method Artificial Intelligence and the Sustainable Development Goals: An Exploratory Study in the Context of the Society Domain Guideline of Test Suite Construction for GUI Software Centered on Grey-Box Approach Software Metric Analysis of Open-Source Business Software Research and Implementation of Cancer Gene Data Classification Based on Deep Learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1