Automated trendline generation for accurate software effort estimation

Karthikeyan Ponnalagu, N. Narendra
{"title":"Automated trendline generation for accurate software effort estimation","authors":"Karthikeyan Ponnalagu, N. Narendra","doi":"10.1145/2384716.2384774","DOIUrl":null,"url":null,"abstract":"It is well-known that accurate effort estimation is one of the key factors in deciding the success of a software project. However, as any project manager knows, generating accurate estimates has proven to be extremely difficult in practice. Even well-known estimation techniques such as COCOMO or SLIMare not guaranteed to work all the time. One key issue in estimation is the selection of the appropriate historical project data set as a frame of reference against which the estimation can be generated. In our experience in working with software projects in IBM, we have found this to be the most crucial deciding factor for the success of a software estimate; indeed, choosing the wrong project data set during estimation could be disastrous for the software project in question. This is because the trendlines (charts of effort vis-a-vis size) generated from the historical data determine the estimate for the software project, and wrong trendlines could result in wrong estimates.To that end, in this paper, we present an automated trendline generation technique for improving effort estimation in software projects. Our technique makes use of a novel data structure that we have designed called Estimation Key-Map, which represents project data in a multi-dimensional format. This format enables dynamic analysis and clustering of project data into appropriate subsets that can be selected as historical data for estimation of the software project in question. We present the results of validation of our technique against reported actual data, by evaluating it against a large project data set from IBM; therein, we show how our technique enables the selection of the appropriate trendline, thereby enabling more accurate effort estimates.","PeriodicalId":194590,"journal":{"name":"ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2384716.2384774","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

It is well-known that accurate effort estimation is one of the key factors in deciding the success of a software project. However, as any project manager knows, generating accurate estimates has proven to be extremely difficult in practice. Even well-known estimation techniques such as COCOMO or SLIMare not guaranteed to work all the time. One key issue in estimation is the selection of the appropriate historical project data set as a frame of reference against which the estimation can be generated. In our experience in working with software projects in IBM, we have found this to be the most crucial deciding factor for the success of a software estimate; indeed, choosing the wrong project data set during estimation could be disastrous for the software project in question. This is because the trendlines (charts of effort vis-a-vis size) generated from the historical data determine the estimate for the software project, and wrong trendlines could result in wrong estimates.To that end, in this paper, we present an automated trendline generation technique for improving effort estimation in software projects. Our technique makes use of a novel data structure that we have designed called Estimation Key-Map, which represents project data in a multi-dimensional format. This format enables dynamic analysis and clustering of project data into appropriate subsets that can be selected as historical data for estimation of the software project in question. We present the results of validation of our technique against reported actual data, by evaluating it against a large project data set from IBM; therein, we show how our technique enables the selection of the appropriate trendline, thereby enabling more accurate effort estimates.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自动化趋势线生成,用于准确的软件工作量估计
众所周知,准确的工作量估算是决定软件项目成功的关键因素之一。然而,正如任何项目经理所知道的那样,在实践中产生准确的估计已经被证明是极其困难的。即使是众所周知的评估技术,如COCOMO或slim,也不能保证一直有效。评估中的一个关键问题是选择适当的历史项目数据集作为可以生成评估的参考框架。根据我们在IBM中处理软件项目的经验,我们发现这是软件评估成功的最关键的决定因素;实际上,在评估期间选择错误的项目数据集可能会对软件项目造成灾难性的影响。这是因为从历史数据生成的趋势线(工作量相对于规模的图表)决定了软件项目的估计,错误的趋势线可能导致错误的估计。为此,在本文中,我们提出了一种自动化趋势线生成技术,用于改进软件项目中的工作量估算。我们的技术使用了我们设计的一种新的数据结构,称为Estimation Key-Map,它以多维格式表示项目数据。这种格式支持对项目数据进行动态分析并将其聚类到适当的子集中,这些子集可以被选择为评估所讨论的软件项目的历史数据。我们通过对来自IBM的大型项目数据集进行评估,提出了针对报告实际数据的技术验证结果;在这里,我们展示了我们的技术如何能够选择合适的趋势线,从而实现更准确的工作量估计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A programmatic introduction to Neo4j Advanced debugging techniques to identify concurrency bugs in actor-based programs When importless becomes meaningful DSLDI 2014: second workshop on domain specific languages design and implementation FPW'14: future programming workshop
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1