Automated trendline generation for accurate software effort estimation

ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity Pub Date : 2012-10-19 DOI:10.1145/2384716.2384774

Karthikeyan Ponnalagu, N. Narendra

{"title":"Automated trendline generation for accurate software effort estimation","authors":"Karthikeyan Ponnalagu, N. Narendra","doi":"10.1145/2384716.2384774","DOIUrl":null,"url":null,"abstract":"It is well-known that accurate effort estimation is one of the key factors in deciding the success of a software project. However, as any project manager knows, generating accurate estimates has proven to be extremely difficult in practice. Even well-known estimation techniques such as COCOMO or SLIMare not guaranteed to work all the time. One key issue in estimation is the selection of the appropriate historical project data set as a frame of reference against which the estimation can be generated. In our experience in working with software projects in IBM, we have found this to be the most crucial deciding factor for the success of a software estimate; indeed, choosing the wrong project data set during estimation could be disastrous for the software project in question. This is because the trendlines (charts of effort vis-a-vis size) generated from the historical data determine the estimate for the software project, and wrong trendlines could result in wrong estimates.To that end, in this paper, we present an automated trendline generation technique for improving effort estimation in software projects. Our technique makes use of a novel data structure that we have designed called Estimation Key-Map, which represents project data in a multi-dimensional format. This format enables dynamic analysis and clustering of project data into appropriate subsets that can be selected as historical data for estimation of the software project in question. We present the results of validation of our technique against reported actual data, by evaluating it against a large project data set from IBM; therein, we show how our technique enables the selection of the appropriate trendline, thereby enabling more accurate effort estimates.","PeriodicalId":194590,"journal":{"name":"ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2384716.2384774","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

It is well-known that accurate effort estimation is one of the key factors in deciding the success of a software project. However, as any project manager knows, generating accurate estimates has proven to be extremely difficult in practice. Even well-known estimation techniques such as COCOMO or SLIMare not guaranteed to work all the time. One key issue in estimation is the selection of the appropriate historical project data set as a frame of reference against which the estimation can be generated. In our experience in working with software projects in IBM, we have found this to be the most crucial deciding factor for the success of a software estimate; indeed, choosing the wrong project data set during estimation could be disastrous for the software project in question. This is because the trendlines (charts of effort vis-a-vis size) generated from the historical data determine the estimate for the software project, and wrong trendlines could result in wrong estimates.To that end, in this paper, we present an automated trendline generation technique for improving effort estimation in software projects. Our technique makes use of a novel data structure that we have designed called Estimation Key-Map, which represents project data in a multi-dimensional format. This format enables dynamic analysis and clustering of project data into appropriate subsets that can be selected as historical data for estimation of the software project in question. We present the results of validation of our technique against reported actual data, by evaluating it against a large project data set from IBM; therein, we show how our technique enables the selection of the appropriate trendline, thereby enabling more accurate effort estimates.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

自动化趋势线生成，用于准确的软件工作量估计

众所周知，准确的工作量估算是决定软件项目成功的关键因素之一。然而，正如任何项目经理所知道的那样，在实践中产生准确的估计已经被证明是极其困难的。即使是众所周知的评估技术，如COCOMO或slim，也不能保证一直有效。评估中的一个关键问题是选择适当的历史项目数据集作为可以生成评估的参考框架。根据我们在IBM中处理软件项目的经验，我们发现这是软件评估成功的最关键的决定因素;实际上，在评估期间选择错误的项目数据集可能会对软件项目造成灾难性的影响。这是因为从历史数据生成的趋势线(工作量相对于规模的图表)决定了软件项目的估计，错误的趋势线可能导致错误的估计。为此，在本文中，我们提出了一种自动化趋势线生成技术，用于改进软件项目中的工作量估算。我们的技术使用了我们设计的一种新的数据结构，称为Estimation Key-Map，它以多维格式表示项目数据。这种格式支持对项目数据进行动态分析并将其聚类到适当的子集中，这些子集可以被选择为评估所讨论的软件项目的历史数据。我们通过对来自IBM的大型项目数据集进行评估，提出了针对报告实际数据的技术验证结果;在这里，我们展示了我们的技术如何能够选择合适的趋势线，从而实现更准确的工作量估计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity

自引率

0.00%

发文量

期刊最新文献

A programmatic introduction to Neo4j Advanced debugging techniques to identify concurrency bugs in actor-based programs When importless becomes meaningful DSLDI 2014: second workshop on domain specific languages design and implementation FPW'14: future programming workshop