Prediction Bounds for General-Error-Regression Cost-Estimating Relationships

Stephen A. Book
{"title":"Prediction Bounds for General-Error-Regression Cost-Estimating Relationships","authors":"Stephen A. Book","doi":"10.1080/1941658X.2012.682935","DOIUrl":null,"url":null,"abstract":"Estimating the cost of a system under development is essentially trying to predict the future, which means that any such estimate contains uncertainty. When estimating using a costestimating relationship (CER), a portion of this uncertainty arises from the possibility that the cost-estimating form to which regression analysis is applied may be the incorrect one. That is, the data may have been fit to a linear form, but some curvilinear relationship may more appropriately model the data. Assuming the algebraic model being used is the correct one, the CER’s uncertainty is described by its standard error of the estimate (SEE), which is basically the standard deviation of errors made (residuals) in applying that CER to estimate the (known) costs of the systems comprising the historical database. The SEE depends primarily on the extent to which those (known) costs fit the CER that purports to model them. Finally, additional uncertainty associated with a specific CER arises from the location of the particular cost-driver value (x) within or without the range of cost-driver values for programs comprising the historical cost database. For example, if x were located near the center of the range of its historical values, the CER would provide a more precise measure of the element’s cost than if x were located toward the edges or even outside the data range. The total uncertainty of CER-based estimates is a combination of all sources of uncertainty. The first kind of uncertainty mentioned, which questions the particular CER shape involved, cannot be measured without redoing the regression analysis for a wide variety of algebraic and other kinds of CER forms. Once we have decided upon a definite CER form, the SEE, represented by only one number characteristic of the CER, is fairly easy to measure for any CER shape or error model using known algebraic formulas. The second kind of uncertainty associated with a specific CER, which assesses both the CER itself and the value of the cost-driving parameter, is more complicated, and the way to account for it is completely understood only in the case of classical linear regression, i.e., ordinary least squares (OLS). As a result, explicit formulas exist for “prediction intervals” that bound cost estimates based on CERs that have been derived by applying OLS to historical cost data. For CERs, even linear ones, derived by other statistical methods, there appears to be no general method of solution described in the theoretical statistical literature. This report illustrates the application of bootstrap statistical sampling, a 34-year-old statistical process (Casella, 2003), to the problem of estimating prediction bounds for multiplicative-error and other CERs derived by non-OLS methods. After the bootstrap method is shown to be capable of yielding prediction bounds that approximate the known OLS bounds fairly","PeriodicalId":390877,"journal":{"name":"Journal of Cost Analysis and Parametrics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cost Analysis and Parametrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/1941658X.2012.682935","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Estimating the cost of a system under development is essentially trying to predict the future, which means that any such estimate contains uncertainty. When estimating using a costestimating relationship (CER), a portion of this uncertainty arises from the possibility that the cost-estimating form to which regression analysis is applied may be the incorrect one. That is, the data may have been fit to a linear form, but some curvilinear relationship may more appropriately model the data. Assuming the algebraic model being used is the correct one, the CER’s uncertainty is described by its standard error of the estimate (SEE), which is basically the standard deviation of errors made (residuals) in applying that CER to estimate the (known) costs of the systems comprising the historical database. The SEE depends primarily on the extent to which those (known) costs fit the CER that purports to model them. Finally, additional uncertainty associated with a specific CER arises from the location of the particular cost-driver value (x) within or without the range of cost-driver values for programs comprising the historical cost database. For example, if x were located near the center of the range of its historical values, the CER would provide a more precise measure of the element’s cost than if x were located toward the edges or even outside the data range. The total uncertainty of CER-based estimates is a combination of all sources of uncertainty. The first kind of uncertainty mentioned, which questions the particular CER shape involved, cannot be measured without redoing the regression analysis for a wide variety of algebraic and other kinds of CER forms. Once we have decided upon a definite CER form, the SEE, represented by only one number characteristic of the CER, is fairly easy to measure for any CER shape or error model using known algebraic formulas. The second kind of uncertainty associated with a specific CER, which assesses both the CER itself and the value of the cost-driving parameter, is more complicated, and the way to account for it is completely understood only in the case of classical linear regression, i.e., ordinary least squares (OLS). As a result, explicit formulas exist for “prediction intervals” that bound cost estimates based on CERs that have been derived by applying OLS to historical cost data. For CERs, even linear ones, derived by other statistical methods, there appears to be no general method of solution described in the theoretical statistical literature. This report illustrates the application of bootstrap statistical sampling, a 34-year-old statistical process (Casella, 2003), to the problem of estimating prediction bounds for multiplicative-error and other CERs derived by non-OLS methods. After the bootstrap method is shown to be capable of yielding prediction bounds that approximate the known OLS bounds fairly
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一般误差回归成本估计关系的预测界
评估开发中的系统的成本本质上是试图预测未来,这意味着任何这样的评估都包含不确定性。当使用成本估算关系(CER)进行估算时,这种不确定性的一部分源于应用回归分析的成本估算形式可能是不正确的。也就是说,数据可能符合线性形式,但某些曲线关系可能更适合对数据进行建模。假设所使用的代数模型是正确的,CER的不确定性由其估计的标准误差(SEE)来描述,这基本上是应用该CER来估计组成历史数据库的系统的(已知)成本时所产生的误差的标准偏差(残差)。SEE主要取决于这些(已知的)成本在多大程度上符合旨在为其建模的CER。最后,与特定CER相关的额外不确定性来自于特定成本驱动值(x)在包含历史成本数据库的项目的成本驱动值范围内或之外的位置。例如,如果x位于其历史值范围的中心附近,则CER将比x位于边缘甚至数据范围之外提供更精确的元素成本度量。基于cer的估计的总不确定性是所有不确定性来源的组合。提到的第一种不确定性,即对所涉及的特定CER形状提出质疑,如果不重新对各种代数和其他类型的CER形式进行回归分析,就无法测量。一旦我们确定了一个明确的CER形式,SEE,仅由CER的一个数字特征表示,对于任何CER形状或误差模型,使用已知的代数公式都是相当容易测量的。与特定CER相关的第二种不确定性,即评估CER本身和成本驱动参数的值,更为复杂,只有在经典线性回归的情况下,即普通最小二乘(OLS),才能完全理解解释它的方法。因此,存在明确的“预测区间”公式,该公式基于通过将OLS应用于历史成本数据而得出的CERs约束成本估算。对于由其他统计方法导出的cer,即使是线性cer,理论统计文献中似乎没有描述的一般求解方法。本报告说明了应用自举统计抽样,一个34岁的统计过程(Casella, 2003),以估计乘法误差和其他非ols方法得出的CERs的预测界限的问题。在证明了自举法能够产生与已知OLS边界相当近似的预测边界之后
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Editorial Board EOV Multiproduct Cost-Volume-Profit Model: A Resource Reallocation Approach for Decision Making Dynamics of New Building Construction Costs: Implications for Forecasting Escalation Allowances Balancing Expert Opinion and Historical Data: The Case of Baseball Umpires Using Robust Statistical Methodology to Evaluate the Cost Performance of Project Delivery Systems: A Case Study of Horizontal Construction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1