{"title":"The effects of data mining techniques on software cost estimation","authors":"Karen T. Lum, Daniel R. Baker, J. Hihn","doi":"10.1109/IEMCE.2008.4617949","DOIUrl":null,"url":null,"abstract":"Current research at JPL incorporates data mining and machine learning techniques to see whether a better software cost model can be developed. 2CEE is a tool developed for developing new software cost estimation models using data mining techniques. The accuracy of these models has been validated internally through leave-one out cross validation. However, the newly generated models have not been validated to see how well they predict in the real world. Our study seeks to find out how well these machine learning based models perform against standard models for eighteen new flight and ground software projects. The accurate performance of the models against current real world projects is extremely important for practitioners to adapt new techniques.","PeriodicalId":408691,"journal":{"name":"2008 IEEE International Engineering Management Conference","volume":"202 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Engineering Management Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCE.2008.4617949","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
Current research at JPL incorporates data mining and machine learning techniques to see whether a better software cost model can be developed. 2CEE is a tool developed for developing new software cost estimation models using data mining techniques. The accuracy of these models has been validated internally through leave-one out cross validation. However, the newly generated models have not been validated to see how well they predict in the real world. Our study seeks to find out how well these machine learning based models perform against standard models for eighteen new flight and ground software projects. The accurate performance of the models against current real world projects is extremely important for practitioners to adapt new techniques.