J. Heidrich, Adam Trendowicz, Jürgen Münch, Yasushi Ishigai, Kenji Yokoyama, Nahomi Kikuchi, Takashi Kawaguchi
{"title":"将数据驱动的成本估算应用于工业数据集的经验教训和结果","authors":"J. Heidrich, Adam Trendowicz, Jürgen Münch, Yasushi Ishigai, Kenji Yokoyama, Nahomi Kikuchi, Takashi Kawaguchi","doi":"10.1109/QUATIC.2007.16","DOIUrl":null,"url":null,"abstract":"The increasing availability of cost-relevant data in industry allows companies to apply data-intensive estimation methods. However, available data are often inconsistent, invalid, or incomplete, so that most of the existing data-intensive estimation methods cannot be applied. Only few estimation methods can deal with imperfect data to a certain extent (e.g., optimized set reduction, OSR). Results from evaluating these methods in practical environments are rare. This article describes a case study on the application of OSR at Toshiba information systems (Japan) corporation. An important result of the case study is that estimation accuracy significantly varies with the data sets used and the way of preprocessing these data. The study supports current results in the area of quantitative cost estimation and clearly illustrates typical problems. Experiences, lessons learned, and recommendations with respect to data preprocessing and data-intensive cost estimation in general are presented.","PeriodicalId":236466,"journal":{"name":"6th International Conference on the Quality of Information and Communications Technology (QUATIC 2007)","volume":"38 5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lessons Learned and Results from Applying Data-Driven Cost Estimation to Industrial Data Sets\",\"authors\":\"J. Heidrich, Adam Trendowicz, Jürgen Münch, Yasushi Ishigai, Kenji Yokoyama, Nahomi Kikuchi, Takashi Kawaguchi\",\"doi\":\"10.1109/QUATIC.2007.16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The increasing availability of cost-relevant data in industry allows companies to apply data-intensive estimation methods. However, available data are often inconsistent, invalid, or incomplete, so that most of the existing data-intensive estimation methods cannot be applied. Only few estimation methods can deal with imperfect data to a certain extent (e.g., optimized set reduction, OSR). Results from evaluating these methods in practical environments are rare. This article describes a case study on the application of OSR at Toshiba information systems (Japan) corporation. An important result of the case study is that estimation accuracy significantly varies with the data sets used and the way of preprocessing these data. The study supports current results in the area of quantitative cost estimation and clearly illustrates typical problems. Experiences, lessons learned, and recommendations with respect to data preprocessing and data-intensive cost estimation in general are presented.\",\"PeriodicalId\":236466,\"journal\":{\"name\":\"6th International Conference on the Quality of Information and Communications Technology (QUATIC 2007)\",\"volume\":\"38 5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"6th International Conference on the Quality of Information and Communications Technology (QUATIC 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/QUATIC.2007.16\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"6th International Conference on the Quality of Information and Communications Technology (QUATIC 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/QUATIC.2007.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Lessons Learned and Results from Applying Data-Driven Cost Estimation to Industrial Data Sets
The increasing availability of cost-relevant data in industry allows companies to apply data-intensive estimation methods. However, available data are often inconsistent, invalid, or incomplete, so that most of the existing data-intensive estimation methods cannot be applied. Only few estimation methods can deal with imperfect data to a certain extent (e.g., optimized set reduction, OSR). Results from evaluating these methods in practical environments are rare. This article describes a case study on the application of OSR at Toshiba information systems (Japan) corporation. An important result of the case study is that estimation accuracy significantly varies with the data sets used and the way of preprocessing these data. The study supports current results in the area of quantitative cost estimation and clearly illustrates typical problems. Experiences, lessons learned, and recommendations with respect to data preprocessing and data-intensive cost estimation in general are presented.