{"title":"Optimization driven data mining and credit scoring","authors":"R. Grossman, H. Poor","doi":"10.1109/CIFER.1996.501831","DOIUrl":null,"url":null,"abstract":"An optimization tree approach to the mining of very extensive and complex databases for performance optimizing opportunities is described. This methodology is based on a combination of three innovations: a data management system designed explicitly for data intensive computing; a distributed algorithm for growing classification and regression trees (CART); and a tree based stochastic programming paradigm for the selection of control attributes to optimize a specified objective function. This methodology provides a general technique for optimization in financial applications that is scalable as the number of objects in the database and as the number of attributes per object grow. This scalability allows for a complete data driven analysis of large scale data sets, without the need to restrict attention to sparsely sampled data sets that limits previous methods.","PeriodicalId":378565,"journal":{"name":"IEEE/IAFE 1996 Conference on Computational Intelligence for Financial Engineering (CIFEr)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE/IAFE 1996 Conference on Computational Intelligence for Financial Engineering (CIFEr)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIFER.1996.501831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
An optimization tree approach to the mining of very extensive and complex databases for performance optimizing opportunities is described. This methodology is based on a combination of three innovations: a data management system designed explicitly for data intensive computing; a distributed algorithm for growing classification and regression trees (CART); and a tree based stochastic programming paradigm for the selection of control attributes to optimize a specified objective function. This methodology provides a general technique for optimization in financial applications that is scalable as the number of objects in the database and as the number of attributes per object grow. This scalability allows for a complete data driven analysis of large scale data sets, without the need to restrict attention to sparsely sampled data sets that limits previous methods.