{"title":"利用聚类分析和堆叠集合进行房地产价格预测的混合机器学习模型架构","authors":"Cihan Çılgın, Hadi Gökçen","doi":"10.1007/s10614-024-10703-4","DOIUrl":null,"url":null,"abstract":"<p>Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.</p>","PeriodicalId":50647,"journal":{"name":"Computational Economics","volume":"47 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction\",\"authors\":\"Cihan Çılgın, Hadi Gökçen\",\"doi\":\"10.1007/s10614-024-10703-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.</p>\",\"PeriodicalId\":50647,\"journal\":{\"name\":\"Computational Economics\",\"volume\":\"47 1\",\"pages\":\"\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Economics\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://doi.org/10.1007/s10614-024-10703-4\",\"RegionNum\":4,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Economics","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1007/s10614-024-10703-4","RegionNum":4,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}
A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction
Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.
期刊介绍:
Computational Economics, the official journal of the Society for Computational Economics, presents new research in a rapidly growing multidisciplinary field that uses advanced computing capabilities to understand and solve complex problems from all branches in economics. The topics of Computational Economics include computational methods in econometrics like filtering, bayesian and non-parametric approaches, markov processes and monte carlo simulation; agent based methods, machine learning, evolutionary algorithms, (neural) network modeling; computational aspects of dynamic systems, optimization, optimal control, games, equilibrium modeling; hardware and software developments, modeling languages, interfaces, symbolic processing, distributed and parallel processing