利用聚类分析和堆叠集合进行房地产价格预测的混合机器学习模型架构

IF 2.2 4区经济学 Q2 ECONOMICS Computational Economics Pub Date : 2024-08-20 DOI:10.1007/s10614-024-10703-4

Cihan Çılgın, Hadi Gökçen

{"title":"利用聚类分析和堆叠集合进行房地产价格预测的混合机器学习模型架构","authors":"Cihan Çılgın, Hadi Gökçen","doi":"10.1007/s10614-024-10703-4","DOIUrl":null,"url":null,"abstract":"<p>Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.</p>","PeriodicalId":50647,"journal":{"name":"Computational Economics","volume":"47 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction\",\"authors\":\"Cihan Çılgın, Hadi Gökçen\",\"doi\":\"10.1007/s10614-024-10703-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.</p>\",\"PeriodicalId\":50647,\"journal\":{\"name\":\"Computational Economics\",\"volume\":\"47 1\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Economics\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://doi.org/10.1007/s10614-024-10703-4\",\"RegionNum\":4,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Economics","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1007/s10614-024-10703-4","RegionNum":4,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

摘要

人口的增长、科技的飞速发展、生活水平的提高、家庭结构和社会经济结构的变化、城市化进程的高速发展以及租房或购房需求的增加，都使房地产市场不断扩大，也使房地产市场更加活跃。房地产市场的这种激烈活动也成正比地加速了房地产价格预测研究。本研究的目的是提出一种模型架构，通过使用混合方法，通过聚类模型作为初步方法，实现对房地产当前市场价值的高精度预测，从而通过使用多种机器学习方法的堆叠集合实现更高的同质性。为了获得同质性更高的子市场，首先将收集到的数据集按照房间数量进行分组，然后通过聚类分析将每个组划分为若干个聚类。通过这种方法，可以获得更多同质的子市场，并提高预测的准确性。然后，针对每个确定的子市场，使用五倍交叉验证对 13 个不同的弱学习器进行训练。对每个弱学习器分别进行了特征选择和参数优化。然后，根据结果最佳的特征和参数集获得的预测结果被用于训练元学习器。整个过程的结果是，最终预测结果由错误率最低的元学习器生成。研究结果表明，即使在许多不同的房地产子市场价格波动较大的时期，也能显示出符合国际标准的高预测性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction

Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computational Economics MATHEMATICS, INTERDISCIPLINARY APPLICATIONS-

CiteScore

4.00

自引率

15.00%

发文量

119

审稿时长

12 months

期刊介绍： Computational Economics, the official journal of the Society for Computational Economics, presents new research in a rapidly growing multidisciplinary field that uses advanced computing capabilities to understand and solve complex problems from all branches in economics. The topics of Computational Economics include computational methods in econometrics like filtering, bayesian and non-parametric approaches, markov processes and monte carlo simulation; agent based methods, machine learning, evolutionary algorithms, (neural) network modeling; computational aspects of dynamic systems, optimization, optimal control, games, equilibrium modeling; hardware and software developments, modeling languages, interfaces, symbolic processing, distributed and parallel processing