利用聚类分析和堆叠集合进行房地产价格预测的混合机器学习模型架构

IF 1.9 4区 经济学 Q2 ECONOMICS Computational Economics Pub Date : 2024-08-20 DOI:10.1007/s10614-024-10703-4
Cihan Çılgın, Hadi Gökçen
{"title":"利用聚类分析和堆叠集合进行房地产价格预测的混合机器学习模型架构","authors":"Cihan Çılgın, Hadi Gökçen","doi":"10.1007/s10614-024-10703-4","DOIUrl":null,"url":null,"abstract":"<p>Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.</p>","PeriodicalId":50647,"journal":{"name":"Computational Economics","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction\",\"authors\":\"Cihan Çılgın, Hadi Gökçen\",\"doi\":\"10.1007/s10614-024-10703-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.</p>\",\"PeriodicalId\":50647,\"journal\":{\"name\":\"Computational Economics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Economics\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://doi.org/10.1007/s10614-024-10703-4\",\"RegionNum\":4,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Economics","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1007/s10614-024-10703-4","RegionNum":4,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

摘要

人口的增长、科技的飞速发展、生活水平的提高、家庭结构和社会经济结构的变化、城市化进程的高速发展以及租房或购房需求的增加,都使房地产市场不断扩大,也使房地产市场更加活跃。房地产市场的这种激烈活动也成正比地加速了房地产价格预测研究。本研究的目的是提出一种模型架构,通过使用混合方法,通过聚类模型作为初步方法,实现对房地产当前市场价值的高精度预测,从而通过使用多种机器学习方法的堆叠集合实现更高的同质性。为了获得同质性更高的子市场,首先将收集到的数据集按照房间数量进行分组,然后通过聚类分析将每个组划分为若干个聚类。通过这种方法,可以获得更多同质的子市场,并提高预测的准确性。然后,针对每个确定的子市场,使用五倍交叉验证对 13 个不同的弱学习器进行训练。对每个弱学习器分别进行了特征选择和参数优化。然后,根据结果最佳的特征和参数集获得的预测结果被用于训练元学习器。整个过程的结果是,最终预测结果由错误率最低的元学习器生成。研究结果表明,即使在许多不同的房地产子市场价格波动较大的时期,也能显示出符合国际标准的高预测性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Hybrid Machine Learning Model Architecture with Clustering Analysis and Stacking Ensemble for Real Estate Price Prediction

Population growth, rapid developments in technology, increase in living standards, changes in the household structure and economic structure of societies, and the increase in urbanization at very high rates, as well as the increase in the demand for renting or purchasing real estate, have both expanded the real estate market and made it more active. This intense activity in the real estate markets also accelerates real estate price prediction studies in direct proportion. The aim of this study is to present a model architecture that can achieve high accuracy in predicting the current market value of real estates by using a hybrid approach, through clustering models as a preliminary approach, in order to achieve higher homogeneity with stacking ensemble using multiple machine learning methods. In order to obtain more homogeneous submarkets, the collected data set was first grouped according to the number of rooms and then each group was divided into clusters by cluster analysis. In this way, more homogeneous submarkets were obtained and predict accuracy was improved. Then, the training process was carried out for 13 different weak learners using fivefold cross-validation for each determined sub-market. Feature selection and parameter optimization were performed separately for each weak learner. Then, the predictions obtained according to the feature and parameter set that gave the best results were used to train the meta-learner. As a result of this entire process, the final prediction was created with the meta learner that gave the least error rate. As the findings show, high predicting performance at international standards has been demonstrated even in a period of high price fluctuations for many and various sub-markets of real estate.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computational Economics
Computational Economics MATHEMATICS, INTERDISCIPLINARY APPLICATIONS-
CiteScore
4.00
自引率
15.00%
发文量
119
审稿时长
12 months
期刊介绍: Computational Economics, the official journal of the Society for Computational Economics, presents new research in a rapidly growing multidisciplinary field that uses advanced computing capabilities to understand and solve complex problems from all branches in economics. The topics of Computational Economics include computational methods in econometrics like filtering, bayesian and non-parametric approaches, markov processes and monte carlo simulation; agent based methods, machine learning, evolutionary algorithms, (neural) network modeling; computational aspects of dynamic systems, optimization, optimal control, games, equilibrium modeling; hardware and software developments, modeling languages, interfaces, symbolic processing, distributed and parallel processing
期刊最新文献
Assessing the Dual Impact of the Social Media Platforms on Psychological Well-being: A Multiple-Option Descriptive-Predictive Framework Modeling Asset Price Process: An Approach for Imaging Price Chart with Generative Diffusion Models Is the Price of Ether Driven by Demand or Pure Speculation? Iterative Deep Learning Approach to Active Portfolio Management with Sentiment Factors Asset Prices with Investor Protection in the Cross-Sectional Economy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1