基于树的ML模型和IML技术对比利时住宅房地产租金的可预测性

Ian Lenaers, Kris Boudt, Lieven De Moor
{"title":"基于树的ML模型和IML技术对比利时住宅房地产租金的可预测性","authors":"Ian Lenaers, Kris Boudt, Lieven De Moor","doi":"10.1108/ijhma-11-2022-0172","DOIUrl":null,"url":null,"abstract":"\nPurpose\nThe purpose is twofold. First, this study aims to establish that black box tree-based machine learning (ML) models have better predictive performance than a standard linear regression (LR) hedonic model for rent prediction. Second, it shows the added value of analyzing tree-based ML models with interpretable machine learning (IML) techniques.\n\n\nDesign/methodology/approach\nData on Belgian residential rental properties were collected. Tree-based ML models, random forest regression and eXtreme gradient boosting regression were applied to derive rent prediction models to compare predictive performance with a LR model. Interpretations of the tree-based models regarding important factors in predicting rent were made using SHapley Additive exPlanations (SHAP) feature importance (FI) plots and SHAP summary plots.\n\n\nFindings\nResults indicate that tree-based models perform better than a LR model for Belgian residential rent prediction. The SHAP FI plots agree that asking price, cadastral income, surface livable, number of bedrooms, number of bathrooms and variables measuring the proximity to points of interest are dominant predictors. The direction of relationships between rent and its factors is determined with SHAP summary plots. In addition to linear relationships, it emerges that nonlinear relationships exist.\n\n\nOriginality/value\nRent prediction using ML is relatively less studied than house price prediction. In addition, studying prediction models using IML techniques is relatively new in real estate economics. Moreover, to the best of the authors’ knowledge, this study is the first to derive insights of driving determinants of predicted rents from SHAP FI and SHAP summary plots.\n","PeriodicalId":14136,"journal":{"name":"International Journal of Housing Markets and Analysis","volume":" ","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2023-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Predictability of Belgian residential real estate rents using tree-based ML models and IML techniques\",\"authors\":\"Ian Lenaers, Kris Boudt, Lieven De Moor\",\"doi\":\"10.1108/ijhma-11-2022-0172\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\nPurpose\\nThe purpose is twofold. First, this study aims to establish that black box tree-based machine learning (ML) models have better predictive performance than a standard linear regression (LR) hedonic model for rent prediction. Second, it shows the added value of analyzing tree-based ML models with interpretable machine learning (IML) techniques.\\n\\n\\nDesign/methodology/approach\\nData on Belgian residential rental properties were collected. Tree-based ML models, random forest regression and eXtreme gradient boosting regression were applied to derive rent prediction models to compare predictive performance with a LR model. Interpretations of the tree-based models regarding important factors in predicting rent were made using SHapley Additive exPlanations (SHAP) feature importance (FI) plots and SHAP summary plots.\\n\\n\\nFindings\\nResults indicate that tree-based models perform better than a LR model for Belgian residential rent prediction. The SHAP FI plots agree that asking price, cadastral income, surface livable, number of bedrooms, number of bathrooms and variables measuring the proximity to points of interest are dominant predictors. The direction of relationships between rent and its factors is determined with SHAP summary plots. In addition to linear relationships, it emerges that nonlinear relationships exist.\\n\\n\\nOriginality/value\\nRent prediction using ML is relatively less studied than house price prediction. In addition, studying prediction models using IML techniques is relatively new in real estate economics. Moreover, to the best of the authors’ knowledge, this study is the first to derive insights of driving determinants of predicted rents from SHAP FI and SHAP summary plots.\\n\",\"PeriodicalId\":14136,\"journal\":{\"name\":\"International Journal of Housing Markets and Analysis\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2023-04-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Housing Markets and Analysis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1108/ijhma-11-2022-0172\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"URBAN STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Housing Markets and Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/ijhma-11-2022-0172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"URBAN STUDIES","Score":null,"Total":0}
引用次数: 1

摘要

目的目的是双重的。首先,本研究旨在证明基于黑箱树的机器学习(ML)模型比标准线性回归(LR)特征模型具有更好的租金预测性能。其次,它展示了使用可解释机器学习(IML)技术分析基于树的ML模型的附加值。设计/方法/方法收集了比利时住宅租赁物业的数据。基于树的ML模型、随机森林回归和极限梯度增强回归被应用于推导租金预测模型,以将预测性能与LR模型进行比较。使用SHapley加性exPlanations(SHAP)特征重要性(FI)图和SHAP汇总图对租金预测中的重要因素的基于树的模型进行了解释。结果表明,在比利时住宅租金预测中,基于树的模型比LR模型表现更好。SHAP FI图一致认为,要价、地籍收入、地表宜居性、卧室数量、浴室数量和测量兴趣点接近程度的变量是主要预测因素。租金及其影响因素之间的关系方向是通过SHAP汇总图确定的。除了线性关系之外,还存在非线性关系。与房价预测相比,使用ML进行的原创/价值租金预测研究相对较少。此外,使用IML技术研究预测模型在房地产经济学中相对较新。此外,据作者所知,本研究首次从SHAP FI和SHAP汇总图中得出预测租金的驱动决定因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Predictability of Belgian residential real estate rents using tree-based ML models and IML techniques
Purpose The purpose is twofold. First, this study aims to establish that black box tree-based machine learning (ML) models have better predictive performance than a standard linear regression (LR) hedonic model for rent prediction. Second, it shows the added value of analyzing tree-based ML models with interpretable machine learning (IML) techniques. Design/methodology/approach Data on Belgian residential rental properties were collected. Tree-based ML models, random forest regression and eXtreme gradient boosting regression were applied to derive rent prediction models to compare predictive performance with a LR model. Interpretations of the tree-based models regarding important factors in predicting rent were made using SHapley Additive exPlanations (SHAP) feature importance (FI) plots and SHAP summary plots. Findings Results indicate that tree-based models perform better than a LR model for Belgian residential rent prediction. The SHAP FI plots agree that asking price, cadastral income, surface livable, number of bedrooms, number of bathrooms and variables measuring the proximity to points of interest are dominant predictors. The direction of relationships between rent and its factors is determined with SHAP summary plots. In addition to linear relationships, it emerges that nonlinear relationships exist. Originality/value Rent prediction using ML is relatively less studied than house price prediction. In addition, studying prediction models using IML techniques is relatively new in real estate economics. Moreover, to the best of the authors’ knowledge, this study is the first to derive insights of driving determinants of predicted rents from SHAP FI and SHAP summary plots.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.80
自引率
29.40%
发文量
68
期刊最新文献
Comparative analysis of machine learning models in predicting housing prices: a case study of Prishtina's real estate market The exchange rates volatilities impact on the stock and real estate markets in South Africa Housing affordability and housing demand assessment for urban poor in India using the hedonic model Short-run dynamics and long-run effects of monetary policy on residential property prices in South Africa Relationship between housing, oil, gold and stock markets: evidence from UK and Norway
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1