Machine learning applied to estate pricing for residential rentals in dynamic urban markets—The case of São Paulo city

IF 4.2 2区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY Engineering Analysis with Boundary Elements Pub Date : 2024-10-08 DOI:10.1016/j.enganabound.2024.105988
Wesley F. Maia , Sergio A. David
{"title":"Machine learning applied to estate pricing for residential rentals in dynamic urban markets—The case of São Paulo city","authors":"Wesley F. Maia ,&nbsp;Sergio A. David","doi":"10.1016/j.enganabound.2024.105988","DOIUrl":null,"url":null,"abstract":"<div><div>This study conducts a comprehensive investigation into real estate rental pricing in São Paulo city, employing an innovative approach that combines advanced machine learning techniques with geospatial and natural language processing (NLP) analyses. The research analyzed a robust dataset comprising 47,243 rental listings, gathered through web scraping techniques. Following a rigorous data cleaning and preprocessing procedure, the study focused on 35,486 instances, incorporating a variety of variables that go beyond conventional metrics, including textual descriptions and geographic information, enriching the analysis and market understanding. Several regression models were implemented and compared, including linear approaches, Support Vector Machines, and ensemble methods such as Gradient Boosting, LightGBM, and XGBoost. The Blending model, which integrates multiple modeling techniques, stood out as the most accurate, achieving a Root Mean Squared Logarithmic Error (RMSLE) of 0.2923 on the test set. This result emphasizes the superiority of hybrid modeling strategies in complex pricing tasks. The findings of this study have significant practical implications. They provide landlords and tenants with a powerful data-driven tool for informed decision-making, reflecting the nuances and complexity of São Paulo’s real estate market. The practical implementation of the model in an interactive web application not only demonstrates its utility in the real-world scenario but also serves as a model for future applications in real estate analysis. This work contributes to mitigating the waste of time and energy when it comes to searching for and pricing residential rentals in a large city, through the use of machine learning that shows its power and potential in accurately estimating rental prices in dynamic urban markets, allowing that more assertive and economical decisions can be taken within a social-sustainable-technological perspective.</div></div>","PeriodicalId":51039,"journal":{"name":"Engineering Analysis with Boundary Elements","volume":"169 ","pages":"Article 105988"},"PeriodicalIF":4.2000,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Analysis with Boundary Elements","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0955799724004612","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

This study conducts a comprehensive investigation into real estate rental pricing in São Paulo city, employing an innovative approach that combines advanced machine learning techniques with geospatial and natural language processing (NLP) analyses. The research analyzed a robust dataset comprising 47,243 rental listings, gathered through web scraping techniques. Following a rigorous data cleaning and preprocessing procedure, the study focused on 35,486 instances, incorporating a variety of variables that go beyond conventional metrics, including textual descriptions and geographic information, enriching the analysis and market understanding. Several regression models were implemented and compared, including linear approaches, Support Vector Machines, and ensemble methods such as Gradient Boosting, LightGBM, and XGBoost. The Blending model, which integrates multiple modeling techniques, stood out as the most accurate, achieving a Root Mean Squared Logarithmic Error (RMSLE) of 0.2923 on the test set. This result emphasizes the superiority of hybrid modeling strategies in complex pricing tasks. The findings of this study have significant practical implications. They provide landlords and tenants with a powerful data-driven tool for informed decision-making, reflecting the nuances and complexity of São Paulo’s real estate market. The practical implementation of the model in an interactive web application not only demonstrates its utility in the real-world scenario but also serves as a model for future applications in real estate analysis. This work contributes to mitigating the waste of time and energy when it comes to searching for and pricing residential rentals in a large city, through the use of machine learning that shows its power and potential in accurately estimating rental prices in dynamic urban markets, allowing that more assertive and economical decisions can be taken within a social-sustainable-technological perspective.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
将机器学习应用于动态城市市场住宅租赁的房地产定价--圣保罗市的案例
本研究采用创新方法,将先进的机器学习技术与地理空间和自然语言处理(NLP)分析相结合,对圣保罗市的房地产租赁定价进行了全面调查。该研究分析了一个强大的数据集,其中包括通过网络刮擦技术收集的 47,243 份租房信息。经过严格的数据清理和预处理程序后,研究重点放在 35,486 个实例上,并纳入了各种超出传统指标的变量,包括文本描述和地理信息,从而丰富了分析和市场理解。实施并比较了几种回归模型,包括线性方法、支持向量机以及梯度提升、LightGBM 和 XGBoost 等集合方法。整合了多种建模技术的混合模型最为准确,在测试集上的均方根对数误差 (RMSLE) 为 0.2923。这一结果凸显了混合建模策略在复杂定价任务中的优越性。这项研究的结果具有重要的现实意义。它们为业主和租户提供了一个强大的数据驱动工具,用于做出明智的决策,反映了圣保罗房地产市场的细微差别和复杂性。该模型在交互式网络应用程序中的实际应用不仅证明了其在现实世界中的实用性,还为未来的房地产分析应用树立了典范。这项工作通过使用机器学习,展示了其在动态城市市场中准确估算租金价格的能力和潜力,有助于减少在大城市中搜索和定价住宅租金时的时间和精力浪费,从而可以从社会可持续发展技术的角度做出更加果断和经济的决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Engineering Analysis with Boundary Elements
Engineering Analysis with Boundary Elements 工程技术-工程:综合
CiteScore
5.50
自引率
18.20%
发文量
368
审稿时长
56 days
期刊介绍: This journal is specifically dedicated to the dissemination of the latest developments of new engineering analysis techniques using boundary elements and other mesh reduction methods. Boundary element (BEM) and mesh reduction methods (MRM) are very active areas of research with the techniques being applied to solve increasingly complex problems. The journal stresses the importance of these applications as well as their computational aspects, reliability and robustness. The main criteria for publication will be the originality of the work being reported, its potential usefulness and applications of the methods to new fields. In addition to regular issues, the journal publishes a series of special issues dealing with specific areas of current research. The journal has, for many years, provided a channel of communication between academics and industrial researchers working in mesh reduction methods Fields Covered: • Boundary Element Methods (BEM) • Mesh Reduction Methods (MRM) • Meshless Methods • Integral Equations • Applications of BEM/MRM in Engineering • Numerical Methods related to BEM/MRM • Computational Techniques • Combination of Different Methods • Advanced Formulations.
期刊最新文献
Special inclusion elements for thermal analysis of composite materials Optimally shaped nanotubes for field concentration Fluid topology optimization using quadtree-based scaled boundary finite element method Efficient exact quadrature of regular solid harmonics times polynomials over simplices in R3 Modified space-time radial basis function collocation method for solving three-dimensional transient elastodynamic problems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1