Using shrinkage for data-driven automated valuation model specification – a case study from Berlin

IF 2.1 Q2 URBAN STUDIES Journal of Property Research Pub Date : 2021-04-03 DOI:10.1080/09599916.2021.1905690
Nils Hinrichs, Jens Kolbe, A. Werwatz
{"title":"Using shrinkage for data-driven automated valuation model specification – a case study from Berlin","authors":"Nils Hinrichs, Jens Kolbe, A. Werwatz","doi":"10.1080/09599916.2021.1905690","DOIUrl":null,"url":null,"abstract":"ABSTRACT We study whether data-driven AVM specification that combines a flexible-yet-simple regression model with shrinkage estimators considerably improves upon the prediction accuracy of a conventional hedonic model. A rolling window prediction comparison based on all condominium sales in Berlin, Germany, between 1996 and 2013 delivered the following results. The highly parameterised model can result in extreme errors if the flexible model, which employs roughly 3,800 variables, is estimated by OLS and even if shrinkage is applied via Ridge regression. Once the most extreme errors are disregarded, Ridge regression appears as the clear winner of the prediction comparison. It is the only procedure that delivers a considerable reduction in the root mean squared prediction error relative to a parsimonious benchmark model (estimated via OLS). Of the two procedures with variable selection capability, Elastic Net delivers a slightly better prediction performance. Lasso, on the other hand, acts considerably more as a selector and typically sets the bulk of the several thousand coefficients to zero. Both procedures largely agree in terms of which characteristics they frequently select: core characteristics of hedonic pricing such as floor space, building age and location dummies.","PeriodicalId":45726,"journal":{"name":"Journal of Property Research","volume":"38 1","pages":"130 - 153"},"PeriodicalIF":2.1000,"publicationDate":"2021-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/09599916.2021.1905690","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Property Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/09599916.2021.1905690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"URBAN STUDIES","Score":null,"Total":0}
引用次数: 3

Abstract

ABSTRACT We study whether data-driven AVM specification that combines a flexible-yet-simple regression model with shrinkage estimators considerably improves upon the prediction accuracy of a conventional hedonic model. A rolling window prediction comparison based on all condominium sales in Berlin, Germany, between 1996 and 2013 delivered the following results. The highly parameterised model can result in extreme errors if the flexible model, which employs roughly 3,800 variables, is estimated by OLS and even if shrinkage is applied via Ridge regression. Once the most extreme errors are disregarded, Ridge regression appears as the clear winner of the prediction comparison. It is the only procedure that delivers a considerable reduction in the root mean squared prediction error relative to a parsimonious benchmark model (estimated via OLS). Of the two procedures with variable selection capability, Elastic Net delivers a slightly better prediction performance. Lasso, on the other hand, acts considerably more as a selector and typically sets the bulk of the several thousand coefficients to zero. Both procedures largely agree in terms of which characteristics they frequently select: core characteristics of hedonic pricing such as floor space, building age and location dummies.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
将收缩用于数据驱动的自动估价模型规范——来自柏林的案例研究
摘要我们研究了数据驱动的AVM规范,该规范将灵活而简单的回归模型与收缩估计量相结合,是否显著提高了传统特征模型的预测精度。基于1996年至2013年间德国柏林所有公寓销售的滚动窗口预测比较得出了以下结果。如果使用大约3800个变量的灵活模型由OLS估计,即使通过岭回归应用收缩,高度参数化的模型也可能导致极端误差。一旦忽略了最极端的误差,岭回归就成为预测比较的明显赢家。相对于简约基准模型(通过OLS估计),这是唯一一个显著降低均方根预测误差的过程。在具有变量选择功能的两个过程中,Elastic Net提供了略好的预测性能。另一方面,拉索更像是一个选择器,通常会将数千个系数中的大部分设置为零。这两种程序在他们经常选择的特征方面基本一致:享乐定价的核心特征,如占地面积、建筑年龄和位置假人。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
3.80
自引率
5.30%
发文量
13
期刊介绍: The Journal of Property Research is an international journal. The title reflects the expansion of research, particularly applied research, into property investment and development. The Journal of Property Research publishes papers in any area of real estate investment and development. These may be theoretical, empirical, case studies or critical literature surveys.
期刊最新文献
Digitalisation and valuations: an empirical analysis of valuers’ supplemental skills requirements From agriculture to new town: land conversion towards new-build gentrification in the southwest of Jakarta, Indonesia The impact of corporate governance and corporate social responsibility on SA REITs’ performance Do private rental tenants pay for energy efficiency?: The dynamics of green premiums and brown discounts Changes in risk appreciation, and short memory of house buyers when the market is hot, a case study of Christchurch, New Zealand
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1