Valter T. Yoshida Junior , Rafael Schiozer , Alan de Genaro , Toni R.E. dos Santos
{"title":"A novel credit model risk measure: Do more data lead to lower model risk?","authors":"Valter T. Yoshida Junior , Rafael Schiozer , Alan de Genaro , Toni R.E. dos Santos","doi":"10.1016/j.qref.2025.101960","DOIUrl":null,"url":null,"abstract":"<div><div>Large databases and Machine Learning enhance our capacity to develop models with many observations and explanatory variables. While the literature has primarily focused on optimizing classifications, little attention has been given to model risk, especially originating from inadequate use. To address this gap, we introduce a new metric for assessing model risk in credit applications. We test the metric using cross-section LASSO default models, each incorporating 200 thousand loan observations from several banks and more than 100 explanatory variables. The results indicate that models that use loans from a single bank have lower model risk than models using loans from the entire financial system. Therefore, adding loans from different banks to increase the number of observations in a model is suboptimal, challenging the widely accepted assumption that more data leads to better predictions.</div></div>","PeriodicalId":47962,"journal":{"name":"Quarterly Review of Economics and Finance","volume":"100 ","pages":"Article 101960"},"PeriodicalIF":2.9000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quarterly Review of Economics and Finance","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1062976925000018","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
Large databases and Machine Learning enhance our capacity to develop models with many observations and explanatory variables. While the literature has primarily focused on optimizing classifications, little attention has been given to model risk, especially originating from inadequate use. To address this gap, we introduce a new metric for assessing model risk in credit applications. We test the metric using cross-section LASSO default models, each incorporating 200 thousand loan observations from several banks and more than 100 explanatory variables. The results indicate that models that use loans from a single bank have lower model risk than models using loans from the entire financial system. Therefore, adding loans from different banks to increase the number of observations in a model is suboptimal, challenging the widely accepted assumption that more data leads to better predictions.
期刊介绍:
The Quarterly Review of Economics and Finance (QREF) attracts and publishes high quality manuscripts that cover topics in the areas of economics, financial economics and finance. The subject matter may be theoretical, empirical or policy related. Emphasis is placed on quality, originality, clear arguments, persuasive evidence, intelligent analysis and clear writing. At least one Special Issue is published per year. These issues have guest editors, are devoted to a single theme and the papers have well known authors. In addition we pride ourselves in being able to provide three to four article "Focus" sections in most of our issues.