Sana Hassan Imam, S. Huhn, Lars Hornu, Rolf Drechsler
{"title":"A Novel Default Risk Prediction and Feature Importance Analysis Technique for Marketplace Lending using Machine Learning","authors":"Sana Hassan Imam, S. Huhn, Lars Hornu, Rolf Drechsler","doi":"10.3790/ccm.56.1.27","DOIUrl":null,"url":null,"abstract":"Marketplace lending has fundamentally changed the relationship between borrowers and lenders in financial markets. As with many other financial products that have emerged in recent years, internet-based investors may be inexperienced in marketplace lending, highlighting the importance of forecasting default rates and evaluating default features such as the loan amount, interest rates, and FICO score. Potential borrowers on marketplace lending platforms may already have been rejected by banks as too risky to lend to, which amplifies the problem of asymmetric information. This paper proposes a holistic data processing flow for the loan status classification of marketplace lending multivariate time series data by using the Bidirectional Long Short-Term Memory model (BiLSTM) to predict “non-default,” “distressed,” and “default” loan status, which outperforms conventional techniques. We adopt the SHapely Additive exPlanations (SHAP) and a four-step ahead model, allowing us to extract the most significant features for default risk assessment. Using our approach, lenders and regulators can identify the most relevant features to enhance the default risk assessment method over time in addition to early risk prediction.","PeriodicalId":36966,"journal":{"name":"Credit and Capital Markets","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Credit and Capital Markets","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3790/ccm.56.1.27","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 0
Abstract
Marketplace lending has fundamentally changed the relationship between borrowers and lenders in financial markets. As with many other financial products that have emerged in recent years, internet-based investors may be inexperienced in marketplace lending, highlighting the importance of forecasting default rates and evaluating default features such as the loan amount, interest rates, and FICO score. Potential borrowers on marketplace lending platforms may already have been rejected by banks as too risky to lend to, which amplifies the problem of asymmetric information. This paper proposes a holistic data processing flow for the loan status classification of marketplace lending multivariate time series data by using the Bidirectional Long Short-Term Memory model (BiLSTM) to predict “non-default,” “distressed,” and “default” loan status, which outperforms conventional techniques. We adopt the SHapely Additive exPlanations (SHAP) and a four-step ahead model, allowing us to extract the most significant features for default risk assessment. Using our approach, lenders and regulators can identify the most relevant features to enhance the default risk assessment method over time in addition to early risk prediction.