Bi在金融领域的应用——基于混合建模的零售贷款信用评分方法

S. Chandrasekhar, B. Tech, .. M.Tec
{"title":"Bi在金融领域的应用——基于混合建模的零售贷款信用评分方法","authors":"S. Chandrasekhar, B. Tech, .. M.Tec","doi":"10.1037/e525192013-023","DOIUrl":null,"url":null,"abstract":"Retail Loans now-a-days form a major proportion of Loan Portfolio. Broadly they can be classified as (i) Loans for Small and medium Sector and (ii) Loans for Individuals. The objective of Credit Scoring is that we use enough of precaution before the sanction of the loan so that the loans do not go bad after disbursement. This will increase to the bottom line of the financial institution and also reduce the Credit Risk. Techniques used to perform Credit Scoring Varies for the above two classes of loans. In this paper, we concentrate on the application of Credit Scoring for individual or so called personal loans like – Auto loan, buying goods like Televisions, Refrigerators etc. Large numbers of loans are being disbursed in these areas. Though the size of the loan may be small, when compared to Small/Medium Scale Industry, if one does not control the defaults, the consequences will be disastrous. From the characteristics of borrower, product characteristics a Credit Score is computed for each applicant. If the Score exceeds a given threshold loan is sanctioned. If it is below the threshold, loan is sanctioned. If it is below the threshold, loan is rejected. In practice a buffer zone is created near the threshold so that those Credit Scores that fall in buffer zone, detailed investigation will be done before a decision is taken. Two broad classes of Scoring Model exists (i) Subjective Scoring and (ii) Statistical Scoring. Subjective Scoring is based on intuitive judgement. Subjective Scoring works but there is scope for improvement one limitation is prediction of risk is person dependent and focuses on few characteristics and may be mistakenly focusing on wrong characteristics. Statistical Scoring uses hardcore data of borrower characteristics, product characteristics and uses mathematical models to predict the risk. The relation is expressed in the form of an equation which finally gets converted to a score. Subjectivity will be reduced and variable(s) that are important to scoring are identified based on strong mathematical foundation. Different Models have been used in Credit Scoring like Regression, Decision Tree, Discriminate Analysis and Logistic  Regression. Most of the times, a single model is used to compute the Credit Score. This method works well when the underlying decision rule is simple and when the rule becomes complex, the accuracy of the model diminishes very fast. In this Research Paper, a combination of Decision Tree and Logistic Regression is used to determine the weights that are to be assigned to different characteristics of the borrower. Decision Tree is used at first level of analysis to narrow down the importance of Variables and overall weights that needs to be assigned. It is also used for optimum groupings of numeric and non-numeric Variables. At second level, Logistic Regression is used to compute odd ratios a variant of probability, which in turn is used to assign weights for an attribute and to individual levels in an attribute. This has been tested on real life data and found to work better compared to methods using a single stage models. An accuracy of around 80% in decision is obtained which is good for any modeling study as there is no model which gives 100% accuracy. The next Section explains the Methodology, Data Used and Results. SPSS Software has been used for Model Building and Data Analysis","PeriodicalId":91079,"journal":{"name":"GSTF international journal on computing","volume":"49 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2014-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"BI APPLICATION IN FINANCIAL SECTOR - CREDIT SCORING OF RETAIL LOANS USING A HYBRID MODELING APPROACH\",\"authors\":\"S. Chandrasekhar, B. Tech, .. M.Tec\",\"doi\":\"10.1037/e525192013-023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Retail Loans now-a-days form a major proportion of Loan Portfolio. Broadly they can be classified as (i) Loans for Small and medium Sector and (ii) Loans for Individuals. The objective of Credit Scoring is that we use enough of precaution before the sanction of the loan so that the loans do not go bad after disbursement. This will increase to the bottom line of the financial institution and also reduce the Credit Risk. Techniques used to perform Credit Scoring Varies for the above two classes of loans. In this paper, we concentrate on the application of Credit Scoring for individual or so called personal loans like – Auto loan, buying goods like Televisions, Refrigerators etc. Large numbers of loans are being disbursed in these areas. Though the size of the loan may be small, when compared to Small/Medium Scale Industry, if one does not control the defaults, the consequences will be disastrous. From the characteristics of borrower, product characteristics a Credit Score is computed for each applicant. If the Score exceeds a given threshold loan is sanctioned. If it is below the threshold, loan is sanctioned. If it is below the threshold, loan is rejected. In practice a buffer zone is created near the threshold so that those Credit Scores that fall in buffer zone, detailed investigation will be done before a decision is taken. Two broad classes of Scoring Model exists (i) Subjective Scoring and (ii) Statistical Scoring. Subjective Scoring is based on intuitive judgement. Subjective Scoring works but there is scope for improvement one limitation is prediction of risk is person dependent and focuses on few characteristics and may be mistakenly focusing on wrong characteristics. Statistical Scoring uses hardcore data of borrower characteristics, product characteristics and uses mathematical models to predict the risk. The relation is expressed in the form of an equation which finally gets converted to a score. Subjectivity will be reduced and variable(s) that are important to scoring are identified based on strong mathematical foundation. Different Models have been used in Credit Scoring like Regression, Decision Tree, Discriminate Analysis and Logistic  Regression. Most of the times, a single model is used to compute the Credit Score. This method works well when the underlying decision rule is simple and when the rule becomes complex, the accuracy of the model diminishes very fast. In this Research Paper, a combination of Decision Tree and Logistic Regression is used to determine the weights that are to be assigned to different characteristics of the borrower. Decision Tree is used at first level of analysis to narrow down the importance of Variables and overall weights that needs to be assigned. It is also used for optimum groupings of numeric and non-numeric Variables. At second level, Logistic Regression is used to compute odd ratios a variant of probability, which in turn is used to assign weights for an attribute and to individual levels in an attribute. This has been tested on real life data and found to work better compared to methods using a single stage models. An accuracy of around 80% in decision is obtained which is good for any modeling study as there is no model which gives 100% accuracy. The next Section explains the Methodology, Data Used and Results. SPSS Software has been used for Model Building and Data Analysis\",\"PeriodicalId\":91079,\"journal\":{\"name\":\"GSTF international journal on computing\",\"volume\":\"49 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GSTF international journal on computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1037/e525192013-023\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GSTF international journal on computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1037/e525192013-023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

如今,零售贷款在贷款组合中占很大比例。大体上可分为(i)中小型部门贷款和(ii)个人贷款。信用评分的目的是我们在贷款批准之前使用足够的预防措施,以便贷款在支付后不会变成坏账。这将提高金融机构的底线,也将降低信用风险。用于执行信用评分的技术对上述两类贷款有所不同。本文主要研究了信用评分在个人贷款中的应用,如汽车贷款、购买电视、冰箱等商品。这些领域正在发放大量贷款。虽然贷款规模可能很小,但与中小企业相比,如果不控制违约,后果将是灾难性的。根据借款人的特征,为每个申请人计算产品特征和信用评分。如果分数超过给定的阈值,则批准贷款。如果低于阈值,则批准贷款。如果低于阈值,则拒绝贷款。在实践中,在阈值附近创建一个缓冲区,以便那些落在缓冲区的信用评分,在做出决定之前将进行详细的调查。评分模型有两大类:(i)主观评分和(ii)统计评分。主观评分是基于直觉判断。主观评分是有效的,但也有改进的余地,一个限制是风险预测依赖于人,关注的特征很少,可能会错误地关注错误的特征。统计评分法利用借款人特征、产品特征等核心数据,运用数学模型对风险进行预测。这种关系以方程的形式表示,最终转化为分数。将减少主观性,并根据强大的数学基础确定对评分重要的变量。信用评分中常用的模型有回归分析、决策树分析、判别分析和逻辑回归等。大多数情况下,使用单一模型来计算信用评分。当底层决策规则较简单时,该方法效果良好,而当决策规则较复杂时,模型的准确性下降很快。本文采用决策树和逻辑回归相结合的方法来确定借款人不同特征的权重。决策树用于第一级分析,以缩小变量的重要性和需要分配的总体权重。它还用于数值和非数值变量的最佳分组。在第二级,逻辑回归用于计算奇数比(概率的一种变体),它又用于为属性和属性中的各个级别分配权重。这已经在现实生活数据上进行了测试,发现与使用单阶段模型的方法相比,效果更好。在决策中获得了80%左右的准确性,这对于任何建模研究都是有益的,因为没有模型可以给出100%的准确性。下一节解释了方法、使用的数据和结果。SPSS软件已用于模型构建和数据分析
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
BI APPLICATION IN FINANCIAL SECTOR - CREDIT SCORING OF RETAIL LOANS USING A HYBRID MODELING APPROACH
Retail Loans now-a-days form a major proportion of Loan Portfolio. Broadly they can be classified as (i) Loans for Small and medium Sector and (ii) Loans for Individuals. The objective of Credit Scoring is that we use enough of precaution before the sanction of the loan so that the loans do not go bad after disbursement. This will increase to the bottom line of the financial institution and also reduce the Credit Risk. Techniques used to perform Credit Scoring Varies for the above two classes of loans. In this paper, we concentrate on the application of Credit Scoring for individual or so called personal loans like – Auto loan, buying goods like Televisions, Refrigerators etc. Large numbers of loans are being disbursed in these areas. Though the size of the loan may be small, when compared to Small/Medium Scale Industry, if one does not control the defaults, the consequences will be disastrous. From the characteristics of borrower, product characteristics a Credit Score is computed for each applicant. If the Score exceeds a given threshold loan is sanctioned. If it is below the threshold, loan is sanctioned. If it is below the threshold, loan is rejected. In practice a buffer zone is created near the threshold so that those Credit Scores that fall in buffer zone, detailed investigation will be done before a decision is taken. Two broad classes of Scoring Model exists (i) Subjective Scoring and (ii) Statistical Scoring. Subjective Scoring is based on intuitive judgement. Subjective Scoring works but there is scope for improvement one limitation is prediction of risk is person dependent and focuses on few characteristics and may be mistakenly focusing on wrong characteristics. Statistical Scoring uses hardcore data of borrower characteristics, product characteristics and uses mathematical models to predict the risk. The relation is expressed in the form of an equation which finally gets converted to a score. Subjectivity will be reduced and variable(s) that are important to scoring are identified based on strong mathematical foundation. Different Models have been used in Credit Scoring like Regression, Decision Tree, Discriminate Analysis and Logistic  Regression. Most of the times, a single model is used to compute the Credit Score. This method works well when the underlying decision rule is simple and when the rule becomes complex, the accuracy of the model diminishes very fast. In this Research Paper, a combination of Decision Tree and Logistic Regression is used to determine the weights that are to be assigned to different characteristics of the borrower. Decision Tree is used at first level of analysis to narrow down the importance of Variables and overall weights that needs to be assigned. It is also used for optimum groupings of numeric and non-numeric Variables. At second level, Logistic Regression is used to compute odd ratios a variant of probability, which in turn is used to assign weights for an attribute and to individual levels in an attribute. This has been tested on real life data and found to work better compared to methods using a single stage models. An accuracy of around 80% in decision is obtained which is good for any modeling study as there is no model which gives 100% accuracy. The next Section explains the Methodology, Data Used and Results. SPSS Software has been used for Model Building and Data Analysis
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Cognitive Computing supported Medical Decision Support System for Patient’s Driving Assessment Propaganda Barometer : A Supportive Tool to Improve Media Literacy Towards Building a Critically Thinking Society A framework for the adoption of bring your own device (BYOD) in the hospital environment On developing adaptive vocabulary learning game for children with an early language delay Stroke Cognitive Medical Assistant (StrokeCMA)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1