基于树的模型:信用违约分类的比较与解释研究

Narayana Darapaneni, Pramod Srinivas, K. Reddy, A. Paduri, Lakshmikanth Kanugovi, Pavithra J, S. B. G., B. S
{"title":"基于树的模型:信用违约分类的比较与解释研究","authors":"Narayana Darapaneni, Pramod Srinivas, K. Reddy, A. Paduri, Lakshmikanth Kanugovi, Pavithra J, S. B. G., B. S","doi":"10.1109/UPCON56432.2022.9986411","DOIUrl":null,"url":null,"abstract":"Most of the real-world problems can be solved with the help of classification. Some problems that might feel to be regression can also be solved using classification by binning the values. But due to the presence of a large number of classification algorithms, it becomes difficult to choose one particular model. SVC, KNN, and Naive Bayes are some of the traditional models available which do the task quite efficiently, but when it comes to the explainability of the outcome these models fail to address them. This is when the real use of tree-based models comes in handy as we can visualize the entire flow of conditions that leads to a particular outcome. In this paper, we have tried to compare the performance and explainability of the Decision tree, Random Forest, AdaBoost, GBoost, and XGBoost for this we have considered the credit default dataset from Kaggle.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Tree Based Models: A Comparative And Explainable Study For Credit Default Classification\",\"authors\":\"Narayana Darapaneni, Pramod Srinivas, K. Reddy, A. Paduri, Lakshmikanth Kanugovi, Pavithra J, S. B. G., B. S\",\"doi\":\"10.1109/UPCON56432.2022.9986411\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Most of the real-world problems can be solved with the help of classification. Some problems that might feel to be regression can also be solved using classification by binning the values. But due to the presence of a large number of classification algorithms, it becomes difficult to choose one particular model. SVC, KNN, and Naive Bayes are some of the traditional models available which do the task quite efficiently, but when it comes to the explainability of the outcome these models fail to address them. This is when the real use of tree-based models comes in handy as we can visualize the entire flow of conditions that leads to a particular outcome. In this paper, we have tried to compare the performance and explainability of the Decision tree, Random Forest, AdaBoost, GBoost, and XGBoost for this we have considered the credit default dataset from Kaggle.\",\"PeriodicalId\":185782,\"journal\":{\"name\":\"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/UPCON56432.2022.9986411\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UPCON56432.2022.9986411","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在分类的帮助下,大多数现实世界的问题都可以得到解决。一些看起来像是回归的问题也可以通过对值进行分类来解决。但是由于存在大量的分类算法,很难选择一个特定的模型。SVC、KNN和朴素贝叶斯是一些传统的模型,它们可以相当有效地完成任务,但是当涉及到结果的可解释性时,这些模型无法解决它们。这是真正使用基于树的模型派上用场的时候,因为我们可以可视化导致特定结果的整个条件流。在本文中,我们试图比较决策树、随机森林、AdaBoost、GBoost和XGBoost的性能和可解释性,为此我们考虑了来自Kaggle的信用违约数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Tree Based Models: A Comparative And Explainable Study For Credit Default Classification
Most of the real-world problems can be solved with the help of classification. Some problems that might feel to be regression can also be solved using classification by binning the values. But due to the presence of a large number of classification algorithms, it becomes difficult to choose one particular model. SVC, KNN, and Naive Bayes are some of the traditional models available which do the task quite efficiently, but when it comes to the explainability of the outcome these models fail to address them. This is when the real use of tree-based models comes in handy as we can visualize the entire flow of conditions that leads to a particular outcome. In this paper, we have tried to compare the performance and explainability of the Decision tree, Random Forest, AdaBoost, GBoost, and XGBoost for this we have considered the credit default dataset from Kaggle.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Mains Interface Circuit Design for Traveling Wave Tube Amplifier A Passive Technique for Detecting Islanding Using Voltage Sequence Component A Unified Framework for Covariance Adaptation with Multiple Source Domains Advance Sensor for Monitoring Electrolyte Leakage in Lithium-ion Batteries for Electric Vehicles A comparative study of survey papers based on energy efficient, coverage-aware, and fault tolerant in static sink node of WSN
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1