使用 ETL 流程和机器学习的自动信用评估框架。

IF 1.1 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING Innovations in Systems and Software Engineering Pub Date : 2022-12-31 DOI:10.1007/s11334-022-00522-x
Neepa Biswas, Anindita Sarkar Mondal, Ari Kusumastuti, Swati Saha, Kartick Chandra Mondal
{"title":"使用 ETL 流程和机器学习的自动信用评估框架。","authors":"Neepa Biswas, Anindita Sarkar Mondal, Ari Kusumastuti, Swati Saha, Kartick Chandra Mondal","doi":"10.1007/s11334-022-00522-x","DOIUrl":null,"url":null,"abstract":"<p><p>In the current business scenario, real-time analysis of enterprise data through Business Intelligence (BI) is crucial for supporting operational activities and taking any strategic decision. The automated ETL (extraction, transformation, and load) process ensures data ingestion into the data warehouse in near real-time, and insights are generated through the BI process based on real-time data. In this paper, we have concentrated on automated credit risk assessment in the financial domain based on the machine learning approach. The machine learning-based classification techniques can furnish a self-regulating process to categorize data. Establishing an automated credit decision-making system helps the lending institution to manage the risks, increase operational efficiency and comply with regulators. In this paper, an empirical approach is taken for credit risk assessment using logistic regression and neural network classification method in compliance with Basel II standards. Here, Basel II standards are adopted to calculate the expected loss. The required data integration for building machine learning models is done through an automated ETL process. We have concluded this research work by evaluating this new methodology for credit risk assessment.</p>","PeriodicalId":44465,"journal":{"name":"Innovations in Systems and Software Engineering","volume":null,"pages":null},"PeriodicalIF":1.1000,"publicationDate":"2022-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9803598/pdf/","citationCount":"0","resultStr":"{\"title\":\"Automated credit assessment framework using ETL process and machine learning.\",\"authors\":\"Neepa Biswas, Anindita Sarkar Mondal, Ari Kusumastuti, Swati Saha, Kartick Chandra Mondal\",\"doi\":\"10.1007/s11334-022-00522-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In the current business scenario, real-time analysis of enterprise data through Business Intelligence (BI) is crucial for supporting operational activities and taking any strategic decision. The automated ETL (extraction, transformation, and load) process ensures data ingestion into the data warehouse in near real-time, and insights are generated through the BI process based on real-time data. In this paper, we have concentrated on automated credit risk assessment in the financial domain based on the machine learning approach. The machine learning-based classification techniques can furnish a self-regulating process to categorize data. Establishing an automated credit decision-making system helps the lending institution to manage the risks, increase operational efficiency and comply with regulators. In this paper, an empirical approach is taken for credit risk assessment using logistic regression and neural network classification method in compliance with Basel II standards. Here, Basel II standards are adopted to calculate the expected loss. The required data integration for building machine learning models is done through an automated ETL process. We have concluded this research work by evaluating this new methodology for credit risk assessment.</p>\",\"PeriodicalId\":44465,\"journal\":{\"name\":\"Innovations in Systems and Software Engineering\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2022-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9803598/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Innovations in Systems and Software Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s11334-022-00522-x\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Innovations in Systems and Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11334-022-00522-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

在当前的商业环境下,通过商业智能(BI)对企业数据进行实时分析,对于支持运营活动和做出任何战略决策都至关重要。自动化 ETL(抽取、转换和加载)流程可确保近乎实时地将数据摄入数据仓库,并通过基于实时数据的 BI 流程产生洞察力。在本文中,我们重点讨论了基于机器学习方法的金融领域信用风险自动评估。基于机器学习的分类技术可以为数据分类提供一个自我调节的过程。建立自动化信贷决策系统有助于贷款机构管理风险、提高运营效率并符合监管机构的要求。本文根据《巴塞尔 II 新资本协议》的标准,采用逻辑回归和神经网络分类方法进行信用风险评估。本文采用《巴塞尔 II 新资本协议》标准来计算预期损失。建立机器学习模型所需的数据整合是通过自动 ETL 流程完成的。我们通过评估这一新的信用风险评估方法,结束了这项研究工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Automated credit assessment framework using ETL process and machine learning.

In the current business scenario, real-time analysis of enterprise data through Business Intelligence (BI) is crucial for supporting operational activities and taking any strategic decision. The automated ETL (extraction, transformation, and load) process ensures data ingestion into the data warehouse in near real-time, and insights are generated through the BI process based on real-time data. In this paper, we have concentrated on automated credit risk assessment in the financial domain based on the machine learning approach. The machine learning-based classification techniques can furnish a self-regulating process to categorize data. Establishing an automated credit decision-making system helps the lending institution to manage the risks, increase operational efficiency and comply with regulators. In this paper, an empirical approach is taken for credit risk assessment using logistic regression and neural network classification method in compliance with Basel II standards. Here, Basel II standards are adopted to calculate the expected loss. The required data integration for building machine learning models is done through an automated ETL process. We have concluded this research work by evaluating this new methodology for credit risk assessment.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Innovations in Systems and Software Engineering
Innovations in Systems and Software Engineering COMPUTER SCIENCE, SOFTWARE ENGINEERING-
CiteScore
3.80
自引率
8.30%
发文量
75
期刊介绍: Innovations in Systems and Software Engineering: A NASA Journal addresses issues and innovations in Systems Engineering, Systems Integration, Software Engineering, Software Development and other related areas that are specifically of interest to NASA. The journal includes peer-reviewed world-class technical papers on topics of research, development and practice related to NASA''s missions and projects, topics of interest to NASA for future use, and topics describing problem areas for NASA together with potential solutions. Papers that do not address issues related to NASA are of course very welcome, provided that they address topics that NASA might like to consider for the future. Papers are solicited from NASA and government employees, contractors, NASA-supported academic and industrial partners, and non-NASA-supported academics and industrialists both in the USA and worldwide. The journal includes updates on NASA innovations, articles on NASA initiatives, papers looking at educational activities, and a State-of-the-Art section that gives an overview of specific topic areas in a comprehensive format written by an expert in the field.
期刊最新文献
A novel self-attention guided deep neural network for bruise segmentation using infrared imaging Why the use of domain-specific modeling in airworthy software requires new methods and how these might look like? (extended version) Detecting mistakes in a domain model: a comparison of three approaches Transitioning towards SysML v2 as a variability modeling language IT2F-SEDNN: an interval type-2 fuzzy logic-based stacked ensemble deep learning approach for early phase software dependability analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1