Jingling Ma , Junqiao Gong , Gang Wang , Xuan Zhang
{"title":"A multi-level sparse attentive fusion network integrating hard and soft information for firm-level loan default prediction","authors":"Jingling Ma , Junqiao Gong , Gang Wang , Xuan Zhang","doi":"10.1016/j.elerap.2025.101479","DOIUrl":null,"url":null,"abstract":"<div><div>Firm-level loan default prediction (FLDP) deserves much attention from both academic and industry. Even a small improvement in the accuracy of FLDP could lead to significant savings by reducing credit risk. While previous studies have utilized deep learning models for FLDP task, they failed to well handle the intra-type ambiguity and inter-type interaction simultaneously facing with combined hard and soft information, thus remaining an area of ongoing development. By this perspective, we seek to design a novel Multi-level Sparse Attention (MLSA) based deep learning fusion framework for FLDP, aiming to fully capture default signals conveyed from both hard and soft information. First, multiple types of information are extracted grounded in 5P theory and LAPP theory, ensuring the sufficiency and rationality of the features. Second, Sparse Attentive MLP (SA-MLP) and Sparse Attentive GRU (SA-GRU) module are proposed to handle the intra-type ambiguity embedded in hard and soft information separately. Further, the Attentive Fusion (AF) module including Differential Enhancive module and Common Selective module is proposed to explore inter-type interaction among hard and soft information. Last, we adopt the focal loss function to mitigate the adverse effects of imbalanced data. The proposed MLSA informs future FLDP research about how to fully exploit the value of hard and soft information by considering their intra-type ambiguity and inter-type interaction. Empirical evaluation of the MLSA on a real-world dataset demonstrates its outperformance of state-of-the-art benchmarks in the FLDP task. Our results also contribute to the growing literature on this topic by highlighting the roles of hard and soft information and improving interpretability.</div></div>","PeriodicalId":50541,"journal":{"name":"Electronic Commerce Research and Applications","volume":"70 ","pages":"Article 101479"},"PeriodicalIF":5.9000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronic Commerce Research and Applications","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1567422325000043","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BUSINESS","Score":null,"Total":0}
引用次数: 0
Abstract
Firm-level loan default prediction (FLDP) deserves much attention from both academic and industry. Even a small improvement in the accuracy of FLDP could lead to significant savings by reducing credit risk. While previous studies have utilized deep learning models for FLDP task, they failed to well handle the intra-type ambiguity and inter-type interaction simultaneously facing with combined hard and soft information, thus remaining an area of ongoing development. By this perspective, we seek to design a novel Multi-level Sparse Attention (MLSA) based deep learning fusion framework for FLDP, aiming to fully capture default signals conveyed from both hard and soft information. First, multiple types of information are extracted grounded in 5P theory and LAPP theory, ensuring the sufficiency and rationality of the features. Second, Sparse Attentive MLP (SA-MLP) and Sparse Attentive GRU (SA-GRU) module are proposed to handle the intra-type ambiguity embedded in hard and soft information separately. Further, the Attentive Fusion (AF) module including Differential Enhancive module and Common Selective module is proposed to explore inter-type interaction among hard and soft information. Last, we adopt the focal loss function to mitigate the adverse effects of imbalanced data. The proposed MLSA informs future FLDP research about how to fully exploit the value of hard and soft information by considering their intra-type ambiguity and inter-type interaction. Empirical evaluation of the MLSA on a real-world dataset demonstrates its outperformance of state-of-the-art benchmarks in the FLDP task. Our results also contribute to the growing literature on this topic by highlighting the roles of hard and soft information and improving interpretability.
期刊介绍:
Electronic Commerce Research and Applications aims to create and disseminate enduring knowledge for the fast-changing e-commerce environment. A major dilemma in e-commerce research is how to achieve a balance between the currency and the life span of knowledge.
Electronic Commerce Research and Applications will contribute to the establishment of a research community to create the knowledge, technology, theory, and applications for the development of electronic commerce. This is targeted at the intersection of technological potential and business aims.