{"title":"Constructing a Fraud Risk Early Warning Model for Chinese Listed Companies Driven by Heterogeneous Data","authors":"Yunbai Chen","doi":"10.61935/aedmr.2.1.2024.p122","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a model for early warning of fraud risk among Chinese listed companies, leveraging the capabilities of heterogeneous data analysis. Given the complexity and multidimensionality of corporate fraud, our model integrates multiple data sources including financial reports, market behavior, and the sentiment of annual report texts, representing diverse sets of heterogeneous data. This approach employs advanced data processing techniques to handle and amalgamate heterogeneous data, ensuring robustness and accuracy. Utilizing machine learning algorithms, the model not only detects potential fraud signals but also quantifies the level of risk, providing stakeholders with a dynamic predictive tool. This research offers a comprehensive data-driven approach to fraud detection in the corporate sector, underscoring the importance of meticulous risk assessment using various data streams. It marks a critical step in proactive fraud management in an increasingly complex financial environment.","PeriodicalId":502155,"journal":{"name":"Advances in Economic Development and Management Research","volume":" 6","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Economic Development and Management Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.61935/aedmr.2.1.2024.p122","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we propose a model for early warning of fraud risk among Chinese listed companies, leveraging the capabilities of heterogeneous data analysis. Given the complexity and multidimensionality of corporate fraud, our model integrates multiple data sources including financial reports, market behavior, and the sentiment of annual report texts, representing diverse sets of heterogeneous data. This approach employs advanced data processing techniques to handle and amalgamate heterogeneous data, ensuring robustness and accuracy. Utilizing machine learning algorithms, the model not only detects potential fraud signals but also quantifies the level of risk, providing stakeholders with a dynamic predictive tool. This research offers a comprehensive data-driven approach to fraud detection in the corporate sector, underscoring the importance of meticulous risk assessment using various data streams. It marks a critical step in proactive fraud management in an increasingly complex financial environment.