首页 > 最新文献

Intelligent Systems in Accounting, Finance and Management最新文献

英文 中文
Pay for Performance: A Comparative Analysis of Machine Learning Models for CEO Compensation Prediction 绩效薪酬:CEO薪酬预测的机器学习模型比较分析
IF 3.7 Q1 Economics, Econometrics and Finance Pub Date : 2025-12-02 DOI: 10.1002/isaf.70023
Mahfuja Malik, Eman G. Abdelfattah

The objective of our study is to demonstrate the feasibility of predicting chief executive officer (CEO) compensation by exploring various machine learning methods. In our analysis, we examine six models: k-nearest neighbors, random forest, decision tree, extra trees, extreme gradient boosting, and support vector machines regressors. We find that XGBoost, random forest, and extra trees regressors exhibit the highest predictive power with the lowest error. Decision tree feature importance analyses identify firm size, CEO age, tangibility, cash holding, and Tobin's Q as key factors in predicting CEO compensation. We also conduct ordinary least squares regressions and find that the significance levels of the coefficients are comparable to the feature importances from the machine learning analysis. Our feature-grouping analysis shows that firms' financial performance and economic characteristics play the most significant role in determining CEO compensation, followed by board characteristics. The analysis further indicates that the predictive power of random forest and extra trees is stronger for forecasting next year's compensation than for predicting the current year's. These findings are valuable for compensation consultants and stakeholders involved in benchmarking decisions.

我们研究的目的是通过探索各种机器学习方法来证明预测首席执行官(CEO)薪酬的可行性。在我们的分析中,我们检查了六个模型:k近邻、随机森林、决策树、额外树、极端梯度增强和支持向量机回归。我们发现XGBoost、随机森林和额外树回归器具有最高的预测能力和最低的误差。决策树特征重要性分析发现,公司规模、CEO年龄、有形资产、现金持有量和托宾Q值是预测CEO薪酬的关键因素。我们还进行了普通最小二乘回归,发现系数的显著性水平与机器学习分析的特征重要度相当。我们的特征分组分析表明,公司的财务绩效和经济特征在决定CEO薪酬方面发挥了最重要的作用,其次是董事会特征。进一步分析表明,随机森林和额外树对来年补偿的预测能力强于对当年补偿的预测能力。这些发现对参与基准决策的薪酬顾问和利益相关者很有价值。
{"title":"Pay for Performance: A Comparative Analysis of Machine Learning Models for CEO Compensation Prediction","authors":"Mahfuja Malik,&nbsp;Eman G. Abdelfattah","doi":"10.1002/isaf.70023","DOIUrl":"https://doi.org/10.1002/isaf.70023","url":null,"abstract":"<div>\u0000 \u0000 <p>The objective of our study is to demonstrate the feasibility of predicting chief executive officer (CEO) compensation by exploring various machine learning methods. In our analysis, we examine six models: <i>k</i>-nearest neighbors, random forest, decision tree, extra trees, extreme gradient boosting, and support vector machines regressors. We find that XGBoost, random forest, and extra trees regressors exhibit the highest predictive power with the lowest error. Decision tree feature importance analyses identify firm size, CEO age, tangibility, cash holding, and Tobin's Q as key factors in predicting CEO compensation. We also conduct ordinary least squares regressions and find that the significance levels of the coefficients are comparable to the feature importances from the machine learning analysis. Our feature-grouping analysis shows that firms' financial performance and economic characteristics play the most significant role in determining CEO compensation, followed by board characteristics. The analysis further indicates that the predictive power of random forest and extra trees is stronger for forecasting next year's compensation than for predicting the current year's. These findings are valuable for compensation consultants and stakeholders involved in benchmarking decisions.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 4","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145695077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Financial Statement Fraud Detection via Large Language Models 基于大型语言模型的财务报表舞弊检测
IF 3.7 Q1 Economics, Econometrics and Finance Pub Date : 2025-11-30 DOI: 10.1002/isaf.70021
Zehra Erva Ergun, Emre Sefer

With the widespread adoption of Internet-based AI technologies, addressing financial fraud has become increasingly critical, particularly within the realm of machine learning. In this case, deep learning and natural language processing (NLP) techniques offer powerful means of detecting fraudulent activity by analyzing financial documents, thereby enhancing both the efficiency and precision of such assessments and supporting financial security. In this study, we introduce deep representation learning-based approaches relying mainly on large language models (LLMs) for identifying fraud in financial statements by examining temporal changes in the Management Discussion and Analysis (MD&A) sections of corporate disclosures. Departing from conventional techniques that rely only on word frequency analysis, we propose DeepFraud that combines time-evolving financial LLM embeddings, such as FinBERT, FinLlama, and FinGPT embeddings, of paragraphs and uses long short-term memory (LSTM) to predict frauds via historical textual embeddings. In addition to LLM embeddings, we also integrate (1) time-evolving word frequencies of words relevant to fraud detection, such as those expressing sentiment or uncertainty, and (2) time-evolving financial ratios. Trajectories of paragraph-level embeddings, frequencies, and ratios are used to construct a fraud detection model, which we evaluate against machine learning methods and deep time-series models. Using 30 years of financial report data (from 1995 to 2024), our experiments demonstrate that DeepFraud on average enhances fraud detection performance across a number of scenarios and on average outperforms the competing approaches as well as conventional word frequency approaches. Our framework introduces a novel direction for deep feature engineering in the field of financial statement fraud detection.

随着基于互联网的人工智能技术的广泛采用,解决金融欺诈问题变得越来越重要,尤其是在机器学习领域。在这种情况下,深度学习和自然语言处理(NLP)技术提供了通过分析财务文件来检测欺诈活动的强大手段,从而提高了此类评估的效率和准确性,并支持金融安全。在本研究中,我们引入了基于深度表征学习的方法,主要依赖于大型语言模型(llm),通过检查公司披露的管理讨论和分析(MD&;A)部分的时间变化来识别财务报表中的欺诈行为。与仅依赖词频分析的传统技术不同,我们提出了深度欺诈,该技术结合了段落的随时间变化的金融LLM嵌入,如FinBERT、FinLlama和FinGPT嵌入,并使用长短期记忆(LSTM)通过历史文本嵌入来预测欺诈。除了LLM嵌入之外,我们还整合了(1)与欺诈检测相关的词的随时间变化的词频,例如那些表达情绪或不确定性的词,以及(2)随时间变化的财务比率。我们使用段落级嵌入、频率和比率的轨迹来构建欺诈检测模型,并对机器学习方法和深度时间序列模型进行评估。使用30年的财务报告数据(从1995年到2024年),我们的实验表明,DeepFraud在许多场景中平均提高了欺诈检测性能,并且平均优于竞争方法以及传统的词频方法。我们的框架为财务报表舞弊检测领域的深度特征工程引入了一个新的方向。
{"title":"Financial Statement Fraud Detection via Large Language Models","authors":"Zehra Erva Ergun,&nbsp;Emre Sefer","doi":"10.1002/isaf.70021","DOIUrl":"https://doi.org/10.1002/isaf.70021","url":null,"abstract":"<div>\u0000 \u0000 <p>With the widespread adoption of Internet-based AI technologies, addressing financial fraud has become increasingly critical, particularly within the realm of machine learning. In this case, deep learning and natural language processing (NLP) techniques offer powerful means of detecting fraudulent activity by analyzing financial documents, thereby enhancing both the efficiency and precision of such assessments and supporting financial security. In this study, we introduce deep representation learning-based approaches relying mainly on large language models (LLMs) for identifying fraud in financial statements by examining temporal changes in the Management Discussion and Analysis (MD&amp;A) sections of corporate disclosures. Departing from conventional techniques that rely only on word frequency analysis, we propose D<span>eep</span>F<span>raud</span> that combines time-evolving financial LLM embeddings, such as FinBERT, FinLlama, and FinGPT embeddings, of paragraphs and uses long short-term memory (LSTM) to predict frauds via historical textual embeddings. In addition to LLM embeddings, we also integrate (1) time-evolving word frequencies of words relevant to fraud detection, such as those expressing sentiment or uncertainty, and (2) time-evolving financial ratios. Trajectories of paragraph-level embeddings, frequencies, and ratios are used to construct a fraud detection model, which we evaluate against machine learning methods and deep time-series models. Using 30 years of financial report data (from 1995 to 2024), our experiments demonstrate that D<span>eep</span>F<span>raud</span> on average enhances fraud detection performance across a number of scenarios and on average outperforms the competing approaches as well as conventional word frequency approaches. Our framework introduces a novel direction for deep feature engineering in the field of financial statement fraud detection.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 4","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145695568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying Going Concern Audit Opinions Using Supervised Machine Learning 使用监督式机器学习识别持续经营审计意见
IF 3.7 Q1 Economics, Econometrics and Finance Pub Date : 2025-10-14 DOI: 10.1002/isaf.70020
Dennis Hedback

This paper evaluates the use of supervised machine learning to automatically identify going concern–modified audit reports. Models based on two different classifiers—logistic regression and extreme gradient boosting—achieve strong classification performance for this task. The same classifiers, along with naïve Bayes, also demonstrate strong performance in the ancillary task of identifying audit report pages in financial reports. These results have practical implications, including the application of the presented methods for timely accounting information retrieval for users, automated peer comparison for auditors, or as a data extraction method for researchers, particularly in settings with limited audit data availability.

本文评估了使用监督机器学习来自动识别持续经营修改的审计报告。基于两种不同分类器的模型-逻辑回归和极端梯度提升-在该任务中实现了较强的分类性能。同样的分类器,连同naïve Bayes,在识别财务报告中的审计报告页面的辅助任务中也表现出了很强的性能。这些结果具有实际意义,包括应用所提出的方法为用户及时检索会计信息,为审计员进行自动同行比较,或作为研究人员的数据提取方法,特别是在审计数据可用性有限的情况下。
{"title":"Identifying Going Concern Audit Opinions Using Supervised Machine Learning","authors":"Dennis Hedback","doi":"10.1002/isaf.70020","DOIUrl":"https://doi.org/10.1002/isaf.70020","url":null,"abstract":"<p>This paper evaluates the use of supervised machine learning to automatically identify going concern–modified audit reports. Models based on two different classifiers—logistic regression and extreme gradient boosting—achieve strong classification performance for this task. The same classifiers, along with naïve Bayes, also demonstrate strong performance in the ancillary task of identifying audit report pages in financial reports. These results have practical implications, including the application of the presented methods for timely accounting information retrieval for users, automated peer comparison for auditors, or as a data extraction method for researchers, particularly in settings with limited audit data availability.</p>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 4","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/isaf.70020","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145316927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Balanced Underbagged Ensemble Approach for Classifying Highly Imbalanced Datasets in the Insurance and Financial Sectors 保险和金融部门高度不平衡数据集分类的平衡欠袋集成方法
IF 3.7 Q1 Economics, Econometrics and Finance Pub Date : 2025-10-13 DOI: 10.1002/isaf.70018
Alberto Gutierrez-Gallego, Oscar Garnica, Daniel Parra, J. Manuel Velasco, J. Ignacio Hidalgo

Data bias is a critical challenge in machine learning applications within the financial and insurance sectors, as it can lead to misleading risk assessments and inaccurate predictive models. A prevalent source of bias in real-world datasets is the imbalanced distribution of classes, which is particularly problematic in fraud detection, credit risk assessment, and claim prediction. Traditional approaches to handling imbalanced data often rely on undersampling or oversampling techniques. However, these methods may generate unrealistic minority class samples or fail to perform effectively when dealing with extreme class imbalances. In this paper, we propose a configurable technique based on the underbagging method, integrated with a classifier for highly imbalanced datasets. Our approach is designed to enhance the predictive accuracy of the minority class while maintaining robust performance for the majority class. We incorporate our methodology into a classification ensemble framework and evaluate its effectiveness by comparing it against 100 combinations of 10 different oversampling and undersampling techniques applied to 10 different machine learning algorithms. The evaluation is conducted on two highly imbalanced real-world datasets: one related to auto insurance claims and another focused on credit card fraud detection. Our statistical analysis demonstrates that Balanced Underbagged Ensemble achieves superior classification performance in terms of recall for both classes, regardless of the base machine learning model used within the ensemble. Furthermore, our method finds an optimal balance between classification performance and computational efficiency.

数据偏差是金融和保险行业机器学习应用中的一个关键挑战,因为它可能导致误导性的风险评估和不准确的预测模型。在现实世界的数据集中,一个普遍的偏差来源是类别分布的不平衡,这在欺诈检测、信用风险评估和索赔预测中尤其有问题。处理不平衡数据的传统方法通常依赖于欠采样或过采样技术。然而,这些方法可能会产生不现实的少数阶级样本,或者在处理极端的阶级不平衡时不能有效地执行。在本文中,我们提出了一种基于underbagging方法的可配置技术,该技术与高度不平衡数据集的分类器相结合。我们的方法旨在提高少数类的预测准确性,同时保持多数类的稳健性能。我们将我们的方法纳入分类集成框架,并通过将其与应用于10种不同机器学习算法的10种不同过采样和欠采样技术的100种组合进行比较来评估其有效性。评估是在两个高度不平衡的真实世界数据集上进行的:一个与汽车保险索赔有关,另一个专注于信用卡欺诈检测。我们的统计分析表明,无论集成中使用的基本机器学习模型如何,平衡的Underbagged集成在两个类的召回率方面都取得了卓越的分类性能。此外,我们的方法在分类性能和计算效率之间找到了最佳平衡。
{"title":"Balanced Underbagged Ensemble Approach for Classifying Highly Imbalanced Datasets in the Insurance and Financial Sectors","authors":"Alberto Gutierrez-Gallego,&nbsp;Oscar Garnica,&nbsp;Daniel Parra,&nbsp;J. Manuel Velasco,&nbsp;J. Ignacio Hidalgo","doi":"10.1002/isaf.70018","DOIUrl":"https://doi.org/10.1002/isaf.70018","url":null,"abstract":"<p>Data bias is a critical challenge in machine learning applications within the financial and insurance sectors, as it can lead to misleading risk assessments and inaccurate predictive models. A prevalent source of bias in real-world datasets is the imbalanced distribution of classes, which is particularly problematic in fraud detection, credit risk assessment, and claim prediction. Traditional approaches to handling imbalanced data often rely on undersampling or oversampling techniques. However, these methods may generate unrealistic minority class samples or fail to perform effectively when dealing with extreme class imbalances. In this paper, we propose a configurable technique based on the underbagging method, integrated with a classifier for highly imbalanced datasets. Our approach is designed to enhance the predictive accuracy of the minority class while maintaining robust performance for the majority class. We incorporate our methodology into a classification ensemble framework and evaluate its effectiveness by comparing it against 100 combinations of 10 different oversampling and undersampling techniques applied to 10 different machine learning algorithms. The evaluation is conducted on two highly imbalanced real-world datasets: one related to auto insurance claims and another focused on credit card fraud detection. Our statistical analysis demonstrates that Balanced Underbagged Ensemble achieves superior classification performance in terms of recall for both classes, regardless of the base machine learning model used within the ensemble. Furthermore, our method finds an optimal balance between classification performance and computational efficiency.</p>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 4","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/isaf.70018","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145272973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lost in the Modeling Stage: A Comparative Analysis of Machine Learning Models for Real Estate Data 迷失在建模阶段:房地产数据机器学习模型的比较分析
IF 3.7 Q1 Economics, Econometrics and Finance Pub Date : 2025-10-12 DOI: 10.1002/isaf.70019
Ian Lenaers, Lieven De Moor

Machine learning dominates automated property valuation, yet comprehensive comparisons of predictive models remain scarce. This study compares 28 rent prediction models using 79,735 Belgian residential rental properties from 2022. Predictive performance is evaluated with traditional and alternative metrics for train data, test data, and across deciles. The results confirm that tree-based ensemble models outperform others, with stacking and averaging yielding superior results at a higher computational cost. Furthermore, middle-range rents show better predictive accuracy than extremes. Traditional and alternative metrics provide consistent findings. These insights aid real estate stakeholders seeking to enhance their expert systems for real estate price modeling.

机器学习主导着自动房地产估值,但预测模型的全面比较仍然很少。这项研究比较了28个租金预测模型,使用了比利时从2022年开始的79,735个住宅租赁物业。预测性能是用传统的和替代的训练数据、测试数据和跨十分位数的指标来评估的。结果证实,基于树的集成模型优于其他模型,在更高的计算成本下,堆叠和平均产生更好的结果。此外,中档租金的预测准确度高于极值租金。传统和替代度量提供一致的结果。这些见解有助于房地产利益相关者寻求增强他们的房地产价格建模专家系统。
{"title":"Lost in the Modeling Stage: A Comparative Analysis of Machine Learning Models for Real Estate Data","authors":"Ian Lenaers,&nbsp;Lieven De Moor","doi":"10.1002/isaf.70019","DOIUrl":"https://doi.org/10.1002/isaf.70019","url":null,"abstract":"<div>\u0000 \u0000 <p>Machine learning dominates automated property valuation, yet comprehensive comparisons of predictive models remain scarce. This study compares 28 rent prediction models using 79,735 Belgian residential rental properties from 2022. Predictive performance is evaluated with traditional and alternative metrics for train data, test data, and across deciles. The results confirm that tree-based ensemble models outperform others, with stacking and averaging yielding superior results at a higher computational cost. Furthermore, middle-range rents show better predictive accuracy than extremes. Traditional and alternative metrics provide consistent findings. These insights aid real estate stakeholders seeking to enhance their expert systems for real estate price modeling.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 4","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145272747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding Decision-Making to Tackle Complexity in Open Innovation Labs in Government 理解决策以解决政府开放式创新实验室的复杂性
IF 3.7 Q1 Economics, Econometrics and Finance Pub Date : 2025-09-13 DOI: 10.1002/isaf.70017
Ben De Coninck, Stijn Viaene, Jan Leysen

This article examines the decision-making processes in open innovation labs (OI-labs) in government. Through a qualitative single case study, we explore how the use of causal and effectual reasoning, as dichotomous logics, evolves over time and is manifested in the form of organizational practices to tackle temporal, relational, and cultural complexity. The findings reveal three episodes: the conceptualizing of the lab (predominantly causation), the building of the lab (predominantly effectuation), and the sustaining of the lab (hybrid causation–effectuation). Moreover, shifts in the logic are aimed at addressing different types of complexity, and over time, a hybrid logic emerges.

本文考察了政府开放式创新实验室的决策过程。通过定性的单一案例研究,我们探索因果推理和有效推理的使用,作为二分逻辑,如何随着时间的推移而演变,并以组织实践的形式表现出来,以解决时间、关系和文化的复杂性。研究结果揭示了三个阶段:实验室的概念化(主要是因果关系)、实验室的建设(主要是效果)和实验室的维持(混合因果关系)。此外,逻辑中的转换旨在处理不同类型的复杂性,并且随着时间的推移,出现了混合逻辑。
{"title":"Understanding Decision-Making to Tackle Complexity in Open Innovation Labs in Government","authors":"Ben De Coninck,&nbsp;Stijn Viaene,&nbsp;Jan Leysen","doi":"10.1002/isaf.70017","DOIUrl":"https://doi.org/10.1002/isaf.70017","url":null,"abstract":"<div>\u0000 \u0000 <p>This article examines the decision-making processes in open innovation labs (OI-labs) in government. Through a qualitative single case study, we explore how the use of causal and effectual reasoning, as dichotomous logics, evolves over time and is manifested in the form of organizational practices to tackle temporal, relational, and cultural complexity. The findings reveal three episodes: the conceptualizing of the lab (predominantly causation), the building of the lab (predominantly effectuation), and the sustaining of the lab (hybrid causation–effectuation). Moreover, shifts in the logic are aimed at addressing different types of complexity, and over time, a hybrid logic emerges.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 3","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145038289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FinSentiment: Predicting Financial Sentiment Through Transfer Learning 金融情绪:通过迁移学习预测金融情绪
IF 3.7 Q1 Economics, Econometrics and Finance Pub Date : 2025-09-11 DOI: 10.1002/isaf.70015
Zehra Erva Ergun, Emre Sefer

There is an increasing interest in financial text mining tasks. Significant progress has been made by using deep learning-based models on a generic corpus, which also shows reasonable results on financial text mining tasks such as financial sentiment analysis. However, financial sentiment analysis is still demanding work because of the insufficiency of labeled data for the financial domain and its specialized language. General-purpose deep learning methods are not as effective mainly due to specialized language used in the financial context. In this study, we focus on enhancing the performance of financial text mining tasks by improving the existing pretrained language models via NLP transfer learning. Pretrained language models demand a small quantity of labeled samples, and they could be enhanced to a greater extent by training them on domain-specific corpora instead. We propose an enhanced model FinSentiment, which incorporates enhanced versions of a number of recently proposed pretrained models, such as BERT, XLNet, RoBERTa, GPT, Llama, and T5, to better perform across NLP tasks in financial domain by training these models on financial domain corpora. The corresponding finance-specific models in FinSentiment are called Fin-BERT, Fin-XLNet, Fin-RoBERTa, Fin-GPT, Fin-Llama, and Fin-T5, respectively. We also propose variants of these models jointly trained over financial domain and general corpora. Our finance-specific FinSentiment models, in general, show the best performance across three financial sentiment analysis datasets, even when only a subpart of these models is fine-tuned with a smaller training set. Our results exhibit enhancement for each tested performance criteria on the existing results for these datasets. Extensive experimental results demonstrate the effectiveness and robustness of especially RoBERTa pretrained on financial corpora. Overall, we show that NLP transfer learning techniques are favorable solutions to financial sentiment analysis tasks. Our source code has been deposited at https://github.com/seferlab/finsentiment.

人们对金融文本挖掘任务越来越感兴趣。在通用语料库上使用基于深度学习的模型取得了重大进展,在金融文本挖掘任务(如金融情绪分析)上也显示出合理的结果。然而,由于金融领域的标记数据及其专业语言的不足,金融情绪分析仍然是一项艰巨的工作。通用的深度学习方法不那么有效,主要是由于在金融环境中使用了专门的语言。在本研究中,我们着重于通过NLP迁移学习改进现有的预训练语言模型,从而提高金融文本挖掘任务的性能。预训练的语言模型需要少量的标记样本,并且可以通过在特定领域的语料库上训练来更大程度地增强它们。我们提出了一个增强模型FinSentiment,它结合了许多最近提出的预训练模型(如BERT、XLNet、RoBERTa、GPT、Llama和T5)的增强版本,通过在金融领域语料库上训练这些模型来更好地执行金融领域的NLP任务。FinSentiment中对应的金融特定模型分别为Fin-BERT、Fin-XLNet、Fin-RoBERTa、Fin-GPT、Fin-Llama和Fin-T5。我们还提出了在金融领域和一般语料库上联合训练的这些模型的变体。一般来说,我们的金融特定FinSentiment模型在三个金融情绪分析数据集上表现最佳,即使这些模型只有一小部分使用较小的训练集进行微调。我们的结果显示,在这些数据集的现有结果上,每个测试的性能标准都有增强。大量的实验结果证明了该方法的有效性和鲁棒性,特别是RoBERTa在金融语料库上的预训练。总体而言,我们表明NLP迁移学习技术是金融情绪分析任务的有利解决方案。我们的源代码已存放在https://github.com/seferlab/finsentiment。
{"title":"FinSentiment: Predicting Financial Sentiment Through Transfer Learning","authors":"Zehra Erva Ergun,&nbsp;Emre Sefer","doi":"10.1002/isaf.70015","DOIUrl":"https://doi.org/10.1002/isaf.70015","url":null,"abstract":"<div>\u0000 \u0000 <p>There is an increasing interest in financial text mining tasks. Significant progress has been made by using deep learning-based models on a generic corpus, which also shows reasonable results on financial text mining tasks such as financial sentiment analysis. However, financial sentiment analysis is still demanding work because of the insufficiency of labeled data for the financial domain and its specialized language. General-purpose deep learning methods are not as effective mainly due to specialized language used in the financial context. In this study, we focus on enhancing the performance of financial text mining tasks by improving the existing pretrained language models via NLP transfer learning. Pretrained language models demand a small quantity of labeled samples, and they could be enhanced to a greater extent by training them on domain-specific corpora instead. We propose an enhanced model FinSentiment, which incorporates enhanced versions of a number of recently proposed pretrained models, such as BERT, XLNet, RoBERTa, GPT, Llama, and T5, to better perform across NLP tasks in financial domain by training these models on financial domain corpora. The corresponding finance-specific models in FinSentiment are called Fin-BERT, Fin-XLNet, Fin-RoBERTa, Fin-GPT, Fin-Llama, and Fin-T5, respectively. We also propose variants of these models jointly trained over financial domain and general corpora. Our finance-specific FinSentiment models, in general, show the best performance across three financial sentiment analysis datasets, even when only a subpart of these models is fine-tuned with a smaller training set. Our results exhibit enhancement for each tested performance criteria on the existing results for these datasets. Extensive experimental results demonstrate the effectiveness and robustness of especially RoBERTa pretrained on financial corpora. Overall, we show that NLP transfer learning techniques are favorable solutions to financial sentiment analysis tasks. Our source code has been deposited at https://github.com/seferlab/finsentiment.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 3","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145037646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Manual Journal Entry Testing: Integrating Natural Language Processing and Deep Learning 手动日志条目测试:整合自然语言处理和深度学习
IF 3.7 Q1 Economics, Econometrics and Finance Pub Date : 2025-09-03 DOI: 10.1002/isaf.70016
Qing Huang, Huijue Kelly Duan, Miklos A. Vasarhelyi

This paper presents an innovative approach to comprehensively and systematically evaluate manual journal entries (MJEs) and enhance the control procedures in auditing. The proposed approach combines quantitative and qualitative information to develop various Key Risk Indicators (KRIs) that provide insights into potential risks associated with MJEs. The approach incorporates textual analytics into traditional quantitative measures. Using the data obtained from a multinational company, the application of the proposed testing approach demonstrates its effectiveness in identifying potential high-risk MJEs and improving the company's journal entry testing and monitoring procedures. The findings contribute to current audit practices by offering a more efficient and comprehensive method for evaluating MJEs.

本文提出了一种创新的方法来全面和系统地评估手工日记账(MJEs),并加强审计中的控制程序。建议的方法结合定量和定性信息来开发各种关键风险指标(KRIs),提供与MJEs相关的潜在风险的见解。该方法将文本分析结合到传统的定量测量中。使用从一家跨国公司获得的数据,所提出的测试方法的应用证明了其在识别潜在高风险MJEs和改进公司日记账测试和监控程序方面的有效性。这些发现为当前的审计实践提供了一种更有效、更全面的评估MJEs的方法。
{"title":"Manual Journal Entry Testing: Integrating Natural Language Processing and Deep Learning","authors":"Qing Huang,&nbsp;Huijue Kelly Duan,&nbsp;Miklos A. Vasarhelyi","doi":"10.1002/isaf.70016","DOIUrl":"https://doi.org/10.1002/isaf.70016","url":null,"abstract":"<div>\u0000 \u0000 <p>This paper presents an innovative approach to comprehensively and systematically evaluate manual journal entries (MJEs) and enhance the control procedures in auditing. The proposed approach combines quantitative and qualitative information to develop various Key Risk Indicators (KRIs) that provide insights into potential risks associated with MJEs. The approach incorporates textual analytics into traditional quantitative measures. Using the data obtained from a multinational company, the application of the proposed testing approach demonstrates its effectiveness in identifying potential high-risk MJEs and improving the company's journal entry testing and monitoring procedures. The findings contribute to current audit practices by offering a more efficient and comprehensive method for evaluating MJEs.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 3","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144935038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of Volatility Using Monetary Rate and GARCH-LSTM Hybrid Model 基于货币利率和GARCH-LSTM混合模型的波动率预测
IF 3.7 Q1 Economics, Econometrics and Finance Pub Date : 2025-08-02 DOI: 10.1002/isaf.70013
Jyoti Ranjan, C. Anirvinna

Predicting volatility is very important for the financial markets as it helps to determine risk and decision-making. Predicting volatilities for such stock indices, which include the Nifty 50, is important for traders, investors, and policymakers. In this study, advanced hybrid models are used to predict the volatility of the Nifty 50 index over intervals of 1, 7, 14, and 21 days. The GJR-GARCH-LSTM and the GARCH-LSTM are two hybrid models that forecast the volatility of the Nifty 50. The effect of including the cash reserve ratio (CRR) in the analysis is also looked at. As the forecast horizon grows, the results show decreased prediction accuracy. The mean squared error (MSE) increased by 0.78% from the 1-day to the 7-day forecast, decreased by 2.63% between the 1-day and 7-day projections, rose by about 55% from the 7-day to the 14-day forecast, and grew by 56% between the 14-day and 21-day projections. The GJR-GARCH-LSTM model had better results compared to the simple GARCH-LSTM hybrid model. The novelty of this study is in building and validating hybrid models, specifically the GJR-GARCH-LSTM, to predict Nifty 50 index volatility and using the CRR as a macroeconomic explanatory variable. Different from current literature, which tends to use hybrid models in a generic sense, this paper adapts the model to the Indian financial environment and shows the additional predictive power of monetary policy determinants such as CRR.

预测波动性对金融市场非常重要,因为它有助于确定风险和决策。预测包括Nifty 50在内的股指的波动对交易员、投资者和政策制定者都很重要。在本研究中,采用先进的混合模型来预测Nifty 50指数在1、7、14和21天的波动率。GJR-GARCH-LSTM和GARCH-LSTM是预测Nifty 50波动率的两个混合模型。本文还探讨了将现金准备金率(CRR)纳入分析的影响。随着预测范围的增大,预测精度逐渐降低。平均平方误差(MSE)从1天预测到7天预测增加0.78%,从1天预测到7天预测减少2.63%,从7天预测到14天预测增加约55%,从14天预测到21天预测增加56%。与简单的GARCH-LSTM混合模型相比,GJR-GARCH-LSTM模型具有更好的效果。本研究的新颖之处在于建立和验证混合模型,特别是GJR-GARCH-LSTM,以预测Nifty 50指数的波动,并使用CRR作为宏观经济解释变量。与当前文献倾向于在一般意义上使用混合模型不同,本文将模型调整到印度金融环境,并显示了货币政策决定因素(如CRR)的额外预测能力。
{"title":"Prediction of Volatility Using Monetary Rate and GARCH-LSTM Hybrid Model","authors":"Jyoti Ranjan,&nbsp;C. Anirvinna","doi":"10.1002/isaf.70013","DOIUrl":"https://doi.org/10.1002/isaf.70013","url":null,"abstract":"<div>\u0000 \u0000 <p>Predicting volatility is very important for the financial markets as it helps to determine risk and decision-making. Predicting volatilities for such stock indices, which include the Nifty 50, is important for traders, investors, and policymakers. In this study, advanced hybrid models are used to predict the volatility of the Nifty 50 index over intervals of 1, 7, 14, and 21 days. The GJR-GARCH-LSTM and the GARCH-LSTM are two hybrid models that forecast the volatility of the Nifty 50. The effect of including the cash reserve ratio (CRR) in the analysis is also looked at. As the forecast horizon grows, the results show decreased prediction accuracy. The mean squared error (MSE) increased by 0.78% from the 1-day to the 7-day forecast, decreased by 2.63% between the 1-day and 7-day projections, rose by about 55% from the 7-day to the 14-day forecast, and grew by 56% between the 14-day and 21-day projections. The GJR-GARCH-LSTM model had better results compared to the simple GARCH-LSTM hybrid model. The novelty of this study is in building and validating hybrid models, specifically the GJR-GARCH-LSTM, to predict Nifty 50 index volatility and using the CRR as a macroeconomic explanatory variable. Different from current literature, which tends to use hybrid models in a generic sense, this paper adapts the model to the Indian financial environment and shows the additional predictive power of monetary policy determinants such as CRR.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 3","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144758617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Developing an Accounting Virtual Assistant Through Supervised Fine-Tuning (SFT) of a Small Language Model (SLM) 基于小语言模型监督微调(SFT)开发会计虚拟助手
Q1 Economics, Econometrics and Finance Pub Date : 2025-07-28 DOI: 10.1002/isaf.70011
Mario Zupan

The development of an in-house accounting bot—an artificial intelligence (AI) assistant capable of generating internally structured bookkeeping double-entry posting schemes—is explored in this paper. The processes of curating a suitable dataset, selecting, and fine-tuning a seven-billion-parameter language model, categorized as a small language model (SLM) (SLMs typically refer to models with fewer than 10 billion parameters, whereas medium-sized models often have 14B parameters, and large-scale models exceed 70B), are described. A human-evaluated benchmark is also presented to assess model performance. To achieve efficient supervised fine-tuning (SFT), low-rank adaptation (LoRA) was employed, significantly reducing memory requirements by using a small set of trainable parameters while maintaining model expressiveness. The process of backpropagation was further optimized using Unsloth, a high-performance training framework designed for efficient video memory usage and flash attention mechanisms, which accelerates adaptation and reduces memory overhead. The model whose layers were updated is called QwenCoder2.5. It was selected with the presumption that it would be able to learn how to generate and examine bookkeeping patterns generated by accounting information system (AIS) over a 17-year history. This proof of concept aims to support researchers and practitioners exploring the integration of generative AI in accounting by providing insights into both the benefits and challenges of AI-driven automation in bookkeeping tasks. The study demonstrates how an SLM can be fine-tuned on a proprietary dataset of journal posting schemes to assist accountants, auditors, and financial analysts while also facilitating synthetic data generation. Challenges related to AI, data preprocessing, fine-tuning optimization, and evaluation methodology are introduced and examined.

本文探讨了内部会计机器人的开发-一种能够生成内部结构化簿记复式记帐方案的人工智能(AI)助手。描述了管理合适的数据集、选择和微调70亿个参数的语言模型的过程,这些模型被归类为小型语言模型(SLM) (SLM通常指的是参数少于100亿个的模型,而中型模型通常有14B个参数,而大型模型通常有70B个参数)。还提出了一个人类评估的基准来评估模型的性能。为了实现有效的监督微调(SFT),采用了低秩自适应(LoRA),在保持模型表达性的同时,使用少量可训练参数显著降低了内存需求。使用Unsloth进一步优化了反向传播过程,Unsloth是一种高性能训练框架,专为高效的视频内存使用和flash注意机制而设计,可以加速适应并减少内存开销。更新图层的模型称为QwenCoder2.5。选择它的前提是,它将能够学习如何生成和检查会计信息系统(AIS)在17年的历史中生成的簿记模式。这一概念证明旨在通过提供对人工智能驱动的簿记任务自动化的好处和挑战的见解,支持研究人员和实践者探索生成式人工智能在会计中的集成。该研究展示了如何在日志发布方案的专有数据集上微调SLM,以协助会计师、审计师和财务分析师,同时促进合成数据的生成。与人工智能,数据预处理,微调优化和评估方法相关的挑战被介绍和检查。
{"title":"Developing an Accounting Virtual Assistant Through Supervised Fine-Tuning (SFT) of a Small Language Model (SLM)","authors":"Mario Zupan","doi":"10.1002/isaf.70011","DOIUrl":"https://doi.org/10.1002/isaf.70011","url":null,"abstract":"<p>The development of an in-house accounting bot—an artificial intelligence (AI) assistant capable of generating internally structured bookkeeping double-entry posting schemes—is explored in this paper. The processes of curating a suitable dataset, selecting, and fine-tuning a seven-billion-parameter language model, categorized as a small language model (SLM) (SLMs typically refer to models with fewer than 10 billion parameters, whereas medium-sized models often have 14B parameters, and large-scale models exceed 70B), are described. A human-evaluated benchmark is also presented to assess model performance. To achieve efficient supervised fine-tuning (SFT), low-rank adaptation (LoRA) was employed, significantly reducing memory requirements by using a small set of trainable parameters while maintaining model expressiveness. The process of backpropagation was further optimized using Unsloth, a high-performance training framework designed for efficient video memory usage and flash attention mechanisms, which accelerates adaptation and reduces memory overhead. The model whose layers were updated is called QwenCoder2.5. It was selected with the presumption that it would be able to learn how to generate and examine bookkeeping patterns generated by accounting information system (AIS) over a 17-year history. This proof of concept aims to support researchers and practitioners exploring the integration of generative AI in accounting by providing insights into both the benefits and challenges of AI-driven automation in bookkeeping tasks. The study demonstrates how an SLM can be fine-tuned on a proprietary dataset of journal posting schemes to assist accountants, auditors, and financial analysts while also facilitating synthetic data generation. Challenges related to AI, data preprocessing, fine-tuning optimization, and evaluation methodology are introduced and examined.</p>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"32 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/isaf.70011","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144716652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Intelligent Systems in Accounting, Finance and Management
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1