首页 > 最新文献

Intelligent Systems in Accounting, Finance and Management最新文献

英文 中文
Massive data language models and conversational artificial intelligence: Emerging issues 海量数据语言模型和会话式人工智能:新兴问题
Q1 Economics, Econometrics and Finance Pub Date : 2022-09-16 DOI: 10.1002/isaf.1522
Daniel E. O’Leary

Google’s LaMDA, Open AI’s GPT-3, and Meta’s BlenderBot are artificial intelligence (AI)-based chatbots, that have been trained on billions of documents creating the notion of “massive data.” These systems use human-generated documents to capture words and relationships between words that people use when they communicate. This paper examines some of the similarities of these systems and the emerging issues regarding these massive data language models, including whether they are sentient, the use and impact of scale, information use and ownership, and explanations of discussions and answers. This paper also directly investigates some artifacts of Google’s LaMDA and compares them with Meta’s BlenderBot. Finally, this paper examines emerging issues and questions deriving from our analysis.

b谷歌的LaMDA、Open AI的GPT-3和Meta的blendbot都是基于人工智能(AI)的聊天机器人,它们经过数十亿份文件的训练,创造了“海量数据”的概念。这些系统使用人工生成的文档来捕获人们在交流时使用的单词和单词之间的关系。本文考察了这些系统的一些相似之处,以及关于这些海量数据语言模型的新问题,包括它们是否有感知、规模的使用和影响、信息的使用和所有权,以及对讨论和答案的解释。本文还直接研究了b谷歌的LaMDA的一些工件,并将它们与Meta的BlenderBot进行了比较。最后,本文探讨了从我们的分析中产生的新问题和问题。
{"title":"Massive data language models and conversational artificial intelligence: Emerging issues","authors":"Daniel E. O’Leary","doi":"10.1002/isaf.1522","DOIUrl":"10.1002/isaf.1522","url":null,"abstract":"<div>\u0000 \u0000 <p>Google’s LaMDA, Open AI’s GPT-3, and Meta’s BlenderBot are artificial intelligence (AI)-based chatbots, that have been trained on billions of documents creating the notion of “massive data.” These systems use human-generated documents to capture words and relationships between words that people use when they communicate. This paper examines some of the similarities of these systems and the emerging issues regarding these massive data language models, including whether they are sentient, the use and impact of scale, information use and ownership, and explanations of discussions and answers. This paper also directly investigates some artifacts of Google’s LaMDA and compares them with Meta’s BlenderBot. Finally, this paper examines emerging issues and questions deriving from our analysis.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"29 3","pages":"182-198"},"PeriodicalIF":0.0,"publicationDate":"2022-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122450525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Effects of classification, feature selection, and resampling methods on bankruptcy prediction of small and medium-sized enterprises 分类、特征选择和重采样方法对中小企业破产预测的影响
Q1 Economics, Econometrics and Finance Pub Date : 2022-09-15 DOI: 10.1002/isaf.1521
Lenka Papíková, Mário Papík

Small and medium-sized enterprises are the pillars of an economy, and their poor performance has a negative impact on living standards of population and country development. This study analyzes real-life data of 89,851 small and medium-sized enterprises, out of which 295 have declared bankruptcy. The analysis is performed via 27 financial ratios. The study framework combines seven classifications and three resampling and seven feature selection methods. Out of all classification methods applied, CatBoost has achieved the best results for all combinations of resampling and feature selection methods. CatBoost surpassed the results of other classification methods for the area under curve parameter, achieving a value of 99.95%. The application of resampling methods on different classification models has not identified a statistically significant level of improvement in any of the resampling methods. This finding has also been observed for feature selection methods. Based on these findings, we assume that individual resampling and feature selection methods do not improve model performance compared with the original imbalanced sample's results. Our results suggest that, even though the data sample may be significantly imbalanced with a minority of bankrupt companies, most classification algorithms can handle this imbalance and achieve interesting results. Moreover, our findings provide broad practical application for all stakeholders who could need to detect bankrupting companies.

中小企业是经济的支柱,中小企业经营不善对人民生活水平和国家发展产生负面影响。这项研究分析了89851家中小企业的真实数据,其中295家已经宣布破产。分析是通过27个财务比率进行的。该研究框架结合了7种分类、3种重采样和7种特征选择方法。在所有应用的分类方法中,CatBoost在所有重采样和特征选择方法的组合中都取得了最好的结果。CatBoost在曲线下面积参数上优于其他分类方法,准确率达到99.95%。重新抽样方法在不同分类模型上的应用并没有发现任何一种重新抽样方法在统计上有显著的改善。这一发现也被观察到特征选择方法。基于这些发现,我们假设与原始不平衡样本的结果相比,个体重采样和特征选择方法并不能提高模型的性能。我们的结果表明,尽管数据样本可能与少数破产公司显著不平衡,但大多数分类算法都可以处理这种不平衡并获得有趣的结果。此外,我们的研究结果为所有可能需要发现破产公司的利益相关者提供了广泛的实际应用。
{"title":"Effects of classification, feature selection, and resampling methods on bankruptcy prediction of small and medium-sized enterprises","authors":"Lenka Papíková,&nbsp;Mário Papík","doi":"10.1002/isaf.1521","DOIUrl":"10.1002/isaf.1521","url":null,"abstract":"<div>\u0000 \u0000 <p>Small and medium-sized enterprises are the pillars of an economy, and their poor performance has a negative impact on living standards of population and country development. This study analyzes real-life data of 89,851 small and medium-sized enterprises, out of which 295 have declared bankruptcy. The analysis is performed via 27 financial ratios. The study framework combines seven classifications and three resampling and seven feature selection methods. Out of all classification methods applied, CatBoost has achieved the best results for all combinations of resampling and feature selection methods. CatBoost surpassed the results of other classification methods for the area under curve parameter, achieving a value of 99.95%. The application of resampling methods on different classification models has not identified a statistically significant level of improvement in any of the resampling methods. This finding has also been observed for feature selection methods. Based on these findings, we assume that individual resampling and feature selection methods do not improve model performance compared with the original imbalanced sample's results. Our results suggest that, even though the data sample may be significantly imbalanced with a minority of bankrupt companies, most classification algorithms can handle this imbalance and achieve interesting results. Moreover, our findings provide broad practical application for all stakeholders who could need to detect bankrupting companies.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"29 4","pages":"254-281"},"PeriodicalIF":0.0,"publicationDate":"2022-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126923972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Application and performance of data mining techniques in stock market: A review 数据挖掘技术在股票市场中的应用与性能综述
Q1 Economics, Econometrics and Finance Pub Date : 2022-08-31 DOI: 10.1002/isaf.1518
Jasleen Kaur, Khushdeep Dharni

Prediction and the stock market go hand in hand. Due to the inherent limitations of traditional forecasting methods and the pursuit to uncover the hidden patterns in stock market data, stock market prediction using data mining techniques has caught the fancy of academicians, researchers, and investors. Based on a systematic review of more than 143 research studies spanning 25 years, the present paper brings to light the major issues concerning forecasting of stock markets based on data mining techniques, such as usage of data mining techniques in the stock market, input data types, single versus hybrid techniques, instruments and stock markets researched, types of software and algorithms used, measures of forecast accuracy, and performance of various data mining techniques. Emerging patterns related to various dimensions have been critically analyzed by highlighting the existing limitations and suggesting future research paradigms. This analysis can be useful for academicians, researchers and investors looking for futuristic directions in a given research domain.

预测和股市是密切相关的。由于传统预测方法的固有局限性和对揭示股票市场数据隐藏规律的追求,利用数据挖掘技术进行股票市场预测已经引起了学术界、研究人员和投资者的关注。基于对超过143项研究的系统回顾,本文揭示了基于数据挖掘技术的股票市场预测的主要问题,如数据挖掘技术在股票市场中的使用,输入数据类型,单一与混合技术,所研究的工具和股票市场,所使用的软件和算法类型,预测准确性的度量,以及各种数据挖掘技术的性能。通过强调现有的局限性和建议未来的研究范式,批判性地分析了与各个维度相关的新兴模式。这种分析对于在特定研究领域寻找未来方向的学者、研究人员和投资者非常有用。
{"title":"Application and performance of data mining techniques in stock market: A review","authors":"Jasleen Kaur,&nbsp;Khushdeep Dharni","doi":"10.1002/isaf.1518","DOIUrl":"10.1002/isaf.1518","url":null,"abstract":"<div>\u0000 \u0000 <p>Prediction and the stock market go hand in hand. Due to the inherent limitations of traditional forecasting methods and the pursuit to uncover the hidden patterns in stock market data, stock market prediction using data mining techniques has caught the fancy of academicians, researchers, and investors. Based on a systematic review of more than 143 research studies spanning 25 years, the present paper brings to light the major issues concerning forecasting of stock markets based on data mining techniques, such as usage of data mining techniques in the stock market, input data types, single versus hybrid techniques, instruments and stock markets researched, types of software and algorithms used, measures of forecast accuracy, and performance of various data mining techniques. Emerging patterns related to various dimensions have been critically analyzed by highlighting the existing limitations and suggesting future research paradigms. This analysis can be useful for academicians, researchers and investors looking for futuristic directions in a given research domain.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"29 4","pages":"219-241"},"PeriodicalIF":0.0,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129104525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Multilayer-neighbor local binary pattern for facial expression recognition 基于多层邻域局部二值模式的面部表情识别
Q1 Economics, Econometrics and Finance Pub Date : 2022-08-10 DOI: 10.1002/isaf.1520
Wei-Yen Hsu, Hsien-Jen Hsu, Yen-Yao Wang, Tawei Wang

Facial expression recognition (FER) has drawn the interest of practitioners and researchers due to its potential in opening new business opportunities. One critical aspect of any successful FER system is a feature extraction method that can efficiently find sufficient facial features and characterize facial expressions. This paper proposes an appearance-based feature extraction method by introducing a local feature descriptor, a multilayer-neighbor local binary pattern (LBP), for recognizing facial expressions. This new LBP operator is an extension of the original one-layer-neighbor LBP to two-layer-neighbor and three-layer-neighbor LBPs. We extract features by comparing new center points with neighborhood points. In addition, based on facial landmark locations, we extract active facial blocks during emotional stimulations. These prominent facial blocks utilize facial symmetry to improve the accuracy and speed of expression recognition. After using principal component analysis to reduce the dimensionality of features, we use a support vector machine to assign expressions to seven categories. We evaluate the proposed method by comparing it with other commonly used methods, and the proposed method is more accurate. Implications for business researchers are discussed.

面部表情识别(FER)因其潜在的商业机会而引起了从业者和研究人员的兴趣。任何成功的人脸识别系统的一个关键方面是能够有效地找到足够的面部特征并表征面部表情的特征提取方法。本文提出了一种基于外观的特征提取方法,该方法通过引入局部特征描述符——多层相邻局部二值模式(LBP)来识别面部表情。这种新的LBP算子是将原有的单层邻居LBP扩展到两层邻居和三层邻居LBP。我们通过比较新的中心点和邻域点来提取特征。此外,基于面部地标位置,我们提取了情绪刺激时活跃的面部块。这些突出的面部块利用面部对称性来提高表情识别的准确性和速度。在使用主成分分析降低特征维数后,我们使用支持向量机将表达式划分为七个类别。通过与其他常用方法的比较,对本文提出的方法进行了评价,结果表明本文提出的方法更加准确。讨论了对商业研究人员的启示。
{"title":"Multilayer-neighbor local binary pattern for facial expression recognition","authors":"Wei-Yen Hsu,&nbsp;Hsien-Jen Hsu,&nbsp;Yen-Yao Wang,&nbsp;Tawei Wang","doi":"10.1002/isaf.1520","DOIUrl":"10.1002/isaf.1520","url":null,"abstract":"<div>\u0000 \u0000 <p>Facial expression recognition (FER) has drawn the interest of practitioners and researchers due to its potential in opening new business opportunities. One critical aspect of any successful FER system is a feature extraction method that can efficiently find sufficient facial features and characterize facial expressions. This paper proposes an appearance-based feature extraction method by introducing a local feature descriptor, a multilayer-neighbor local binary pattern (LBP), for recognizing facial expressions. This new LBP operator is an extension of the original one-layer-neighbor LBP to two-layer-neighbor and three-layer-neighbor LBPs. We extract features by comparing new center points with neighborhood points. In addition, based on facial landmark locations, we extract active facial blocks during emotional stimulations. These prominent facial blocks utilize facial symmetry to improve the accuracy and speed of expression recognition. After using principal component analysis to reduce the dimensionality of features, we use a support vector machine to assign expressions to seven categories. We evaluate the proposed method by comparing it with other commonly used methods, and the proposed method is more accurate. Implications for business researchers are discussed.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"29 3","pages":"156-168"},"PeriodicalIF":0.0,"publicationDate":"2022-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115427266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Commodity price forecasting via neural networks for coffee, corn, cotton, oats, soybeans, soybean oil, sugar, and wheat 通过神经网络预测咖啡、玉米、棉花、燕麦、大豆、大豆油、糖和小麦的商品价格
Q1 Economics, Econometrics and Finance Pub Date : 2022-08-03 DOI: 10.1002/isaf.1519
Xiaojie Xu, Yun Zhang

Agricultural commodity price forecasting represents a key concern for market participants. We explore the usefulness of neural network modeling for forecasting problems in datasets of daily prices over periods of greater than 50 years for coffee, corn, cotton, oats, soybeans, soybean oil, sugar, and wheat. By investigating different model settings across the algorithm, delay, hidden neuron, and data-splitting ratio, we arrive at models leading to a decent performance for each commodity, with the overall relative root mean square error ranging from 1.70% to 3.19%. These results have small advantages over no-change models due to particular price adjustments in the prices considered here. Our results can be used on a standalone basis or combined with fundamental forecasts in forming perspectives of commodity price trends and conducting policy analysis. Our empirical framework should not be diffucult to implement, which is a critical consideration for many decision-makers and has the potential to be generalized for price forecasts of more commodities.

农产品价格预测是市场参与者关注的一个关键问题。我们探索了神经网络建模在预测咖啡、玉米、棉花、燕麦、大豆、大豆油、糖和小麦超过50年的每日价格数据集中的有用性。通过研究算法、延迟、隐藏神经元和数据分割率的不同模型设置,我们得出了每种商品都有不错表现的模型,总体相对均方根误差在1.70%到3.19%之间。由于这里考虑的价格中有特定的价格调整,这些结果比没有变化的模型有小的优势。我们的结果可以单独使用,也可以与基本面预测结合使用,形成商品价格趋势的观点并进行政策分析。我们的经验框架应该不难实施,这是许多决策者的关键考虑因素,并有可能推广到更多商品的价格预测中。
{"title":"Commodity price forecasting via neural networks for coffee, corn, cotton, oats, soybeans, soybean oil, sugar, and wheat","authors":"Xiaojie Xu,&nbsp;Yun Zhang","doi":"10.1002/isaf.1519","DOIUrl":"10.1002/isaf.1519","url":null,"abstract":"<div>\u0000 \u0000 <p>Agricultural commodity price forecasting represents a key concern for market participants. We explore the usefulness of neural network modeling for forecasting problems in datasets of daily prices over periods of greater than 50 years for coffee, corn, cotton, oats, soybeans, soybean oil, sugar, and wheat. By investigating different model settings across the algorithm, delay, hidden neuron, and data-splitting ratio, we arrive at models leading to a decent performance for each commodity, with the overall relative root mean square error ranging from 1.70% to 3.19%. These results have small advantages over no-change models due to particular price adjustments in the prices considered here. Our results can be used on a standalone basis or combined with fundamental forecasts in forming perspectives of commodity price trends and conducting policy analysis. Our empirical framework should not be diffucult to implement, which is a critical consideration for many decision-makers and has the potential to be generalized for price forecasts of more commodities.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"29 3","pages":"169-181"},"PeriodicalIF":0.0,"publicationDate":"2022-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129521764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Enhanced financial fraud detection using cost-sensitive cascade forest with missing value imputation 基于缺失价值估算的成本敏感级联森林增强财务欺诈检测
Q1 Economics, Econometrics and Finance Pub Date : 2022-07-28 DOI: 10.1002/isaf.1517
Lukui Huang, Alan Abrahams, Peter Ractham

Financial statement fraud is a global problem for investors, audit firms, regulators, and other stakeholders. Fraud detection can be regarded as a binary classification problem with a false negative being more expensive than a false positive. Although existing studies have made great efforts to detect fraud using various data-mining techniques, the difference in misclassification costs is seldom considered. In this study, we propose a cost-sensitive cascade forest (CSCF) for fraud detection, which places heavy penalty on false negative prediction and self-adjusts the depth of a cascade forest according to the classifier’s recall (i.e. the classifier’s sensitivity). As missing values are ubiquitous in fraud research, we also explore the effect of selected missing data treatments on prediction performance, including complete case analysis, three selected classic statistical mechanisms (zero, mean, and modified mean imputation), and two machine learning (K-nearest neighbor [KNN] and random forest [RF]) approaches. The experimental results show that the proposed CSCF significantly improves the fraud prediction in comparison with one of the latest fraud detection models using the RUSBoost algorithm. Comparing different missing value treatments, even though RUSBoost and CSCF perform well when using complete case analysis, we find that the best performance is achieved when CSCF is used with missing data imputed as zero. Such treatment further improves the performance, and results in an area under curve (AUC) score of 0.82 compared to the highest AUC (0.71) from the baseline model. Supplementary analysis further reveals that the low AUC of complete case analysis for the two examined models persists under different training sizes. Thus, our findings shed light on the potential benefits of missing value imputation for the model’s performance for fraud detection.

对于投资者、审计公司、监管机构和其他利益相关者来说,财务报表欺诈是一个全球性问题。欺诈检测可以看作是一个二值分类问题,假阴性比假阳性代价更大。尽管现有的研究已经利用各种数据挖掘技术做出了很大的努力来检测欺诈行为,但很少考虑错误分类成本的差异。在本研究中,我们提出了一种成本敏感级联森林(CSCF)用于欺诈检测,它对假阴性预测进行重罚,并根据分类器的召回率(即分类器的灵敏度)自调整级联森林的深度。由于缺失值在欺诈研究中无处不在,我们还探讨了选定的缺失数据处理对预测性能的影响,包括完整的案例分析,三种选定的经典统计机制(零、均值和修正均值imputation),以及两种机器学习(k最近邻[KNN]和随机森林[RF])方法。实验结果表明,与使用RUSBoost算法的最新欺诈检测模型相比,所提出的CSCF显著提高了欺诈预测。比较不同的缺失值处理,尽管RUSBoost和CSCF在使用完整的案例分析时表现良好,但我们发现,当CSCF使用缺失数据为零时,可以获得最佳性能。这种处理进一步提高了性能,与基线模型的最高AUC(0.71)相比,曲线下面积(AUC)得分为0.82。补充分析进一步表明,在不同的训练规模下,两种模型的完整案例分析的低AUC仍然存在。因此,我们的研究结果揭示了缺失值估算对模型欺诈检测性能的潜在好处。
{"title":"Enhanced financial fraud detection using cost-sensitive cascade forest with missing value imputation","authors":"Lukui Huang,&nbsp;Alan Abrahams,&nbsp;Peter Ractham","doi":"10.1002/isaf.1517","DOIUrl":"10.1002/isaf.1517","url":null,"abstract":"<div>\u0000 \u0000 <p>Financial statement fraud is a global problem for investors, audit firms, regulators, and other stakeholders. Fraud detection can be regarded as a binary classification problem with a false negative being more expensive than a false positive. Although existing studies have made great efforts to detect fraud using various data-mining techniques, the difference in misclassification costs is seldom considered. In this study, we propose a cost-sensitive cascade forest (CSCF) for fraud detection, which places heavy penalty on false negative prediction and self-adjusts the depth of a cascade forest according to the classifier’s recall (i.e. the classifier’s sensitivity). As missing values are ubiquitous in fraud research, we also explore the effect of selected missing data treatments on prediction performance, including complete case analysis, three selected classic statistical mechanisms (zero, mean, and modified mean imputation), and two machine learning (K-nearest neighbor [KNN] and random forest [RF]) approaches. The experimental results show that the proposed CSCF significantly improves the fraud prediction in comparison with one of the latest fraud detection models using the RUSBoost algorithm. Comparing different missing value treatments, even though RUSBoost and CSCF perform well when using complete case analysis, we find that the best performance is achieved when CSCF is used with missing data imputed as zero. Such treatment further improves the performance, and results in an area under curve (AUC) score of 0.82 compared to the highest AUC (0.71) from the baseline model. Supplementary analysis further reveals that the low AUC of complete case analysis for the two examined models persists under different training sizes. Thus, our findings shed light on the potential benefits of missing value imputation for the model’s performance for fraud detection.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"29 3","pages":"133-155"},"PeriodicalIF":0.0,"publicationDate":"2022-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122885228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards an early warning system for sovereign defaults leveraging on machine learning methodologies 利用机器学习方法建立主权违约预警系统
Q1 Economics, Econometrics and Finance Pub Date : 2022-06-12 DOI: 10.1002/isaf.1516
Anastasios Petropoulos, Vasilis Siakoulis, Evangelos Stavroulakis

In this study, we address the topic of credit risk stemming from central governments from a technical point of view. First, we explore various econometric and machine learning techniques to build an enhanced sovereign rating system that effectively differentiates the risk of default among countries. Our empirical results indicate that the machine learning method of XGBOOST has a superior out-of-sample and out-of-time predictive performance. Then, we use the models developed to calibrate a sovereign rating system and provide useful insights into the set-up of a parsimonious early warning system. Our results provide a more concise view of the most robust method for classifying countries’ default risk with significant regulatory implications, given that the efficient assessment of sovereign debt is crucial for effective proactive risk measurement.

在本研究中,我们从技术的角度来探讨中央政府的信用风险问题。首先,我们探索了各种计量经济学和机器学习技术,以建立一个增强的主权评级系统,有效区分各国之间的违约风险。我们的实证结果表明,XGBOOST的机器学习方法具有优异的样本外和时间外预测性能。然后,我们使用开发的模型来校准主权评级系统,并为建立节俭的预警系统提供有用的见解。鉴于对主权债务的有效评估对于有效的主动风险测量至关重要,我们的研究结果提供了一种更简洁的观点,即对具有重大监管意义的国家违约风险进行分类的最稳健方法。
{"title":"Towards an early warning system for sovereign defaults leveraging on machine learning methodologies","authors":"Anastasios Petropoulos,&nbsp;Vasilis Siakoulis,&nbsp;Evangelos Stavroulakis","doi":"10.1002/isaf.1516","DOIUrl":"10.1002/isaf.1516","url":null,"abstract":"<div>\u0000 \u0000 <p>In this study, we address the topic of credit risk stemming from central governments from a technical point of view. First, we explore various econometric and machine learning techniques to build an enhanced sovereign rating system that effectively differentiates the risk of default among countries. Our empirical results indicate that the machine learning method of XGBOOST has a superior out-of-sample and out-of-time predictive performance. Then, we use the models developed to calibrate a sovereign rating system and provide useful insights into the set-up of a parsimonious early warning system. Our results provide a more concise view of the most robust method for classifying countries’ default risk with significant regulatory implications, given that the efficient assessment of sovereign debt is crucial for effective proactive risk measurement.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"29 2","pages":"118-129"},"PeriodicalIF":0.0,"publicationDate":"2022-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132033582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Measuring relative volatility in high-frequency data under the directional change approach 在方向变化方法下测量高频数据的相对波动性
Q1 Economics, Econometrics and Finance Pub Date : 2022-06-02 DOI: 10.1002/isaf.1510
Shengnan Li, Edward P. K. Tsang, John O'Hara

We introduce a new approach in measuring relative volatility between two markets based on the directional change (DC) method. DC is a data-driven approach for sampling financial market data such that the data are recorded when the price changes have reached a significant amplitude rather than recording data under a predetermined timescale. Under the DC framework, we propose a new concept of DC micro-market relative volatility to evaluate relative volatility between two markets. Unlike the time-series method, micro-market relative volatility redefines the timescale based on the frequency of the observed DC data between the two markets. We show that it is useful for measuring the relative volatility in micro-market activities (high-frequency data).

本文提出了一种基于方向性变化(DC)方法来测量两个市场之间相对波动率的新方法。DC是一种数据驱动的方法,用于对金融市场数据进行抽样,当价格变化达到显著幅度时记录数据,而不是在预定的时间尺度下记录数据。在直流框架下,我们提出了直流微观市场相对波动率的新概念来评估两个市场之间的相对波动率。与时间序列方法不同,微观市场相对波动率根据两个市场之间观察到的直流数据的频率重新定义了时间尺度。我们表明,它对于衡量微观市场活动(高频数据)的相对波动性是有用的。
{"title":"Measuring relative volatility in high-frequency data under the directional change approach","authors":"Shengnan Li,&nbsp;Edward P. K. Tsang,&nbsp;John O'Hara","doi":"10.1002/isaf.1510","DOIUrl":"10.1002/isaf.1510","url":null,"abstract":"<p>We introduce a new approach in measuring relative volatility between two markets based on the directional change (DC) method. DC is a data-driven approach for sampling financial market data such that the data are recorded when the price changes have reached a significant amplitude rather than recording data under a predetermined timescale. Under the DC framework, we propose a new concept of DC micro-market relative volatility to evaluate relative volatility between two markets. Unlike the time-series method, micro-market relative volatility redefines the timescale based on the frequency of the observed DC data between the two markets. We show that it is useful for measuring the relative volatility in micro-market activities (high-frequency data).</p>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"29 2","pages":"86-102"},"PeriodicalIF":0.0,"publicationDate":"2022-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/isaf.1510","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128450432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Forecasting Commodity Market Returns Volatility: A Hybrid Ensemble Learning GARCH-LSTM based Approach 预测商品市场收益波动:一种基于GARCH-LSTM的混合集成学习方法
Q1 Economics, Econometrics and Finance Pub Date : 2022-05-31 DOI: 10.1002/isaf.1515
Kshitij Kakade, Aswini Kumar Mishra, Kshitish Ghate, Shivang Gupta

This study investigates the advantage of combining the forecasting abilities of multiple generalized autoregressive conditional heteroscedasticity (GARCH)-type models, such as the standard GARCH (GARCH), exponential GARCH (eGARCH), and threshold GARCH (tGARCH) models with advanced deep learning methods to predict the volatility of five important metals (nickel, copper, tin, lead, and gold) in the Indian commodity market. This paper proposes integrating the forecasts of one to three GARCH-type models into an ensemble learning-based hybrid long short-term memory (LSTM) model to forecast commodity price volatility. We further evaluate the forecasting performance of these models for standalone LSTM and GARCH-type models using the root mean squared error, mean absolute error, and mean fundamental percentage error. The results highlight that combining the information from the forecasts of multiple GARCH types into a hybrid LSTM model leads to superior volatility forecasting capability. The SET-LSTM, which represents the model that combines forecasts of the GARCH, eGARCH, and tGARCH into the LSTM hybrid, has shown the best overall results for all metals, barring a few exceptions. Moreover, the equivalence of forecasting accuracy is tested using the Diebold–Mariano and Wilcoxon signed-rank tests.

本文研究了将标准GARCH (GARCH)、指数GARCH (eGARCH)和阈值GARCH (tGARCH)模型等多种广义自回归条件异方差(GARCH)模型的预测能力与先进的深度学习方法相结合,对印度商品市场五种重要金属(镍、铜、锡、铅和金)的波动性进行预测的优势。本文提出将一到三个garch型模型的预测整合到一个基于集成学习的混合长短期记忆(LSTM)模型中来预测商品价格波动。我们进一步使用均方根误差、平均绝对误差和平均基本百分比误差来评估这些模型对独立LSTM和garch类型模型的预测性能。结果表明,将多个GARCH类型的预测信息结合到混合LSTM模型中可以获得更好的波动率预测能力。SET-LSTM模型将GARCH、eGARCH和tGARCH的预测结合到LSTM混合模型中,除了少数例外,它对所有金属的预测结果都是最好的。此外,使用Diebold-Mariano和Wilcoxon符号秩检验检验了预测精度的等价性。
{"title":"Forecasting Commodity Market Returns Volatility: A Hybrid Ensemble Learning GARCH-LSTM based Approach","authors":"Kshitij Kakade,&nbsp;Aswini Kumar Mishra,&nbsp;Kshitish Ghate,&nbsp;Shivang Gupta","doi":"10.1002/isaf.1515","DOIUrl":"https://doi.org/10.1002/isaf.1515","url":null,"abstract":"<div>\u0000 \u0000 <p>This study investigates the advantage of combining the forecasting abilities of multiple generalized autoregressive conditional heteroscedasticity (GARCH)-type models, such as the standard GARCH (GARCH), exponential GARCH (eGARCH), and threshold GARCH (tGARCH) models with advanced deep learning methods to predict the volatility of five important metals (nickel, copper, tin, lead, and gold) in the Indian commodity market. This paper proposes integrating the forecasts of one to three GARCH-type models into an ensemble learning-based hybrid long short-term memory (LSTM) model to forecast commodity price volatility. We further evaluate the forecasting performance of these models for standalone LSTM and GARCH-type models using the root mean squared error, mean absolute error, and mean fundamental percentage error. The results highlight that combining the information from the forecasts of multiple GARCH types into a hybrid LSTM model leads to superior volatility forecasting capability. The SET-LSTM, which represents the model that combines forecasts of the GARCH, eGARCH, and tGARCH into the LSTM hybrid, has shown the best overall results for all metals, barring a few exceptions. Moreover, the equivalence of forecasting accuracy is tested using the Diebold–Mariano and Wilcoxon signed-rank tests.</p>\u0000 </div>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"29 2","pages":"103-117"},"PeriodicalIF":0.0,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"137749873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Anti-money laundering and financial fraud detection: A systematic literature review 反洗钱与金融欺诈侦查:系统的文献综述
Q1 Economics, Econometrics and Finance Pub Date : 2022-05-19 DOI: 10.1002/isaf.1509
Lucas Schmidt Goecks, André Luis Korzenowski, Platão Gonçalves Terra Neto, Davenilcio Luiz de Souza, Taciana Mareth

Money laundering has affected the global economy for many years, and there are several methods of solving it presented in the literature. However, when tackling money laundering and financial fraud together there are few methods for solving them. Thus, this study aims to identify methods for anti-money laundering (AML) and financial fraud detection (FFD). A systematic literature review was performed for analysis and research of the methods used, utilizing the SCOPUS and Web of Science databases. Of the 48 articles that aligned with the research theme, 20 used quantitative methods for AML and FFD solution, 13 were literature reviews, 7 used qualitative methods, and 8 used mixed methods. This study contributes by presenting a systematic literature review that fills two research gaps: lack of studies on AML and FFD, and the methods used to solve them. This will assist researchers in identifying gaps and related research.

洗钱已经影响了全球经济多年,有几种方法解决它在文献中提出。然而,当洗钱和金融欺诈一起处理时,解决它们的方法很少。因此,本研究旨在确定反洗钱(AML)和金融欺诈检测(FFD)的方法。利用SCOPUS和Web of Science数据库,对所采用的方法进行了系统的文献综述和分析研究。在符合研究主题的48篇文章中,20篇采用AML和FFD溶液的定量方法,13篇为文献综述,7篇采用定性方法,8篇采用混合方法。本研究通过系统的文献综述填补了两个研究空白:AML和FFD研究的缺乏,以及解决这些问题的方法。这将有助于研究人员确定差距和相关研究。
{"title":"Anti-money laundering and financial fraud detection: A systematic literature review","authors":"Lucas Schmidt Goecks,&nbsp;André Luis Korzenowski,&nbsp;Platão Gonçalves Terra Neto,&nbsp;Davenilcio Luiz de Souza,&nbsp;Taciana Mareth","doi":"10.1002/isaf.1509","DOIUrl":"10.1002/isaf.1509","url":null,"abstract":"<p>Money laundering has affected the global economy for many years, and there are several methods of solving it presented in the literature. However, when tackling money laundering and financial fraud together there are few methods for solving them. Thus, this study aims to identify methods for anti-money laundering (AML) and financial fraud detection (FFD). A systematic literature review was performed for analysis and research of the methods used, utilizing the SCOPUS and Web of Science databases. Of the 48 articles that aligned with the research theme, 20 used quantitative methods for AML and FFD solution, 13 were literature reviews, 7 used qualitative methods, and 8 used mixed methods. This study contributes by presenting a systematic literature review that fills two research gaps: lack of studies on AML and FFD, and the methods used to solve them. This will assist researchers in identifying gaps and related research.</p>","PeriodicalId":53473,"journal":{"name":"Intelligent Systems in Accounting, Finance and Management","volume":"29 2","pages":"71-85"},"PeriodicalIF":0.0,"publicationDate":"2022-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133438268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Intelligent Systems in Accounting, Finance and Management
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1