首页 > 最新文献

Journal of Finance and Data Science最新文献

英文 中文
Fintech, financial inclusion, digital currency, and CBDC 金融科技、普惠金融、数字货币和 CBDC
Q1 Mathematics Pub Date : 2023-11-01 DOI: 10.1016/j.jfds.2024.100115
Huining (Henry) Cao, Xiaoyan Zhang, Yi Huang, Yiping Huang, Bernard Yeung
{"title":"Fintech, financial inclusion, digital currency, and CBDC","authors":"Huining (Henry) Cao, Xiaoyan Zhang, Yi Huang, Yiping Huang, Bernard Yeung","doi":"10.1016/j.jfds.2024.100115","DOIUrl":"10.1016/j.jfds.2024.100115","url":null,"abstract":"","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918824000011/pdfft?md5=3ec6a5ae01b92dc8eb68df1265a77962&pid=1-s2.0-S2405918824000011-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139395799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A general framework for portfolio construction based on generative models of asset returns 基于资产收益生成模型的投资组合构建总体框架
Q1 Mathematics Pub Date : 2023-11-01 DOI: 10.1016/j.jfds.2023.100113
Tuoyuan Cheng , Kan Chen

In this paper, we present an integrated approach to portfolio construction and optimization, leveraging high-performance computing capabilities. We first explore diverse pairings of generative model forecasts and objective functions used for portfolio optimization, which are evaluated using performance-attribution models based on least absolute shrinkage and selection operator (LASSO). We illustrate our approach using extensive simulations of crypto-currency portfolios, and we show that the portfolios constructed using the vine-copula generative model and the Sharpe-ratio objective function consistently outperform. To accommodate a wide array of investment strategies, we further investigate portfolio blending and propose a general framework for evaluating and combining investment strategies. We employ an extension of the multi-armed bandit framework and use value models and policy models to construct eclectic blended portfolios based on past performance. We consider similarity and optimality measures for value models and employ probability-matching (“blending”) and a greedy algorithm (“switching”) for policy models. The eclectic portfolios are also evaluated using LASSO models. We show that the value model utilizing cosine similarity and logit optimality consistently delivers robust superior performances. The extent of outperformance by eclectic portfolios over their benchmarks significantly surpasses that achieved by individual generative model-based portfolios over their respective benchmarks.

在本文中,我们提出了一种利用高性能计算能力构建和优化投资组合的综合方法。我们首先探索了用于投资组合优化的生成模型预测和目标函数的各种配对,并使用基于最小绝对收缩和选择算子(LASSO)的性能分配模型对其进行了评估。我们使用大量的加密货币投资组合模拟来说明我们的方法,结果表明,使用葡萄树-科普拉斯生成模型和夏普比率目标函数构建的投资组合始终表现优异。为了适应各种投资策略,我们进一步研究了投资组合混合,并提出了评估和组合投资策略的一般框架。我们采用了多臂匪徒框架的扩展,并使用价值模型和政策模型来构建基于过往业绩的折衷混合投资组合。我们考虑了价值模型的相似性和最优性度量,并对政策模型采用了概率匹配("混合")和贪婪算法("切换")。我们还使用 LASSO 模型对折中投资组合进行了评估。我们发现,利用余弦相似性和对数最优性的价值模型始终具有稳健的卓越表现。折中投资组合优于其基准的程度大大超过了基于生成模型的单个投资组合优于其各自基准的程度。
{"title":"A general framework for portfolio construction based on generative models of asset returns","authors":"Tuoyuan Cheng ,&nbsp;Kan Chen","doi":"10.1016/j.jfds.2023.100113","DOIUrl":"https://doi.org/10.1016/j.jfds.2023.100113","url":null,"abstract":"<div><p>In this paper, we present an integrated approach to portfolio construction and optimization, leveraging high-performance computing capabilities. We first explore diverse pairings of generative model forecasts and objective functions used for portfolio optimization, which are evaluated using performance-attribution models based on least absolute shrinkage and selection operator (LASSO). We illustrate our approach using extensive simulations of crypto-currency portfolios, and we show that the portfolios constructed using the vine-copula generative model and the Sharpe-ratio objective function consistently outperform. To accommodate a wide array of investment strategies, we further investigate portfolio blending and propose a general framework for evaluating and combining investment strategies. We employ an extension of the multi-armed bandit framework and use value models and policy models to construct eclectic blended portfolios based on past performance. We consider similarity and optimality measures for value models and employ probability-matching (“blending”) and a greedy algorithm (“switching”) for policy models. The eclectic portfolios are also evaluated using LASSO models. We show that the value model utilizing cosine similarity and logit optimality consistently delivers robust superior performances. The extent of outperformance by eclectic portfolios over their benchmarks significantly surpasses that achieved by individual generative model-based portfolios over their respective benchmarks.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918823000296/pdfft?md5=d79808b84c71271aae85d6e7e66cc3dd&pid=1-s2.0-S2405918823000296-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138839770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An analysis of conditional mean-variance portfolio performance using hierarchical clustering 利用分层聚类分析条件均值-方差投资组合绩效
Q1 Mathematics Pub Date : 2023-11-01 DOI: 10.1016/j.jfds.2023.100112
Stephen R. Owen

This paper studies portfolio optimization through improvements of ex-ante conditional covariance estimates. We use the cross-section of stock returns over a 52-year sample to analyze trading performance by implementing the machine learning algorithm of hierarchical clustering. We find that higher out-of-sample risk-adjusted returns are achieved relative to the traditional Markowitz portfolio through hierarchical clustering using a 3-month buy-and-hold, long-only strategy. Additionally, the average change in portfolio weights at each rebalancing period is significantly lower for the portfolio formed using machine learning relative to Markowitz, decreasing investor trading costs. The results are robust to various settings and subsamples.

本文通过改进事前条件协方差估计来研究投资组合优化。我们采用分层聚类的机器学习算法,利用 52 年样本股票收益的横截面来分析交易绩效。我们发现,与传统的马科维茨投资组合相比,通过使用 3 个月买入并持有的只做多策略进行分层聚类,可以获得更高的样本外风险调整回报。此外,相对于马科维茨,使用机器学习形成的投资组合在每个再平衡期的投资组合权重平均变化要低得多,从而降低了投资者的交易成本。这些结果对各种设置和子样本都是稳健的。
{"title":"An analysis of conditional mean-variance portfolio performance using hierarchical clustering","authors":"Stephen R. Owen","doi":"10.1016/j.jfds.2023.100112","DOIUrl":"10.1016/j.jfds.2023.100112","url":null,"abstract":"<div><p>This paper studies portfolio optimization through improvements of ex-ante conditional covariance estimates. We use the cross-section of stock returns over a 52-year sample to analyze trading performance by implementing the machine learning algorithm of hierarchical clustering. We find that higher out-of-sample risk-adjusted returns are achieved relative to the traditional Markowitz portfolio through hierarchical clustering using a 3-month buy-and-hold, long-only strategy. Additionally, the average change in portfolio weights at each rebalancing period is significantly lower for the portfolio formed using machine learning relative to Markowitz, decreasing investor trading costs. The results are robust to various settings and subsamples.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918823000284/pdfft?md5=f08cced7b32b9843b62604332db0b92a&pid=1-s2.0-S2405918823000284-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135714581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CentralBankRoBERTa: A fine-tuned large language model for central bank communications CentralBankRoBERTa:用于中央银行通信的微调大型语言模型
Q1 Mathematics Pub Date : 2023-11-01 DOI: 10.1016/j.jfds.2023.100114
Moritz Pfeifer , Vincent P. Marohl

Central bank communications are an important tool for guiding the economy and fulfilling monetary policy goals. Natural language processing (NLP) algorithms have been used to analyze central bank communications. These outdated bag-of-words methods often ignore context and cannot distinguish who these sentiments are addressing. Recent research has introduced deep-learning-based NLP algorithms, also known as large language models (LLMs), which take context into account. This study applies LLMs to central bank communications and constructs CentralBankRoBERTa, a state-of-the-art economic agent classifier that distinguishes five basic macroeconomic agents and binary sentiment classifier that identifies the emotional content of sentences in central bank communications. The absence of large-language models in the central bank communications literature may be attributed to a lack of appropriately labeled datasets. To address this gap, we introduce our model, CentralBankRoBERTa, offering an easy-to-use and standardized tool for scholars of central bank communications.

中央银行通信是指导经济和实现货币政策目标的重要工具。自然语言处理 (NLP) 算法一直被用于分析中央银行的通信。这些过时的词袋法往往忽略了上下文,无法区分这些情绪是针对谁的。最近的研究引入了基于深度学习的 NLP 算法,也称为大型语言模型 (LLM),它将上下文考虑在内。本研究将 LLMs 应用于中央银行通信,并构建了 CentralBankRoBERTa,这是一种最先进的经济代理分类器,可区分五种基本宏观经济代理和二元情感分类器,可识别中央银行通信中句子的情感内容。中央银行通信文献中缺乏大型语言模型,这可能是由于缺乏适当标记的数据集。为了填补这一空白,我们推出了我们的模型 CentralBankRoBERTa,为研究中央银行通信的学者提供了一个易于使用的标准化工具。
{"title":"CentralBankRoBERTa: A fine-tuned large language model for central bank communications","authors":"Moritz Pfeifer ,&nbsp;Vincent P. Marohl","doi":"10.1016/j.jfds.2023.100114","DOIUrl":"10.1016/j.jfds.2023.100114","url":null,"abstract":"<div><p>Central bank communications are an important tool for guiding the economy and fulfilling monetary policy goals. Natural language processing (NLP) algorithms have been used to analyze central bank communications. These outdated bag-of-words methods often ignore context and cannot distinguish who these sentiments are addressing. Recent research has introduced deep-learning-based NLP algorithms, also known as large language models (LLMs), which take context into account. This study applies LLMs to central bank communications and constructs CentralBankRoBERTa, a state-of-the-art economic agent classifier that distinguishes five basic macroeconomic agents and binary sentiment classifier that identifies the emotional content of sentences in central bank communications. The absence of large-language models in the central bank communications literature may be attributed to a lack of appropriately labeled datasets. To address this gap, we introduce our model, CentralBankRoBERTa, offering an easy-to-use and standardized tool for scholars of central bank communications.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918823000302/pdfft?md5=f9ee367fba22f338380c708e9da5b5a4&pid=1-s2.0-S2405918823000302-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139012960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asset allocation using a Markov process of clustered efficient frontier coefficients states 利用聚类有效前沿系数状态的马尔可夫过程进行资产配置
Q1 Mathematics Pub Date : 2023-11-01 DOI: 10.1016/j.jfds.2023.100110
Nolan Alexander , William Scherer , Jamey Thompson

We propose a novel asset allocation model using a Markov process of states defined by clustered efficient frontier coefficients. While most research in Markov models of the market characterize regimes using return and volatility, we instead propose characterizing these states using efficient frontiers, which provide more information on the interactions of underlying assets that comprise the market. Efficient frontiers can be decomposed to their functional form, a square-root second-order polynomial defined by three coefficients, to provide a dimensionality reduction of the return vector and covariance matrix. Each month, the proposed model hierarchically clusters the monthly coefficients data up to the current month, to characterize the market states, then defines a Markov process on the sequence of states. To incorporate these states into portfolio optimization, for each state, we calculate the tangency portfolio using only return data in that state. We then take the expectation of these weights for each state, weighted by the probability of transitioning from the current state to each state. To empirically validate our proposed model, we employ three sets of assets that span the market, and show that our proposed model significantly outperforms benchmark portfolios.

我们提出了一种新颖的资产配置模型,该模型采用了由聚类有效前沿系数定义的马尔可夫状态过程。市场马尔可夫模型的大多数研究都是利用收益率和波动率来描述市场状态的,而我们则建议利用有效前沿来描述这些状态,因为有效前沿提供了更多关于构成市场的基础资产之间相互作用的信息。有效前沿可分解为其函数形式,即由三个系数定义的平方根二阶多项式,从而降低回报向量和协方差矩阵的维度。每个月,所提出的模型都会对截至当月的月度系数数据进行分层聚类,以描述市场状态,然后在状态序列上定义马尔可夫过程。为了将这些状态纳入投资组合优化,对于每个状态,我们仅使用该状态下的收益数据计算切线投资组合。然后,我们根据从当前状态过渡到每种状态的概率,对每种状态的权重进行加权,求出这些权重的期望值。为了对我们提出的模型进行实证验证,我们采用了三组跨市场的资产,结果表明我们提出的模型明显优于基准投资组合。
{"title":"Asset allocation using a Markov process of clustered efficient frontier coefficients states","authors":"Nolan Alexander ,&nbsp;William Scherer ,&nbsp;Jamey Thompson","doi":"10.1016/j.jfds.2023.100110","DOIUrl":"10.1016/j.jfds.2023.100110","url":null,"abstract":"<div><p>We propose a novel asset allocation model using a Markov process of states defined by clustered efficient frontier coefficients. While most research in Markov models of the market characterize regimes using return and volatility, we instead propose characterizing these states using efficient frontiers, which provide more information on the interactions of underlying assets that comprise the market. Efficient frontiers can be decomposed to their functional form, a square-root second-order polynomial defined by three coefficients, to provide a dimensionality reduction of the return vector and covariance matrix. Each month, the proposed model hierarchically clusters the monthly coefficients data up to the current month, to characterize the market states, then defines a Markov process on the sequence of states. To incorporate these states into portfolio optimization, for each state, we calculate the tangency portfolio using only return data in that state. We then take the expectation of these weights for each state, weighted by the probability of transitioning from the current state to each state. To empirically validate our proposed model, we employ three sets of assets that span the market, and show that our proposed model significantly outperforms benchmark portfolios.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918823000260/pdfft?md5=11224414382844d224bb072b3978b6e7&pid=1-s2.0-S2405918823000260-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136152404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Topological tail dependence: Evidence from forecasting realized volatility 拓扑尾依赖:来自预测已实现波动的证据
Q1 Mathematics Pub Date : 2023-11-01 DOI: 10.1016/j.jfds.2023.100107
Hugo Gobato Souto

This paper proposes a novel theory, coined as Topological Tail Dependence Theory, that links the mathematical theory behind Persistent Homology (PH) and the financial stock market theory. This study also proposes a novel algorithm to measure topological stock market changes as well as the incorporation of these topological changes into forecasting realized volatility (RV) models to improve their forecast performance during turbulent periods. The results of the empirical experimentation of this study provide evidence that the predictions drawn from the Topological Tail Dependence Theory are correct and indicate that the employment of PH information allows nonlinear and neural network models to better forecast RV during a turbulent period.

本文提出了一种新的理论,称为拓扑尾依赖理论,它将持久同调(PH)背后的数学理论与金融股票市场理论联系起来。本文还提出了一种新的算法来衡量股票市场的拓扑变化,并将这些拓扑变化纳入预测已实现波动率(RV)模型,以提高其在动荡时期的预测性能。本研究的实证实验结果证明了拓扑尾依赖理论的预测是正确的,并表明PH信息的使用可以使非线性和神经网络模型更好地预测湍流时期的RV。
{"title":"Topological tail dependence: Evidence from forecasting realized volatility","authors":"Hugo Gobato Souto","doi":"10.1016/j.jfds.2023.100107","DOIUrl":"https://doi.org/10.1016/j.jfds.2023.100107","url":null,"abstract":"<div><p>This paper proposes a novel theory, coined as Topological Tail Dependence Theory, that links the mathematical theory behind Persistent Homology (PH) and the financial stock market theory. This study also proposes a novel algorithm to measure topological stock market changes as well as the incorporation of these topological changes into forecasting realized volatility (RV) models to improve their forecast performance during turbulent periods. The results of the empirical experimentation of this study provide evidence that the predictions drawn from the Topological Tail Dependence Theory are correct and indicate that the employment of PH information allows nonlinear and neural network models to better forecast RV during a turbulent period.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918823000235/pdfft?md5=72e1114c64fc1153368f76b24fe561aa&pid=1-s2.0-S2405918823000235-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91959207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A dynamic partial equilibrium model of capital gains taxation 资本利得税的动态局部均衡模型
Q1 Mathematics Pub Date : 2023-11-01 DOI: 10.1016/j.jfds.2023.100111
Stephen L. Lenkey, Timothy T. Simin

We analyze a multi-period model of capital gains taxation with endogenous prices. Relative to an economy without taxation, a capital gains tax tends to lower prices and increase returns. Abstracting from tax redistribution policies, we find that a taxable investor's welfare falls, a nontaxable investor's welfare rises, and, depending on the tax rate, social welfare may either rise or fall. The taxable investor's tax-timing option increases social welfare but may either increase or decrease tax revenue. Tax rebates for capital losses have little effect on welfare or tax revenue. Implications for empirical asset pricing are identified.

我们分析了一个具有内生价格的多期资本利得税模型。与不征税的经济相比,资本利得税倾向于降低价格和提高收益。在不考虑税收再分配政策的情况下,我们发现应税投资者的福利下降,非应税投资者的福利上升,而且根据税率的不同,社会福利可能上升或下降。应税投资者的纳税时间选择会增加社会福利,但可能会增加或减少税收。资本损失退税对福利或税收的影响很小。本文还指出了实证资产定价的意义。
{"title":"A dynamic partial equilibrium model of capital gains taxation","authors":"Stephen L. Lenkey,&nbsp;Timothy T. Simin","doi":"10.1016/j.jfds.2023.100111","DOIUrl":"https://doi.org/10.1016/j.jfds.2023.100111","url":null,"abstract":"<div><p>We analyze a multi-period model of capital gains taxation with endogenous prices. Relative to an economy without taxation, a capital gains tax tends to lower prices and increase returns. Abstracting from tax redistribution policies, we find that a taxable investor's welfare falls, a nontaxable investor's welfare rises, and, depending on the tax rate, social welfare may either rise or fall. The taxable investor's tax-timing option increases social welfare but may either increase or decrease tax revenue. Tax rebates for capital losses have little effect on welfare or tax revenue. Implications for empirical asset pricing are identified.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918823000272/pdfft?md5=f46276c217c32e30c56baa96e93c837f&pid=1-s2.0-S2405918823000272-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138769959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research frontiers of the Chinese financial markets 中国金融市场研究前沿
Q1 Mathematics Pub Date : 2023-11-01 DOI: 10.1016/j.jfds.2024.100116
Hao Zhou
{"title":"Research frontiers of the Chinese financial markets","authors":"Hao Zhou","doi":"10.1016/j.jfds.2024.100116","DOIUrl":"10.1016/j.jfds.2024.100116","url":null,"abstract":"","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918824000023/pdfft?md5=7983cd9c806eed13bf50220414314813&pid=1-s2.0-S2405918824000023-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139455617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning in classifying bitcoin addresses 分类比特币地址的机器学习
Q1 Mathematics Pub Date : 2023-11-01 DOI: 10.1016/j.jfds.2023.100109
Leonid Garin , Vladimir Gisin

The emergence of the Bitcoin cryptocurrency marked a new era of illegal transactions. Cryptocurrency provides some level of anonymity allowing its users to create an unlimited number of wallets with alias addresses, which makes it challenging to identify the actual user. This is used by criminals for the purpose of making illegal transactions. At the same time, Bitcoin stores and provides information about all committed transactions, which opens up opportunities for identifying suspicious behavior patterns in this network using data mining. The problem of detecting suspicious activity in the Bitcoin network can be solved with sufficiently high accuracy using machine learning methods. The paper provides a comparative study of various machine learning methods to solve the mentioned problem: logistic regression, decision tree, random forest, gradient boosting. Selecting hyper parameters, rebalancing the dataset, and active learning are particularly important. The most important hyperparameters of the algorithms are described. Metrics show that the gradient boosting looks the most promising. In total 38 features of bitcoin addresses were identified. The top features are presented in the paper.

比特币加密货币的出现标志着非法交易的新时代。加密货币提供了一定程度的匿名性,允许其用户使用别名地址创建无限数量的钱包,这使得识别实际用户变得具有挑战性。这是犯罪分子用来进行非法交易的工具。同时,比特币存储并提供有关所有已提交交易的信息,这为使用数据挖掘识别该网络中的可疑行为模式提供了机会。使用机器学习方法可以以足够高的精度解决比特币网络中可疑活动的检测问题。本文对解决上述问题的各种机器学习方法进行了比较研究:逻辑回归、决策树、随机森林、梯度增强。选择超参数、重新平衡数据集和主动学习尤为重要。描述了算法中最重要的超参数。指标显示梯度增强看起来最有希望。总共确定了比特币地址的38个特征。本文给出了该系统的主要特征。
{"title":"Machine learning in classifying bitcoin addresses","authors":"Leonid Garin ,&nbsp;Vladimir Gisin","doi":"10.1016/j.jfds.2023.100109","DOIUrl":"10.1016/j.jfds.2023.100109","url":null,"abstract":"<div><p>The emergence of the Bitcoin cryptocurrency marked a new era of illegal transactions. Cryptocurrency provides some level of anonymity allowing its users to create an unlimited number of wallets with alias addresses, which makes it challenging to identify the actual user. This is used by criminals for the purpose of making illegal transactions. At the same time, Bitcoin stores and provides information about all committed transactions, which opens up opportunities for identifying suspicious behavior patterns in this network using data mining. The problem of detecting suspicious activity in the Bitcoin network can be solved with sufficiently high accuracy using machine learning methods. The paper provides a comparative study of various machine learning methods to solve the mentioned problem: logistic regression, decision tree, random forest, gradient boosting. Selecting hyper parameters, rebalancing the dataset, and active learning are particularly important. The most important hyperparameters of the algorithms are described. Metrics show that the gradient boosting looks the most promising. In total 38 features of bitcoin addresses were identified. The top features are presented in the paper.</p></div>","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918823000259/pdfft?md5=dcd018899c4df905d0f9dda9a9ff1a7a&pid=1-s2.0-S2405918823000259-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136127683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The inaugural Journal of Finance and Data Science Conference was held successfully in Beijing 首届《金融与数据科学杂志》大会在北京成功举办
Q1 Mathematics Pub Date : 2023-11-01 DOI: 10.1016/j.jfds.2024.100119
{"title":"The inaugural Journal of Finance and Data Science Conference was held successfully in Beijing","authors":"","doi":"10.1016/j.jfds.2024.100119","DOIUrl":"https://doi.org/10.1016/j.jfds.2024.100119","url":null,"abstract":"","PeriodicalId":36340,"journal":{"name":"Journal of Finance and Data Science","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2405918824000047/pdfft?md5=4984821b801a870625893c3a19f9f174&pid=1-s2.0-S2405918824000047-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139737164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Finance and Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1