The Journal of Financial Data Science最新文献

英文中文

A Practitioner’s Guide to the Optimal Number of Clusters Algorithm 最佳聚类数算法的从业者指南

The Journal of Financial Data Science

Pub Date : 2023-07-21 DOI: 10.3905/jfds.2023.1.133

M. Andrews

Identifying profitable investment strategies has been a long-standing challenge for finance practitioners. The optimal number of clusters (ONC) algorithm is a reliable tool used to evaluate backtest results affected by multiple testing. The algorithm is necessary to calculate the deflated Sharpe ratio, a popular metric that detects potential false positive investment strategies. These methods are based on the familywise error rate approach, which provides stringent control over the overall error rate, reducing the likelihood of false discoveries and increasing the reliability of findings. The ONC algorithm’s time complexity, however, poses a significant challenge for practitioners. This study proposes a practical solution to reduce the number of clusters tested by the ONC algorithm while maintaining accuracy. Results from simulated datasets demonstrate that the proposed solution significantly reduces the algorithm’s runtime. Additionally, this study addresses the impact of outliers on the ONC algorithm, showing that they can lead to nonoptimal solutions, and provides a simple solution to mitigate their effects. These findings contribute to the literature on finance by enhancing the usability of the ONC algorithm.

确定有利可图的投资策略一直是金融从业者面临的一个长期挑战。最优聚类数(ONC)算法是评估多重测试影响下回测结果的可靠工具。该算法对于计算缩水夏普比率(deflated Sharpe ratio)是必要的，后者是一种检测潜在误报投资策略的流行指标。这些方法基于家庭错误率方法，该方法严格控制了总体错误率，减少了错误发现的可能性，提高了结果的可靠性。然而，ONC算法的时间复杂度给实践者带来了巨大的挑战。本研究提出了一种实用的解决方案，以减少ONC算法测试的聚类数量，同时保持准确性。模拟数据集的结果表明，该方法显著降低了算法的运行时间。此外，本研究解决了异常值对ONC算法的影响，表明它们可能导致非最优解，并提供了一个简单的解决方案来减轻其影响。这些发现通过提高ONC算法的可用性，为金融文献做出了贡献。

{"title":"A Practitioner’s Guide to the Optimal Number of Clusters Algorithm","authors":"M. Andrews","doi":"10.3905/jfds.2023.1.133","DOIUrl":"https://doi.org/10.3905/jfds.2023.1.133","url":null,"abstract":"Identifying profitable investment strategies has been a long-standing challenge for finance practitioners. The optimal number of clusters (ONC) algorithm is a reliable tool used to evaluate backtest results affected by multiple testing. The algorithm is necessary to calculate the deflated Sharpe ratio, a popular metric that detects potential false positive investment strategies. These methods are based on the familywise error rate approach, which provides stringent control over the overall error rate, reducing the likelihood of false discoveries and increasing the reliability of findings. The ONC algorithm’s time complexity, however, poses a significant challenge for practitioners. This study proposes a practical solution to reduce the number of clusters tested by the ONC algorithm while maintaining accuracy. Results from simulated datasets demonstrate that the proposed solution significantly reduces the algorithm’s runtime. Additionally, this study addresses the impact of outliers on the ONC algorithm, showing that they can lead to nonoptimal solutions, and provides a simple solution to mitigate their effects. These findings contribute to the literature on finance by enhancing the usability of the ONC algorithm.","PeriodicalId":199045,"journal":{"name":"The Journal of Financial Data Science","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128993038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Kernel Market Impact Analysis in China A-Share Markets 中国a股市场核心市场影响分析

The Journal of Financial Data Science

Pub Date : 2023-06-18 DOI: 10.3905/jfds.2023.1.131

Hongsong Chou, Jimin Han, Charles Huang, Danny D. Sun

With transaction-level market data for stocks in China A-share markets, the authors construct individual stocks’ kernel functions of market impact and analyze their statistical properties. Attribution analysis of such kernel functions is also performed to understand how market microstructure variables such as bid–ask spread and liquidity distribution in order books can be used to classify different groups of kernel functions. The authors’ analysis shows that stocks in China A-share markets exhibit clear patterns of market impact curves, which is likely due to specific market structure regulations such as constant tick size across different stocks and stock-specific order book dynamics resulting from market participants’ behaviors. The authors also explore the application of kernel functions in forecasting price movement in close-to-reality trading simulators that consider market impact costs at individual trade level.

利用中国a股交易级市场数据，构建了个股的市场影响核函数，并对其统计性质进行了分析。还对这些核函数进行归因分析，以了解如何使用市场微观结构变量(如买卖价差和订单簿中的流动性分布)对不同组的核函数进行分类。作者的分析表明，中国a股市场的股票表现出明显的市场影响曲线模式，这可能是由于特定的市场结构规则，如不同股票之间不变的滴答大小和市场参与者行为导致的特定股票的订单动态。作者还探讨了核函数在接近现实的交易模拟器中预测价格变动的应用，该模拟器考虑了个人交易水平的市场影响成本。

引用次数: 0

Spatio-Temporal Momentum: Jointly Learning Time-Series and Cross-Sectional Strategies 时空动量:时间序列与横截面策略的联合学习

The Journal of Financial Data Science

Pub Date : 2023-06-11 DOI: 10.3905/jfds.2023.1.130

Wee Ling Tan, Stephen Roberts, Stefan Zohren

The authors introduce spatio-temporal momentum strategies, a class of models that unify both time-series and cross-sectional momentum strategies by trading assets based on their cross-sectional momentum features over time. Although both time-series and cross-sectional momentum strategies are designed to systematically capture momentum risk premiums, these strategies are regarded as distinct implementations and do not consider the concurrent relationship and predictability between temporal and cross-sectional momentum features of different assets. They model spatio-temporal momentum with neural networks of varying complexities and demonstrate that a simple neural network with only a single fully connected layer learns to simultaneously generate trading signals for all assets in a portfolio by incorporating both their time-series and cross-sectional momentum features. Back testing on portfolios of 46 actively traded US equities and 12 equity index futures contracts, they demonstrate that the model is able to retain its performance over benchmarks in the presence of high transaction costs of up to 5–10 basis points. In particular, they find that the model when coupled with least absolute shrinkage and turnover regularization results in the best performance over various transaction cost scenarios.

作者介绍了时空动量策略，这是一类统一时间序列和横截面动量策略的模型，通过基于资产随时间的横截面动量特征进行交易。虽然时间序列动量策略和横截面动量策略都是为了系统地捕捉动量风险溢价而设计的，但这些策略被视为不同的实现，没有考虑不同资产的时间和横截面动量特征之间的并发关系和可预测性。他们用不同复杂性的神经网络模拟时空动量，并证明了一个只有一个完全连接层的简单神经网络，通过结合时间序列和横截面动量特征，学会同时为投资组合中的所有资产生成交易信号。他们对46只交易活跃的美国股票和12只股指期货合约的投资组合进行了回测，结果表明，在交易成本高达5-10个基点的情况下，该模型仍能保持相对于基准的表现。特别是，他们发现当模型与最小的绝对收缩和周转正则化相结合时，在各种交易成本场景下的性能最好。

{"title":"Spatio-Temporal Momentum: Jointly Learning Time-Series and Cross-Sectional Strategies","authors":"Wee Ling Tan, Stephen Roberts, Stefan Zohren","doi":"10.3905/jfds.2023.1.130","DOIUrl":"https://doi.org/10.3905/jfds.2023.1.130","url":null,"abstract":"The authors introduce spatio-temporal momentum strategies, a class of models that unify both time-series and cross-sectional momentum strategies by trading assets based on their cross-sectional momentum features over time. Although both time-series and cross-sectional momentum strategies are designed to systematically capture momentum risk premiums, these strategies are regarded as distinct implementations and do not consider the concurrent relationship and predictability between temporal and cross-sectional momentum features of different assets. They model spatio-temporal momentum with neural networks of varying complexities and demonstrate that a simple neural network with only a single fully connected layer learns to simultaneously generate trading signals for all assets in a portfolio by incorporating both their time-series and cross-sectional momentum features. Back testing on portfolios of 46 actively traded US equities and 12 equity index futures contracts, they demonstrate that the model is able to retain its performance over benchmarks in the presence of high transaction costs of up to 5–10 basis points. In particular, they find that the model when coupled with least absolute shrinkage and turnover regularization results in the best performance over various transaction cost scenarios.","PeriodicalId":199045,"journal":{"name":"The Journal of Financial Data Science","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136370719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

The Impact of Technology, Big Data, and Analytics: The Evolving Data-Driven Model of Innovation in the Finance Industry 科技、大数据和分析的影响:金融行业数据驱动创新模式的演变

The Journal of Financial Data Science

Pub Date : 2023-06-10 DOI: 10.3905/jfds.2023.1.129

R. Malhotra, D. Malhotra

The rise of digital technologies in the 21st century has brought about a profound transformation in the way people live their lives, and this shift is having a significant impact on the global economy. As a result, businesses in all sectors are being forced to reevaluate their operations and decision-making processes, including the financial sector. This study investigates the different factors that are compelling financial institutions to rethink their approaches to doing business. The study also presents the emerging data-centric and analytics-driven business model of the finance sector, which is necessary to adapt, survive, and compete in today’s dynamic and highly digitized global market.

21世纪数字技术的兴起，深刻改变了人们的生活方式，并对全球经济产生了重大影响。因此，所有部门的企业都被迫重新评估其运营和决策过程，包括金融部门。本研究调查了迫使金融机构重新思考其经营方式的不同因素。该研究还介绍了新兴的以数据为中心和分析驱动的金融行业商业模式，这是在当今充满活力和高度数字化的全球市场中适应、生存和竞争所必需的。

引用次数: 0

Using Machine Learning to Forecast Market Direction with Efficient Frontier Coefficients 利用机器学习有效前沿系数预测市场方向

The Journal of Financial Data Science

Pub Date : 2023-06-09 DOI: 10.3905/jfds.2023.1.128

Nolan Alexander, William Scherer

The authors propose a novel method to improve estimation of asset returns for portfolio optimization. This approach first performs a monthly directional market forecast using an online decision tree. The decision tree is trained on a novel set of features engineered from portfolio theory: the efficient frontier functional coefficients. Efficient frontiers can be decomposed to their functional form, a square-root second-order polynomial, and the coefficients of this function capture the information of all the constituents that compose the market in the current time period. To make these forecasts actionable, these directional forecasts are integrated to a portfolio optimization framework using expected returns conditional on the market forecast as an estimate for the return vector. This conditional expectation is calculated using the inverse Mills ratio, and the capital asset pricing model is used to translate the market forecast to individual asset forecasts. This novel method outperforms baseline portfolios, as well as other feature sets including technical indicators and the Fama–French factors. To empirically validate the proposed model, the authors employ a set of market sector exchange-traded funds.

提出了一种改进投资组合优化中资产收益估计的新方法。该方法首先使用在线决策树进行月度定向市场预测。决策树是根据组合理论设计的一组新特征进行训练的:有效前沿泛函系数。有效边界可以分解为它们的函数形式，即一个平方根二阶多项式，该函数的系数捕获了当前时间段内构成市场的所有成分的信息。为了使这些预测可行，这些方向预测被整合到一个投资组合优化框架中，使用市场预测的预期回报作为回报向量的估计。这一条件预期是使用Mills逆比计算的，资本资产定价模型用于将市场预测转化为个人资产预测。这种新颖的方法优于基线投资组合，以及其他特征集，包括技术指标和Fama-French因素。为了从经验上验证所提出的模型，作者采用了一组市场部门的交易所交易基金。

{"title":"Using Machine Learning to Forecast Market Direction with Efficient Frontier Coefficients","authors":"Nolan Alexander, William Scherer","doi":"10.3905/jfds.2023.1.128","DOIUrl":"https://doi.org/10.3905/jfds.2023.1.128","url":null,"abstract":"The authors propose a novel method to improve estimation of asset returns for portfolio optimization. This approach first performs a monthly directional market forecast using an online decision tree. The decision tree is trained on a novel set of features engineered from portfolio theory: the efficient frontier functional coefficients. Efficient frontiers can be decomposed to their functional form, a square-root second-order polynomial, and the coefficients of this function capture the information of all the constituents that compose the market in the current time period. To make these forecasts actionable, these directional forecasts are integrated to a portfolio optimization framework using expected returns conditional on the market forecast as an estimate for the return vector. This conditional expectation is calculated using the inverse Mills ratio, and the capital asset pricing model is used to translate the market forecast to individual asset forecasts. This novel method outperforms baseline portfolios, as well as other feature sets including technical indicators and the Fama–French factors. To empirically validate the proposed model, the authors employ a set of market sector exchange-traded funds.","PeriodicalId":199045,"journal":{"name":"The Journal of Financial Data Science","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126380776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Using the Graphics Processing Unit to Evaluate American-Style Derivatives 使用图形处理单元来评估美式衍生品

The Journal of Financial Data Science

Pub Date : 2023-06-08 DOI: 10.3905/jfds.2023.1.127

Leon Xing Li, Ren‐Raw Chen

In this article, the authors apply graphics processing unit (GPU) computation to an American option pricing problem via Monte Carlo (MC) simulations and particle swarm optimization (PSO). Given that computations in both MC and PSO can be vectorized and made independent, the valuation can be readily performed on GPUs. As a result, we can increase the accuracy of the valuation by increasing MC paths and particles without spending more time. For example, with a large number of particles (but allocated to GPUs), convergence can be reached in very few steps. The method introduced in this article can be extended to a wide variety of exotic derivatives or a large portfolio of diverse derivatives (known as an eigen portfolio). This is helpful in both trading and risk management.

本文通过蒙特卡罗(MC)模拟和粒子群优化(PSO)方法，将图形处理单元(GPU)计算应用于一个美式期权定价问题。由于MC和PSO的计算都可以矢量化并独立进行，因此可以很容易地在gpu上进行估值。因此，我们可以通过增加MC路径和粒子来提高估值的准确性，而无需花费更多的时间。例如，使用大量的粒子(但分配给gpu)，收敛可以在很少的步骤中达到。本文介绍的方法可以扩展到各种各样的外来衍生品或各种衍生品的大型投资组合(称为特征投资组合)。这对交易和风险管理都很有帮助。

引用次数: 0

The Virtue and Vice of Complexity in Equity Risk Premium Prediction 股票风险溢价预测中复杂性的利弊

The Journal of Financial Data Science

Pub Date : 2023-06-05 DOI: 10.3905/jfds.2023.1.126

Brian Jacobsen

When forecasting the equity risk premium, simple techniques generate results that are easier to interpret than results from more complex techniques. If complex techniques have better performance, does the virtue of superior performance trump the vice of lack of interpretability? This presumes simpler techniques underperform. Complex does not equate to superior performance. Old and simple techniques like discriminant analysis combine the virtue of performance with the virtue of intelligibility. This article performs a horse race among stepwise quadratic discriminant analysis, classification trees, regression trees, and ridgeless regression. Sometimes, accuracy can be sacrificed in favor of better out-of-sample Sharpe ratios. This article also shows that preprocessing data using rolling percentage ranks can be better than using either an expanding window or Z-scores.

当预测股票风险溢价时，简单的技术产生的结果比更复杂的技术产生的结果更容易解释。如果复杂的技术具有更好的性能，那么性能优越的优点是否胜过缺乏可解释性的缺点?这假定更简单的技术表现不佳。复杂并不等同于卓越的表现。像判别分析这样古老而简单的技术将性能的优点与可理解性的优点结合起来。本文在逐步二次判别分析、分类树、回归树和无脊回归之间进行了一场竞赛。有时，为了获得更好的样本外夏普比率，可以牺牲精度。本文还表明，使用滚动百分比排名预处理数据可能比使用扩展窗口或z分数更好。

引用次数: 0

Managing Editor’s Letter 总编辑的信

The Journal of Financial Data Science

Pub Date : 2023-04-30 DOI: 10.3905/jfds.2023.5.2.001

F. Fabozzi

引用次数: 0

No-Transaction Band Network: A Neural Network Architecture for Efficient Deep Hedging 无交易频带网络:一种高效深度对冲的神经网络结构

The Journal of Financial Data Science

Pub Date : 2023-04-25 DOI: 10.3905/jfds.2023.1.125

Shota Imaki, Kentaro Imajo, Katsuya Ito, Kentaro Minami, Kei Nakagawa

Deep hedging is a versatile framework for computing the optimal hedging strategy of derivatives in incomplete markets. However, it is subject to the action-dependence problem impeding efficient training because the appropriate hedging action at the next step depends on the current action. To overcome this issue, the authors leverage a no-transaction band strategy, an existing technique that provides optimal hedging strategies for European options and exponential utility. The authors theoretically argue this strategy to be optimal for a wider class of utilities and derivatives, including exotics. Based on the result, the authors propose a no-transaction band network, namely, a neural network architecture that facilitates fast training and precise evaluation of the optimal hedging strategy. Moreover, the authors experimentally demonstrate that, for European and lookback options, their architecture rapidly attains a better hedging strategy compared with a standard feed-forward network. The findings thus have important implications for the practical applications of deep hedging.

深度套期保值是计算不完全市场中衍生品最优套期保值策略的通用框架。然而，由于下一步适当的对冲行动依赖于当前的行动，因此它受制于行动依赖问题，阻碍了有效的训练。为了克服这个问题，作者利用无交易波段策略，这是一种现有的技术，为欧洲期权和指数效用提供了最优对冲策略。从理论上讲，作者认为这种策略对更广泛的效用和衍生品(包括外来产品)是最优的。在此基础上，作者提出了一种无交易频带网络，即一种便于快速训练和精确评估最优对冲策略的神经网络架构。此外，作者通过实验证明，对于欧洲期权和回溯期权，与标准前馈网络相比，他们的体系结构迅速获得了更好的对冲策略。因此，研究结果对深度套期保值的实际应用具有重要意义。

引用次数: 6

A Review on Derivative Hedging Using Reinforcement Learning 基于强化学习的衍生品套期保值研究综述

The Journal of Financial Data Science

Pub Date : 2023-03-14 DOI: 10.3905/jfds.2023.1.124

Peng Liu

Hedging is a common trading activity to manage the risk of engaging in transactions that involve derivatives such as options. Perfect and timely hedging, however, is an impossible task in the real market that characterizes discrete-time transactions with costs. Recent years have witnessed reinforcement learning (RL) in formulating optimal hedging strategies. Specifically, different RL algorithms have been applied to learn the optimal offsetting position based on market conditions, offering an automatic risk management solution that proposes optimal hedging strategies while catering to both market dynamics and restrictions. In this article, the author provides a comprehensive review of the use of RL techniques in hedging derivatives. In addition to highlighting the main streams of research, the author provides potential research directions on this exciting and emerging field.

套期保值是一种常见的交易活动，用于管理涉及期权等衍生品交易的风险。然而，在具有成本的离散时间交易的真实市场中，完美和及时的对冲是一项不可能完成的任务。近年来，强化学习(RL)在制定最优对冲策略方面得到了广泛应用。具体而言，不同的强化学习算法已被应用于根据市场条件学习最佳对冲头寸，提供自动风险管理解决方案，在满足市场动态和限制的同时提出最佳对冲策略。在这篇文章中，作者提供了在套期保值衍生品中使用RL技术的全面回顾。在强调研究主流的同时，作者还提出了这一令人兴奋的新兴领域的潜在研究方向。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

The Journal of Financial Data Science

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀