首页 > 最新文献

The Journal of Derivatives最新文献

英文 中文
Cover’s Rebalancing Option with Discrete Hindsight Optimization Cover基于离散后见之明优化的再平衡选择
Pub Date : 2021-04-20 DOI: 10.3905/jod.2021.1.135
Alex Garivaltis
The author studies T. Cover’s rebalancing option (Ordentlich and Cover 1998) under discrete hindsight optimization in continuous time. The payoff in question is equal to the final wealth that would have accrued to an initial deposit of 1 unit of the numéraire into the best of some finite set of (perhaps levered) rebalancing rules determined in hindsight. A rebalancing rule (or fixed-fraction betting scheme) amounts to fixing an asset allocation (i.e., 200% equities and −100% bonds) and then continuously executing rebalancing trades so as to counteract allocation drift. Restricting the hindsight optimization to a small number of rebalancing rules (i.e., 2) has some advantages over the pioneering approach taken by Cover & Company in their theory of universal portfolios (1986, 1991, 1996, 1998), wherein one’s trading performance is benchmarked relative to the final wealth of the best unlevered rebalancing rule (of any kind) in hindsight. Our approach lets practitioners express an a priori view that one of the favored asset allocations (“bets”) in the set {b1, …, bn} will turn out to have performed spectacularly well in hindsight. In limiting our robustness to some discrete set of asset allocations (rather than all possible asset allocations), we reduce the price of the rebalancing option and guarantee that we will achieve a correspondingly higher percentage of the hindsight-optimized wealth at the end of the planning period. A practitioner who lives to delta-hedge this variant of Cover’s rebalancing option through several decades is guaranteed to see the day that his realized compound-annual capital growth rate is very close to that of the best bi in hindsight, hence the point of the rock-bottom option price.
作者研究了连续时间离散后见之明优化下T. Cover的再平衡选择(Ordentlich and Cover 1998)。所讨论的收益等于最初存入1个单位的numsamraire的最终财富,这些财富将累积到一些有限的(可能是杠杆的)后见之明确定的再平衡规则的最佳集合中。再平衡规则(或固定分数投注方案)相当于固定资产配置(即,200%的股票和- 100%的债券),然后不断执行再平衡交易,以抵消分配漂移。将后见之明优化限制在少数再平衡规则(即2)中,比Cover & Company在其通用投资组合理论(1986、1991、1996、1998)中采用的开创性方法有一些优势,其中一个人的交易绩效是相对于后见之明的最佳无杠杆再平衡规则(任何类型)的最终财富进行基准测试的。我们的方法让从业者表达一种先验的观点,即集合{b1,…,bn}中最受青睐的资产配置(“赌注”)之一在事后会表现得非常好。通过将我们的稳健性限制在一些离散的资产配置(而不是所有可能的资产配置)上,我们降低了再平衡选项的价格,并保证在规划期结束时,我们将获得相应更高比例的后见之明优化财富。一个实践者在几十年的时间里通过delta对冲来实现Cover的这种再平衡期权的变体,他肯定会看到有一天,他的已实现的复合年资本增长率非常接近后见之明的最佳水平,因此期权价格才会跌至谷底。
{"title":"Cover’s Rebalancing Option with Discrete Hindsight Optimization","authors":"Alex Garivaltis","doi":"10.3905/jod.2021.1.135","DOIUrl":"https://doi.org/10.3905/jod.2021.1.135","url":null,"abstract":"The author studies T. Cover’s rebalancing option (Ordentlich and Cover 1998) under discrete hindsight optimization in continuous time. The payoff in question is equal to the final wealth that would have accrued to an initial deposit of 1 unit of the numéraire into the best of some finite set of (perhaps levered) rebalancing rules determined in hindsight. A rebalancing rule (or fixed-fraction betting scheme) amounts to fixing an asset allocation (i.e., 200% equities and −100% bonds) and then continuously executing rebalancing trades so as to counteract allocation drift. Restricting the hindsight optimization to a small number of rebalancing rules (i.e., 2) has some advantages over the pioneering approach taken by Cover & Company in their theory of universal portfolios (1986, 1991, 1996, 1998), wherein one’s trading performance is benchmarked relative to the final wealth of the best unlevered rebalancing rule (of any kind) in hindsight. Our approach lets practitioners express an a priori view that one of the favored asset allocations (“bets”) in the set {b1, …, bn} will turn out to have performed spectacularly well in hindsight. In limiting our robustness to some discrete set of asset allocations (rather than all possible asset allocations), we reduce the price of the rebalancing option and guarantee that we will achieve a correspondingly higher percentage of the hindsight-optimized wealth at the end of the planning period. A practitioner who lives to delta-hedge this variant of Cover’s rebalancing option through several decades is guaranteed to see the day that his realized compound-annual capital growth rate is very close to that of the best bi in hindsight, hence the point of the rock-bottom option price.","PeriodicalId":501089,"journal":{"name":"The Journal of Derivatives","volume":"48 5","pages":"8-29"},"PeriodicalIF":0.0,"publicationDate":"2021-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pricing and Hedging Options on Assets with Options on Related Assets 用相关资产期权定价和对冲资产期权
Pub Date : 2021-04-08 DOI: 10.3905/jod.2021.1.132
Dilip B. Madan,King Wang
The question addressed is the pricing of options on the CBOE Skew Index. The option pricing theory developed partially hedges risk by taking positions in the market for options on a related asset. The option is then priced at the cost of this hedge. The theory is applied to pricing Volatility Index (VIX) options hedged by the SPDR S&P 500 ETF Trust (SPY) options and pricing options on JPMorgan hedged by Financial Select Sector SPDR (XLF) options. The approach is then applied to illustrate the pricing of CBOE Skew Index options with a hedge in the market for SPY options. The Skew Index smile is then seen to imply the VIX and SKEW of the Skew Index itself. The pricing of VIX options with SPY as the related asset has the Gaussian copula underpricing options while the t-copula significantly overprices them. The multivariate bilateral gamma models are closer to market. The premia of cross-asset hedge prices over the market price are observed to fall with moneyness and maturity and rise with the level of the VIX. TOPICS:Derivatives, options, exchange-traded funds and applications, quantitative methods, statistical methods, performance measurement Key Findings ▪ Time series data on physical returns may be used to obtain market relevant option prices provided market-relevant hedging costs are incorporated. ▪ Options on the CBOE Skew Index are priced at the cost of an SPY option hedge portfolio. ▪ Residual risk pricing technologies may be applied more widely with market calibrated parameters if desired.
问题是芝加哥期权交易所(CBOE)扭曲指数的期权定价。期权定价理论通过在相关资产的期权市场上建立头寸来部分对冲风险。然后,期权按照对冲的成本定价。该理论应用于由SPDR标准普尔500指数ETF信托(SPY)期权对冲的波动率指数(VIX)定价期权,以及由金融精选板块SPDR (XLF)期权对冲的摩根大通定价期权。然后应用该方法来说明CBOE倾斜指数期权在SPY期权市场上的对冲定价。扭曲指数的微笑暗示了波动率指数和扭曲指数本身。以SPY为相关资产的VIX期权定价存在高斯关联关系,低估期权,而t关联关系显著高估期权。多元双边伽马模型更接近市场。跨资产对冲价格相对于市场价格的溢价随货币规模和期限而下降,随波动率指数水平而上升。主题:衍生品,期权,交易所交易基金和应用,定量方法,统计方法,绩效测量关键发现▪实物回报的时间序列数据可以用来获得市场相关的期权价格,前提是纳入了市场相关的对冲成本。▪芝加哥期权交易所(CBOE)倾斜指数期权的定价是以SPY期权对冲投资组合的成本为基础的。剩余风险定价技术可根据需要更广泛地应用于市场校准参数。
{"title":"Pricing and Hedging Options on Assets with Options on Related Assets","authors":"Dilip B. Madan,King Wang","doi":"10.3905/jod.2021.1.132","DOIUrl":"https://doi.org/10.3905/jod.2021.1.132","url":null,"abstract":"The question addressed is the pricing of options on the CBOE Skew Index. The option pricing theory developed partially hedges risk by taking positions in the market for options on a related asset. The option is then priced at the cost of this hedge. The theory is applied to pricing Volatility Index (VIX) options hedged by the SPDR S&P 500 ETF Trust (SPY) options and pricing options on JPMorgan hedged by Financial Select Sector SPDR (XLF) options. The approach is then applied to illustrate the pricing of CBOE Skew Index options with a hedge in the market for SPY options. The Skew Index smile is then seen to imply the VIX and SKEW of the Skew Index itself. The pricing of VIX options with SPY as the related asset has the Gaussian copula underpricing options while the t-copula significantly overprices them. The multivariate bilateral gamma models are closer to market. The premia of cross-asset hedge prices over the market price are observed to fall with moneyness and maturity and rise with the level of the VIX. TOPICS:Derivatives, options, exchange-traded funds and applications, quantitative methods, statistical methods, performance measurement Key Findings ▪ Time series data on physical returns may be used to obtain market relevant option prices provided market-relevant hedging costs are incorporated. ▪ Options on the CBOE Skew Index are priced at the cost of an SPY option hedge portfolio. ▪ Residual risk pricing technologies may be applied more widely with market calibrated parameters if desired.","PeriodicalId":501089,"journal":{"name":"The Journal of Derivatives","volume":"49 s18","pages":"27-47"},"PeriodicalIF":0.0,"publicationDate":"2021-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Closed-Form Model-Free Implied Volatility Formula through Delta Families 基于Delta族的闭式无模型隐含波动率公式
Pub Date : 2020-12-26 DOI: 10.3905/jod.2020.1.127
Zhenyu Cui,Justin Kirkby,Duy Nguyen,Stephen Taylor
In this article, we derive a closed-form explicit model-free formula for the (Black-Scholes) implied volatility. The method is based on the novel use of the Dirac Delta function, corresponding delta families, and the change of variable technique. The formula is expressed through either a limit or as an infinite series of elementary functions, and we establish that the proposed formula converges to the true implied volatility value. In numerical experiments, we verify the convergence of the formula, and consider several benchmark cases, for which the data-generating processes are respectively the stochastic volatility inspired model, and the stochastic alpha beta rho model. We also establish an explicit formula for the implied volatility expressed directly in terms of respective model parameters, and use the Heston model to illustrate this idea. The delta family and change of variable technique that we develop are of independent interest and can be used to solve inverse problems arising in other applications. TOPIC: Derivatives Key Findings ▪ A novel closed-form representation of the Black-Scholes implied volatility is developed by utilizing a delta-family technique. ▪ Convergence and error analyses of approximate forms of this representations are presented. ▪ This technique is applied to the parametric SVI and SABR models as well as the stochastic volatility Heston model.
在本文中,我们推导了(Black-Scholes)隐含波动率的一个封闭形式的显式无模型公式。该方法是基于狄拉克函数、相应的函数族和变量变换技术的新颖应用。该公式通过极限或无限初等函数级数表示,并证明了该公式收敛于真实隐含波动率值。在数值实验中验证了公式的收敛性,并考虑了几种基准情况,其中数据生成过程分别是随机波动激励模型和随机α - β - rho模型。我们还建立了直接用各自模型参数表示的隐含波动率的显式公式,并使用Heston模型来说明这一思想。我们开发的delta族和变量变换技术具有独立的意义,可用于解决其他应用中出现的逆问题。主题:衍生工具主要发现▪利用delta族技术开发了一种新的Black-Scholes隐含波动率的封闭形式表示。给出了这种表示的近似形式的收敛性和误差分析。▪该技术可应用于参数SVI和SABR模型以及随机波动Heston模型。
{"title":"A Closed-Form Model-Free Implied Volatility Formula through Delta Families","authors":"Zhenyu Cui,Justin Kirkby,Duy Nguyen,Stephen Taylor","doi":"10.3905/jod.2020.1.127","DOIUrl":"https://doi.org/10.3905/jod.2020.1.127","url":null,"abstract":"In this article, we derive a closed-form explicit model-free formula for the (Black-Scholes) implied volatility. The method is based on the novel use of the Dirac Delta function, corresponding delta families, and the change of variable technique. The formula is expressed through either a limit or as an infinite series of elementary functions, and we establish that the proposed formula converges to the true implied volatility value. In numerical experiments, we verify the convergence of the formula, and consider several benchmark cases, for which the data-generating processes are respectively the stochastic volatility inspired model, and the stochastic alpha beta rho model. We also establish an explicit formula for the implied volatility expressed directly in terms of respective model parameters, and use the Heston model to illustrate this idea. The delta family and change of variable technique that we develop are of independent interest and can be used to solve inverse problems arising in other applications. TOPIC: Derivatives Key Findings ▪ A novel closed-form representation of the Black-Scholes implied volatility is developed by utilizing a delta-family technique. ▪ Convergence and error analyses of approximate forms of this representations are presented. ▪ This technique is applied to the parametric SVI and SABR models as well as the stochastic volatility Heston model.","PeriodicalId":501089,"journal":{"name":"The Journal of Derivatives","volume":"77 1","pages":"111-127"},"PeriodicalIF":0.0,"publicationDate":"2020-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138516946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal Volatility Dependent Derivatives in the Stochastic Volatility Model 随机波动率模型中的最优波动率相关导数
Pub Date : 2020-11-20 DOI: 10.3905/jod.2020.1.122
Artem Dyachenko,Marc Oliver Rieger
We consider derivatives that maximize an investor’s expected utility in the stochastic volatility model. We show that the optimal derivative that depends on the stock and its variance significantly outperforms the optimal derivative that depends on the stock only. Such derivatives yield a much higher certainty equivalent return. This result implies that investors could benefit from structured financial products constructed along these ideas. TOPICS: Derivatives, fixed income and structured finance Key Findings ▪ A derivative is optimal if it maximizes an investor’s expected utility. In the stochastic volatility model, the optimal buy-and-hold derivative with the payoff that depends on the stock price and its volatility incorporates both the market risk premium and the variance risk premium. ▪ The optimal buy-and-hold derivative with the payoff that depends on the stock price and its volatility usually outperforms significantly both the optimal buy-and-hold derivative with the payoff that depends on the stock price only and the optimal buy-and-hold portfolio made up of the stock and the risk-free bond. ▪ Investors could benefit from derivatives with payoffs that depend on the stock price and its volatility.
我们考虑在随机波动模型中使投资者期望效用最大化的衍生品。我们表明,依赖于股票及其方差的最优导数明显优于仅依赖于股票的最优导数。这类衍生品产生的确定性等效回报要高得多。这一结果表明,投资者可以从按照这些理念构建的结构性金融产品中获益。主题:衍生工具,固定收益和结构性金融主要发现▪如果衍生工具能使投资者的预期效用最大化,那么它就是最优的。在随机波动率模型中,收益取决于股票价格及其波动率的最优买入持有衍生品同时包含市场风险溢价和方差风险溢价。▪收益取决于股票价格及其波动性的最佳买入并持有衍生工具,通常明显优于收益仅取决于股票价格的最佳买入并持有衍生工具,以及由股票和无风险债券组成的最佳买入并持有投资组合。▪投资者可以从衍生品中获益,衍生品的收益取决于股价及其波动性。
{"title":"Optimal Volatility Dependent Derivatives in the Stochastic Volatility Model","authors":"Artem Dyachenko,Marc Oliver Rieger","doi":"10.3905/jod.2020.1.122","DOIUrl":"https://doi.org/10.3905/jod.2020.1.122","url":null,"abstract":"We consider derivatives that maximize an investor’s expected utility in the stochastic volatility model. We show that the optimal derivative that depends on the stock and its variance significantly outperforms the optimal derivative that depends on the stock only. Such derivatives yield a much higher certainty equivalent return. This result implies that investors could benefit from structured financial products constructed along these ideas. TOPICS: Derivatives, fixed income and structured finance Key Findings ▪ A derivative is optimal if it maximizes an investor’s expected utility. In the stochastic volatility model, the optimal buy-and-hold derivative with the payoff that depends on the stock price and its volatility incorporates both the market risk premium and the variance risk premium. ▪ The optimal buy-and-hold derivative with the payoff that depends on the stock price and its volatility usually outperforms significantly both the optimal buy-and-hold derivative with the payoff that depends on the stock price only and the optimal buy-and-hold portfolio made up of the stock and the risk-free bond. ▪ Investors could benefit from derivatives with payoffs that depend on the stock price and its volatility.","PeriodicalId":501089,"journal":{"name":"The Journal of Derivatives","volume":"49 3","pages":"24-44"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Premium Reduction of European, American, and Perpetual Log Return Options 欧洲、美国和永久对数回报期权的减溢价
Pub Date : 2020-11-04 DOI: 10.3905/jod.2020.1.115
Stephen Taylor,Jan Vecer
Traditional plain vanilla options may be regarded as contingent claims whose value depends upon the simple returns of an underlying asset. These options have convex payoffs, and as a consequence of Jensen’s inequality, their prices increase as a function of maturity in the absence of interest rates. This results in long-dated call option premia being excessively expensive in relation to the fraction of a corresponding insured portfolio. We show that replacing the simple return payoff with the log return call option payoff leads to substantial premium savings while providing the similar insurance protection. Call options on log returns have favorable prices for very long maturities on the scale of decades. This property enables them to be attractive securities for long-term investors, such as pension funds. TOPICS: Options, pension funds Key Findings ▪ This article develops valuation and risk techniques for a log return payoff option under a Geometric Brownian Motion. ▪ A comparison is made between premium advantages of the log return contract to those of traditional European options. ▪ A pricing and optimal excise boundary formula for perpetual and finite maturity American log return options id derived. ▪ This article examines long-term insurance applications of the new contract that are prohibitively expensive for traditional options.
传统的普通期权可被视为或有债权,其价值取决于标的资产的简单回报。这些期权具有凸收益,并且由于詹森不等式,在没有利率的情况下,它们的价格作为期限的函数而增加。这导致长期看涨期权溢价相对于相应保险投资组合的部分过于昂贵。我们表明,用日志回报看涨期权支付代替简单的回报支付导致大量的保费节省,同时提供类似的保险保护。对数回报的看涨期权对于很长的期限(以几十年为单位)具有有利的价格。这种特性使它们成为对长期投资者(如养老基金)有吸引力的证券。本文发展了几何布朗运动下对数回报支付期权的估值和风险技术。▪比较了日志回报合同与传统欧洲期权的溢价优势。▪推导了永久和有限期限美国对数回报期权的定价和最优附加边界公式。▪本文研究了新合同的长期保险应用,这些应用对于传统期权来说过于昂贵。
{"title":"The Premium Reduction of European, American, and Perpetual Log Return Options","authors":"Stephen Taylor,Jan Vecer","doi":"10.3905/jod.2020.1.115","DOIUrl":"https://doi.org/10.3905/jod.2020.1.115","url":null,"abstract":"Traditional plain vanilla options may be regarded as contingent claims whose value depends upon the simple returns of an underlying asset. These options have convex payoffs, and as a consequence of Jensen’s inequality, their prices increase as a function of maturity in the absence of interest rates. This results in long-dated call option premia being excessively expensive in relation to the fraction of a corresponding insured portfolio. We show that replacing the simple return payoff with the log return call option payoff leads to substantial premium savings while providing the similar insurance protection. Call options on log returns have favorable prices for very long maturities on the scale of decades. This property enables them to be attractive securities for long-term investors, such as pension funds. TOPICS: Options, pension funds Key Findings ▪ This article develops valuation and risk techniques for a log return payoff option under a Geometric Brownian Motion. ▪ A comparison is made between premium advantages of the log return contract to those of traditional European options. ▪ A pricing and optimal excise boundary formula for perpetual and finite maturity American log return options id derived. ▪ This article examines long-term insurance applications of the new contract that are prohibitively expensive for traditional options.","PeriodicalId":501089,"journal":{"name":"The Journal of Derivatives","volume":"48 8","pages":"7-23"},"PeriodicalIF":0.0,"publicationDate":"2020-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds QLBS: Black-Scholes(-Merton)世界中的Q-Learner
Pub Date : 2020-04-25 DOI: 10.3905/jod.2020.1.108
Igor Halperin
This article presents a discrete-time option pricing model that is rooted in reinforcement learning (RL), and more specifically in the famous Q-Learning method of RL. We construct a risk-adjusted Markov Decision Process for a discrete-time version of the classical Black-Scholes-Merton (BSM) model, where the option price is an optimal Q-function, while the optimal hedge is a second argument of this optimal Q-function, so that both the price and hedge are parts of the same formula. Pricing is done by learning to dynamically optimize risk-adjusted returns for an option replicating portfolio, as in Markowitz portfolio theory. Using Q-Learning and related methods, once created in a parametric setting, the model can go model-free and learn to price and hedge an option directly from data, without an explicit model of the world. This suggests that RL may provide efficient data-driven and model-free methods for the optimal pricing and hedging of options. Once we depart from the academic continuous-time limit, and vice versa, option pricing methods developed in Mathematical Finance may be viewed as special cases of model-based reinforcement learning. Further, due to the simplicity and tractability of our model, which only needs basic linear algebra (plus Monte Carlo simulation, if we work with synthetic data), and its close relationship to the original BSM model, we suggest that our model could be used in the benchmarking of different RL algorithms for financial trading applications. TOPICS: Derivatives, options Key Findings • Reinforcement learning (RL) is the most natural way for pricing and hedging of options that relies directly on data and not on a specific model of asset pricing. • The discrete-time RL approach to option pricing generalizes classical continuous-time methods; enables tracking mis-hedging risk, which disappears in the formal continuous-time limit; and provides a consistent framework for using options for both hedging and speculation. • A simple quadratic reward function, which presents a minimal extension of the classical Black-Scholes framework when combined with the Q-learning method of RL, gives rise to a particularly simple computational scheme where option pricing and hedging are semianalytical, as they amount to multiple uses of a conventional least-squares regression.
本文提出了一个基于强化学习(RL)的离散时间期权定价模型,更具体地说,是基于著名的强化学习(RL)的Q-Learning方法。我们为经典的Black-Scholes-Merton (BSM)模型的离散时间版本构建了一个风险调整的马尔可夫决策过程,其中期权价格是最优q函数,而最优套期保值是最优q函数的第二个参数,因此价格和套期保值都是同一个公式的一部分。定价是通过学习动态优化一个期权复制投资组合的风险调整收益来完成的,正如马科维茨投资组合理论。使用Q-Learning和相关方法,一旦在参数设置中创建,模型就可以脱离模型,学习直接从数据中定价和对冲期权,而无需明确的世界模型。这表明RL可以为期权的最优定价和对冲提供有效的数据驱动和无模型方法。一旦我们离开了学术上的连续时间限制,反之亦然,数学金融中开发的期权定价方法可能被视为基于模型的强化学习的特殊情况。此外,由于我们的模型的简单性和可追溯性,它只需要基本的线性代数(加上蒙特卡罗模拟,如果我们使用合成数据),以及它与原始BSM模型的密切关系,我们建议我们的模型可以用于金融交易应用程序中不同强化学习算法的基准测试。•强化学习(RL)是期权定价和对冲最自然的方法,它直接依赖于数据,而不是特定的资产定价模型。•离散时间RL期权定价方法是经典连续时间期权定价方法的推广;可跟踪误对冲风险,该风险在正式的连续时间限制中消失;并为使用期权进行对冲和投机提供了一致的框架。•一个简单的二次奖励函数,当与RL的Q-learning方法相结合时,它呈现了经典Black-Scholes框架的最小扩展,产生了一个特别简单的计算方案,其中期权定价和套期保值是半分析的,因为它们相当于传统最小二乘回归的多次使用。
{"title":"QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds","authors":"Igor Halperin","doi":"10.3905/jod.2020.1.108","DOIUrl":"https://doi.org/10.3905/jod.2020.1.108","url":null,"abstract":"This article presents a discrete-time option pricing model that is rooted in reinforcement learning (RL), and more specifically in the famous Q-Learning method of RL. We construct a risk-adjusted Markov Decision Process for a discrete-time version of the classical Black-Scholes-Merton (BSM) model, where the option price is an optimal Q-function, while the optimal hedge is a second argument of this optimal Q-function, so that both the price and hedge are parts of the same formula. Pricing is done by learning to dynamically optimize risk-adjusted returns for an option replicating portfolio, as in Markowitz portfolio theory. Using Q-Learning and related methods, once created in a parametric setting, the model can go model-free and learn to price and hedge an option directly from data, without an explicit model of the world. This suggests that RL may provide efficient data-driven and model-free methods for the optimal pricing and hedging of options. Once we depart from the academic continuous-time limit, and vice versa, option pricing methods developed in Mathematical Finance may be viewed as special cases of model-based reinforcement learning. Further, due to the simplicity and tractability of our model, which only needs basic linear algebra (plus Monte Carlo simulation, if we work with synthetic data), and its close relationship to the original BSM model, we suggest that our model could be used in the benchmarking of different RL algorithms for financial trading applications. TOPICS: Derivatives, options Key Findings • Reinforcement learning (RL) is the most natural way for pricing and hedging of options that relies directly on data and not on a specific model of asset pricing. • The discrete-time RL approach to option pricing generalizes classical continuous-time methods; enables tracking mis-hedging risk, which disappears in the formal continuous-time limit; and provides a consistent framework for using options for both hedging and speculation. • A simple quadratic reward function, which presents a minimal extension of the classical Black-Scholes framework when combined with the Q-learning method of RL, gives rise to a particularly simple computational scheme where option pricing and hedging are semianalytical, as they amount to multiple uses of a conventional least-squares regression.","PeriodicalId":501089,"journal":{"name":"The Journal of Derivatives","volume":"95 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138516925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
The Journal of Derivatives
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1