Learning the random variables in Monte Carlo simulations with stochastic gradient descent: Machine learning for parametric PDEs and financial derivative pricing
Sebastian Becker, Arnulf Jentzen, Marvin S. Müller, Philippe von Wurstemberger
{"title":"Learning the random variables in Monte Carlo simulations with stochastic gradient descent: Machine learning for parametric PDEs and financial derivative pricing","authors":"Sebastian Becker, Arnulf Jentzen, Marvin S. Müller, Philippe von Wurstemberger","doi":"10.1111/mafi.12405","DOIUrl":null,"url":null,"abstract":"<p>In financial engineering, prices of financial products are computed approximately many times each trading day with (slightly) different parameters in each calculation. In many financial models such prices can be approximated by means of Monte Carlo (MC) simulations. To obtain a good approximation the MC sample size usually needs to be considerably large resulting in a long computing time to obtain a single approximation. A natural deep learning approach to reduce the computation time when new prices need to be calculated as quickly as possible would be to train an artificial neural network (ANN) to learn the function which maps parameters of the model and of the financial product to the price of the financial product. However, empirically it turns out that this approach leads to approximations with unacceptably high errors, in particular when the error is measured in the <math>\n <semantics>\n <msup>\n <mi>L</mi>\n <mi>∞</mi>\n </msup>\n <annotation>$L^\\infty$</annotation>\n </semantics></math>-norm, and it seems that ANNs are not capable to closely approximate prices of financial products in dependence on the model and product parameters in real life applications. This is not entirely surprising given the high-dimensional nature of the problem and the fact that it has recently been proved for a large class of algorithms, including the deep learning approach outlined above, that such methods are in general not capable to overcome the curse of dimensionality for such approximation problems in the <math>\n <semantics>\n <msup>\n <mi>L</mi>\n <mi>∞</mi>\n </msup>\n <annotation>$L^\\infty$</annotation>\n </semantics></math>-norm. In this article we introduce a new numerical approximation strategy for parametric approximation problems including the parametric financial pricing problems described above and we illustrate by means of several numerical experiments that the introduced approximation strategy achieves a very high accuracy for a variety of high-dimensional parametric approximation problems, even in the <math>\n <semantics>\n <msup>\n <mi>L</mi>\n <mi>∞</mi>\n </msup>\n <annotation>$L^\\infty$</annotation>\n </semantics></math>-norm. A central aspect of the approximation strategy proposed in this article is to combine MC algorithms with machine learning techniques to, roughly speaking, <i>learn the random variables</i> (LRV) in MC simulations. In other words, we employ stochastic gradient descent (SGD) optimization methods not to train parameters of standard ANNs but instead to learn random variables appearing in MC approximations. In that sense, the proposed LRV strategy has strong links to <i>Quasi-Monte Carlo</i> (QMC) methods as well as to the field of algorithm learning. Our numerical simulations strongly indicate that the LRV strategy might indeed be capable to overcome the curse of dimensionality in the <math>\n <semantics>\n <msup>\n <mi>L</mi>\n <mi>∞</mi>\n </msup>\n <annotation>$L^\\infty$</annotation>\n </semantics></math>-norm in several cases where the standard deep learning approach has been proven not to be able to do so. This is not a contradiction to the established lower bounds mentioned above because this new LRV strategy is outside of the class of algorithms for which lower bounds have been established in the scientific literature. The proposed LRV strategy is of general nature and not only restricted to the parametric financial pricing problems described above, but applicable to a large class of approximation problems. In this article we numerically test the LRV strategy in the case of the pricing of European call options in the Black-Scholes model with one underlying asset, in the case of the pricing of European worst-of basket put options in the Black-Scholes model with three underlying assets, in the case of the pricing of European average put options in the Black-Scholes model with three underlying assets and knock-in barriers, as well as in the case of stochastic Lorentz equations. For these examples the LRV strategy produces highly convincing numerical results when compared with standard MC simulations, QMC simulations using Sobol sequences, SGD-trained shallow ANNs, and SGD-trained deep ANNs.</p>","PeriodicalId":49867,"journal":{"name":"Mathematical Finance","volume":"34 1","pages":"90-150"},"PeriodicalIF":1.6000,"publicationDate":"2023-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/mafi.12405","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Finance","FirstCategoryId":"96","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/mafi.12405","RegionNum":3,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
引用次数: 6
Abstract
In financial engineering, prices of financial products are computed approximately many times each trading day with (slightly) different parameters in each calculation. In many financial models such prices can be approximated by means of Monte Carlo (MC) simulations. To obtain a good approximation the MC sample size usually needs to be considerably large resulting in a long computing time to obtain a single approximation. A natural deep learning approach to reduce the computation time when new prices need to be calculated as quickly as possible would be to train an artificial neural network (ANN) to learn the function which maps parameters of the model and of the financial product to the price of the financial product. However, empirically it turns out that this approach leads to approximations with unacceptably high errors, in particular when the error is measured in the -norm, and it seems that ANNs are not capable to closely approximate prices of financial products in dependence on the model and product parameters in real life applications. This is not entirely surprising given the high-dimensional nature of the problem and the fact that it has recently been proved for a large class of algorithms, including the deep learning approach outlined above, that such methods are in general not capable to overcome the curse of dimensionality for such approximation problems in the -norm. In this article we introduce a new numerical approximation strategy for parametric approximation problems including the parametric financial pricing problems described above and we illustrate by means of several numerical experiments that the introduced approximation strategy achieves a very high accuracy for a variety of high-dimensional parametric approximation problems, even in the -norm. A central aspect of the approximation strategy proposed in this article is to combine MC algorithms with machine learning techniques to, roughly speaking, learn the random variables (LRV) in MC simulations. In other words, we employ stochastic gradient descent (SGD) optimization methods not to train parameters of standard ANNs but instead to learn random variables appearing in MC approximations. In that sense, the proposed LRV strategy has strong links to Quasi-Monte Carlo (QMC) methods as well as to the field of algorithm learning. Our numerical simulations strongly indicate that the LRV strategy might indeed be capable to overcome the curse of dimensionality in the -norm in several cases where the standard deep learning approach has been proven not to be able to do so. This is not a contradiction to the established lower bounds mentioned above because this new LRV strategy is outside of the class of algorithms for which lower bounds have been established in the scientific literature. The proposed LRV strategy is of general nature and not only restricted to the parametric financial pricing problems described above, but applicable to a large class of approximation problems. In this article we numerically test the LRV strategy in the case of the pricing of European call options in the Black-Scholes model with one underlying asset, in the case of the pricing of European worst-of basket put options in the Black-Scholes model with three underlying assets, in the case of the pricing of European average put options in the Black-Scholes model with three underlying assets and knock-in barriers, as well as in the case of stochastic Lorentz equations. For these examples the LRV strategy produces highly convincing numerical results when compared with standard MC simulations, QMC simulations using Sobol sequences, SGD-trained shallow ANNs, and SGD-trained deep ANNs.
在金融工程中,金融产品的价格在每个交易日大约要计算多次,每次计算的参数(略有)不同。在许多金融模型中,可以通过蒙特卡罗(MC)模拟来近似计算这些价格。要获得良好的近似值,MC 样本大小通常需要相当大,从而导致获得单个近似值所需的计算时间很长。当需要尽快计算新价格时,减少计算时间的自然深度学习方法是训练人工神经网络(ANN)来学习将模型和金融产品的参数映射到金融产品价格的函数。然而,经验证明,这种方法导致的近似误差高得令人无法接受,特别是当误差以 L ∞ $L^\infty$ -norm来衡量时,人工神经网络似乎无法在实际应用中根据模型和产品参数来近似金融产品的价格。鉴于该问题的高维性质,以及最近对包括上述深度学习方法在内的一大类算法所证明的事实,即对于 L ∞ $L^infty$ -norm中的此类近似问题,此类方法一般无法克服维度诅咒,因此这并不完全令人惊讶。在本文中,我们针对参数逼近问题(包括上述参数金融定价问题)介绍了一种新的数值逼近策略,并通过几个数值实验说明,所介绍的逼近策略即使在 L ∞ $L\infty$ -norm下也能为各种高维参数逼近问题实现非常高的精度。本文提出的近似策略的核心是将 MC 算法与机器学习技术相结合,粗略地说,就是在 MC 模拟中学习随机变量(LRV)。换句话说,我们采用随机梯度下降(SGD)优化方法不是为了训练标准 ANN 的参数,而是为了学习 MC 近似中出现的随机变量。从这个意义上说,所提出的 LRV 策略与准蒙特卡罗(QMC)方法以及算法学习领域有着密切联系。我们的数值模拟有力地表明,在标准深度学习方法已被证明无法克服的 L ∞ $L^\infty$ -norm中,LRV策略可能确实能够克服维度诅咒。这与上述已确定的下限并不矛盾,因为这种新的 LRV 策略不属于科学文献中已确定下限的算法类别。所提出的 LRV 策略具有普遍性,不仅限于上述参数金融定价问题,而且适用于一大类近似问题。在本文中,我们对 Black-Scholes 模型中一种标的资产的欧式看涨期权定价、Black-Scholes 模型中三种标的资产的欧式最差一篮子看跌期权定价、Black-Scholes 模型中三种标的资产和敲入障碍的欧式平均看跌期权定价以及随机洛伦兹方程等情况下的 LRV 策略进行了数值检验。在这些示例中,与标准 MC 仿真、使用 Sobol 序列的 QMC 仿真、SGD 训练的浅层 ANN 和 SGD 训练的深层 ANN 相比,LRV 策略产生了极具说服力的数值结果。
期刊介绍:
Mathematical Finance seeks to publish original research articles focused on the development and application of novel mathematical and statistical methods for the analysis of financial problems.
The journal welcomes contributions on new statistical methods for the analysis of financial problems. Empirical results will be appropriate to the extent that they illustrate a statistical technique, validate a model or provide insight into a financial problem. Papers whose main contribution rests on empirical results derived with standard approaches will not be considered.