首页 > 最新文献

Journal of the Operations Research Society of Japan最新文献

英文 中文
AN EFFICIENT BRANCH-AND-CUT ALGORITHM FOR SUBMODULAR FUNCTION MAXIMIZATION 一种有效的子模函数最大化的分支割算法
Q4 Decision Sciences Pub Date : 2020-04-30 DOI: 10.15807/jorsj.63.41
Naoya Uematsu, S. Umetani, Y. Kawahara
{"title":"AN EFFICIENT BRANCH-AND-CUT ALGORITHM FOR SUBMODULAR FUNCTION MAXIMIZATION","authors":"Naoya Uematsu, S. Umetani, Y. Kawahara","doi":"10.15807/jorsj.63.41","DOIUrl":"https://doi.org/10.15807/jorsj.63.41","url":null,"abstract":"","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44281953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
MONTE CARLO ALGORITHM FOR CALCULATING THE SHAPLEY VALUES OF MINIMUM COST SPANNING TREE GAMES 最小代价生成树对策SHAPLEY值的蒙特卡罗算法
Q4 Decision Sciences Pub Date : 2020-01-31 DOI: 10.15807/jorsj.63.31
Kazutoshi Ando, K. Takase
In this paper, we address a Monte Carlo algorithm for calculating the Shapley values of minimum cost spanning tree games. We provide tighter upper and lower bounds for the marginal cost vector and improve a previous study’s lower bound on the number of permutations required for the output of the algorithm to achieve a given accuracy with a given probability. In addition, we present computational experiments for estimating the lower bound on the number of permutations required by the Monte Carlo algorithm.
在本文中,我们讨论了一种计算最小代价生成树对策的Shapley值的蒙特卡罗算法。我们为边际成本向量提供了更严格的上界和下界,并改进了先前研究中算法输出所需的排列数量的下界,以达到给定概率下的给定精度。此外,我们还提供了计算实验来估计蒙特卡罗算法所需的排列数的下界。
{"title":"MONTE CARLO ALGORITHM FOR CALCULATING THE SHAPLEY VALUES OF MINIMUM COST SPANNING TREE GAMES","authors":"Kazutoshi Ando, K. Takase","doi":"10.15807/jorsj.63.31","DOIUrl":"https://doi.org/10.15807/jorsj.63.31","url":null,"abstract":"In this paper, we address a Monte Carlo algorithm for calculating the Shapley values of minimum cost spanning tree games. We provide tighter upper and lower bounds for the marginal cost vector and improve a previous study’s lower bound on the number of permutations required for the output of the algorithm to achieve a given accuracy with a given probability. In addition, we present computational experiments for estimating the lower bound on the number of permutations required by the Monte Carlo algorithm.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":"63 1","pages":"31-40"},"PeriodicalIF":0.0,"publicationDate":"2020-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41622463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
ONE-WAY TRADING PROBLEMS VIA LINEAR OPTIMIZATION 基于线性优化的单向交易问题
Q4 Decision Sciences Pub Date : 2020-01-31 DOI: 10.15807/jorsj.63.1
H. Fujiwara, Naohiro Araki, Hiroaki Yamamoto
Abstract In the one-way trading problem, we are asked to convert dollars into yen only by unidirectional conversions, while watching the exchange rate that fluctuates along time. The goal is to maximize the amount of yen we finally get, under the assumption that we are not informed of when the game ends. For this problem, an optimal algorithm was proposed by El-Yaniv et al. In this paper we formulate this problem into a linear optimization problem (linear program) and reduce derivation of an optimal algorithm to solving the linear optimization problem. This reveals that the optimality of the algorithm follows from the duality theorem. Our analysis demonstrates how infinite-dimensional linear optimization helps to design algorithms.
在单向交易问题中,我们被要求只通过单向转换将美元兑换成日元,同时观察随时间波动的汇率。我们的目标是在我们不知道游戏何时结束的前提下,最大化我们最终获得的日元数量。针对这一问题,El-Yaniv等人提出了一种最优算法。本文将该问题转化为线性优化问题(线性规划),并简化了求解该问题的最优算法的推导过程。这表明该算法的最优性遵循对偶定理。我们的分析展示了无限维线性优化如何帮助设计算法。
{"title":"ONE-WAY TRADING PROBLEMS VIA LINEAR OPTIMIZATION","authors":"H. Fujiwara, Naohiro Araki, Hiroaki Yamamoto","doi":"10.15807/jorsj.63.1","DOIUrl":"https://doi.org/10.15807/jorsj.63.1","url":null,"abstract":"Abstract In the one-way trading problem, we are asked to convert dollars into yen only by unidirectional conversions, while watching the exchange rate that fluctuates along time. The goal is to maximize the amount of yen we finally get, under the assumption that we are not informed of when the game ends. For this problem, an optimal algorithm was proposed by El-Yaniv et al. In this paper we formulate this problem into a linear optimization problem (linear program) and reduce derivation of an optimal algorithm to solving the linear optimization problem. This reveals that the optimality of the algorithm follows from the duality theorem. Our analysis demonstrates how infinite-dimensional linear optimization helps to design algorithms.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67215450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ON DYNAMIC PATROLLING SECURITY GAMES 关于动态巡逻安全游戏
Q4 Decision Sciences Pub Date : 2019-10-31 DOI: 10.15807/jorsj.62.152
Akifumi Kira, Naoyuki Kamiyama, H. Anai, H. Iwashita, Kotaro Ohori
We consider Stackelberg patrolling security games in which a security guard and an intruder walk around a facility. In these games, at each timepoint, the guard earns a reward (intruder incurs a cost) depending on their locations at that time. The objective of the guard (resp., the intruder) is to patrol (intrude) the facility so that the total sum of rewards is maximized (minimized). We study three cases: In Case 1, the guard chooses a scheduled route first and then the intruder chooses a scheduled route after perfectly observing the guard’s choice. In Case 2, the guard randomizes her scheduled routes and then intruder observes its probability distribution and also randomize his scheduled routes. In Case 3, the guard randomizes her scheduled routes as well, but the intruder sequentially observes the location of the guard and reroutes to reach one of his targets. We show that the intruder’s best response problem in Cases 1 and 2 and Case 3 can be formulated as a shortest path problem and a Markov decision process, respectively. Moreover, the equilibrium problem in each case reduces to a polynomial-sized mixed integer linear programming, linear programming, and bilinear programming problem, respectively.
我们考虑Stackelberg巡逻安全游戏,在这个游戏中,一名保安和一名入侵者在一个设施周围走动。在这些游戏中,在每个时间点,守卫会根据他们当时的位置获得奖励(入侵者会付出代价)。守卫的目的(责任)。(如入侵者)巡视(侵入)设施,使总奖励最大化(最小化)。我们研究了三种情况:在情形1中,守卫先选择预定路线,然后入侵者在完全观察守卫的选择后选择预定路线。在情形2中,守卫将自己的计划路线随机化,入侵者观察其概率分布,也将自己的计划路线随机化。在情况3中,守卫也随机安排了她的预定路线,但是入侵者会依次观察守卫的位置并改变路线到达他的目标之一。研究表明,在情形1、情形2和情形3中,入侵者的最佳对策问题可以分别表述为最短路径问题和马尔可夫决策过程。此外,每种情况下的平衡问题分别简化为多项式大小的混合整数线性规划、线性规划和双线性规划问题。
{"title":"ON DYNAMIC PATROLLING SECURITY GAMES","authors":"Akifumi Kira, Naoyuki Kamiyama, H. Anai, H. Iwashita, Kotaro Ohori","doi":"10.15807/jorsj.62.152","DOIUrl":"https://doi.org/10.15807/jorsj.62.152","url":null,"abstract":"We consider Stackelberg patrolling security games in which a security guard and an intruder walk around a facility. In these games, at each timepoint, the guard earns a reward (intruder incurs a cost) depending on their locations at that time. The objective of the guard (resp., the intruder) is to patrol (intrude) the facility so that the total sum of rewards is maximized (minimized). We study three cases: In Case 1, the guard chooses a scheduled route first and then the intruder chooses a scheduled route after perfectly observing the guard’s choice. In Case 2, the guard randomizes her scheduled routes and then intruder observes its probability distribution and also randomize his scheduled routes. In Case 3, the guard randomizes her scheduled routes as well, but the intruder sequentially observes the location of the guard and reroutes to reach one of his targets. We show that the intruder’s best response problem in Cases 1 and 2 and Case 3 can be formulated as a shortest path problem and a Markov decision process, respectively. Moreover, the equilibrium problem in each case reduces to a polynomial-sized mixed integer linear programming, linear programming, and bilinear programming problem, respectively.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48610628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SEPARABLE CONVEX RESOURCE ALLOCATION PROBLEM WITH L1-DISTANCE CONSTRAINT 具有11 -距离约束的可分离凸资源分配问题
Q4 Decision Sciences Pub Date : 2019-07-31 DOI: 10.15807/JORSJ.62.109
N. Minamikawa, A. Shioura
Separable convex resource allocation problem aims at finding an allocation of a discrete resource to several activities that minimizes a separable convex function representing the total cost or the total loss. In this paper, we consider the separable convex resource allocation problem with an additional constraint that the L1-distance between a given vector and a feasible solution is bounded by a given positive constant. We prove that the simplest separable convex resource allocation problem with the L1-distance constraint can be reformulated as a submodular resource allocation problem. This result implies that the problem can be solved in polynomial time by existing algorithms for the submodular resource allocation problem. We present specialized implementations of the existing algorithms and analyze their running time.
可分离凸资源分配问题的目的是寻找一个离散的资源分配到几个活动,使代表总成本或总损失的可分离凸函数最小。本文考虑了可分离凸资源分配问题,该问题具有一个附加约束,即给定向量与可行解之间的l1距离有一个给定的正常数。证明了具有l1 -距离约束的最简单可分离凸资源分配问题可以重新表述为子模资源分配问题。该结果表明,现有的子模资源分配算法可以在多项式时间内求解该问题。我们给出了现有算法的专门实现,并分析了它们的运行时间。
{"title":"SEPARABLE CONVEX RESOURCE ALLOCATION PROBLEM WITH L1-DISTANCE CONSTRAINT","authors":"N. Minamikawa, A. Shioura","doi":"10.15807/JORSJ.62.109","DOIUrl":"https://doi.org/10.15807/JORSJ.62.109","url":null,"abstract":"Separable convex resource allocation problem aims at finding an allocation of a discrete resource to several activities that minimizes a separable convex function representing the total cost or the total loss. In this paper, we consider the separable convex resource allocation problem with an additional constraint that the L1-distance between a given vector and a feasible solution is bounded by a given positive constant. We prove that the simplest separable convex resource allocation problem with the L1-distance constraint can be reformulated as a submodular resource allocation problem. This result implies that the problem can be solved in polynomial time by existing algorithms for the submodular resource allocation problem. We present specialized implementations of the existing algorithms and analyze their running time.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41432246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CORDON AND AREA ROAD PRICING IN RADIAL-ARC NETWORK 放射形圆弧网中警戒线和区域道路收费
Q4 Decision Sciences Pub Date : 2019-07-31 DOI: 10.15807/JORSJ.62.121
M. Miyagawa
Abstract This paper proposes a continuous network model for determining the size of the toll area and toll level in cordon and area road pricing. Cordon pricing charges a toll to vehicles passing a cordon line surrounding a designated area, whereas area pricing charges a toll to all vehicles driving inside the area. Analytical expressions for the traffic volume and toll revenue are obtained for a circular city with a radial-arc network. The analytical expressions demonstrate how the size of the toll area and toll level affect the traffic volume and toll revenue. Comparing cordon and area pricing shows that area pricing is superior to cordon pricing in both reducing traffic volume in the toll area and generating revenue.
摘要本文提出了一个连续网络模型,用于确定警戒线和区域道路定价中收费区的大小和收费水平。警戒线定价对通过指定区域周围警戒线的车辆收取通行费,而区域定价对在该区域内行驶的所有车辆收取通行费用。得到了具有径向弧形网络的圆形城市的交通量和通行费收入的解析表达式。分析表达式展示了收费区的大小和收费水平如何影响交通量和收费收入。将警戒线和区域定价进行比较表明,区域定价在减少收费区交通量和创收方面都优于警戒线定价。
{"title":"CORDON AND AREA ROAD PRICING IN RADIAL-ARC NETWORK","authors":"M. Miyagawa","doi":"10.15807/JORSJ.62.121","DOIUrl":"https://doi.org/10.15807/JORSJ.62.121","url":null,"abstract":"Abstract This paper proposes a continuous network model for determining the size of the toll area and toll level in cordon and area road pricing. Cordon pricing charges a toll to vehicles passing a cordon line surrounding a designated area, whereas area pricing charges a toll to all vehicles driving inside the area. Analytical expressions for the traffic volume and toll revenue are obtained for a circular city with a radial-arc network. The analytical expressions demonstrate how the size of the toll area and toll level affect the traffic volume and toll revenue. Comparing cordon and area pricing shows that area pricing is superior to cordon pricing in both reducing traffic volume in the toll area and generating revenue.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44949317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
ESTIMATING FORWARD LOOKING DISTRIBUTION WITH THE ROSS RECOVERY THEOREM 用ROSS恢复定理估计前瞻分布
Q4 Decision Sciences Pub Date : 2019-04-25 DOI: 10.15807/JORSJ.62.83
Takuya Kiriu, Norio Hibiki
The payoff of option is determined by the future price of underlying asset and therefore the option prices contain the forward looking information. Implied distribution is a forward looking distribution of the underlying asset derived from option prices. There are a lot of studies estimating implied distribution in the risk neutral probability framework. However, a risk neutral probability generally differs from a real world probability, which represents actual investors view about asset return. Recently, Ross (2015) has showed remarkable theorem, named “Recovery Theorem”. It enables us to estimate the real world probability distribution from option prices under a particular assumption about representative investor's risk preferences. However, it is not easy to derive the appropriate estimators because it is necessary to solve an ill-posed problem in estimation process. This paper discusses about the method to estimate a real world distribution accurately with the Recovery Theorem. The previous studies propose the methods to estimate the real world distribution, whereas they do not investigate on the estimation accuracy. Hence, we test the effectiveness of the Tikhonov method used by Audrino et al. (2015) in the numerical analysis with hypothetical data. We propose a new method to derive the more accurate solution by configuring the regularization term considering prior information and compare it with the Tikhonov method. Moreover, we discuss regularization parameter selection to get the accurate real world distribution. We find the following three points through the numerical analysis. (1) To stabilize the solution by introducing regularization term is an effective method in terms of estimating a real world distribution with the Recovery Theorem. (2) Proposed method can estimate a real world distribution more accurately than the Tikhonov method. (3) We can offer the appropriate solutions even if the number of maturities is less than that of states.
期权的收益是由标的资产的未来价格决定的,因此期权价格包含了前瞻性信息。隐含分布是从期权价格推导出的标的资产的前瞻性分布。在风险中性概率框架下对隐含分布的估计已有大量的研究。然而,风险中性概率通常不同于现实世界的概率,它代表了投资者对资产回报的实际看法。最近,Ross(2015)提出了一个引人注目的定理,命名为“恢复定理”。它使我们能够在一个关于代表性投资者风险偏好的特定假设下,从期权价格估计真实世界的概率分布。然而,由于在估计过程中需要解决一个不适定问题,因此推导出合适的估计量并不容易。本文讨论了用恢复定理准确估计实际分布的方法。以往的研究提出了估计真实世界分布的方法,但没有对估计的精度进行研究。因此,我们使用假设数据测试Audrino et al.(2015)在数值分析中使用的Tikhonov方法的有效性。我们提出了一种新的方法,通过配置考虑先验信息的正则化项来获得更精确的解,并与Tikhonov方法进行了比较。此外,我们还讨论了正则化参数的选择,以获得准确的真实世界分布。通过数值分析,我们发现以下三点。(1)引入正则化项稳定解是利用恢复定理估计实际分布的一种有效方法。(2)与Tikhonov方法相比,该方法可以更准确地估计真实世界的分布。(3)即使期限少于国家,我们也可以提供相应的解决方案。
{"title":"ESTIMATING FORWARD LOOKING DISTRIBUTION WITH THE ROSS RECOVERY THEOREM","authors":"Takuya Kiriu, Norio Hibiki","doi":"10.15807/JORSJ.62.83","DOIUrl":"https://doi.org/10.15807/JORSJ.62.83","url":null,"abstract":"The payoff of option is determined by the future price of underlying asset and therefore the option prices contain the forward looking information. Implied distribution is a forward looking distribution of the underlying asset derived from option prices. There are a lot of studies estimating implied distribution in the risk neutral probability framework. However, a risk neutral probability generally differs from a real world probability, which represents actual investors view about asset return. Recently, Ross (2015) has showed remarkable theorem, named “Recovery Theorem”. It enables us to estimate the real world probability distribution from option prices under a particular assumption about representative investor's risk preferences. However, it is not easy to derive the appropriate estimators because it is necessary to solve an ill-posed problem in estimation process. This paper discusses about the method to estimate a real world distribution accurately with the Recovery Theorem. The previous studies propose the methods to estimate the real world distribution, whereas they do not investigate on the estimation accuracy. Hence, we test the effectiveness of the Tikhonov method used by Audrino et al. (2015) in the numerical analysis with hypothetical data. We propose a new method to derive the more accurate solution by configuring the regularization term considering prior information and compare it with the Tikhonov method. Moreover, we discuss regularization parameter selection to get the accurate real world distribution. We find the following three points through the numerical analysis. (1) To stabilize the solution by introducing regularization term is an effective method in terms of estimating a real world distribution with the Recovery Theorem. (2) Proposed method can estimate a real world distribution more accurately than the Tikhonov method. (3) We can offer the appropriate solutions even if the number of maturities is less than that of states.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44486401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A DYNAMIC PROGRAMMING ALGORITHM FOR OPTIMIZING BASEBALL STRATEGIES 棒球策略优化的动态规划算法
Q4 Decision Sciences Pub Date : 2019-04-25 DOI: 10.15807/JORSJ.62.64
Akifumi Kira, Keisuke Inakawa, Toshiharu Fujita
In this paper, baseball is formulated as a finite Markov game with approximately 6.45 million states. We give an effective dynamic programming algorithm which computes Markov perfect equilibria and the value functions of the game for both teams in 2 second per game. Optimal decision making can be found depending on the situation—for example, for the batting team, whether batting for a hit, stealing a base or sacrifice bunting will maximize their win percentage, or for the fielding team, whether to pitch to or intentionally walk a batter, yields optimal results. In addition, our algorithm makes it possible to compute the optimal batting order, in consideration of strategy optimization such as a sacrifice bunt or a stolen base. The authors believe that this baseball model is also useful as a benchmark instance for evaluating the performances of (multi-agent) Reinforcement Learning methods.
在本文中,棒球被公式化为具有大约645万个状态的有限马尔可夫对策。我们给出了一个有效的动态规划算法,该算法在每场比赛2秒内计算出两支球队的马尔可夫完全均衡和比赛的值函数。最佳决策可以根据情况而定——例如,对于击球队来说,无论是击球击球、盗垒还是牺牲彩旗都会最大限度地提高他们的胜率,或者对于守备队来说,是向击球手投球还是故意保送击球手,都会产生最佳结果。此外,我们的算法使计算最佳击球顺序成为可能,同时考虑到策略优化,如牺牲短棍或盗垒。作者认为,这个棒球模型也可以作为评估(多智能体)强化学习方法性能的基准实例。
{"title":"A DYNAMIC PROGRAMMING ALGORITHM FOR OPTIMIZING BASEBALL STRATEGIES","authors":"Akifumi Kira, Keisuke Inakawa, Toshiharu Fujita","doi":"10.15807/JORSJ.62.64","DOIUrl":"https://doi.org/10.15807/JORSJ.62.64","url":null,"abstract":"In this paper, baseball is formulated as a finite Markov game with approximately 6.45 million states. We give an effective dynamic programming algorithm which computes Markov perfect equilibria and the value functions of the game for both teams in 2 second per game. Optimal decision making can be found depending on the situation—for example, for the batting team, whether batting for a hit, stealing a base or sacrifice bunting will maximize their win percentage, or for the fielding team, whether to pitch to or intentionally walk a batter, yields optimal results. In addition, our algorithm makes it possible to compute the optimal batting order, in consideration of strategy optimization such as a sacrifice bunt or a stolen base. The authors believe that this baseball model is also useful as a benchmark instance for evaluating the performances of (multi-agent) Reinforcement Learning methods.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.15807/JORSJ.62.64","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48615475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
DP-BASED ALGORITHM AND FPTAS FOR THE KNAPSACK SHARING AND RELATED PROBLEMS 基于dp的背包共享算法与fptas及相关问题
Q4 Decision Sciences Pub Date : 2019-01-31 DOI: 10.15807/JORSJ.62.1
S. Kataoka, Takeo Yamada
In the knapsack sharing problem (KSP), formulated previously, we considered a game-theoretic situation in which two or more players (agents) compete for their share of capacity in a knapsack with their respective sets of items. As an extension of this problem, we formulate the extended knapsack sharing problem (XKSP). This is actually a family of KSP-like problems, and we present a dynamic programmingbased (DP-based), pseudo-polynomial time algorithm to solve XKSP to optimality in a unified way. XKSP is shown to be NP-hard, but due to the existence of this pseudo-polynomial time algorithm, it is only weakly NP-hard. Next, we develop an algorithm to solve the problem approximately in polynomial time by decomposing it into a series of subproblems. Furthermore, we introduce a scaling factor into the DP computation to obtain a fully polynomial time approximation scheme (FPTAS) for XKSP with two agents. Extension to the case of more than two agents is discussed, together with a non-DP-based PTAS.
在之前制定的背包共享问题(KSP)中,我们考虑了一个博弈论的情况,其中两个或多个参与者(代理)用各自的物品集竞争他们在背包中的容量份额。作为该问题的推广,我们提出了扩展背包共享问题(XKSP)。这实际上是一类类似于ksp的问题,我们提出了一种基于动态规划(DP-based)的伪多项式时间算法,以统一的方式求解XKSP的最优性。证明XKSP是np困难的,但由于该伪多项式时间算法的存在,它只是弱np困难的。接下来,我们开发了一种算法,通过将问题分解为一系列子问题,在多项式时间内近似地解决问题。此外,我们在DP计算中引入了一个比例因子,以获得具有两个代理的XKSP的全多项式时间近似方案(FPTAS)。扩展到超过两个代理的情况下进行了讨论,以及一个非基于dp的PTAS。
{"title":"DP-BASED ALGORITHM AND FPTAS FOR THE KNAPSACK SHARING AND RELATED PROBLEMS","authors":"S. Kataoka, Takeo Yamada","doi":"10.15807/JORSJ.62.1","DOIUrl":"https://doi.org/10.15807/JORSJ.62.1","url":null,"abstract":"In the knapsack sharing problem (KSP), formulated previously, we considered a game-theoretic situation in which two or more players (agents) compete for their share of capacity in a knapsack with their respective sets of items. As an extension of this problem, we formulate the extended knapsack sharing problem (XKSP). This is actually a family of KSP-like problems, and we present a dynamic programmingbased (DP-based), pseudo-polynomial time algorithm to solve XKSP to optimality in a unified way. XKSP is shown to be NP-hard, but due to the existence of this pseudo-polynomial time algorithm, it is only weakly NP-hard. Next, we develop an algorithm to solve the problem approximately in polynomial time by decomposing it into a series of subproblems. Furthermore, we introduce a scaling factor into the DP computation to obtain a fully polynomial time approximation scheme (FPTAS) for XKSP with two agents. Extension to the case of more than two agents is discussed, together with a non-DP-based PTAS.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.15807/JORSJ.62.1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43247876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
CHOICE-BASED SEATING POSITION MODEL WITH UNDISTINGUISHED MULTI-LINES IN REVENUE MANAGEMENT 基于选择的座位位置模型与未区分的多线收入管理
Q4 Decision Sciences Pub Date : 2019-01-31 DOI: 10.15807/JORSJ.62.37
Yu Ogasawara, Masamichi Kon
In revenue management, there are models which aim to maximize revenue by controlling policy for uncertain demands throughout a booking horizon. The models are called dynamic models. One of the applications of the dynamic models is reservation system which offers available seats for customers’ requests. Recently, the system has allowed us to choose our booking seat position. However, the dynamic models in revenue management have not been included customers’ selection behavior for seating position. This paper proposes choice-based seating position model with undistinguished multi-lines that is a dynamic model considered with the customers’ selection behavior for seating positions. Approximate solutions for this model are calculated by Choice-based Deterministic Linear Programming (CDLP) and decomposition approximation which are used in choice-based network revenue management models. This paper suggests that CDLP is more effective than decomposition approximation for the choice-based seating position model, even through some reports in revenue management suggested that decomposition approximation could derive higher revenue than CDLP in their models.
在收益管理中,有一些模型旨在通过在整个预订范围内控制不确定需求的策略来实现收益最大化。这些模型被称为动态模型。动态模型的应用之一是预定系统,它根据顾客的要求提供可用的座位。最近,系统允许我们选择我们的预订座位位置。然而,收益管理中的动态模型并没有考虑顾客对座位位置的选择行为。本文提出了一种考虑顾客座位选择行为的基于选择的无区分多线座位模型。利用基于选择的网络收益管理模型中使用的基于选择的确定性线性规划(CDLP)和分解近似计算了该模型的近似解。本文认为,对于基于选择的座位位置模型,CDLP比分解近似更有效,甚至通过一些收益管理报告表明,在他们的模型中,分解近似比CDLP可以获得更高的收益。
{"title":"CHOICE-BASED SEATING POSITION MODEL WITH UNDISTINGUISHED MULTI-LINES IN REVENUE MANAGEMENT","authors":"Yu Ogasawara, Masamichi Kon","doi":"10.15807/JORSJ.62.37","DOIUrl":"https://doi.org/10.15807/JORSJ.62.37","url":null,"abstract":"In revenue management, there are models which aim to maximize revenue by controlling policy for uncertain demands throughout a booking horizon. The models are called dynamic models. One of the applications of the dynamic models is reservation system which offers available seats for customers’ requests. Recently, the system has allowed us to choose our booking seat position. However, the dynamic models in revenue management have not been included customers’ selection behavior for seating position. This paper proposes choice-based seating position model with undistinguished multi-lines that is a dynamic model considered with the customers’ selection behavior for seating positions. Approximate solutions for this model are calculated by Choice-based Deterministic Linear Programming (CDLP) and decomposition approximation which are used in choice-based network revenue management models. This paper suggests that CDLP is more effective than decomposition approximation for the choice-based seating position model, even through some reports in revenue management suggested that decomposition approximation could derive higher revenue than CDLP in their models.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44413760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the Operations Research Society of Japan
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1