Dynamic Routing and Wavelength Assignment with Reinforcement Learning

INFORMS journal on optimization Pub Date : 2023-06-08 DOI:10.1287/ijoo.2023.0092

Peyman Kafaei, Quentin Cappart, Nicolas Chapados, H. Pouya, Louis-Martin Rousseau

引用次数: 0

Abstract

With the rapid developments in communication systems, and considering their dynamic nature, all-optical networks are becoming increasingly complex. This study proposes a novel method based on deep reinforcement learning for the routing and wavelength assignment problem in all-optical wavelength-decision-multiplexing networks. We consider dynamic incoming requests, in which their arrival and holding times are not known in advance. The objective is to devise a strategy that minimizes the number of rejected packages due to the lack of resources in the long term. We use graph neural networks to capture crucial latent information from the graph-structured input to develop the optimal strategy. The proposed deep reinforcement learning algorithm selects a route and a wavelength simultaneously for each incoming traffic connection as they arrive. The results demonstrate that the learned agent outperforms the methods used in practice and can be generalized on network topologies that did not participate in training.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于强化学习的动态路由和波长分配

随着通信系统的快速发展，考虑到其动态特性，全光网络变得越来越复杂。提出了一种基于深度强化学习的全光波长决策复用网络路由和波长分配问题的新方法。我们考虑动态传入请求，其中它们的到达和保持时间是未知的。目标是设计一种策略，以尽量减少由于长期缺乏资源而被拒绝的包的数量。我们使用图神经网络从图结构输入中捕获关键的潜在信息，以制定最优策略。提出的深度强化学习算法在每个进入的流量连接到达时同时选择路由和波长。结果表明，学习后的智能体优于实践中使用的方法，可以在未参与训练的网络拓扑上进行泛化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

INFORMS journal on optimization

自引率

0.00%

发文量

期刊最新文献

A Stochastic Inexact Sequential Quadratic Optimization Algorithm for Nonlinear Equality-Constrained Optimization Scenario-Based Robust Optimization for Two-Stage Decision Making Under Binary Uncertainty On the Hardness of Learning from Censored and Nonstationary Demand Temporal Bin Packing with Half-Capacity Jobs Editorial Board