咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Learning Sub-Second Routing Op... 收藏
arXiv

Learning Sub-Second Routing Optimization in Computer Networks requires Packet-Level Dynamics

作     者:Boltres, Andreas Freymuth, Niklas Jahnke, Patrick Karl, Holger Neumann, Gerhard 

作者机构:Autonomous Learning Robots Karlsruhe Institute of Technology SAP SE Germany Turba AI United States Internet-Technology and Softwarization Hasso-Plattner-Institut Potsdam Germany 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2024年

核心收录:

主  题:Reinforcement learning 

摘      要:Finding efficient routes for data packets is an essential task in computer networking. The optimal routes depend greatly on the current network topology, state and traffic demand, and they can change within milliseconds. Reinforcement Learning can help to learn network representations that provide routing decisions for possibly novel situations. So far, this has commonly been done using fluid network models. We investigate their suitability for millisecond-scale adaptations with a range of traffic mixes and find that packet-level network models are necessary to capture true dynamics, in particular in the presence of TCP traffic. To this end, we present PackeRL, the first packet-level Reinforcement Learning environment for routing in generic network topologies. Our experiments confirm that learning-based strategies that have been trained in fluid environments do not generalize well to this more realistic, but more challenging setup. Hence, we also introduce two new algorithms for learning sub-second Routing Optimization. We present M-Slim, a dynamic shortest-path algorithm that excels at high traffic volumes but is computationally hard to scale to large network topologies, and FieldLines, a novel next-hop policy design that re-optimizes routing for any network topology within milliseconds without requiring any re-training. Both algorithms outperform current learning-based approaches as well as commonly used static baseline protocols in scenarios with high-traffic volumes. All findings are backed by extensive experiments in realistic network conditions in our fast and versatile training and evaluation framework. © 2024, CC BY.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分