The network has become a bottleneck for generative artificial intelligence (GAI) jobs. Accelerating GAI jobs in edge data centers using hybrid electrical/optical switch is considered a promising solution. This archite...
详细信息
The network has become a bottleneck for generative artificial intelligence (GAI) jobs. Accelerating GAI jobs in edge data centers using hybrid electrical/optical switch is considered a promising solution. This architecture optimizes bandwidth utilization by enabling demand-aware topology reconfiguration through flexible configuration of optical circuit switche optical circuit switches (OCS). However, frequent topology reconfiguration may increase latency. Therefore, there is a balanced relationship between latency and bandwidth utilization. In this article, we propose a multigranularityadaptive interleaved algorithm for service scheduling in edge data centers. First, different degrees of time slot shifts are introduced based on the latency sensitivity of jobs, where large bandwidth GAI jobs are transmitted in a single hop by configuring a demand-aware topology. Additionally, when the reconfiguration threshold is met, low-priority ports are prioritized for reconfiguration to ensure latency requirements are met. This approach effectively resolves the tradeoff between bandwidth utilization and latency by decoupling them from each other. Simulation results show that this approach can effectively reduce the latency and improve the network throughput.
暂无评论