检索结果-内蒙古大学图书馆

FADO: Floorplan-Aware directive optimization Based on Synthesis and Analytical Models for High-Level Synthesis Designs on Multi-Die FPGAs

引用

ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS 2024年第3期17卷 1-33页

作者： Du, Linfeng Liang, Tingyuan Zhou, Xiaofeng Ge, Jinming Li, Shangkun Sinha, Sharad Zhao, Jieru Xie, Zhiyao Zhang, Wei Hong Kong Univ Sci & Technol Kowloon Elect & Comp Engn Hong Kong Peoples R China Fudan Univ Shanghai Peoples R China Indian Inst Technol Goa Comp Sci & Engn Ponda Goa India Shanghai Jiao Tong Univ Comp Sci & Engn Shanghai Peoples R China

Multi-die FPGAs are widely adopted for large-scale accelerators, but optimizing high-level synthesis designs on these FPGAs faces two challenges. First, the delay caused by die-crossing nets creates an NP-hard floor- planning problem. Second, traditional directive optimization cannot consider resource constraints on each die or the timing issue incurred by the die-crossings. Furthermore, the high algorithmic complexity and the large scale lead to extended runtime for legalizing the floorplan of HLS designs under different directive configurations. To co-optimize the directives and floorplan of HLS designs on multi-die FPGAs, we formulate the co-search based on bin-packing variants and present two iterative optimization flows. The first (FADO 1.0) relies on a pre-built QoR library. It involves a greedy, latency-bottleneck-guided directive search, and an incremental floorplan legalization. Compared with a global floorplanning solution, it takes 693X similar to 4925X similar to 4925X shorter search time and achieves 1.16X similar to 8.78X similar to 8.78X better design performance, measured in workload execution time. To remove the time-consuming QoR library generation, the second flow (FADO 2.0) integrates an analytical QoR model and redesigns the directive search to accelerate convergence. Through experiments on mixed dataflow and non-dataflow designs, compared with 1.0, FADO 2.0 further yields a 1.40X better design performance on average after implementation on the Alveo U250 FPGA.

关键词： High-level synthesis analytical model design space exploration multi-die FPGA directive optimization floorplanning

来源：评论

学校读者我要写书评

暂无评论

FADO: Floorplan-Aware directive optimization for High-Level Synthesis Designs on Multi-Die FPGAs 23

FADO: Floorplan-Aware Directive Optimization for High-Level ...

引用

31st ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA)

作者： Du, Linfeng Liang, Tingyuan Sinha, Sharad Xie, Zhiyao Zhang, Wei Hong Kong Univ Sci & Technol Kowloon Hong Kong Peoples R China Indian Inst Technol Goa Ponda Goa India

ISBN: (纸本)9781450394178

Multi-die FPGAs are widely adopted to deploy large-scale hardware accelerators. Two factors impede the performance optimization of high-level synthesis (HLS) designs implemented on multi-die FPGAs. On the one hand, the long net delay due to nets crossing die-boundaries results in an NP-hard problem to properly floorplan and pipeline an application. On the other hand, traditional automated searching flow for HLS directive optimizations targets single-die FPGAs, and hence, it cannot consider the resource constraints on each die and the timing issue incurred by the die-crossings. Further, it leads to an excessively long runtime to legalize the floorplanning of HLS designs generated under each group of configurations during directive optimization due to the large design scale. To co-optimize the directives and floorplan of HLS designs on multi-die FPGAs, we propose the FADO framework, which formulates the directive-floorplan co-search problem based on the multi-choice multi-dimensional bin-packing and solves it using an iterative optimization flow. For each step of directive optimization, a latency-bottleneck-guided greedy algorithm searches for more efficient directive configurations. For floorplanning, instead of repetitively incurring global floorplanning algorithms, we implement a more efficient incremental floorplan legalization algorithm. It mainly applies the worst-fit strategy from the online bin-packing algorithm to balance the floorplan, together with an offline best-fit-decreasing re-packing step to compact the floorplan, followed by pipelining of the long wires crossing die-boundaries. Through experiments on a set of HLS designs mixing dataflow and non-dataflow kernels, FADO not only well-automates the co-optimization and finishes within 693X similar to 4925X shorter runtime, compared with DSE assisted by global floorplanning, but also yields an improvement of 1.16X similar to 8.78X in overall workflow execution time after implementation on the Xilinx Alveo

关键词： High-Level Synthesis Design Space Exploration Multi-Die FPGA directive optimization Floorplanning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：