检索结果-内蒙古大学图书馆

AutoPipe-H: A Heterogeneity-Aware Data-Paralleled Pipeline Approach on Commodity GPU Servers

IEEE TRANSACTIONS ON COMPUTERS 2025年第4期74卷 1196-1209页

作者： Liu, Weijie Lu, Kai Lai, Zhiquan Li, Shengwei Ge, Keshi Li, Dongsheng Lu, Xicheng Natl Univ Def Technol Coll Comp Changsha 410073 Hunan Peoples R China

Recently, the data-parallel pipeline approach has been widely used in training DNN models on commodity GPU servers. However, there are still three challenges for hybrid parallelism on commodity GPU servers: i) a balanced model partition is crucial for efficiency, whereas prior works lack a sound solution to generate a balanced partition automatically;ii) an orchestrated device mapping is essential to reduce communication contention, however, prior works ignore server heterogeneity, exacerbating communication contention;iii) the startup overhead is inevitable and especially significant for deep pipelines, which is an essential source of pipeline bubbles and severely affects pipeline scalability. We propose AutoPipe-H to solve these three problems, which contains i) a pipeline partitioner component for automatically and quickly generating a balanced sub-block partition scheme;ii) a device mapping component that assigns pipeline stages to devices, considering server heterogeneity, to reduce communication contention;and iii) a distributed training runtime component that reduces pipeline startup overhead by splitting the micro-batch evenly. The experimental results show that AutoPipe-H can accelerate training by up to 1.26x over the hybrid parallelism framework DAPPLE and Piper, with a 2.73x-12.7x improvement in the partition balance and an order-of-magnitude time reduction in partition scheme searching.

关键词： Pipelines Parallel processing training Servers Computational modeling Graphics processing units Transformers Costs Runtime Data models Artificial neural networks distributed systems distributed deep neural network training hybrid parallelism pipeline parallelism data parallelism

来源：评论

学校读者我要写书评

暂无评论

Stannis: Low-Power Acceleration of DNN training Using Computational Storage Devices 57

Stannis: Low-Power Acceleration of DNN Training Using Comput...

引用

57th ACM/IEEE Design Automation Conference (DAC)

作者： HeydariGorji, Ali Torabzadehkashi, Mahdi Rezaei, Siavash Bobarshad, Hossein Alves, Vladimir Chou, Pai H. UC Irvine Irvine CA 92697 USA NGD Syst Inc Irvine CA USA

ISBN: (数字)9781728110851

ISBN: (纸本)9781728110851

Computational storage devices enable in-storage processing of data in place. These devices contain 64-bit application processors and hardware accelerators that can help improving performance and saving power by reducing or eliminating data movement between host computers and storage units. This paper proposes a framework, named Stannis, for distributed in-storage training of deep neural networks on clusters of computational storage devices. This in-storage processing style of training ensures that private data never leaves the storage while fully controlling the public sharing of data. The Stannis framework distributes the workload based on the processing power of each worker by determining the proper batch size for each node. Stannis also ensures the availability of input data for all nodes to avoid rank stall while maximizing the utilization and overall processing speed. Experimental results show up to 2.7x speedup and 69% reduction in energy consumption with no significant loss in accuracy.

关键词： distributed deep neural network training training distribution task parallelization computational storage near data processing privacy

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：