检索结果-内蒙古大学图书馆

Analysis of scalable data-privatization threading algorithms for hybrid mpi/openmp parallelization of molecular dynamics

引用

JOURNAL OF SUPERCOMPUTING 2013年第1期66卷 406-430页

作者： Kunaseth, Manaschai Richards, David F. Glosli, James N. Kalia, Rajiv K. Nakano, Aiichiro Vashishta, Priya Univ So Calif Dept Comp Sci Collaboratory Adv Comp & Simulat Los Angeles CA 90089 USA Lawrence Livermore Natl Lab Livermore CA 94550 USA

We propose and analyze threading algorithms for hybrid mpi/openmp parallelization of a molecular-dynamics simulation, which are scalable on large multicore clusters. Two data-privatization thread scheduling algorithms via nucleation-growth allocation are introduced: (1) compact-volume allocation scheduling (CVAS);and (2) breadth-first allocation scheduling (BFAS). The algorithms combine fine-grain dynamic load balancing and minimal memory-footprint data privatization threading. We show that the computational costs of CVAS and BFAS are bounded by I similar to(n (5/3) p (-2/3)) and I similar to(n), respectively, for p threads working on n particles on a multicore compute node. Memory consumption per node of both algorithms scales as O(n+n (2/3) p (1/3)), but CVAS has smaller prefactors due to a geometric effect. Based on these analyses, we derive the selection criterion between the two algorithms in terms of the granularity, n/p. We observe that memory consumption is reduced by 75 % for p=16 and n=8,192 compared to a na < ve data privatization, while maintaining thread imbalance below 5 %. We obtain a strong-scaling speedup of 14.4 with 16-way threading on a four quad-core AMD Opteron node. In addition, our mpi/openmp code achieves 2.58x and 2.16x speedups over the mpi-only implementation on 32,768 cores of BlueGene/P for 0.84 and 1.68 million particle systems, respectively.

关键词： hybrid mpi/openmp parallelization Thread scheduling Memory optimization Load balancing Parallel molecular dynamics

来源：评论

学校读者我要写书评

暂无评论

A hybrid message passing/shared memory parallelization of the adaptive integral method for multi-core clusters

引用

PARALLEL COMPUTING 2011年第6-7期37卷 279-301页

作者： Wei, Fangzhou Yilmaz, Ali E. Univ Texas Austin Dept Elect & Comp Engn Austin TX 78712 USA

A hybrid message passing and shared memory parallelization technique is presented for improving the scalability of the adaptive integral method (AIM), an FFT based algorithm, on clusters of identical multi-core processors. The proposed hybrid mpi/openmp parallelization scheme is based on a nested one-dimensional (1-D) slab decomposition of the 3-D auxiliary regular grid and the associated AIM calculations: If there are M processors and T cores per processor, the scheme (i) divides the regular grid into M slabs and MT sub-slabs, (ii) assigns each slab/sub-slab and the associated operations to one of the processors/cores, and (iii) uses mpi for inter-processor data communication and openmp for intra-processor data exchange. The mpi/openmp parallel AIM is used to accelerate the solution of the combined-field integral equation pertinent to the analysis of time-harmonic electromagnetic scattering from perfectly conducting surfaces. The scalability of the scheme is investigated theoretically and verified on a state-of-the-art multi-core cluster for benchmark scattering problems. Timing and speedup results on up to 1024 quad-core processors show that the hybrid mpi/openmp parallelization of AIM exhibits better strong scalability (fixed problem size speedup) than pure mpi parallelization of it when multiple cores are used on each processor. (C) 2011 Elsevier B.V. All rights reserved.

关键词： Adaptive integral method hybrid mpi/openmp parallelization Multi-core cluster Nested one-dimensional slab decomposition Latency/grid limited

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：