检索结果-内蒙古大学图书馆

Entropic lattice Boltzmann simulation of three-dimensional binary gas mixture flow in packed beds using graphics processors

引用

INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING 2016年第4期12卷 298-310页

作者： Safi, Mohammad Amin Ashrafizaadeh, Mahmud TU Dortmund Inst Appl Math LS 3 Vogelpothsweg 87 D-44227 Dortmund Germany Isfahan Univ Technol Dept Mech Engn Esfahan *** Iran

The lattice Boltzmann method is employed for simulating the binary flow of oxygen/nitrogen mixture passing through a highly dense bed of spherical particles. Simulations are performed based on the latest proposed entropic lattice Boltzmann model for multi-component flows, using the D3Q27 lattice stencil. The curved solid boundary of the particles is accurately treated via a linear interpolation. To lower the total computational cost and time of the simulations, implementation on graphics processing units (GPU) is also presented. Since the workload associated with each iteration is relatively higher than that of conventional 3D LBM simulations, special emphasis is paid in order to obtain the best computational performance on GPUs. Performance gains of one order of magnitude over optimised multi-core CPUs are achieved for the complex flow of interest on Fermi generation GPUs. Moreover, the numerical results for a three-dimensional benchmark flow show excellent agreements with the available analytical data.

关键词： lattice Boltzmann method LBM entropic model binary mixture flow packed beds graphics processing unit GPU optimised parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

Graphics processing unit implementation and optimisation of a flexible maximum a-posteriori decoder for synchronisation correction

引用

JOURNAL OF ENGINEERING-JOE 2014年第6期2014卷 284-296页

作者： Briffa, Johann A. Univ Surrey Dept Comp Guildford GU2 7XH Surrey England

In this paper, the author presents an optimised parallel implementation of a flexible maximum a-posteriori decoder for synchronisation error correcting codes, supporting a very wide range of code sizes and channel conditions. On mid-range GPUs the author demonstrates decoding speedups of more than two orders of magnitude over a central processing unit implementation of the same optimised algorithm, and more than an order of magnitude over the author's earlier GPU implementation. The prominent challenge is to maintain high parallelisation efficiency over a wide range of code sizes and channel conditions, and different execution hardware. The author ensures this with a dynamic strategy for choosing parallel execution parameters at run-time. They also present a variant that trades off some decoding speed for significantly reduced memory requirement, with no loss to the decoder's error correction performance. The increased throughput of their implementation and its ability to work with less memory allow us to analyse larger codes and poorer channel conditions, and makes practical use of such codes more feasible.

关键词： graphics processing units error correction codes synchronisation maximum likelihood decoding parallel algorithms graphics processing unit flexible maximum a-posteriori decoder optimisation synchronisation error correcting codes channel conditions code sizes GPUs central processing unit high parallelisation efficiency dynamic strategy parallel execution parameters decoder error correction performance optimised parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：