检索结果-内蒙古大学图书馆

IEEE International Conference on Computer Design

作者： Loew, Jason Elwell, Jesse Ponomarev, Dmitry Madden, Patrick H. SUNY Binghamton Dept Comp Sci Binghamton NY 13902 USA

ISBN: (纸本)9781424489350

Many important software applications are dominated by non-trivial serial components: Amdahl's Law places a hard upper bound on possible speedup that can be achieved for these applications. In this paper, we propose an integrated software/hardware approach for accelerating hard serial bottlenecks in data structure heavy algorithms. The key idea is to overlap the processing of the main algorithmic functions and the data structure related operations. We describe the language, compiler, ISA and architectural support for such data structure co-processing (DSCP), and define a clean interface between the software and the hardware. We perform extensive simulations using the popular C++ STL container classes, as well as a detailed implementation of our approach for Dijkstra's single-source shortest path algorithm. We find potential for improvements that are well beyond what can be achieved with more conventional parallel computation methods.

关键词： Amdahl law Dijkstra single source shortest path algorithm ISA compiler coprocessor approach coprocessors data structure intensive algorithms data structures hardware-software codesign parallel computation methods parallel processing program compilers software-hardware approach

来源：评论

学校读者我要写书评

暂无评论

Fast and accurate RCS evaluation via high-performance parallel FDTD simulation

引用

JOURNAL OF ENGINEERING-JOE 2019年第21期2019卷 7322-7325页

作者： Zhou, Xiao Long Wang, Xin Yu Zhang, Jian Feng You, Jian Wei China Ship Dev & Design Ctr Wuhan 430064 Hubei Peoples R China Southeast Univ Sch Informat & Sci Engn Nanjing 210096 Jiangsu Peoples R China

In this study, a fast and accurate method to predict the radar cross-section (RCS) of large-scale and complicated shape targets is proposed based on a high-performance parallel finite difference time-domain (FDTD) numerical method. To this end, several most popular parallel computation methods [including OpenMP, graphics processing unit (GPU), and message-passing interface (MPI)] are discussed first. Based on this discussion, a novel MPI-OpenMP-GPU hybrid parallel computation scheme for FDTD is developed. Moreover, the corresponding load-balance parallel configuration is discussed as well. Since this hybrid parallel scheme combines the merits of existing parallel technologies, the computation performance is remarkably improved. The results show that the computation time of the RCS simulation of a large-scale target can be reduced from 3 days to 0.8 h, that is, similar to 98.9% time saving.

关键词： application program interfaces radar cross-sections parallel algorithms finite difference time-domain analysis message passing parallel processing radar computing MPI high-performance parallel FDTD simulation parallel computation methods large-scale target RCS simulation computation time computation performance parallel technologies hybrid parallel scheme corresponding load-balance parallel configuration novel MPI-OpenMP-GPU hybrid parallel computation scheme message-passing interface high-performance parallel finite difference time-domain numerical method complicated shape targets radar cross-section time 0 8 hour to 3 0 d

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：