检索结果-内蒙古大学图书馆

29th IEEE international parallel and Distributed Processing symposium (IPDPS)

作者： Slota, George M. Rajamanickam, Sivasankaran Madduri, Kamesh Penn State Univ Comp Sci & Engn University Pk PA 16802 USA Sandia Natl Labs Scalable Algorithms Dept Albuquerque NM 87185 USA

ISBN: (纸本)9781479986484

The divergence in the computer architecture landscape has resulted in different architectures being considered mainstream at the same time. For application and algorithm developers, a dilemma arises when one must focus on using underlying architectural features to extract the best performance on each of these architectures, while writing portable code at the same time. We focus on this problem with graph analytics as our target application domain. In this paper, we present an abstraction-based methodology for performance-portable graph algorithm design on manycore architectures. We demonstrate our approach by systematically optimizing algorithms for the problems of breadth-first search, color propagation, and strongly connected components. We use Kokkos, a manycore library and programming model, for prototyping our algorithms. Our portable implementation of the strongly connected components algorithm on the NVIDIA Tesla K40M is up to 3.25x faster than a state-of-the-art parallel CPU implementation on a dual-socket Sandy Bridge compute node.

关键词： graph computations BFS color propagation GPU parallel performance portability

来源：评论

学校读者我要写书评

暂无评论

Approximate string matching using Markovian distance

Approximate string matching using Markovian distance

引用

international symposium on parallel architectures, algorithms, and programming

作者： Katsumata, Akifumi Miura, Takao Shioya, Isamu Dept.of Elect. and Elect. Engineering HOSEI University Kajinocho 3-7-2 Koganei Tokyo Japan Dept. of Managament and Informatics SANNO University Kamikasuya 1672 Isehara kanagawa Japan

ISBN: (纸本)9780769543123

In this work we examine a new technique for approximate string matching using Markovian distance. Here we assume each character appears in a probabilistic way. By means of this idea, we introduce a notion of dissimilarity using text corpus. Then we propose our sophisticated algorithm based on dynamic programming. vVe show some experimental results to see how the approach works well. © 2010 IEEE.

关键词： Dynamic programming

来源：评论

学校读者我要写书评

暂无评论

Special Section on PAR-CAD: parallel CAD algorithms and CAD for parallel architectures/Systems

引用

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS 2012年第1期31卷 7-8页

作者： Marculescu, Diana Li, Peng Carnegie Mellon Univ Dept Elect & Comp Engn Pittsburgh PA 15213 USA Texas A&M Univ Dept Elect & Comp Engn College Stn TX 77843 USA

The five papers in this special section on PAR-CAD: parallel CAD algorithms and CAD for parallel architectures/systems.

关键词： Special issues and sections Design automation parallel architectures

来源：评论

学校读者我要写书评

暂无评论

A parallel subspace iteration method for generalized eigenvalue problems based on multi-core platform

A parallel subspace iteration method for generalized eigenva...

引用

2011 4th international symposium on parallel architectures, algorithms and programming, PAAP 2011

作者： Wang, Shunxu School of Science Huaihai Institute of Technology Lianyungang Jiangsu 222005 China

ISBN: (纸本)9780769545752

A parallel subspace iteration method for solving eigenvalue problem of based on multi-core platform is presented, which can solve several extreme eigenpair in parallel. Compared with Jacobi-Davidson method, the dimension number of the subspace in the method keeps unchanged, which makes it easier for the programming implementation. Numerical experiments are performed with a quad-core computer under the joint programming environment of Intel Fortran and OpenMp. The computation of the plane wing frequency and aircraft pylon for a real model airplane is taken as an example. As a result, the first 10 frequencies of a plane wing and an aircraft pylon are provided which shown the efficiency and applicability of our parallel computation algorithm. © 2011 IEEE.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

parallelizing Optimal Multiple Sequence Alignment by Dynamic programming

Parallelizing Optimal Multiple Sequence Alignment by Dynamic...

引用

IEEE international symposium on parallel and Distributed Processing with Applications

作者： Helal, Manal El-Gindy, Hossam Mullin, Lenore Gaeta, Bruno Univ New S Wales Sch Engn & Comp Sci Fac Engn Sydney NSW Australia Natl Sci Fdn Washington DC USA

ISBN: (纸本)9780769534718

Optimal multiple sequence alignment by dynamic programming, like many highly dimensional scientific computing problems, has failed to benefit from the improvements in computing performance brought about by multi-processor systems, due to the lack of suitable scheme to manage partitioning and dependencies. A scheme for parallel implementation of the dynamic programming multiple sequence alignment is presented, based on a peer to peer design and a multidimensional array indexing method. This design results in up to 5-fold improvement compared to a previously described master/slave design, and scales favourably with the number of processors used. This study demonstrates an approach for parallelising multi-dimensional dynamic programming and similar algorithms utilizing multi-processor architectures.

关键词： Sequence Alignment dynamic programming Multiprocessor computational performance PROCESSOR Processor architectures

来源：评论

学校读者我要写书评

暂无评论

parallel local alignment algorithm for multiple sequences on heterogeneous cluster systems

Parallel local alignment algorithm for multiple sequences on...

引用

international symposium on parallel architectures, algorithms, and programming

作者： Cui, Xin Zhong, Cheng Lu, Xiang-Yan School of Computer and Electronics and Information Guangxi University Nanning China School of Information Engineering DaLian University DaLian China

ISBN: (纸本)9780769543123

By taking into account communication startup overhead and the assigned processor distribution order and by applying hashing technique, a novel sequence distribution strategy is presented and the parallel local alignment algorithm for multiple sequences is designed on the heterogeneous cluster system that the computing nodes have different computing speeds and communication capabilities based on divisible load principle. The experimental results on the cluster system with heterogeneous personal computers show that, compared with the parallel algorithm with the average sequence distribution approach, the parallel local alignment algorithm for multiple sequences with the presented sequence distribution strategy can decrease the execution time of 13%∼35%, and it can obtain good speedup and scalability. © 2010 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel point-multiplication for conic curves cryptosystem

Parallel point-multiplication for conic curves cryptosystem

引用

international symposium on parallel architectures, algorithms, and programming

作者： Li, Yongnan Xiao, Limin State Key Laboratory of Software Development Environment Beijing 100191 China School of Computer Science and Engineering Beihang University Beijing 100191 China

ISBN: (纸本)9780769543123

Cryptosystem on conic curves, which is a new developing cryptography, becomes more widespread in these days. It is important to explore fast parallel algorithms to both encrypt and decrypt information in conic curves cryptosystem. Point-multiplication is the key operation for constructing security protocol in conic curves cryptosystem. There is no existing research focused on paralleling point-multiplication for conic curves cryptosystem. This paper presents parallel computation of point-multiplication for conic curves cryptosystem over finite field Fp and ring Zn. Research in this paper is based on our previous works about several parallel algorithms for conic curves cryptosystem. The parallel technique of point-multiplication is computing point-addition and pointdouble respectively. The performance evaluation demonstrates that our methodology could improve efficiency for conic curves cryptosystem over finite field Fp and ring Zn. © 2010 IEEE.

关键词： Cryptography

来源：评论

学校读者我要写书评

暂无评论

Accelerating CUDA Graph algorithms at Maximum Warp 11

Accelerating CUDA Graph Algorithms at Maximum Warp

引用

16th ACM symposium on Principles and Practice of parallel programming

作者： Hong, Sungpack Kim, Sang Kyun Oguntebi, Tayo Olukotun, Kunle Stanford Univ Comp Syst Lab Stanford CA 94305 USA

ISBN: (纸本)9781450301190

Graphs are powerful data representations favored in many computational domains. Modern GPUs have recently shown promising results in accelerating computationally challenging graph problems but their performance suffers heavily when the graph structure is highly irregular, as most real-world graphs tend to be. In this study, we first observe that the poor performance is caused by work imbalance and is an artifact of a discrepancy between the GPU programming model and the underlying GPU architecture. We then propose a novel virtual warp-centric programming method that exposes the traits of underlying GPU architectures to users. Our method significantly improves the performance of applications with heavily imbalanced workloads, and enables trade-offs between workload imbalance and ALU underutilization for fine-tuning the performance. Our evaluation reveals that our method exhibits up to 9x speedup over previous GPU algorithms and 12x over single thread CPU execution on irregular graphs. When properly configured, it also yields up to 30% improvement over previous GPU algorithms on regular graphs. In addition to performance gains on graph algorithms, our programming method achieves 1.3x to 15.1x speedup on a set of GPU benchmark applications. Our study also confirms that the performance gap between GPUs and other multi-threaded CPU graph implementations is primarily due to the large difference in memory bandwidth.

关键词： algorithms Performance parallel graph algorithms CUDA GPGPU

来源：评论

学校读者我要写书评

暂无评论

parallel Architecture, Algorithm and programming 1st ed. 2017

引用

丛书名： Communications in Computer and Information Science

2017年

作者： Guoliang Chen Hong Shen Mingrui Chen

ISBN: (数字)9789811064425

ISBN: (纸本)9789811064418

This book constitutes the refereed proceedings of the 8th international symposium on parallel Architecture, Algorithm and programming, PAAP 2017, held in Haikou, China, in June 2017. The 50 revised full papers and 7 revised short papers presented were carefully reviewed and selected from 192 submissions. The papers deal with research results and development activities in all aspects of parallel architectures, algorithms and programming techniques.

关键词：

来源：评论

学校读者我要写书评

暂无评论

programming with transactions and chemical abstract machine

Programming with transactions and chemical abstract machine

引用

Proceedings of the 1996 2nd international symposium on parallel architectures, algorithms, and Networks, I-SPAN

作者： Ma, Wanli Johnson, Christopher W. Brent, Richard P. Australian Natl Univ Canberra

The coordination style programming language T-Cham extends chemical abstract machine (Cham) with transactions. The Cham is an interactive computational model based on chemical reaction metaphor, where a computation proceeds as a succession of chemical reactions. A transaction is a piece of sequentially executed codes and could be written in any language, such as C, Pascal, or Fortran etc., as long as it satisfies its pre-condition and post-condition. Every transaction begins its execution whenever its execution condition is satisfied. A T-Cham program can be executed in a parallel, distributed, or sequential manner based on the available computer resources.

关键词： Computer programming languages

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：