检索结果-内蒙古大学图书馆

Parallel evolutionary algorithms based on shared memory programming approaches

JOURNAL OF SUPERCOMPUTING 2011年第2期58卷 270-279页

作者： Redondo, J. L. Garcia, I. Ortigosa, P. M. Univ Almeria Dpt Comp Architecture & Elect Almeria Spain

In this work, two parallel techniques based on shared memory programming are presented. These models are specially suitable to be applied over evolutionary algorithms. To study their performance, the algorithm UEGO (U... 详细信息

关键词： Evolutionary algorithm shared memory programming Computational experiment UEGO

来源：评论

学校读者我要写书评

暂无评论

RWT: Suppressing Write-Through Cost when Coherence is not Needed

RWT: Suppressing Write-Through Cost when Coherence is not Ne...

引用

IEEE-Computer-Society Annual Symposium on VLSI (ISVLSI)

作者： Liu, Hao Devigne, Clement Garcia, Lucas Meunier, Quentin Wajsburt, Franck Greiner, Alain Sorbonne Univ Univ Paris 06 Lab Informat UMR 7606 4 Pl Jussieu F-75252 Paris 05 France

ISBN: (纸本)9781479987191

In shared-memory multicore architectures, handling a write cache operation is more complicated than in single-processor systems. A cache line may be present in more than one private L1 cache. Any cache willing to write this line must inform all the other sharers. Therefore, it is necessary to implement a cache coherence protocol for multicore architectures. At present, directory based protocols are popular cache coherence protocols in both industry and academic domains because of their reduced coherence traffic compared to snooping protocols, at the expense of an indirection. The write policy - write through or write back - is crucial in the protocol design. The write-through policy reduces the bandwidth because it augments the write traffic in the interconnection network, and also augments the energy consumption. However, it can efficiently solve the false sharing problem via write updates. In this paper, we introduce a new way to reduce the write traffic of a write-through coherence protocol by combining write-through coherence with a write-back policy for non coherent lines. The baseline write-through used as reference is a scalable hybrid invalidate/update protocol. Simulation results show that with our enhanced protocol, we can reduce at least by 50% the write traffic in the interconnection network, and gain up to 20% performance compared with the baseline write-through protocol.

关键词： System-on-Chip Many-core Architecture shared memory programming Hardware Cache Coherence Network-on-Chip Write-Through Write-Back Released Write-Through

来源：评论

学校读者我要写书评

暂无评论

Towards High-Performance Computational Algebra with GAP

Towards High-Performance Computational Algebra with GAP

引用

3rd International Congress on Mathematical Software

作者： Behrends, Reimer Konovalov, Alexander Linton, Steve Luebeck, Frank Neunhoeffer, Max Univ St Andrews Sch Comp Sci St Andrews KY16 9AJ Fife Scotland Rhein Westfal TH Aachen LDFM Aachen Germany Univ St Andrews Sch Math & Stat St Andrews KY16 9AJ Fife Scotland

ISBN: (纸本)9783642155819

We present the project of parallelising the computational algebra system GAP. Our design aims to make concurrency facilities available for GAP users, while preserving as much of the existing codebase (about one million lines of code) with as few changes as possible without requiring users (a large percentage of which are domain experts in their fields without necessarily having a background in parallel programming) to have to learn complicated parallel programming techniques. To this end, we preserve the appearance of sequentiality on a per-thread basis by containing each thread within its own data space. Parallelism is made possible through the notion of migrating objects out of one thread's data space into that of another one, allowing threads to interact.

关键词： GAP shared memory programming threads data spaces

来源：评论

学校读者我要写书评

暂无评论

Experiences on grid shared data programming

引用

INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING 2009年第4期1卷 296-307页

作者： Tudor, Dacian Macariu, Georgiana Schreiner, Wolfgang Cretu, Vladimir-Ioan Politehn Univ Timisoara Comp Sci & Engn Dept Vasile Parvan St 2 Timisoara 300223 Romania Johannes Kepler Univ Linz RISC A-4040 Linz Austria

Despite the continuous advances of the last years in grid computing, programming paradigms are dominated by the message passing concept. There is little support for other paradigms such as shared data or associative programming. In this paper, we analyse why previous attempts did not have a significant impact in the grid computing community. We start by assessing the landscape of grid programming solutions with a focus on shared data concepts. Next, we introduce an original idea to attack shared data programming on the grid by making use of both relaxed consistency models and user specified type consistency in an object-oriented model. Last but not least, we present a prototype architecture together with experimental results.

关键词： shared memory programming grid programming distributed shared memory

来源：评论

学校读者我要写书评

暂无评论

Pleiad: A Cross-Environment Middleware Providing Efficient Multithreading on Clusters 09

Pleiad: A Cross-Environment Middleware Providing Efficient M...

引用

6th ACM International Conference on Computing Frontiers and Workshops

作者： Karantasis, Konstantinos I. Polychronopoulos, Eleftherios D. Univ Patras Comp Engn & Informat Dept High Performance Informat Syst Lab Rion 26500 Greece

ISBN: (纸本)9781605584133

The engagement of cluster and grid computing, two popular trends of today's high performance computation, has formed an imperative need for efficient utilization of the afforded resources. In this paper we present the concept, design and implementation of the Pleiad platform'. Having its origin in the proposition of distributed shared memory (DSM), Pleiad is a cluster middleware that provides shared memory abstraction which enables transparent multithreaded execution across the cluster nodes. It belongs to the new generation of cluster middleware that aside from providing the proof of concept regarding unification of the cluster memory resources, they aim to achieve satisfactory levels of performance and scalability for a broad range of multithreaded applications. First results from the performance evaluation of Pleiad appear emboldening and they are presented in comparison with an efficient implementation of MPI for the Java platform.

关键词： cluster middleware shared memory programming java

来源：评论

学校读者我要写书评

暂无评论

Algorithmic optimizations of a conjugate gradient solver on shared memory architectures

引用

INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS 2006年第5期21卷 345-363页

作者： Lof, Henrik Rantakokko, Jarmo Uppsala Univ Dept Informat Technol Box 337 S-75105 Uppsala Sweden

OpenMP is an architecture-independent language for programming in the shared memory model. OpenMP is designed to be simple and powerful in terms of programming abstractions. Unfortunately, the architecture-independent abstractions sometimes come with the price of low parallel performance. This is especially true for applications with an unstructured data access pattern running on distributed shared memory systems (DSM). Here, proper data distribution and algorithmic optimizations play a vital role for performance. In this article, we have investigated ways of improving the performance of an industrial class conjugate gradient (CG) solver, implemented in OpenMP running on two types of shared memory systems. We have evaluated bandwidth minimization, graph partitioning and reformulations of the original algorithm reducing global barriers. By a detailed analysis of barrier time and memory system performance, we found that bandwidth minimization is the most important optimization reducing both L2 misses and remote memory accesses. On a uniform memory system, we get perfect scaling. On a NUMA system, the performance is significantly improved with the algorithmic optimizations leaving the system dependent global reduction operations as a bottleneck.

关键词： OpenMP shared memory programming Iterative solvers Conjugate gradients Bandwidth minimization Reversed Cuthill-McKee

来源：评论

学校读者我要写书评

暂无评论

Scalable parallel graph coloring algorithms

引用

CONCURRENCY-PRACTICE AND EXPERIENCE 2000年第12期12卷 1131-1146页

作者： Gebremedhin, AH Manne, F Univ Bergen Dept Informat N-5020 Bergen Norway

Finding a good graph coloring quickly is often a crucial phase in the development of efficient, parallel algorithms for many scientific and engineering applications. In this paper we consider the problem of solving the graph coloring problem itself in parallel, We present a simple and fast parallel graph coloring heuristic that is well suited for shared memory programming and yields an almost linear speedup on the PRAM model. We also present a second heuristic that improves on the number of colors used. The heuristics have been implemented using OpenMP, Experiments conducted on an SGI Gray Origin 2000 supercomputer using very large graphs from finite element methods and eigenvalue computations validate the theoretical run-time analysis. Copyright (C) 2000 John Whey & Sons, Ltd.

关键词： graph coloring parallel algorithms shared memory programming OpenMP

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：