检索结果-内蒙古大学图书馆

Proceedings of the 1995 2nd International conference on programming models for massively parallel Computers

作者： O'Boyle, M.F.P. Bull, J.M. Univ of Manchester Manchester United Kingdom

In this paper we show that, under different circumstances, data scheduling and loop scheduling are both useful models for parallel programs executing on shared virtual memory (SVM) systems. We therefore propose a unified programming model that permits both types of scheduling. We show that, given affine array references, a program segment which is parallel under loop scheduling can always be transformed to make it parallel under data scheduling and vice-versa, and hence that the two types of scheduling are equally powerful at exploiting parallelism. We review existing Fortran dialects for SVM and propose compiler directives that allow program segments to be data scheduled.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Retargeting sequential image-processing programs for data parallel execution

引用

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 2005年第2期31卷 116-136页

作者： Baumstark, LB Wills, LM State Univ W Georgia Dept Comp Sci Carrollton GA 30118 USA Georgia Inst Technol Sch Elect & Comp Engn Atlanta GA 30332 USA

New compact, low-power implementation technologies for processors and imaging arrays can enable a new generation of portable video products. However, software compatibility with large bodies of existing applications written in C prevents more efficient, higher performance data parallel architectures from being used in these embedded products. If this software could be automatically retargeted explicitly for data parallel execution, product designers could incorporate these architectures into embedded products. The key challenge is exposing the parallelism that is inherent in these applications but that is obscured by artifacts imposed by sequential programming languages. This paper presents a recognition-based approach for automatically extracting a data parallel program model from sequential image processing code and retargeting it to data parallel execution mechanisms. The explicitly parallel model presented, called multidimensional data flow ( MDDF), captures a model of how operations on data regions ( e. g., rows, columns, and tiled blocks) are composed and interact. To extract an MDDF model, a partial recognition technique is used that focuses on identifying array access patterns in loops, transforming only those program elements that hinder parallelization, while leaving the core algorithmic computations intact. The paper presents results of retargeting a set of production programs to a representative data parallel processor array to demonstrate the capacity to extract parallelism using this technique. The retargeted applications yield a potential execution throughput limited only by the number of processing elements, exceeding thousands of instructions per cycle in massively parallel implementations.

关键词： Reengineering SIMD processors data-level parallelization explicitly parallel program representation program recognition

来源：评论

学校读者我要写书评

暂无评论

Integrated Route Assignment and Traffic Simulation System with a massively parallel Computing Architecture.

Integrated Route Assignment and Traffic Simulation System wi...

引用

Proceedings of the Pacific Rim Trans Tech conference

作者： Chang, G. Junchaya, T. Zhuang, L. Univ of Maryland College Park United States

ISBN: (纸本)0872629163

This research presents two critical issues in the development of an integrated route assignment and traffic simulation system of ATMS-ATIS applications. The first issue addresses the conceptual and algorithmic aspects of the models. the conceptual aspects of the models. The second concerns with computation and implementation efficiency which lead to the exploration of using advanced parallel computing architecture. We propose an integrated system that has been implemented on a massively parallel computing architecture. This paper presents the structure of the proposed system, along with a brief description of each component.

关键词： Traffic control

来源：评论

学校读者我要写书评

暂无评论

Load balancing hybrid programming models for SMP clusters and fully permutable loops

Load balancing hybrid programming models for SMP clusters an...

引用

34th International conference on parallel Processing (ICPP)

作者： Drosinos, N Koziris, N Natl Tech Univ Athens Sch Elect & Comp Engn Comp Syst Lab Athens Greece

ISBN: (纸本)0769523811

This paper emphasizes on load balancing issues associated with hybrid programming models for the parallelization of fully permutable nested loops onto SMP clusters. Hybrid parallel programming models usually suffer from intrinsic load imbalance between threads, mainly because most existing message passing libraries generally provide limited multi-threading support, allowing only the master thread to perform inter-node message passing communication. In order to mitigate this effect, we propose a generic method for the application of static load balancing on the coarse-grain hybrid model for the appropriate distribution of the computational load to the working threads. We experimentally evaluate the efficiency of the proposed scheme against a micro-kernel benchmark, and demonstrate the potential of such load balancing schemes for the extraction of maximum performance out of hybrid parallel programs.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Solving nonlinear financial planning problems with 109 decision variables on massively parallel architectures

Solving nonlinear financial planning problems with 109 decis...

引用

2nd International conference on Computational Finance and its Applications, COMPUTATIONAL FINANCE 2006, CF06

作者： Gondzio, J. Grothey, A. School of Mathematics University of Edinburgh

ISBN: (纸本)1845641744

Multistage stochastic programming is a popular technique to deal with uncertainty in optimization models. However, the need to adequately capture the underlying distributions leads to large problems that are usually beyond the scope of general purpose solvers. Dedicated methods exist but pose restrictions on the type of model they can be applied to. parallelism makes these problems potentially tractable, but is generally not exploited in today's general purpose solvers. We apply a structure-exploiting parallel primal-dual interior-point solver for linear, quadratic and nonlinear programming problems. The solver efficiently exploits the structure of these models. Its design relies on object-oriented programming principles, treating each substructure of the problem as an object carrying its own dedicated linear algebra routines. We demonstrate its effectiveness on a wide range of financial planning problems, resulting in linear, quadratic or non-linear formulations. Also coarse grain parallelism is exploited in a generic way that is efficient on any parallel architecture from ethernet linked PCs to massively parallel computers. On a 1280-processor machine with a peak performance of 6.2 TFlops we can solve a quadratic financial planning problem exceeding 109 decision variables.

关键词： Financial data processing

来源：评论

学校读者我要写书评

暂无评论

A Component Based Graphical parallel programming Approach for Numerical Simulation Development 2

A Component Based Graphical Parallel Programming Approach fo...

引用

2nd IEEE International conference on High Performance and Smart Computing (IEEE HPSC)

作者： Liao Li Mo Zeyao Zhang Aiqing Inst Appl Phys & Computat Math CAEP Software Ctr High Performance Numer Simulat High Performance Comp Ctr Beijing Peoples R China

ISBN: (纸本)9781509024032

Building massively parallel numerical simulations is not easy due to lasting changes of parallel programming models and various software technologies needed. We develop a component based graphical parallel programming approach to lower the difficulties of coding applications in scientific and engineering computing and support rapid development of large scale simulations basing on a domain specific framework. parallel applications can be constructed simply by configuring components and assembling them in predefined flowcharts interactively. Large part of codes is auto generated from the graphical configuration for an application. The approach facilitates the rapid design and development of parallel numerical simulations by shielding many knowledge and technologies required from domain experts. Real applications demonstrate that the approach for developing complex numerical is both practical and efficient.

关键词： scientific computing domain-specific framework component-based programming parallel application GUI

来源：评论

学校读者我要写书评

暂无评论

Toward a tool for automatic implementation of parallel image processing applications 1

Toward a tool for automatic implementation of parallel image...

引用

1st International conference on massively parallel Computing Systems: The Challenges of General-Purpose and Special-Purpose Computing, MPCS 1994

作者： Cartier, S. Fiorini, P. Hoeltzener-Douarin, B. Pissaloux, E. Etablissement Technique Central de L'Armement 16bis Avenue Prieur de la Côte d'Or ArcueilF-94114 France

ISBN: (纸本)0818663227

This paper describes a research proposal related to the design of parallel software for image processing. The proposal focuses on the design of a tool, which generates the best implementation of a given application taking into account different parallel programming models available on different parallel (heterogeneous) computers. All steps needed for the automatic mapping of applications are presented. © 1994 IEEE.

关键词： Image processing

来源：评论

学校读者我要写书评

暂无评论

massively parallel Landscape-Evolution Modelling using General Purpose Graphical Processing Units

Massively Parallel Landscape-Evolution Modelling using Gener...

引用

19th International conference on High Performance Computing (HiPC)

作者： McGough, A. S. Liang, S. Rapoportas, M. Grey, R. Vinod, G. Kumar Maddy, D. Trueman, A. Wainwright, J. Newcastle Univ Sch Comp Sci Newcastle Upon Tyne NE1 7RU Tyne & Wear England Newcastle Univ Sch Geog Polit & Sociol Newcastle Upon Tyne Turkey Univ Durham Dept Geog Durham England Newcastle Univ Newcastle Upon Tyne Tyne & Wear England

ISBN: (纸本)9781467323703;9781467323727

As our expectations of what computer systems can do and our ability to capture data improves, the desire to perform ever more computationally intensive tasks increases. Often these tasks, comprising vast numbers of repeated computations, are highly interdependent on each other - a closely coupled problem. The process of Landscape-Evolution Modelling is an example of such a problem. In order to produce realistic models it is necessary to process landscapes containing millions of data points over time periods extending up to millions of years. This leads to non-tractable execution times, often in the order of years. Researchers therefore seek multiple orders of magnitude reduction in the execution time of these models. The massively parallel programming environment offered through General Purpose Graphical Processing Units offers the potential for multiple orders of magnitude speedup in code execution times. In this paper we demonstrate how the time dominant parts of a Landscape-Evolution Model can be recoded for a massively parallel architecture providing two orders of magnitude reduction in execution time.

关键词： parallel LEM GPU CUDA

来源：评论

学校读者我要写书评

暂无评论

Scaling Analysis of Solving Algorithms for Canonical Problem of Dispatching in the Context of Dynamic programming 7

Scaling Analysis of Solving Algorithms for Canonical Problem...

引用

7th International conference Internet Technologies and Applications (ITA)

作者： Fedosenko, Yuriy S. Reznikov, Mikhail B. Plekhov, Aleksandr S. Chakirov, Roustiam Houlden, Nigel Volga State Univ Water Transportat 5 Nesterov St Nizhnii Novgorod 603005 Russia Alekseev Nizhny Novgorod State Tech Univ 24 Minin St Nizhnii Novgorod 603155 Russia Bonn Rhein Sieg Univ Appl Sci 20 Grantham Allee D-53757 St Augustin Germany Glyndwr Univ Mold Rd Wrexham LL11 2AW Wales

ISBN: (纸本)9781509048151

The paper analyses computational model based on dynamic programming for platforms with multicore processors and heterogeneous architectures with FPGA. The models are applied for solving a canonical problem of dispatching where the computation time significantly depends on the problem scale factor. The parallel algorithms of NP-hard problem of dispatching are complicate and require intensive RAM data exchange. In order to reduce the computation time, it is suggested to use FPGA as a coprocessor providing massively parallel computation and increase the operational performance of the system in one order.

关键词： massively parallel calculations dynamic programming dispatching problem calculations modeling discrete optimisation

来源：评论

学校读者我要写书评

暂无评论

View-oriented parallel programming on multi-core clusters

View-oriented parallel programming on multi-core clusters

引用

6th New Zealand Computer Science Research Student conference, NZCSRSC 2008

作者： Huang, Qihang Department of Computer Science University of Otago Dunedin New Zealand

Driven by the ever-growing demand for computing power, computers are becoming more and more powerful. However, in recent years, due to the physical limitations, this increased computing power does not come in the form of increased CPU clock speed, but in the form of more cores (processors) in a single chip die. Computer industry has started to use this new multi-core technology to massively produce systems for both stand-alone desktop PCs and high-end servers. In the near future, multi-core cluster will become one of the most economic supercomputer architectures. In order to utilize the full power of multi-core systems, some kind of parallel computing is necessary. However, parallel programming is notoriously known as a challenge job. This paper analyzes different parallel programming models, compares their strengths and weaknesses on multi-core based systems, and introduces an on-going project on providing a better parallel programming environment based on a novel View-Oriented parallel programming (VOPP) model. Copyright is held by the author/owner(s).

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：