检索结果-内蒙古大学图书馆

International Symposium on parallel Processing

作者： J.L. Sobral A.J. Proenca Departamento de Informática Universidade do Minho Braga Portugal

This paper presents the SCOOPP (SCalable Object Oriented parallel programming) approach to support the design and execution of scalable parallel applications. The SCOOPP programming model aims the portability, dynamic scalability and efficiency of parallel applications. The SCOOPP is an hybrid compile and run-time system, which can perform parallelism extraction, supports explicit parallelism and performs dynamic granularity control at run-time. The mechanism that supports dynamic grain-size adaptation is presented and performance evaluated on two parallel systems. The measured results show the feasibility of the proposed dynamic grain-size adaptation and a scalability improvement of parallel applications over static parallel OO environments, which suggests cost benefits to develop scalable parallel applications to run on multiple platforms.

关键词： Dynamic programming parallel programming parallel processing Concurrent computing programming profession Degradation Communication system traffic control Electrical capacitance tomography Object oriented modeling Costs

来源：评论

学校读者我要写书评

暂无评论

Portable parallel programming for the dynamic load balancing of unstructured grid applications

Portable parallel programming for the dynamic load balancing...

引用

International Symposium on parallel Processing

作者： R. Biswas S.K. Das D. Harvey L. Oliker NASA Ames Research Center MRJ Technology Solutions Inc. CA USA Department of Computer Sciences University of North Texas Denton TX USA NASA Ames Research Center RIACS CA USA

The ability to dynamically adapt an unstructured grid (or mesh) is a powerful tool for solving computational problems with evolving physical features; however an efficient parallel implementation is rather difficult, particularly from the viewpoint of portability on various multiprocessor platforms. We address this problem by developing PLUM, an automatic and architecture-independent framework for adaptive numerical computations in a message-passing environment. Portability is demonstrated by comparing performance on an SP2, an Origin2000, and a T3E, without any code modifications. We also present a general-purpose load balancer that utilizes symmetric broadcast networks (SBN) as the underlying communication pattern, with a goal to providing a global view of system loads across processors. Experiments on an SP2 and an Origin2000 demonstrate the portability of our approach which achieves superb load balance at the cost of minimal extra overhead.

关键词： parallel programming Dynamic programming Load management Physics computing Concurrent computing Runtime Application software NASA Broadcasting parallel processing

来源：评论

学校读者我要写书评

暂无评论

DP: A paradigm for anonymous remote computation and communication for cluster computing

引用

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2001年第10期12卷 1052-1065页

作者： Johnson, BK Karthikeyan, R Ram, DJ Indian Inst Technol Dept Comp Sci & Engn Distributed & Object Syst Grp Madras 600036 Tamil Nadu India

This paper explores the transparent programmability of communicating parallel tasks in a Network of Workstations (NOW). Programs which are tied up with specific machines will not be resilient to the changing conditions of a NOW. The Distributed Pipes (DP) model enables location independent intertask communication among processes across machines. This approach enables migration of communicating parallel tasks according to runtime conditions. A transparent programming model for a parallel solution to Iterative Grid Computations using DP is also proposed. Programs written using the model are resilient to the heterogeneity of nodes and changing conditions in the NOW. They are also devoid of any network related code. The design of runtime support and function library support are presented. An engineering problem, namely, the Steady State Equilibrium Problem, is studied over the model. The performance analysis shows the speedup due to parallel execution and scaled down memory requirements. We present a case where the effect of communication overhead can be nullified to achieve a linear to super-linear speedup. The analysis discusses performance resilience of Iterative Grid Computations and characterizes synchronization delay among subtasks;and the effect of network overhead and load fluctuations on performance. The performance saturation characteristics of such applications are also studied.

关键词： parallel programming data parallelism task parallelism network of workstations loosely coupled distributed systems distributed problem solving distributed pipes steady state distribution

来源：评论

学校读者我要写书评

暂无评论

Imagine: Media processing with streams

引用

IEEE MICRO 2001年第2期21卷 35-46页

作者： Khailany, B Dally, WJ Kapasi, UJ Mattson, P Namkoong, J Owens, JD Towles, B Chang, A Rixner, S Stanford Univ Comp Syst Lab Stanford CA 94305 USA Rice Univ Houston TX 77251 USA

THE POWER-EFFICIENT IMAGINE STREAM PROCESSOR ACHIEVES PERFORMANCE DENSITIES COMPARABLE TO THOSE OF SPECIAL-PURPOSE EMBEDDED PROCESSORS. EXECUTING PROGRAMS MAPPED TO STREAMS AND KERNELS, A SING LE IMAGINE PROCESSOR IS ... 详细信息

关键词： Streaming media Kernel Cameras Computer architecture Bandwidth Application software parallel programming Data mining Arithmetic Logic

来源：评论

学校读者我要写书评

暂无评论

Towards fully parallel aerospace simulations on unstructured meshes

引用

ENGINEERING COMPUTATIONS 2001年第3-4期18卷 347-375页

作者： Weatherill, NP Hassan, O Morgan, K Jones, JW Larwood, B Univ Wales Dept Civil Engn Swansea W Glam Wales

A general philosophy is presented in which all the modules within the computational cycle are parallelised and executed on parallel computer hardware, thereby avoiding the creation of computational bottlenecks. In particular, unstructured mesh generation with adaption, computational fluid dynamics and computational electromagnetic solvers and the visualisation of grid and solution data are all performed in parallel. In addition, all these modules are embedded within a parallel problem solving environment. This paper will provide an overview of these developments. In particular, details of the parallel mesh generator, which has been used to generate meshes in excess of 100 million elements, will be given. A brief overview will be presented of the approach used to parallelise the solvers and how large data sets are interrogated and visualised on distributed computer platforms. Details of the parallel adaption algorithm will be presented. These parallel component modules are linked using CORBA communications to provide an integrated parallel approach for large scale simulations. Several examples are given of the approach applied to the simulation of large aerospace calculations in the field of aerodynamics and electromagnetics.

关键词： parallel programming mesh generation problem solving simulation aerospace

来源：评论

学校读者我要写书评

暂无评论

Object-oriented analysis and design of the Message Passing Interface

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2001年第4期13卷 245-292页

作者： Skjellum, A Wooley, DG Lu, ZY Wolf, M Bangalore, PV Lumsdaine, A Squyres, JM McCandless, B Mississippi State Univ Engn Res Ctr Mississippi State MS 39762 USA Mississippi State Univ Dept Comp Sci Mississippi State MS 39762 USA Univ Notre Dame Dept Comp Sci & Engn Comp Sci Lab Notre Dame IN 46556 USA

The major contribution of this paper is the application of modern analysis techniques to the important Message Passing Interface standard, work done in order to obtain information useful in designing both application programmer interfaces for object-oriented languages, and message passing systems. Recognition of 'Design Patterns' within MPI is an important discernment of this work. A further contribution is a comparative discussion of the design and evolution of three actual object-oriented designs for the Message Passing Interface (MPI-1) application programmer interface (API), two of which have influenced the standardization of C++ explicit parallel programming with MPI-2, and which strongly indicate the value of a priori object-oriented design and analysis of such APIs. Knowledge of design patterns is assumed herein. Discussion provided here includes systems developed at Mississippi State University (MPI++), the University of Notre Dame (OOMPI), and the merger of these systems that results in a standard binding within the MPI-2 standard. Commentary concerning additional opportunities for further object-oriented analysis and design of message passing systems and APIs, such as MPI-2 and MPI/RT, are mentioned in conclusion. Connection of modern software design and engineering principles to high performance computing programming approaches is a new and important further contribution of this work. Copyright (C) 2001 John Wiley & Sons, Ltd.

关键词： message passing parallel programming object-oriented application programmer interface design patterns MPI-1 MPI-2

来源：评论

学校读者我要写书评

暂无评论

An approach to the specification and verification of a hardware compilation scheme

引用

JOURNAL OF SUPERCOMPUTING 2001年第1期19卷 23-39页

作者： Bowen, JP He, JF S Bank Univ Ctr Appl Formal Methods Sch Comp Informat Syst & Math Borough Rd London SE1 0AA England UN Univ Int Inst Software Technol Macau Macao Peoples R China

The use of Field Programmable Gate Arrays (FPGA) to produce custom hardware circuits rapidly using a completely software-based process is becoming increasingly widespread. Specialized Hardware Description Languages (HDL) are used to describe and develop the required circuits. In this paper, we advocate using an even more general purpose programming language, based on Occam, for the automatic compilation of high-level programs to low-level circuits. The parallel constructs of Occam can map directly to hardware as conveniently as to software, with potentially dramatic speed-up of highly parallel algorithms. We demonstrate that the compilation process can be verified using algebraic refinement laws, increasing the confidence in its correctness. Verification is particularly important in high-integrity systems where safety or security is paramount. A prototype compiler has also been produced very directly from the theorems using the logic programming language Prolog.

关键词： digital systems formal specification hardware compilation parallel programming programmable hardware refinement verification

来源：评论

学校读者我要写书评

暂无评论

FATCOP 2.0: Advanced features in an opportunistic mixed integer programming solver

引用

ANNALS OF OPERATIONS RESEARCH 2001年第1-4期103卷 17-32页

作者： Chen, Q Ferris, MC Linderoth, J Univ Wisconsin Dept Comp Sci Madison WI 53706 USA Oracle Corp Portland Dev Ctr Portland OR 97204 USA Argonne Natl Lab Div Math & Comp Sci Argonne IL 60439 USA

We describe FATCOP 2.0, a new parallel mixed integer program solver:that works in an opportunistic computing environment provided by the Condor resource management system. We outline changes to the search strategy of FATCOP 1.0 that are necessary to improve resource utilization, together with new techniques to exploit heterogeneous resources. We detail several advanced features in the code that are necessary for successful solution of a variety of mixed integer test problems, along with the different usage schemes that are pertinent to our particular computing environment. Computational results demonstrating the effects of the changes are provided and used to generate effective default strategies for the FATCOP solver.

关键词： integer programming Condor PVM parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallelization of an ecological landscape model by functional decomposition

引用

ECOLOGICAL MODELLING 2001年第1期144卷 13-20页

作者： Cornwell, CF Wille, LT Wu, YG Sklar, FH Florida Atlantic Univ Dept Phys Boca Raton FL 33431 USA S Florida Water Mangement Dist Everglades Syst Res Div W Palm Beach FL 33416 USA

A functional scheme is described to parallelize computer simulations of grid-based ecological landscape models. The method is implemented using the Message Passing Interface protocol and is applied to the Everglades Landscape Vegetation Model. On a two-processor system, the speed-up is satisfactory and the overall performance of the program is competitive with traditional parallelization techniques such as geometrical decomposition. The method is discussed, timing information is provided for three different parallel machines, and some further developments are indicated. (C) 2001 Elsevier Science B.V. All rights reserved.

关键词： parallel programming functional decomposition vegetation landscape model

来源：评论

学校读者我要写书评

暂无评论

Short-term priming, concurrent processing, and saccade curvature during a target selection task in the monkey

引用

VISION RESEARCH 2001年第6期41卷 785-800页

作者： McPeek, RM Keller, EL Smith Kettlewell Eye Res Inst San Francisco CA 94115 USA

In human subjects, two mechanisms for improving the efficiency of saccades in visual search have recently been described: color priming and concurrent processing of two saccades. Since the monkey provides an important model for understanding the neural underpinnings of target selection in visual search, we sought to explore the degree to which the saccadic system of monkeys uses these same mechanisms. Therefore, we recorded the eye movements of rhesus monkeys performing a simple color-oddity pop-out search task, similar to that used previously with human subjects. The monkeys were rewarded for making a saccade to the odd-colored target, which was presented with an array of three distracters. The target and distracters were randomly chosen to be red or green in each trial. Similar to what was previously observed for humans, we found that monkeys show the influence of a cumulative, short-term priming mechanism which facilitates saccades when the color of the search target happens to repeat from trial to trial. Furthermore, we found that like humans, when monkeys make an erroneous initial saccade to a distracter, they are capable of executing a second saccade to the target after a very brief inter-saccadic interval, suggesting that the two saccades have been programmed concurrently (i.e. in parallel). These results demonstrate a close similarity between human and monkey performance. We also made a new observation: we found that when monkeys make such two-saccade responses, the trajectory of the initial saccade tends to curve toward the goal of the subsequent saccade. This provides evidence that the two saccade goals are simultaneously represented on a common motor map, supporting the idea that the movements are processed concurrently. It also indicates that concurrent processing is not limited to brain areas involved in higher-level planning. rather, such parallel programming apparently occurs at a low enough level in the saccadic system that it can affect saccade traj

关键词： saccade visual search priming parallel programming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：