检索结果-内蒙古大学图书馆

IEEE International Conference on Algorithms and Architectures for parallel Processing (ICAP)

作者： Lin Huang M.J. Oudshoorn Department of Computer Science University of Adelaide Adelaide SA Australia

parallel applications with inconstant usage patterns presents a big challenge to programmers in that the spawning of tasks and the communication between them may be conditional (named "conditional parallel programming"). Ideally, the programmer should not be burdened by operational issues which have little relationship to the application itself. This paper proposes a new parallel programming environment, ATME, to automate task scheduling in conditional parallel programming. By adaptively producing accurate estimates of the task model prior to execution, ATME modifies task distribution to improve the system and application performance.

关键词： parallel programming Application software Processor scheduling Runtime programming profession Scheduling algorithm Computational modeling Computer science Distributed computing Automatic programming

来源：评论

学校读者我要写书评

暂无评论

GPS: a parallel programming tool based on process groups

GPS: a parallel programming tool based on process groups

引用

Computer Science Society (SCCC) International Conference Chilean FLAGGED

作者： R. de Cassia Pivetta Machado C.M. de Costa Informatics Institute Federal University of Rio Grande do Sul Porto Alegre Brazil

This paper presents a programming tool for development of parallel applications on a networked environment, called GPS. The programming model adopted is based on message passing and process groups. The process group approach is very attractive for model parallel applications since it is a very natural concept, closer to the application structure. The programming interface provided by GPS differs from other message-passing interfaces by its simplicity, generality and ease of use. The implementation of the tool, also covered in the paper, follows a new idea about distributed systems design: it is based on a microkernel environment.

关键词： Global Positioning System parallel programming Message passing Computer networks Informatics parallel processing Software systems Logic programming US Department of Transportation Microprocessors

来源：评论

学校读者我要写书评

暂无评论

Fast parallel programming Of Multi-level NAND Flash Memory Cells Using The Booster-line Technology

Fast Parallel Programming Of Multi-level NAND Flash Memory C...

引用

Symposium on VLSI Technology

作者： Kim Choi Shin Mang Ahn Memory Division Semiconductor Business Samsung Electronics Company Limited Yongin si Kyunggi South Korea

来源：评论

学校读者我要写书评

暂无评论

A parallel programming model for irregular dynamic neural networks

A parallel programming model for irregular dynamic neural ne...

引用

Working Conference on Massively parallel programming Models

作者： L. Prechelt Fakultat fur Informatik Universitat Karlsruhe Karlsruhe Germany

The compilation of high-level programming languages for parallel machines faces two challenges: maximizing data/process locality and balancing load. No solutions for the general case are known that solve both problems at once. The present paper describes a programming model that allows to solve both problems for the special case of neural network learning algorithms, even for irregular networks with dynamically changing topology (constructive neural algorithms). The model is based on the observation that such algorithms predominantly execute local operations (on nodes and connections of the network), reductions, and broadcasts. The model is concretized in an object-centered procedural language called CuPit. The language is completely abstract: No aspects of the parallel implementation such as number of processors, data distribution, process distribution, execution model etc. are visible in user programs. The compiler can derive most information relevant for the generation of efficient code from unannotated source code. Therefore, CuPit programs are efficiently portable. A compiler for CuPit has been built for the MasPar MP-1/MP-2 using compilation techniques that can also be applied to most other parallel machines. The paper shortly presents the main ideas of the techniques used and results obtained by the various optimizations.

关键词： parallel programming Dynamic programming Neural networks Network topology Computer networks Computer languages parallel machines Broadcasting Program processors Yarn

来源：评论

学校读者我要写书评

暂无评论

Modular parallel programming in mpC for distributed memory machines

Modular parallel programming in mpC for distributed memory m...

引用

Aizu International Symposium on parallel Algorithms/Architecture Synthesis

作者： D. Arapov V. Ivannikov A. Kalinov A. Lastovetsky I. Ledovskih T. Lewis Institute for System Programming Russian Academy of Science Moscow Russia Naval Postgraduate School Monterrey CA USA

The mpC language is an ANSI C superset supporting modular parallel programming for distributed memory machines. It allows the user to specify dynamically an application topology, and the mpC programming environment uses this information in run time to provide the most efficient execution of the program on any particular distributed memory machine. The paper describes the features of mpC and its programming environment which allow to use them for developing libraries of parallel programs.

关键词： parallel programming programming environments Libraries Dynamic programming parallel processing Network topology Distributed computing Functional programming parallel algorithms Computer networks

来源：评论

学校读者我要写书评

暂无评论

Distributed parallel strategies for industrial CFD solvers: A case study and analysis of performances

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 1999年第3期57卷 334-344页

作者： Manzini, G Stolcis, L CRS4 I-09123 Cagliari Italy

The complexity of characterizing both parallel hardware and software makes it very difficult to explain and predict the performances of parallel programs for real industrial CFD applications. A performance model based on a generalized Amdahl's formulation has been developed and, applied to a flow solver. The present formulation allows us to explain the behavior of a typical CFD explicit multiblock solver when the program is run on a multiprocessor distributed-memory system. Using this approach, it is possible to gain an insight on the performance limitations of this class of parallel solvers, by considering the impact of larger and larger number of processors on fixed-scaled problems, (C) 1999 Academic Press, Inc.

关键词： Solvers Industrial PROCESSOR Complexity Multiprocessor parallel programming Cache Memory System parallel Lines Case studies

来源：评论

学校读者我要写书评

暂无评论

A multi-level frontal algorithm for finite element analysis and its implementation on parallel computation

引用

ENGINEERING COMPUTATIONS 1999年第4期16卷 405-427页

作者： Wang, XC Baggio, P Schrefler, BA Dalian Univ Technol Res Inst Engn Mech Dalian Peoples R China Univ Trent Dipartimento Ingn Civile & Ambientale Trent Italy Univ Padua Dipartimento Costruz & Trasporti Padua Italy

This paper presents a multi-level frontal algorithm and its implementation and applications on parallel computation A multi-frontal program is given which may be used for unsymmetric finite element matrix equations. The parallel program is developed on a cluster of workstations. The PVM (parallel virtual machine) system is used to handle communications among networked workstations. The method has advantages such as numbering of the finite element mesh in an arbitrary manner, simple programming organisation, smaller core requirements and computation times. An implementation of this parallel method on workstations is discussed, the speedup and efficiency of this method being demonstrated and compared with general domain decomposition method based on band matrix methods by numerical examples.

关键词： algorithms finite element method heat transfer parallel computing parallel programming

来源：评论

学校读者我要写书评

暂无评论

CoCa: a parallelization model for high-energy physics

引用

IEEE CONCURRENCY 1999年第2期7卷 38-46页

作者： van der Stok, P Argante, E Willers, I Eindhoven Univ Technol Dept Comp Sci NL-5600 MB Eindhoven Netherlands CERN European Lab Particle Phys Div EP CMC CH-1211 Geneva 23 Switzerland

Software parallelization is required to contend with the increasing scale and complexity of High-Energy Physics experiments. The authors have developed a programming model, Communication Capability (CoCa), which allow... 详细信息

关键词： Delay Throughput parallel programming Hardware Transaction databases Detectors Collision mitigation Concurrent computing Physics computing Mesons

来源：评论

学校读者我要写书评

暂无评论

Hierarchical Bulk Synchronous parallel Model and Performance Optimization

引用

Journal of Computer Science & Technology 1999年第3期14卷 224-233页

作者：黄林鹏孙永强袁伟 DepartmentofComputerScienceandEngineering ShanghaiJiaoTongUniversityShanghai200030PR.China

Based on the framework of BSP, a Hierarchical Bulk Synchronous parallel (HBSP) performance model is introduced in this paper to capture the per formance optimization problem for various stages in parallel program development and to accurately predict the performance of a parallel program by considering fac tors causing variance at local computation and global communication. The related methodology has been applied to several real applications and the results show that HBSP is a suitable model for optimizing parallel programs.

关键词： parallel programming bulk synchronous parallel model,perfor mance optimization

来源：评论

学校读者我要写书评

暂无评论

Empirical performance modeling for parallel weather prediction codes

引用

parallel COMPUTING 1999年第13-14期25卷 2135-2148页

作者： Mierendorff, H Joppich, W German Natl Res Ctr Informat Techol GMD Inst Algorithms & Sci Comp SCAI D-53754 St Augustin Germany

Performance modeling for large industrial or scientific codes is of value for program tuning or for selection of new machines when benchmarking is not yet possible, We discuss an empirical method of estimating runtime for certain large parallel programs where computational work is estimated by regression functions based on measurements and time cost of communication is modeled by program analysis and benchmarks for communication primitives. The method is demonstrated with the local weather model (LM) of the German Weather Service (DWD) on SP-2, T3E, and SX-4. The method is an economic way of developing performance models because only a moderate number of measurements is required. The resulting model is sufficiently accurate even for very large test cases. (C) 1999 Elsevier Science B.V. All rights reserved.

关键词： parallel programming performance modeling weather prediction code

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：