The proceedings contain 27 papers. The special focus in this conference is on Vector and parallel Processing. The topics include: High performance computing in power system applications;scan-directional architectures;...
ISBN:
(纸本)3540628282
The proceedings contain 27 papers. The special focus in this conference is on Vector and parallel Processing. The topics include: High performance computing in power system applications;scan-directional architectures;markov chain based management of large scale distributed computations of earthen dam leakages;control and data flow analysis for parallel program debugging;a vector processor for the efficient estimation of higher-order moments;the use of computational kernels in full and sparse linear solvers, efficient code design on high-performance RISC processors;parallel implementation of a symmetric eigensolver based on the Yau and Lu method;preconditioned conjugate gradient methods for semiconductor device simulation on a CRAY C90 vector processor;a parallel implementation of the general lanczos method on the cray T3D;parallel and distributed computations in a parameter inverse problem;automated optimal design using CFD and high performance computing;parallelization of the discrete-ordinates method;experiences with advanced CFD algorithms on NEC SX-4;parallelization of a CFD code using PVM and domain decomposition techniques;possibilities of parallelcomputing in the finite element analysis of industrial forming processes;preconditioners for nonsymmetric linear systems in domain decomposition applied to a coupled discretization of navler-stokes equations;parallel implementation of non-recurrent neural networks;parallelcomputing of fragment vector in steiner triple systems;stabilizing large control linear systems on multicomputers;an interface based on transputers to simulate the dynamic equation of robot manipulators using parallelcomputing and large scale traffic simulations.
In this paper, an approach using least squares method for Callback implementation is presented. Experiments on Callback approved that Callback was not only effective in high-level metasystems but also in low level het...
详细信息
ISBN:
(纸本)0818678763
In this paper, an approach using least squares method for Callback implementation is presented. Experiments on Callback approved that Callback was not only effective in high-level metasystems but also in low level heterogeneous system.
In this article, we would like to detect boundaries of objects with the help of a multiagent system made up of reactive agents. The reactivity being very important, the agents' behavior is very simple (perception-...
详细信息
ISBN:
(纸本)0819425885
In this article, we would like to detect boundaries of objects with the help of a multiagent system made up of reactive agents. The reactivity being very important, the agents' behavior is very simple (perception-action) : they have the capacity, nevertheless, to adapt locally to what they consider their environment, that is to say the image. An agent can move and has its own position in its environment. The basic behavior for an agent consists of following the highest brightness gradient and inscribing its path, if estimating to be on an edge, in all the agents' shared memory. Its path thus corresponds to edges which are found in the image. Please note that, in order to be noise resistant, the path is actually stored in the shared memory only if it is long enough. The notion of shared memory is very important because it allows the interaction among agents and the coordination of their actions. The agents actually use already found edges for finding new ones or complete those already detected. We have tested this system on different grey scale images scenes, but as well on synthetic scenes allowing analysis of thus obtained results. The results are promising and especially fast. Our multiagent system has been tested on a single-processor computer, and it has been note that the number of agents in a simulation neither affects the quality of the result nor CPU time necessary for segmentation of a given scene. We think that this approach is original in its use of agents and may be used to implement parallel image processing by assigning, for instance, an agent to each processor.
A parallel design and implementation of the Self-Organizing Map (SOM) neural computing model is proposed. The parallel design of SOM is implemented in a parallel virtual machine (PVM) environment of a distributed syst...
详细信息
ISBN:
(纸本)0818678763
A parallel design and implementation of the Self-Organizing Map (SOM) neural computing model is proposed. The parallel design of SOM is implemented in a parallel virtual machine (PVM) environment of a distributed system. A practical realization of SOM algorithm is investigated, the construction of computing module in parallel virtual machine is discussed, the communication methods and an optimization of message passing between multiple processes are proposed, and the parallel programming technique and a PVM implementation of SOM neural computing model are given and discussed in detail.
With the advances of wireless communication technology, using the wireless LAN as a platform to perform distributed network computing becomes feasible. We study the characteristics of end-to-end communication over wir...
详细信息
With the advances of wireless communication technology, using the wireless LAN as a platform to perform distributed network computing becomes feasible. We study the characteristics of end-to-end communication over wireless links. With the advantage of reduced bandwidth competition in each LAN segment separated by the wireless bridges, and with the overlap of wireless and wired communications, an analytical comparison showed that the group communications over wireless links can be more efficient than over a single segment wired LAN. We conducted experiments of running distributed applications and the results showed that with the support of threads, wireless network computing can achieve the same performance as the wired networks. Furthermore, the statistical results from our survey showed that the users cannot tell the difference between wireless and wired settings in terms of the data access speed.
The proceedings contain 37 papers. The special focus in this conference is on High Performance computing. The topics include: The generation of optimized codes using nonzero structure analysis;on the importance of an ...
ISBN:
(纸本)3540637664
The proceedings contain 37 papers. The special focus in this conference is on High Performance computing. The topics include: The generation of optimized codes using nonzero structure analysis;on the importance of an end-to-end view of memory consistency in future computer sysmtems;high performance distributed object systems;instruction cache prefetching using multilevel branch prediction;high performance wireless computing;high-performance computing and applications in image processing and computer vision;evaluation of multithreaded processors and thread-switch policies;a building block for high performance computers;data dependence path reduction with tunneling load instructions;performance estimation of embedded software with pipeline and cache hazard modeling;an implementation and evaluation of a distributed shared-memory system on workstation clusters using fast serial links;designing and optimizing 3-connectivity communication networks using a distributed genetic algorithm;adaptive routing on the recursive diagonal torus;an algorithm for automatic detection of loop indices for communication overlapping;hybrid approach for non-strict dataflow program on commodity machine;scenario-based hypersequential programming;parallelization of space plasma particle simulation;implementing iterative solvers for irregular sparse matrix problems in high performance fortran;cost estimation of coherence protocols of software managed cache on distributed shared memory system;a portable distributed shared memory system on the cluster environment;a method for runtime recognition of collective communication on distributed-memory multiprocessors;efficiency of parallel machine for large-scale simulation in computational physics and a parallelization method for neural networks with weak connection design.
Global climate modeling is one of the grand challenges of computational science, and ocean modeling plays an important role in both understanding the current climatic conditions and predicting the future climate chang...
详细信息
ISBN:
(纸本)0897919858
Global climate modeling is one of the grand challenges of computational science, and ocean modeling plays an important role in both understanding the current climatic conditions and predicting the future climate change. Three-dimensional time-dependent ocean general circulation models (OGCMs) require a large amount of memory and processing time to run realistic simulations. Recent advances in computing hardware have dramatically affected the prospect of studying the global climate. The significant computational resources of massively parallel supercomputers promise to make such studies feasible. In addition to using advanced hardware, designing and implementing a well-optimized parallel ocean code will significantly improve the computational performance and reduce the total research time to complete these studies. In our present work, we chose the most widely used OGCM code as our base code. This OGCM is based on the parallel Ocean Program (POP) developed in FORTRAN 90 on the Los Alamos CM-2 Connection Machine by the Los Alamos ocean modeling research group. During the first half of 1994, the code was ported to the Cray T3D by Cray Research using SHMEM-based message passing. Since the code on the T3D was still time-consuming when large problems were encountered, improving the code performance was considered essential. We have developed several general strategies to optimize the ocean general circulation model on the Cray T3D. These strategies include memory optimization, effective use of arithmetic pipelines, and usage of optimized libraries. The optimized code runs 2 to 2.5 times faster than the original code, which gives significant performance improvements for modeling large scaled ocean flows. Many test runs for both of the original and the optimized code have been carried out on the Cray T3D using various numbers of processors (1-256). Comparisons are made for a variety of real-world problems. A nearly linear scaling performance line is obtained for the optimiz
Current advances in high-speed networks and WWW technologies have made network computing a cost-effective, high-performance computing alternative. New software tools are being developed to utilize efficiently the netw...
详细信息
Current advances in high-speed networks and WWW technologies have made network computing a cost-effective, high-performance computing alternative. New software tools are being developed to utilize efficiently the network computing environment. Our project, called Virtual distributedcomputing Environment (VDCE), is a high-performance computing environment that allows users to write and evaluate networked applications for different hardware and software configurations using a web interface. In this paper we present the software architecture of VDCE by emphasizing application development and specification, scheduling, and execution/runtime aspects.
In this paper, a parallel algorithm for solving tridiagonal equations based on recurrence is presented. Compared with the parallel prefix method (PP) which is also based on the recursive method, the computation cost i...
详细信息
ISBN:
(纸本)0818678763
In this paper, a parallel algorithm for solving tridiagonal equations based on recurrence is presented. Compared with the parallel prefix method (PP) which is also based on the recursive method, the computation cost is reduced by a factor of two while maintaining the same communication cost. The method can be viewed as a modified prefix method or prefix with substructuring. The complexity of the algorithm is analysed using the BSP model (Bulk Synchronous parallel). Experimental results are obtained on a Sun workstation using the Oxford BSP Library.
This paper presents a lifetime-sensitive scheduling method. By shortening lifetimes of variables in scheduling phase, it can lighten register pressure in register allocation phase, lessen spill codes and result in mor...
详细信息
ISBN:
(纸本)0818678763
This paper presents a lifetime-sensitive scheduling method. By shortening lifetimes of variables in scheduling phase, it can lighten register pressure in register allocation phase, lessen spill codes and result in more efficient object codes. The preliminary experimental results show that this method is an effective scheduling method.
暂无评论