The proceedings contains 37 papers. Some of the topics discussed are: distributed computer systems, parallelprocessing systems, network protocols, data handling, algorithms, computer applications, time sharing system...
详细信息
The proceedings contains 37 papers. Some of the topics discussed are: distributed computer systems, parallelprocessing systems, network protocols, data handling, algorithms, computer applications, time sharing systems, multiprogramming, interactive computer systems, virtual reality, computer workstations, supercomputers, dynamic load balancing, data storage, and fault tolerant computing.
Software distributed shared memory (DSM) platforms on networks of workstations tolerate large network latencies by employing one of several weak memory consistency models. Fully asynchronous parallel iterative algorit...
详细信息
Software distributed shared memory (DSM) platforms on networks of workstations tolerate large network latencies by employing one of several weak memory consistency models. Fully asynchronous parallel iterative algorithms offer an additional degree of freedom to tolerate network latency: they behave correctly when supplied outdated shared data. However, these algorithms can flood the network with messages in the presence of large delays. We propose a method of controlling asynchronous iterative methods wherein the reader of a shared datum imposes an upper bound on its age via use of a blocking Global_Read primitive. This reduces the overall number of iterations executed by the reader, thus controlling the amount of shared updates generated. Experiments for a fully asynchronous linear equation solver running on a network of 10 IBM RS/6000 workstations show that the proposed Global_Read primitive provides significant performance improvement.
In this paper, we have modeled and evaluated a new parallelprocessing system for commercial applications, so called SPAX. SPAX costeffectively overcomes the SMP limitation by providing both scalability of the paralle...
It was recently shown that block-circulant preconditioners applied to a conjugate gradient method used to solve structured sparse linear systems arising from 2D or 3D elliptic problems have good numerical properties a...
详细信息
ISBN:
(纸本)3540249370
It was recently shown that block-circulant preconditioners applied to a conjugate gradient method used to solve structured sparse linear systems arising from 2D or 3D elliptic problems have good numerical properties and a potential for high parallel efficiency. The asymptotic estimate for their convergence rate is as for the incomplete factorization methods but the efficiency of the parallel algorithms based on circulant preconditioners are asymptotically optimal. In this paper parallel performance of a circulant block-factorization based preconditioner applied to a 3D model problem is investigated. The aim of this presentation is to analyze the performance and to report on the experimental results obtained on shared and distributed memory parallel architectures. A portable parallel code is developed based on Message Passing Interface (MPI) and OpenMP (Open Multi processing) standards. The performed numerical tests on a wide range of parallel computer systems clearly demonstrate the high level of parallel efficiency of the developed parallel code.
At present, in the design and analysis of discrete event systems, the parallel or distributed simulation mechanism is adopted to make these models occur concurrently, then the problem 14411 be easy to be solved, the v...
详细信息
ISBN:
(纸本)0769528759
At present, in the design and analysis of discrete event systems, the parallel or distributed simulation mechanism is adopted to make these models occur concurrently, then the problem 14411 be easy to be solved, the validation of their models is often addressed via simulation. Lookahead computation is a good method that improves the distributed simulation performance of Timed Transition Petri Nets (TTPN). Extended Timed Petri Nets (ETTPN) is extended in time based on TTPN, it meet Lookahead requirements, and use the Lookahead to analyze the ETTPN model for finding the concurrency and blocking structure, thereby making certain the logical process (LP). Based on the specialties of Lookahead the Lookahead was applied in parallel simulation, and acquire the sufficient condition about the ETTPN model existing concurrency. According to the condition, parallel program can be partitioned into several logical processes, and the partition algorithm about distributedprocessing is avail In order to reduce error rate, a improved mapping algorithm is presented.
Human pose estimation is the process of approximating the configuration of the body39;s underlying skeletal articulation in one or more frames. The curve-skeleton of an object is a line-like representation that pres...
详细信息
ISBN:
(纸本)9781467325851;9781467325837
Human pose estimation is the process of approximating the configuration of the body's underlying skeletal articulation in one or more frames. The curve-skeleton of an object is a line-like representation that preserves topology and geometrical information. Finding the curve-skeleton of a volume corresponding to the person is a good starting point for approximating the underlying skeletal structure. In this paper a GPU implementation of a fully parallel thinning algorithm based on the critical kernels framework is presented. The algorithm is compared to another state-of-the-art thinning method, and while it is demonstrated that both achieve real-time frame rates, the proposed algorithm yields superior accuracy and robustness when used in a pose estimation context. The GPU implementation is > 8 x faster than a sequential version, and the positions of the four extremities are estimated with rms error similar to 6 cm and similar to 98 % of frames correctly labelled.
Declarative SQL queries are a way of abstracting out the underlying complexity in Wireless sensor-actuator networks (WSAN). Current SQL syntax to perform actuation tasks misleads the user because of their artificial w...
详细信息
ISBN:
(纸本)9781467347211
Declarative SQL queries are a way of abstracting out the underlying complexity in Wireless sensor-actuator networks (WSAN). Current SQL syntax to perform actuation tasks misleads the user because of their artificial way of expressing the actuation tasks thus violating the purpose of the declarative nature. Particularly with respect to distributed in-network actuation tasks where multiple sensor nodes and actuator nodes are involved, the user may be required to write multiple queries based on the current actuation query syntax causing a negative impact on the performance. We address these issues in existing declarative layer with modifications to the virtual data table of the abstraction and thus a new syntax is suggested and implemented.
In this paper, a Graphics processing Unit (GPU) implementation of a Successive-Cancellation List (SCL) decoder for polar codes is proposed. Based on the compute unified device architecture (CUDA), the strategy to para...
详细信息
ISBN:
(纸本)9781509063529
In this paper, a Graphics processing Unit (GPU) implementation of a Successive-Cancellation List (SCL) decoder for polar codes is proposed. Based on the compute unified device architecture (CUDA), the strategy to parallelize the decoding procedure is designed to reduce the latency. Moreover, the optimal design of data structures for several kinds of intermediate information is presented. In order to reduce the global memory accesses in list pruning, The GPU-adapted lazy-copy strategy is elaborated. By means of the parallel computing capabilities of GPUs, the proposed decoder achieves high throughput of 41Mbps on NVIDIA GTX 980 and 65Mbps on TITAN X while decoding the code with length of 1024 bits and 32 lists.
The natural immune system is a robust and powerful information process system that demonstrates features such as distributed control, parallelprocessing and adaptation. Artificial Immune Systems (AIS) are machine-lea...
详细信息
ISBN:
(纸本)9781424421961
The natural immune system is a robust and powerful information process system that demonstrates features such as distributed control, parallelprocessing and adaptation. Artificial Immune Systems (AIS) are machine-learning algorithms that embody some of the principles and attempt to take advantages of the benefits of natural immune systems for use in tackling complex problem domains. Using the artificial immune algorithm in GPS data processing can solve the GPS nonlinear models directly and avoid the ill-conditions. The principle and algorithm steps of the AIS are introduced and it's application in GPS single frequency precise point positioning (PPP) is discussed detailed. Numerical result shows that the single frequency PPP model solved by AIS can reach an accuracy of sub-meter level.
In-memory storage techniques provide cloud applications with cheap, fast and large-scale RAM-based storage. By replicating data and providing adequate consistency control mechanisms, in-memory storage can simplify the...
详细信息
暂无评论