With the advancement of Internet technologies and the rapid increase of World Wide Web applications, there has been tremendous growth in the volume of digital data. This takes the digital world into a new era of big d...
详细信息
We have designed and implemented a software facility, called Agora, that supports the development of parallel applications written in multiple languages. At the core of Agora there is a mechanism that allows concurren...
详细信息
Various computer methods are sourced in parallel programming. Advances in methods and techniques with their appropriate usage are beneficial for multimedia applications. parallelization can significantly decrease the ...
详细信息
Multi-core processors offer a growing potential of parallelism but pose a challenge of program development for achieving high performance in applications. This pape r presents a comparison of the five parallel program...
详细信息
Multi-core processors offer a growing potential of parallelism but pose a challenge of program development for achieving high performance in applications. This pape r presents a comparison of the five parallel programming models for implementing parallel programs in C++ on multi -core computer systems. The models under consideration are Intel ® 's Thread Building Blocks (TBB), OpenMPI, Intel ® 's Cilk™ Plus, OpenMP and Pthreads. For demonstration purposes multiple parallel implementations of an algorithm for matrix multiplication suitable for parallelization were created. The main goal of this paper is a comprehensive comparison of chosen models with respect to the following criteria: performance and coding effort required.
An approach for efficiently mapping parallel applications onto parallel MIMD machine architectures is introduced. The applicability of this approach to uniform memory-access multiprocessors is demonstrated. It is show...
详细信息
An approach for efficiently mapping parallel applications onto parallel MIMD machine architectures is introduced. The applicability of this approach to uniform memory-access multiprocessors is demonstrated. It is shown that an intermediate layer of abstraction between the application level and the parallel architecture level is conducive not only to a better software productivity but also to performance efficient programs. The intermediate layer consists of a set of commonly used parallel programming paradigms (implementation machines). A mathematical representation and a pragmatic representation are provided for each implementation machine (IM). The user maps the application onto one or a set of IMs and the system implements the IMs efficiently on the underlying parallel machine.< >
A novel linear genetic programming (LGP) paradigm called genetic parallel programming (GPP) has been proposed to evolve parallel programs based on a multi-ALU processor. It is found that GPP can evolve parallel progra...
详细信息
ISBN:
(纸本)0780378040
A novel linear genetic programming (LGP) paradigm called genetic parallel programming (GPP) has been proposed to evolve parallel programs based on a multi-ALU processor. It is found that GPP can evolve parallel programs for data classification problems. In this paper, five binary-class UCI machine learning repository databases are used to test the effectiveness of the proposed GPP-classifier. The main advantages of employing GPP for data classification are: 1) speeding up evolutionary process by parallel hardware fitness evaluation; and 2) discovering parallel algorithms automatically. Experimental results show that the GPP-classifier evolves simple classification programs with good generalization performance. The accuracies of these evolved classifiers are comparable to other existing classification algorithms.
A parallel algorithm for solving TSP(traveling salesman problem) is presented in this paper. Combining 2-opt local search optimization with genetic algorithm is the main ideal of this algorithm. In this paper, MPI+TBB...
详细信息
A parallel algorithm for solving TSP(traveling salesman problem) is presented in this paper. Combining 2-opt local search optimization with genetic algorithm is the main ideal of this algorithm. In this paper, MPI+TBB hybrid parallel programming model is employed in implement of our algorithm. Numerical results indicate that it is possible to arrive at high quality solutions in reasonable time. With the increase in the scale of solving problem, the speedup of parallel algorithm is improved. Moreover, with the growth in the number of cores, the speedup of the parallel algorithm presents nearly linear growth.
The authors discuss methods for reducing the computation time for 3-D PET (positron emission tomography) reconstruction through the use of fast computer hardware, vector and parallel programming techniques, and algori...
详细信息
The authors discuss methods for reducing the computation time for 3-D PET (positron emission tomography) reconstruction through the use of fast computer hardware, vector and parallel programming techniques, and algorithm optimization. The strengths and weaknesses of i860 microprocessor based workstation accelerator boards are investigated in implementations of 3-D PET reconstruction. One key issue in achieving speed on the SuperCard 2 has been minimizing memory accesses, and eliminating redundant calculations through approximations and look-up tables. This has allowed the calculation time to decrease from 20 minutes on the MicroVax 3200 to 146 seconds on a dual i860 system.< >
The design and implementation of a high-level parallel programming language interface for distributed-memory MIMD architectures is described. This interface is called HyperPascal and is based on a Computation Graph De...
详细信息
The design and implementation of a high-level parallel programming language interface for distributed-memory MIMD architectures is described. This interface is called HyperPascal and is based on a Computation Graph Decomposition model of parallel programming. HyperPascal closely resembles the Pascal language but in addition provides constructs that allow for the high-level specification of process parallelism. HyperPascal provides a machine-independent view of parallel programming, simplifies parallel program design and development and facilitates portability of parallel programs. The current version of HyperPascal has been implemented on the Intel iPSC/860 Hypercube.
暂无评论