The proceedings contain 137 papers. The topics discussed include: design and implementation of portable and efficient non-blocking collective communication;productive parallel linear algebra programming with unstructu...
ISBN:
(纸本)9780769546919
The proceedings contain 137 papers. The topics discussed include: design and implementation of portable and efficient non-blocking collective communication;productive parallel linear algebra programming with unstructured topology adaption;user transparent data and task parallel multimedia computing with pyxis-DT;fine-grained access control in the chirp distributed file system;a map-reduce based framework for heterogeneous processing element cluster environments;workflow scheduling to minimize data movement using multi-constraint graph partitioning;decentralized orchestration of data-centric workflows using the object modeling system;evaluating dynamics and bottlenecks of memory collaboration in cluster systems;GPU performance enhancement via communication cost reduction: case studies of radix sort and WSN relay node placement problem;and client classification policies for SLA enforcement in shared cloud datacenters.
This paper introduces an architecture of parallel remote sensing image processing software, with advantages including high scalability, platform-independence, language-independence, and so on. It helps achieve high-pe...
详细信息
ISBN:
(纸本)9783642342882
This paper introduces an architecture of parallel remote sensing image processing software, with advantages including high scalability, platform-independence, language-independence, and so on. It helps achieve high-performance computing in this field. MPI is used as the fundamental distributed message passing protocol. An object-oriented wrapper, *** library, is used in the software to manipulate MPI. Open Source libraries such as GDAL and Open-CV are studied in this paper to help develop detailed image processing programs and implement classification algorithms. A number of experiments are conducted to test the parallel classification programs. The results indicate that in most cases the performance is significantly improved, especially for multi-spectral remote sensing image classification, in which a highest speed-up of 3.92 is reached.
The JEM-EUSO space observatory will be launched and attached to the Japanese module of the international Space Station (ISS) in 2016. Its aims is to observe UV photon tracks produced by Ultra High Energy Cosmic Rays (...
详细信息
Energy will be a major limiting factor in future multi-core architectures, so optimizing performance per watt should be a key driver for next generation massive-core architectures. Recent studies show that heterogeneo...
详细信息
Chapel is a parallel programming language designed to improve the productivity and ease of use of conventional and parallel computers. This language currently delivers sub optimal performance when executing codes that...
详细信息
Spatial decomposition is a popular basis for parallelising code. Cast in the frame of task parallelism, calculations on a spatial domain can be treated as a task. If neighbouring domains interact and share results, ac...
详细信息
作者:
Biscani, Francesco
European Space Agency Advanced Concepts Team 2200 AG Noordwijk Netherlands
We present a high performance algorithm for the parallel multiplication of sparse multivariate polynomials on modern computer architectures. The algorithm is built on three main concepts: a cache-friendly hash table i...
详细信息
ISBN:
(纸本)9781450312691
We present a high performance algorithm for the parallel multiplication of sparse multivariate polynomials on modern computer architectures. The algorithm is built on three main concepts: a cache-friendly hash table implementation for the storage of polynomial terms in distributed form, a statistical method for the estimation of the size of the multiplication result, and the use of Kronecker substitution as a homomorphic hash function. The algorithm achieves high performance by promoting data access patterns that favour temporal and spatial locality of reference. We present benchmarks comparing our algorithm to routines of other computer algebra systems, both in sequential and parallel mode. Copyright 2012 ACM.
The inherently limited processing power and battery lifetime of mobile phones hinder the possible execution of computationally intensive applications like content-based video analysis or 3D modeling. Offloading of com...
详细信息
With the rapid growth of emerging applications like social network analysis, semantic Web analysis and bioinformatics network analysis, a variety of data to be processed continues to witness a quick increase. Effectiv...
详细信息
ISBN:
(纸本)9780769549309
With the rapid growth of emerging applications like social network analysis, semantic Web analysis and bioinformatics network analysis, a variety of data to be processed continues to witness a quick increase. Effective management and analysis of large-scale data poses an interesting but critical challenge. Recently, big data has attracted a lot of attention from academia, industry as well as government. This paper introduces several big data processing technics from system and application aspects. First, from the view of cloud data management and big data processing mechanisms, we present the key issues of big data processing, including cloud computing platform, cloud architecture, cloud database and data storage scheme. Following the MapReduce parallel processing framework, we then introduce MapReduce optimization strategies and applications reported in the literature. Finally, we discuss the open issues and challenges, and deeply explore the research directions in the future on big data processing in cloud computing environments.
Performance optimization, especially in the field of HPC, is an integral part of today's software development process. One powerful way of optimizing applications is to analyze their event traces. Yet, the compari...
详细信息
暂无评论