This workshop focuses on understanding the implications of accelerators on the architectures and programming environments of future systems. It seeks to ground accelerator research through studies of application kerne...
This workshop focuses on understanding the implications of accelerators on the architectures and programming environments of future systems. It seeks to ground accelerator research through studies of application kernels or whole applications on such systems, as well as tools and libraries that improve the performance or productivity of applications trying to use these systems. The goal of this workshop is to bring together researchers and practitioners who are involved in application studies for accelerators and other hybrid systems, to learn the opportunities and challenges in future design trends for HPC applications and systems.
The following topics are dealt with: Grid and distributed computing; scheduling task systems; shared-memory multiprocessors; imaging and visualization; testing and debugging; performance analysis and real-time systems...
详细信息
The following topics are dealt with: Grid and distributed computing; scheduling task systems; shared-memory multiprocessors; imaging and visualization; testing and debugging; performance analysis and real-time systems; scheduling for heterogeneous resources; networking; peer-to-peer and mobile computing; compiler technology and run-time systems; load balancing; network routing; parallel programming models; parallel algorithms; scheduling and storage; parallel and distributed performance; software for high performance clusters; decentralized algorithms; multithreading and VLIW; parallel and distributed real-time systems; high-level parallel programming models and supportive environments; Java for parallel and distributed computing; nature inspired distributed computing; high performance computational biology; advances in parallel and distributed computational models; reconfigurable architectures; communication architecture for clusters; next generation systems; fault-tolerant parallel and distributed systems; wireless, mobile and ad hoc networks; parallel and distributed image processing, video processing, and multimedia; formal methods for parallel programming; Internet computing and e-commerce; parallel and distributed scientific and engineering computing with applications; massively parallelprocessing; performance modeling, evaluation, and optimization of parallel and distributed systems; and parallel and distributed systems: testing and debugging.
HIPS-HPGC 2005 is a full-day workshop, focusing on high-performance grid computing and high-level parallel programming models. The papers deal with component models and service-based systems for grids, emphasizing on ...
HIPS-HPGC 2005 is a full-day workshop, focusing on high-performance grid computing and high-level parallel programming models. The papers deal with component models and service-based systems for grids, emphasizing on experiences with existing systems. Also the papers report on the state of the art of grid applications, both for academic and industrial problems
The proceedings contain 71 papers. The topics discussed include: on building and updating distributed LSI for P2P systems;performing efficient keyword search by keyword grouping in DHT peer-to-peer network;analysis of...
详细信息
ISBN:
(纸本)3540297707
The proceedings contain 71 papers. The topics discussed include: on building and updating distributed LSI for P2P systems;performing efficient keyword search by keyword grouping in DHT peer-to-peer network;analysis of GHT in mobile ad hoc networks;a P2P market place based on aggregate signatures;construct optimal diameter network and d-partition routing algorithm;decoupling service and feedback trust in a peer-to-peer reputation system;design and implementation of an efficient multipath for a SAN environment;an efficient cache replacement policy with distinguishing write blocks from read blocks in disk arrays;a cluster LVM for SAN environments;fast sequence similarity computing with LCS on LARPBS;various features with integrated strategies for protein name classification;and grid supporting platform for AMS data processing.
From the very beginning of the CUDA technology, it was essential to apply state-of-the-art optimization techniques. Only then was it possible to fully utilize the enormous computational power of graphic processing uni...
详细信息
ISBN:
(纸本)9781665435741
From the very beginning of the CUDA technology, it was essential to apply state-of-the-art optimization techniques. Only then was it possible to fully utilize the enormous computational power of graphic processing units. However, with the development of the CUDA architecture, the impact of typical optimization techniques on software performance has changed significantly. This article shows how the impact of several optimization techniques on the performance of the image filtering algorithm has changed for the subsequent generations of CUDA architecture. Then, based on the results obtained, it attempts to answer whether tedious and time-consuming optimization of the CUDA software is still necessary.
Novel platforms of modular robot systems have been developed with important applications in safety, transportation and sensing domains. In such systems, modular robots are able to change their organization in order to...
详细信息
ISBN:
(纸本)9781479942930
Novel platforms of modular robot systems have been developed with important applications in safety, transportation and sensing domains. In such systems, modular robots are able to change their organization in order to obtain different shapes. The conception of distributed programs allowing the "optimal" reorganization of a set of robots into a specific shape appears as a very challenging problem. In this paper we present an original distributed meta-algorithm for micro-robots shape-shifting problem. We show that this meta-algorithm, described as a general functioning schema, presents a good framework to easily conceive distributed algorithms for shape-shifting problems. We also prove the facility to instantiate the algorithm for special target shapes and we give an adaptation of the algorithm to reach any horizontally convex form. The presented meta-algorithm presents two main advantages: first, there is no need to exact positioning of the robots and secondly, the memory storage and communication requirements are significantly reduced.
Pattern libraries are important tools for high productivity application development. Their struggle for best performance is complicated by the fact that they are used to execute user-provided code, which is not known ...
详细信息
ISBN:
(纸本)9781479942930
Pattern libraries are important tools for high productivity application development. Their struggle for best performance is complicated by the fact that they are used to execute user-provided code, which is not known during their creation. This makes pattern libraries good candidate for automatic software tuning. In this paper, we deal with automatic online parameter tuning of the HyPHI hybrid pattern library for heterogeneous systems equipped with the Intel Xeon Phi coprocessors. We propose a framework that can be used to combine a pattern library with an existing tuning library in a practical and efficient way. Our experiments show that tuning can noticeably improve the performance of the library and it introduces very little overhead.
With the rapid development of edge computing technology, the application of edge computing in smart grids has become more and more extensive. But edge computing has not yet been applied to the operation control of dis...
详细信息
ISBN:
(纸本)9781665435741
With the rapid development of edge computing technology, the application of edge computing in smart grids has become more and more extensive. But edge computing has not yet been applied to the operation control of distributed power generation microgrid systems. This article proposes a microgrid-oriented edge computing architecture. First, we introduce the main functions of edge-cloud collaboration. Then we explain the construction plan of the architecture, including the realization of data processing, network communication and security mechanisms. Finally, we introduce the architecture application practice in a rural community in Central China.
As the Internet of Vehicles (IoV) becomes flourishing and the data generated by sensors be ubiquitous, there exist various kinds of IoV applications with different performance requirements. Hence, different distribute...
详细信息
ISBN:
(纸本)9781538637906
As the Internet of Vehicles (IoV) becomes flourishing and the data generated by sensors be ubiquitous, there exist various kinds of IoV applications with different performance requirements. Hence, different distributed data processing systems (DDPS) clusters will coexist, e.g., a stream processing system cluster for real-time tasks and a batch one for statistics based data mining tasks, to meet the requirements of such IoV applications. However, it is not an economical or convenient way to maintain varied systems clusters, as the developers and/or administrators have to be familiar with all of these DDPSs, and of course, the deployment of multiple DDPS means a waste of resources compared to the deployment of one DDPS. Based on these observations, this paper proposes the TDAG as a solution. TDAG allows users to adjust the data processing from the streaming style to the batch style by encapsulating the input data with specific packing strategies. We have implemented TDAG in a prototype called TStream. The experimental tests show that our TStream is both effective and efficient.
As processors and systems on chip in the embedded world increasingly become multicore, parallel programming remains a difficult, time-consuming and complicated task. End users who are not parallel programming experts ...
详细信息
ISBN:
(纸本)9781479942930
As processors and systems on chip in the embedded world increasingly become multicore, parallel programming remains a difficult, time-consuming and complicated task. End users who are not parallel programming experts have a need to exploit such processors and architectures, using high level programming languages, like Scilab or MATLAB. The ALMA toolset solves this problem: it takes Scilab code as input and produces parallel code for embedded multiprocessor systems on chip, using platform quasi-agnostic optimizations. The platform information is provided by an architecture description language designed for the purpose of a flexible system description as well as simulation. A hierarchical system description in combination with a parameterizable simulation environment allows fine-grained trade-offs between simulation performance and simulation accuracy.
暂无评论