the proceedings contain 10 papers. the topics discussed include: CRM-OO-VM: a checkpointing-enabled Java VM for efficient and reliable e-science applications in grids;multi-domain grid/cloud computing through a hierar...
ISBN:
(纸本)9781450304535
the proceedings contain 10 papers. the topics discussed include: CRM-OO-VM: a checkpointing-enabled Java VM for efficient and reliable e-science applications in grids;multi-domain grid/cloud computing through a hierarchical component-based middleware;a scheduling model for workflows on grids and clouds;adaptive threshold-based approach for energy-efficient consolidation of virtual machines in cloud data centers;resource allocation across multiple cloud data centers;optimizing the pre-processing of scientific visualization techniques using QEF;a middleware for parallelprocessing of large graphs;Mooshabaya - mashup generator for XBaya;the dark energy survey data management system as a data intensive science gateway;and BISSA: empowering web-gadget communication with tuple spaces.
In this work we present a design space exploration of the memory subsystem of our configurable CoreVA VLIW architecture. the development of resource efficient processor architectures is based on a two-stage tool flow ...
详细信息
We present a concept of a parallel implementation of a novel 3-D model of tumor growth. the model is based on particle dynamics, which are building blocks of normal, cancerous and vascular tissues. the dynamics of the...
详细信息
ISBN:
(纸本)9783642143892
We present a concept of a parallel implementation of a novel 3-D model of tumor growth. the model is based on particle dynamics, which are building blocks of normal, cancerous and vascular tissues. the dynamics of the system is driven also by the processes in microscopic scales (e.g. cell life-cycle), diffusive substances nutrients and TAF (tumor angiogenic factors) - and blood flow. We show that the cell life-cycle (particle production and annihilation), the existence of elongated particles, the influence of continuum fields and blood flow in capillaries, makes the model very tough for parallelization in comparison to standard MD codes. We present preliminary timings of our parallel implementation and we discuss the perspectives of our approach.
In this paper we analyze energy usage in divisible load processing Divisible load theory (DLT) applies to computations which can be divided into parts of arbitiary sizes, and the parts can be independently processed i...
详细信息
ISBN:
(纸本)9783642144028
In this paper we analyze energy usage in divisible load processing Divisible load theory (DLT) applies to computations which can be divided into parts of arbitiary sizes, and the parts can be independently processed in parallelthe shortest schedule for divisible load processing is determined by the speed of computation and communication Energy usage for such a time-optimum schedule is analyzed in this paper We propose a simple model of energy consumption Two states of the computing system ale taken into account an active state and an idle state with reduced energy consumption Energy consumption is examined as a function of system parameters We point out possible ways of energy conservation It is demonstrated that energy can be saved by use of parallelprocessing
OpenMP and Intel threading Building Blocks (TBB) are two parallel programming paradigms for multicore processors. they have a lot in common but were designed in mind for different parallel execution models. Comparing ...
详细信息
ISBN:
(纸本)9783642143892
OpenMP and Intel threading Building Blocks (TBB) are two parallel programming paradigms for multicore processors. they have a lot in common but were designed in mind for different parallel execution models. Comparing the performance gain of these two paradigms depends to a great extent on the parallelization overheads of their parallel mechanisms. parallel overheads are inevitable and therefore understanding their potential costs can help developers to design more scalable applications. this paper presents a comparative study of OpenMP and TBB parallelization overheads. the study was conducted on a dual-core machine with two different compilers;Intel compiler and Microsoft Visual Studio C++ 2008, and shows that Intel compiler outperforms Microsoft compiler. Nevertheless, the relative performance of TBB versus OpenMP is mainly depends on the implementation of the parallel constructs of a specific compiler.
An approach is presented permitting for extracting both affine and non-linear synchronization-free slices in program loops. It requires an exact dependence analysis. To describe and implement the approach, the depende...
详细信息
ISBN:
(纸本)9783642143892
An approach is presented permitting for extracting both affine and non-linear synchronization-free slices in program loops. It requires an exact dependence analysis. To describe and implement the approach, the dependence analysis by Pugh and Wonnacott was chosen where dependences are found in the form of tuple relations. the approach is based on operations on integer tuple relations and sets and it has been implemented and verified by means of the Omega project software. Results of experiments withthe UTDSP benchmark suite are discussed. Speed-up and efficiency of parallel code produced by means of the approach is studied.
We present a case study parallelizing streaming aggregation on three different parallel hardware architectures. Aggregation is a performance-critical operation for data summarization in stream computing, and is common...
详细信息
We propose here a parallel implementation of multidimensional scaling (MDS) method which can be used for visualization of large datasets of multidimensional data.. Unlike in traditional approaches, which employ classi...
详细信息
ISBN:
(纸本)9783642143892
We propose here a parallel implementation of multidimensional scaling (MDS) method which can be used for visualization of large datasets of multidimensional data.. Unlike in traditional approaches, which employ classical minimization methods for finding the global optimum of the "stress function", we use a heuristic based on particle dynamics. this method allows avoiding local minima and is convergent to the global one. However, due to its O(N-2) complexity, the application of this method in data mining problems involving large datasets requires efficient parallel codes. We show that employing both optimized Taylor's algorithm and hybridized model of parallel computations, our solver is efficient enough to visualize multidimensional data sets consisting of 10(4) feature vectors in time of minutes.
Two real-valued signal models based on selective spanning with fast enumeration (SSFE) and layered orthogonal lattice detector (LORD) algorithms are implemented on a Nvidia graphics processing unit (GPU). A 2 x 2 mult...
详细信息
Fuel assemblies are very expensive parts of the nuclear reactor. Initially they were used in Hungary for 3 years, now for 4 years and soon they will stay in the core for 5 years. Each year only 1/3rd, 1/4th later 1/5t...
详细信息
ISBN:
(纸本)9789077381557
Fuel assemblies are very expensive parts of the nuclear reactor. Initially they were used in Hungary for 3 years, now for 4 years and soon they will stay in the core for 5 years. Each year only 1/3rd, 1/4th later 1/5th of them is replaced, therefore the change of the fuel type is a lengthy process, with mixed cores used. the authorities require that the staff should be trained to each particular core before they operate it. For this reason the simulator should be upgraded to simulate the exact behavior of each core foreseen for the next 5 years. the RETINA code (Reactor thermo-hydraulics Interactive) is a 3D offline code, developed in our department.[5] KIKO3D - Neutron Kinetics 3D - has been developed in our Institute, too, in the Reactor Analysis Department [4]. Both of them should be integrated into our full-scope replica simulator, coupled, and stressed to operate parallel in real-time, using four hi-power processors [6]. the simulation-specific details are discussed in the paper.
暂无评论