GPU computing has been accepted as viable alternatives for compute-intensive applications. The radiative process is an important atmospheric physics process and the RRTM_LW radiative scheme has been chosen to be one o...
详细信息
ISBN:
(纸本)9780769544298
GPU computing has been accepted as viable alternatives for compute-intensive applications. The radiative process is an important atmospheric physics process and the RRTM_LW radiative scheme has been chosen to be one of the five benchmark kernels by NCAR for atmospheric applications. We accelerate the RRTM_LW scheme on three different GPU platforms and obtain a speedup of 27.6x. Three decomposition strategies are utilized to exploit data parallelisms existing in various kernels. A systematic performance analysis is performed in the aspects of GPU hardware features, execution configurations, register file utilizations and the application characteristics. Several observations are achieved: (1) the performance of GPU applications is affected greatly by clock rates;(2) a 5.4% to 9.5% performance discrepancy exists between various execution configurations;and (3) hardly any benefit can be brought to the state-heavy atmospheric applications by bounding the register file usage.
This paper discusses Desktop Grid and Volunteer computing System (DGVCS) security issues from a different point of view, namely that of an organisation security team that is external to the unit running DGVCS. In an e...
详细信息
Hadoop provides a sophisticated framework for cloud platform programmers, which, MapReduce is a programming model for large-scale data sets of parallelcomputing. By MapReduce distributed processing framework, we are ...
详细信息
This paper presents an efficient fault tolerance system for heterogeneous many-core processors. The efficiencies and coverage of the presented fault tolerance are optimized by customizing the techniques for different ...
详细信息
This paper describes a Role-based Access Control (RBAC) mechanism for distributed High Performance computing (HPC) systems that will facilitate scalable evaluation, management and enforcement of access control policie...
详细信息
Achieving an efficient realistic illumination is an important aim of research in computer graphics. In this paper a new parallel global illumination method for hybrid systems based on the hierarchical radiosity method...
详细信息
Achieving an efficient realistic illumination is an important aim of research in computer graphics. In this paper a new parallel global illumination method for hybrid systems based on the hierarchical radiosity method is presented. Our solution allows the exploitation of systems that combine independent nodes with multiple cores per node. Thus, multiple nodes work in parallel in the computation of the global illumination for the same scene. Within each node, all the available computational cores are used through a shared-memory multithreading approach. The good results obtained in terms of speedup on several distributed-memory and shared-memory configurations show the versatility of our hybrid proposal.
Network resources monitoring and management is critical to ensure security and load balance of network and information system, especially in the increasingly extensively used cloud computing and distributedparallel a...
详细信息
Multicore computing platforms have emerged as the most common computing platform to overcome challenges stemming from high power densities and thermal hot spots in conventional microprocessors. However, providing mult...
详细信息
The optimization process of large-scale computational problems presents various issues that need to be solved in order to achieve favorable results. One of the most significant challenges faced is the computational co...
详细信息
As the emergence of cloud computing brings the potential for large-scale data analysis to a broader community, architectural patterns for data analysis on the cloud, especially those addressing iterative algorithms, a...
详细信息
暂无评论