the molecular dynamics code CHARMM is a popular research tool for computational biology. An increasing number of researchers are currently looking for affordable and adequate platforms to execute CHARMM or similar cod...
详细信息
the goal of this research is to develop performance profiles of parallel and distributed applications in order to predict their execution time under different network conditions. this paper measures the resource requi...
Weighted reference counting is a very simple and efficient memory management system for multiprocessor architectures. this paper extends the weighted reference counting algorithm to work efficiently with cyclic data s...
详细信息
Weighted reference counting is a very simple and efficient memory management system for multiprocessor architectures. this paper extends the weighted reference counting algorithm to work efficiently with cyclic data structures.
In this paper, we examine some of the challenges present in providing support for OpenMP applications on a Software Distributed Shared Memory(DSM) based cluster system. We present detailed measurements of the performa...
详细信息
Derivatives of almost arbitrary functions can be evaluated efficiently by automatic differentiation whenever the functions are given in the form of computer programs in a high-level programming language such as Fortra...
详细信息
Derivatives of almost arbitrary functions can be evaluated efficiently by automatic differentiation whenever the functions are given in the form of computer programs in a high-level programming language such as Fortran, C, or C++. In contrast to numerical differentiation, where derivatives are only approximated, automatic differentiation generates derivatives that are accurate up to machine precision. Sophisticated software tools implementing the technology of automatic differentiation are capable of automatically generating code for the product of the Jacobian matrix and a so-called seed matrix. It is shown how these tools can benefit from concepts of shared memory programming to parallelize, in a completely mechanical fashion, the gradient operations associated with each statement of the given code. the feasibility of our approach is demonstrated by numerical experiments. they were performed with a code that was generated automatically by the Adifor system and augmented with OpenMP directives.
the present study deals with parallel asynchronous iterations applied to the numerical solution of the obstacle problem defined in a three-dimensional domain. For the considered problem, the convergence analysis of th...
详细信息
the present study deals with parallel asynchronous iterations applied to the numerical solution of the obstacle problem defined in a three-dimensional domain. For the considered problem, the convergence analysis of the algorithm is made. Finally, the implementation of the algorithms are presented and computational experiments on IBM-SP3 are analysed.
Emerging trends in heterogeneous distributed metacomputing and in Web Services technologies exhibit several commonalities that each domain can exploit. In this paper, we present an architectural model and design issue...
详细信息
Emerging trends in heterogeneous distributed metacomputing and in Web Services technologies exhibit several commonalities that each domain can exploit. In this paper, we present an architectural model and design issues in leveragingWeb Services to construct metacomputing frameworks. Our design is based on a combination of concepts currently embodied in the Harness system and those implemented by the Web Services Description Language and associated technologies. We begin by drawing parallels between component based metacomputing and the use of web services in the electronic commerce domain. We then suggest that the direct use of web service technologies is inappropriate and inefficient for highperformance distributed computing, but that natural extensions are possible to enhance the suitability of these technologies. Based on our proposed extensions, specifically with regard to component deployment, localization, and encoding, we describe an operational model, architectural layering, and high-level design for the Harness II metacomputing framework.
In this paper, we describe Peppermint and Sled: tools developed for evaluations of computer systems based on IA-64 processors. Sled generates trace from applications running on IA-64 processors, while Peppermint model...
暂无评论