This paper presents algorithms and principles for the implementation of a novel broadband multimedia service called "TV-anytime". The basic idea of this service is to store broadcast media assets on media se...
详细信息
Efficient job-scheduling strategies are important to improve the performance and usability of Hpc-clusters. In this paper we evaluate job-scheduling strategies (FCFS, SJF, and LJF) used in the resource management syst...
详细信息
In modern resource management systems for supercomputers and Hpc-clusters the job-scheduler plays a major role in improving the performance and usability of the system. The performance of the used scheduling policies ...
详细信息
Distributed high-performance computing - so-called metacomputing - refers to the coordinated use of a pool of geographically distributed high-performance computers. The user's view of an ideal metacomputer is that...
详细信息
Distributed high-performance computing - so-called metacomputing - refers to the coordinated use of a pool of geographically distributed high-performance computers. The user's view of an ideal metacomputer is that of a powerful monolithic virtual machine. The implementor's view, on the other hand, is that of a variety of interacting services implemented in a scalable and extensible manner. In this paper, we present MOL, the Metacomputer Online environment. In contrast to other metacomputing environments, MOL is not based on specific programming models or tools. It has rather been designed as an open, extensible software system comprising a variety of software modules, each of them specialized in serving one specific task such as resource scheduling, job control, task communication, task migration, user interface, and much more. All of these modules exist and are working. The main challenge in the design of MOL lies in the specification of suitable, generic interfaces for the effective interaction between the modules.
A performance prediction method is presented, which accurately predicts the expected program execution time on massively parallel systems. We consider distributed-memory architectures with SMD nodes and a fast communi...
详细信息
High efficient implementations of parallel algorithms need high efficient sequential kernels. Therefore, libraries like BLAS are successfully used in many numerical applications. In this paper we show the tradeoff bet...
详细信息
This paper presents bounds on the quality of partitions induced by space-filling curves. We compare the surface that surrounds an arbitrary index range with the optimal partition in the grid, i. e. the square. It is s...
详细信息
Several performance analysis tools have been developed with the drawback of dedicated hardware solutions or the compute intenseness of simulations. The modern microprocessors, with hardware support for counting of sys...
详细信息
ISBN:
(纸本)9783540634409
Several performance analysis tools have been developed with the drawback of dedicated hardware solutions or the compute intenseness of simulations. The modern microprocessors, with hardware support for counting of system hardware events, now make possible universal software tools for the performance analysis of complex application programs such as the SPEC benchmarks. In this paper, we present a new method to determine system resource utilization (cache miss ratios, CPI values, branch miss predictions) of arbitrary programs, based on a sampling technique, combined with access to processor-internal event counter registers. We present the sprof tool set that is based on this method and enables also the detailed analysis of individual subroutines of a program, as they are executed over time. The high accuracy and the negligible overhead of the tool set is demonstrated. We used the SPEC95 benchmark suite, consisting of 8 integer and 10 floating-point intensive non-trivial programs that are commonly used to define the performance of workstations and servers. As an example, we present the analysis of a SPEC CPU95 benchmark program on different processor architectures.
Exponentially growing capacities of disk drives have increased the problem that not only a complete disk can fail, but also individual, small groups of sectors can be erroneous. These sector errors are especially crit...
详细信息
This paper addresses the potential benefit of sharing jobs between independent sites in a grid computing environment. Also the aspect of parallel multi-site job execution on different sites is discussed. To this end, ...
详细信息
暂无评论