In this paper an adaptive matrix multiplication algorithm for dynamic heterogeneous environments is developed and evaluated. Unlike the state-of-the-art approaches, where load balancing is achieved through unequal dis...
详细信息
the network computing industry has eagerly embraced technologies, welcoming an ever-increasing variety of new service discovery protocols and object architectures. Withthis abundance now offered across a wide collect...
详细信息
ISBN:
(纸本)140207008X
the network computing industry has eagerly embraced technologies, welcoming an ever-increasing variety of new service discovery protocols and object architectures. Withthis abundance now offered across a wide collection of environments, technologiesthat offer standardized interfaces for the discovery process, while supporting communication for several different types of service access technologies, will provide the greatest achievable interoperability and resilience in the long-term. In this paper, we introduce a distributed architecture based on using directory services to significantly reduce the complexity of managing the information and services required to support next-generation networked applications, by providing automatic service discovery and a single coherent model for representing the data managed by supporting services. Standards-based solutions are used, and a prototype implementation of the CORBA Naming Service that has been designed to illustrate how the architecture incorporates distributed object models, directory services and multicast-based dynamic service discovery is presented.
the aim of the paper is to propose a three-dimensional graphics chip withparallel architecture, in accordance withthe chip of the NURBS algorithm was structured. this architecture presents a regular and easily scala...
详细信息
the proceedings contain 140 papers. the special focus in this conference is on parallel Processing. the topics include: Orchestrating computations on the world-wide web;non-massive, non-high performance, distributed c...
ISBN:
(纸本)3540440496
the proceedings contain 140 papers. the special focus in this conference is on parallel Processing. the topics include: Orchestrating computations on the world-wide web;non-massive, non-high performance, distributedcomputing;facts on performance evaluation and its dependence on workloads;concepts and technologies for a worldwide grid infrastructure;a performance analysis tool for distributed and parallel programs;a hybrid strategy for automated performance problem searches;on the scalability of tracing mechanisms;component based problem solving environment;integrating temporal assertions into a parallel debugger;performance evaluation, analysis and optimization;prototyping and verifying stream-processing systems;symbolic cost estimation of parallelapplications;performance modeling and interpretive simulation of PIM architectures and applications;extended overhead analysis for openMP;a call-graph based automatic tool for capture of hardware performance metrics for MPI and openMP applications;performance tuning through source code interdependence;on scheduling task-graphs to logP-machines with disturbances;optimal scheduling algorithms for communication constrained parallel processing;an automatic scheduler for parallel machines;non-approximability results for the hierarchical communication problem with a bounded number of clusters;non-approximability of the bulk synchronous task scheduling problem;adjusting time slices to apply coscheduling techniques in a non-dedicated now;a semi-dynamic multiprocessor scheduling algorithm with an asymptotically optimal competitive ratio;tiling and memory reuse for sequences of nested loops;towards detection of coarse-grain loop-level parallelism in irregular computations and parallel and distributed databases, data mining and knowledge discovery.
We introduce Aksum, a novel system for performance analysis that helps programmers to locate and to understand performance problems in message passing, shared memory and mixed parallel programs. the user must provide ...
详细信息
ISBN:
(数字)9783540362654
ISBN:
(纸本)3540003037
We introduce Aksum, a novel system for performance analysis that helps programmers to locate and to understand performance problems in message passing, shared memory and mixed parallel programs. the user must provide the set of problem and machine sizes for which performance analysis should be conducted. the search for performance problems (properties) is user-controllable by restricting the performance analysis to specific code regions, by creating new or customizing existing property specifications and property hierarchies, by indicating the maximum search time and maximum time a single experiment may take, by providing thresholds that define whether or not a property is critical, and by indicating conditions under which the search for properties stops. Aksum automatically selects and instruments code regions for collecting raw performance data based on which performance properties are computed. Heuristics are incorporated to prune the search for performance properties. We have implemented Aksum as a portable Java-based distributed system which displays all properties detected during the search process together withthe code regions that cause them. A filtering mechanism allows the examination of properties at various levels of detail. We present an experiment with a financial modeling application to demonstrate the usefulness and effectiveness of our approach.
OpenMPi s a relatively new industry standard for programming parallel computers with a shared memory programming model. Given that clusters of workstations are a cost-effective solution for building parallel platforms...
详细信息
For smooth problems spectral element methods (SEM) exhibit exponential convergence and have been very successfully used in practical problems. However, in many engineering and scientific applications we frequently enc...
详细信息
Proxy caches are essential to improve the performance of World Wide Web and to enhance user perceived latency. In this paper, we propose a new Web object based policy to manage the storage system of a proxy cache. We ...
详细信息
Two level carry look-ahead (CLA) and novel parallel-prefix Ling adder architectures are introduced in this paper. the adders resulting from these architectures, as well as from the well known single level CLA Ling add...
详细信息
Two level carry look-ahead (CLA) and novel parallel-prefix Ling adder architectures are introduced in this paper. the adders resulting from these architectures, as well as from the well known single level CLA Ling adder architecture, are compared against traditional adders using CMOS realizations. the results reveal that Ling architectures are more attractive in several cases.
this paper describes an interactive activation model of eye movement control in reading, Glenmore, that can account within one mechanism for preview and spillover effects, for regressions, progressions, and refixation...
详细信息
this paper describes an interactive activation model of eye movement control in reading, Glenmore, that can account within one mechanism for preview and spillover effects, for regressions, progressions, and refixations. the model decouples the decision about when to move the eyes from the word recognition process. the time course of activity in a "fixate centre" determines the triggering of a saccade. the other main feature of the model is the use of a saliency map that acts as an arena for the interplay of bottom-up visual features of the text, and top-down lexical features. these factors combine to create a pattern of. activation that selects one word as the saccade target. Even within the relatively simple framework proposed here, a coherent account has been provided for a range of eye movement control phenomena that have hitherto proved problematic to reconcile.
暂无评论