this paper presents the PuRE/EPOS approach to deal withthe high complexity of adaptable operating systems and also to diminish the distance between application and operating system. A system designed according to the...
详细信息
Real-Time Fiber Communications (RTFC) is a gigabit speed network that has been designed for damage tolerant local area networks. In addition to its damage tolerant characteristics, it has several features that make it...
详细信息
this paper describes a technique that allows an MPI code to be encapsulated into a component. Our technique is based on an extension to the Common Object Request Broker Architecture (CORBA) from the OMG (Object Manage...
详细信息
this paper describes a technique that allows an MPI code to be encapsulated into a component. Our technique is based on an extension to the Common Object Request Broker Architecture (CORBA) from the OMG (Object Management Group). the proposed extensions do not modify the CORBA core infrastructure (the Object Request Broker) so that it can fully co-exist with existing CORBA applications. An MPI code is seen as a new kind of CORBA object that hides most of the cumbersome problems when dealing withparallelism. Such technique can be used to connect MPI codes to existing CORBA software infrastructures which are now being developed in the framework of several research and development projects such as JACO3, JULIUS or TENT from DLR. To illustrate the concept of parallel CORBA object, we present a virtual reality application that is made of the coupling of a light simulation application (radiosity) and a visualization tool using VRML and Java.
A methodology intended to avoid bottlenecks that typically arise as the result of data consumers that must access and process large amounts of data that has been generated and resides on other hosts, and which must pa...
详细信息
A methodology intended to avoid bottlenecks that typically arise as the result of data consumers that must access and process large amounts of data that has been generated and resides on other hosts, and which must pass through a central data cache prior to being used by the data consumer is described. It is based on a fundamental paradigm that the end result rendered by a data consumer can be produced using a reduced data set that has been distilled or filtered from the original data set. Data distribution bottlenecks for visualization applications are reduced by avoiding the transfer of large amounts of raw data in favor of considerably distilled visual data.
this paper proposes a novel queue-based programming abstraction, parallel Dispatch Queue (PDQ), that enables efficient parallel execution of fine-grain software communication protocols. parallel systems often use fine...
详细信息
ISBN:
(纸本)0769500048
this paper proposes a novel queue-based programming abstraction, parallel Dispatch Queue (PDQ), that enables efficient parallel execution of fine-grain software communication protocols. parallel systems often use fine-grain software handlers to integrate a network message into computation. Executing such handlers in parallel requires access synchronization around resources. Much as a monitor construct in a concurrent language protects accesses to a set of data structures, PDQ allows messages to include a synchronization key protecting handler accesses to a group of protocol resources. By simply synchronizing messages in a queue prior to dispatch, PDQ not only eliminates the overhead of acquiring/releasing synchronization primitives but also prevents busy-waiting within handlers. In this paper, we study PDQ's impact on software protocol performance in the context of fine-grain distributed shared memory (DSM) on ail SMP cluster: Simulation results running shared-memory applications indicate that: (i) parallel software protocol execution using PDQ significantly improves performance in fine-grain DSM, (ii) tight integration of PDQ and embedded processors into a single custom device carl offer performance competitive or better than arl all-hardware DSM, and (iii) PDQ best benefits cost-effective systems that use idle SMP processors (rather than custom embedded processors) to execute protocols. On a cluster of 4 16-way SMPs, a PDQ-based parallel protocol running on idle SMP processors improves application performance by a factor of 2.6 over a system running a serial protocol on a single dedicated processor.
A parallel version of a Genetic Algorithm is presented and implemented on a cluster of workstations. Even though our algorithm is general enough to be applied to a wide variety of problems, we used it to obtain optima...
详细信息
Withthe advent of cheap and powerful hardware for workstations and networks, a new cluster-based architecture for parallelprocessingapplications has been envisioned. However, fine-grained asynchronous applications ...
详细信息
this paper proposes a method for implementing fractal image compression on dynamically reconfigurable architecture. In the encoding of this compression, metric computations among image blocks are the most time consumi...
详细信息
A basic prerequisite for parallel programming is a good communication API. the recent interest in using Java for scientific and engineering application has led to several international efforts to produce a message pas...
详细信息
the branch-and-bound problem involves determining the leaf of minimum cost in a cost-labelled, heap-ordered tree, subject to the constraint that only the root is known initially and that the children of each node are ...
详细信息
暂无评论