In distributed hybrid computing systems, traditional sequential processors are loosely coupled with reconfigurable hardware for optimal performance. This loose coupling proves to be a communication challenge; the proc...
详细信息
In distributed hybrid computing systems, traditional sequential processors are loosely coupled with reconfigurable hardware for optimal performance. This loose coupling proves to be a communication challenge; the processor units cannot efficiently share a physical memory. This paper proposes a distributed shared memory architecture and a method for effective data migration within that shared memory. Data is moved using a novel garbage collection scheme, the dual semispace collector. The new garbage collector and the distributed memory prove to be an effective means of data migration in distributed hybrid computing systems.
Communication libraries have dramatically made progress over the fifteen years, pushed by the success of cluster architectures as the preferred platform for high performance distributed computing. However, many potent...
详细信息
Communication libraries have dramatically made progress over the fifteen years, pushed by the success of cluster architectures as the preferred platform for high performance distributed computing. However, many potential optimizations are left unexplored in the process of mapping application communication requests onto low level network commands. The fundamental cause of this situation is that the design of communication subsystems is mostly focused on reducing the latency by shortening the critical path. In this paper, we present a new communication scheduling engine which dynamically optimizes application requests in accordance with the NICs capabilities and activity. The optimizing code is generic and portable. The database of optimizing strategies may be dynamically extended.
A new class of Java multithreading based parallel approximate inverse preconditioning is introduced, for efficiently solving sparse arrow-type linear systems. The parallel Explicit Preconditioned Biconjugate Conjugate...
详细信息
ISBN:
(纸本)9780769529172;0769529178
A new class of Java multithreading based parallel approximate inverse preconditioning is introduced, for efficiently solving sparse arrow-type linear systems. The parallel Explicit Preconditioned Biconjugate Conjugate Gradient - STAB method for shared memory systems is presented in order to examine the parallel behavior of this scheme using explicit approximate inverses as the suitable preconditioner. Design and implementation issues of Java's multithreading techniques are also discussed. The performance in terms of speedups and parallel efficiencies of the method, using Java multithreading, is illustrated by solving sparse arrow-type linear systems. Static and dynamic workload scheduling systems implemented in Java and the results of their use are presented and discussed.
The proceedings contain 60 papers. The topics discussed include: evolutionary neural network prediction for software reliability modeling;software projects finance feasibility analysis in the context of project manage...
ISBN:
(纸本)9781604231847
The proceedings contain 60 papers. The topics discussed include: evolutionary neural network prediction for software reliability modeling;software projects finance feasibility analysis in the context of project management body of knowledge using 'use case points' estimation approach;correctness in the software life cycle;a framework for constraint checking involving aggregates for multiple XML databases using schematron;an extensive math query language;rank-based feature weighting;using text mining to extract gene and protein synonyms from biomedical texts;case study: a tool centric approach for fault avoidance in microchip designs;a common ground: extensible operating systems design principles;a parallel neural network classification approach to assist in decision making;probabilistic artificial neural networks for Arabic character recognition;and a simple model for distributed-feedback laser using adaptive neuro-fuzzy inference system.
The purpose of content-based image retrieval (CBIR) is to retrieve, from real data stored in a database, information that is relevant to a query. In remote sensing applications, the wealth of spectral information prov...
详细信息
The purpose of content-based image retrieval (CBIR) is to retrieve, from real data stored in a database, information that is relevant to a query. In remote sensing applications, the wealth of spectral information provided by last-generation (hyperspectral) instruments has quickly introduced the need for parallel CBIR systems able to effectively retrieve features of interest from ever-growing data archives. To address this need, this paper develops a new parallel CBIR system which has been specifically designed to be run on heterogeneous networks of computers (HNOCs). These platforms have soon become a standard computing architecture in remote sensing missions due to the distributed nature of data repositories. The proposed heterogeneous system first extracts an image feature vector able to characterize image content with sub-pixel precision, and then uses the obtained feature as a search reference. The system is validated using a complex hyperspectral image database, and implemented on several networks of workstations at University of Maryland.
The previous works in the area of checking integrity constraints in distributeddatabases have suggested that sufficient tests should be explored as these tests are cheaper than the complete tests and the initial cons...
详细信息
The previous works in the area of checking integrity constraints in distributeddatabases have suggested that sufficient tests should be explored as these tests are cheaper than the complete tests and the initial constraints due to their characteristics which can be executed at the local (target) site. This paper presents the various types of integrity tests as reported in previous works and discusses how these tests can significantly improve the performance of the constraint checking mechanisms with respect to the amount of data transferred across the network, without limiting to certain type of test as suggested by previous researchers in this area.
Whole genome comparison consists of comparing or aligning genome sequences with a goal of finding similarities between them. Previously we have shown how SIMD extensions used in Intel processors can be used to efficie...
详细信息
ISBN:
(纸本)9780769529172;0769529178
Whole genome comparison consists of comparing or aligning genome sequences with a goal of finding similarities between them. Previously we have shown how SIMD extensions used in Intel processors can be used to efficiently implement the, genome comparing, Smith-Waterman algorithm. Here we present distributed version of that algorithm. We show that on somewhat outdated hardware we can achieve speeds upwards of 8000 MCUPS; one of the fastest implementations of the Smith-Waterman algorithm.
Coupled clusters usually exhibit a heterogeneous but also hierarchical structure in terms of communication and computation. Therefore, it is inevitable to adapt parallel applications to such systems in order to gain r...
详细信息
ISBN:
(纸本)9780769529172;0769529178
Coupled clusters usually exhibit a heterogeneous but also hierarchical structure in terms of communication and computation. Therefore, it is inevitable to adapt parallel applications to such systems in order to gain reasonable performance results. Moreover, also regular benchmark tools are not capable of exposing the latent potential of such coupled cluster systems. Though without adapted (or better self-adapting) benchmark tools for such systems, it is almost not possible to forecast the scalability of well-adapted applications and one is not able to compare the possibly achievable performance in an application independent manner. In this paper we present such a fair, self-adapting and meaningful benchmark tool for heterogeneous coupled cluster systems, following the MPI standard.
Exudates are the primary signs of diabetic retinopathy which are mainly cause of blindness and could be prevented with an early screening process. Pupil dilation is required in the normal screening process but this af...
详细信息
ISBN:
(纸本)9780889866560
Exudates are the primary signs of diabetic retinopathy which are mainly cause of blindness and could be prevented with an early screening process. Pupil dilation is required in the normal screening process but this affects patients' vision. This paper investigated and proposed automatic methods of exudates detection on low-contrast images taken from non-dilated pupils. The process has two main segmentation steps which are coarse segmentation using Fuzzy C-Means clustering and fine segmentation using morphological reconstruction. Four features, namely intensity, standard deviation on intensity, hue and adapted edge, were selected for coarse segmentation. The detection results are validated by comparing with expert ophthalmologists' hand-drawn ground-truth. The sensitivity and specificity for our exudates detection are 86% and 99% respectively.
Grid computing focuses on making use of a very large amount of resources from a large-scale computing environment. It intends to deliver high-performance computing over distributed platforms for computation and data-i...
详细信息
Grid computing focuses on making use of a very large amount of resources from a large-scale computing environment. It intends to deliver high-performance computing over distributed platforms for computation and data-intensive applications. In this paper, we present an effective parallel hybrid asynchronous method to solve large sparse linear systems by the use of a grid computing platform Grid5000. This hybrid method combines a parallel GMRES(m) (generalized minimum residual) algorithm with the least square method that needs some eigenvalues obtained from a parallel Arnoldi algorithm. All of these algorithms run on the different processors of the platform Grid5000. Grid5000, a 5000 CPUs nation-wide infrastructure for research in grid computing, is designed to provide a scientific tool for computing. We discuss the performances of this hybrid method deployed on Grid5000, and compare these performances with those on the IBM SP series supercomputers.
暂无评论