检索结果-内蒙古大学图书馆

Proceedings of the 1999 8th IEEE international symposium on High Performance distributed Computing - HPDC-8

作者： Cruz, John Park, Kihong Purdue Univ West Lafayette United States

With the proliferation of workstation clusters connected by high-speed networks, providing efficient system support for concurrent applications engaging in nontrivial interaction has become an important problem. Two principal barriers to harnessing parallelism are: one, efficient mechanisms that achieve transparent dependency maintenance while preserving semantic correctness, and two, scheduling algorithms that match coupled processes to distributed resources while explicitly incorporating their communication costs. this paper describes a set of performance features, their properties, and implementation in a system support environment called DUNES that achieves transparent dependency maintenance - IPC, file access, memory access, process creation/termination, process relationships - under dynamic load balancing. the two principal performance features are push/pull-based active and passive end-point caching and communication-sensitive load balancing. Collectively, they mitigate the overhead introduced by the transparent dependency maintenance mechanisms. Communication-sensitive load balancing, in addition, affects the scheduling of distributed resources to application processes where both communication and computation costs are explicitly taken into account. DUNES' architecture endows commodity operating systems with distributed operating system functionality while achieving transparency with respect to their existing application base. DUNES also preserves semantic correctness with respect to single processor semantics. We show performance measurements of a UNIX based implementation on Sparc and x86 architectures over high-speed LAN environments. We show that significant performance gains in terms of system throughput and parallel application speed-up are achievable.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

A dynamic fault-tolerant mesh architecture 13th

引用

13th international parallel processing symposium, IPPS 1999 Held in Conjunction with the 10th symposium on parallel and distributed processing, SPDP 1999

作者： Huang, Jyh-Ming Yang, Ted C. Department of Information Engineering and Computer Sciences Feng-Chia University 100 Wen-Hwa Rd Sea-Tween Taichung407 Taiwan Computer and Communication Research Laboratories Industrial Technology Research Institute Bldg. 51195-11 Sec. 4 Chung Hsing Rd Chutung Hsinchu310 Taiwan

ISBN: (纸本)3540658319

A desired mesh architecture, based on connected-cycle modules, is constructed. To enhance the reliability, multiple bus sets and spare nodes are dynamically inserted to construct modular blocks. Two reconfiguration schemes are associated, and can eliminate the spare substitution domino effect. Simulations show that both schemes provide for increase in reliability over the interstitial redundancy scheme[l l] and the multi-level fault tolerance mesh(MFTM)[6], at the same redundant spare ratio. Especially, with global reconfiguration, the reliability improvement ratio per spare (RIPS) can be at least twice of that of the MFTM scheme. Furthermore, the lower port complexity in spare nodes as compared to those in both of the aforementioned schemes, and versatility in reconfiguration capability are two additional merits of our proposed architecture. © Springer-Verlag Berlin Heidelberg 1999.

关键词： Fault tolerance

来源：评论

学校读者我要写书评

暂无评论

Dynamic grain-size adaptation on object oriented parallel programming - the SCOOPP approach

Proceedings of the International Parallel Processing Symposi...

引用

Proceedings of the international parallel processing symposium, IPPS 1999年 728-732页

作者： Sobral, Joao Luis Proenca, Alberto Jose Universidade do Minho Braga Portugal

this paper presents the SCOOPP (SCalable Object Oriented parallel Programming) approach to support the design and execution of scalable parallel applications. the SCOOPP programming model aims the portability, dynamic scalability and efficiency of parallel applications. the SCOOPP is an hybrid compile and run-time system, which can perform parallelism extraction, supports explicit parallelism and performs dynamic granularity control at run-time. the mechanism that supports dynamic grain-size adaptation is presented and performance evaluated on two parallel systems. the measured results show the feasibility of the proposed dynamic grain-size adaptation and a scalability improvement of parallel applications over static parallel OO environments, which suggests cost benefits to develop scalable parallel applications to run on multiple platforms.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Delivering on standards: Balancing portability and performance

Proceedings of the International Parallel Processing Symposi...

引用

Proceedings of the international parallel processing symposium, IPPS 1999年 416-418页

作者： Robinson, John Mercury Computer Systems Inc

As high-performance embedded computing systems become more commonplace in a variety of applications, the need for supporting standards becomes more critical. Specifications developed by concensus, such as Message Passing Interface (MPI) and Vector, Signal, and Image processing (VSIP), and `de facto' standards such as MATLAB, provide a means for developers to create real-time applications across multiple platform technologies. the balance between portability and performance presents some significant challenges, including balancing the application of tools tuned to specific platforms with the use of standard, but possibly slower, code and tools.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Automatic array alignment in parallel Matlab scripts

Proceedings of the International Parallel Processing Symposi...

引用

Proceedings of the international parallel processing symposium, IPPS 1999年 285-289页

作者： Milosavljevic, Igor Z. Jabri, Marwan A. Univ of Sydney New South Wales

We present the ParAL system which compiles Matlab scripts into C programs with calls to a parallel run-time library. the novel feature of the compiler is the optimization of array alignment which reduces or eliminates unnecessary communication overheads. We have evaluated this technique on several Matlab codes. For comparison, the same applications were hand-coded using the PBLAS library. the aligned codes were on average 43% faster then the misaligned codes, with the speedup factor of almost 4 achieved in some cases. this optimization technique enabled ordinary Matlab scripts to run at a similar speed as manually optimized PBLAS codes.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Cashmere-VLM: Remote memory paging for software distributed shared memory

Proceedings of the International Parallel Processing Symposi...

引用

Proceedings of the international parallel processing symposium, IPPS 1999年 153-159页

作者： Dwarkadas, Sandhya Hardavellas, Nikolaos Kontothanassis, Leonidas Nikhil, Rishiyur Stets, Robert Compaq Cambridge Research Lab Cambridge United States

ISBN: (纸本)0769501433

Software distributed shared memory (DSM) systems have successfully provided the illusion of shared memory on distributed memory machines. However, most software DSM systems use the main memory of each machine as a level in a cache hierarchy, replicating copies of shared data in local memory. Since computer memories tend to be much larger than caches, DSM systems have largely ignored memory capacity issues, assuming there is always enough space in main memory in which to replicate data. applications that access data that exceeds the capacity available in local memory will page to disk, resulting in reduced performance. We have developed a software DSM system based on Cashmere that takes advantage of system-wide memory resources in order to reduce or eliminate paging overhead. Experimental results on a 4-node, 16-processor AlphaServer system demonstrate the improvement in performance using the enhanced software DSM system for applications with large data sets.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

COWL: Copy-on-write for logic programs

Proceedings of the International Parallel Processing Symposi...

引用

Proceedings of the international parallel processing symposium, IPPS 1999年 720-727页

作者： Costa, Vitor Santos Universidade do Porto Porto Portugal

In order for parallel logic programming systems to become popular, they should serve the broadest range of applications. To achieve this goal, designers of parallel logic programming systems would like to exploit maximum parallelism for existing and novel applications, ideally by supporting both and-parallelism and or-parallelism. Unfortunately, the combination of both forms of parallelism is a hard problem, and available proposals cannot match the efficiency of, say, or-parallel only systems. We propose a novel approach to And/Or parallelism in logic programs. Our initial observation is that stack copying, the most popular technique in or-parallel systems, does not work well with And/Or systems because memory management is much more complex. Copying is also a significant problem in operating systems where the copy-on-write (COW) has been developed to address the problem. We demonstrate that this technique can also be applied to And/Or systems, and present both shared memory and distributed shared memory designs.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

BRISK: A portable and flexible distributed instrumentation system

Proceedings of the International Parallel Processing Symposi...

引用

Proceedings of the international parallel processing symposium, IPPS 1999年 387-391页

作者： Bakic, Aleksandar M. Mutka, Matt W. Rover, Diane T. Michigan State Univ East Lansing United States

Researchers and practitioners in the area of parallel and distributed computing have been lacking a portable, flexible and robust distributed instrumentation system. We present the Baseline Reduced Instrumentation System Kernel (BRISK) that we have developed as a part of a real-time system instrumentation and performance visualization project. the design is based on a simple distributed instrumentation system model for flexibility and extensibility. the basic implementation poses minimalistic system requirements and achieves high performance. We show evaluations of BRISK using two distinct configurations: one emphasizes isolated simple performance metrics;and the other, BRISK's operation on distributed applications, its built-in clock synchronization and dynamic on-line sorting algorithms.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

Limits to the performance of software shared memory: A layered approach

Limits to the performance of software shared memory: A layer...

引用

5th international symposium on High-Performance Computer Architecture (HPCA)

作者： Bilas, A Jiang, DM Zhou, YY Singh, JP Univ Toronto Dept Elect & Comp Engn Toronto ON M5S 3G4 Canada

ISBN: (纸本)0769500048

Much research has been done in fast communication on clusters and in protocols for supporting software shared memory across them. However, the end performance of applications that were written for the more proven hardware-coherent shared memory is still not very good on these systems. three major layers of software (and hardware) stand between the end user and parallel performance, each with its own functionality and performance characteristics. they include the communication layer, the software protocol layer that supports the programming model, and the application layer. these layers provide a useful framework to identify the key remaining limitations and bottlenecks in software shared memory systems, as well as the areas where optimization efforts might yield the greatest performance improvements. this paper performs such an integrated study, using this layered framework, for two types of software distributed shared memory systems: page-based shared virtual memory (SVM) and fine-grained software systems (FG). For the two system layers (communication and protocol), we focus on the performance costs of basic operations in the layers rather than on their functionalities. this is possible because their functionalities are now fairly mature. the less mature applications layer is treated through application restructuring. We examine the layers individually and in combination, understanding their implications for the two types of protocols and exposing the synergies among layers.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

Infrastructure for building parallel database systems for multi-dimensional data

Proceedings of the International Parallel Processing Symposi...

引用

Proceedings of the international parallel processing symposium, IPPS 1999年 582-587页

作者： Chang, Chialin Ferreira, Renato Sussman, Alan Saltz, Joel Univ of Maryland College Park United States

Our study of a large set of scientific applications over the past three years indicates that the processing for multi-dimensional datasets is often highly stylized. the basic processing step usually consists of mapping the individual input items to the output grid and computing output items by aggregating, in some way, all the input items mapped to the corresponding grid point. In this paper, we discuss the design and performance of T2, an infrastructure for building parallel database systems that integrates storage, retrieval and processing of multi-dimensional datasets. It achieves its primary advantage from the ability to integrate data retrieval and processing for a wide variety of applications and from the ability to maintain and jointly process multiple datasets with different underlying grids. We present preliminary performance results comparing the implementation of two applications using the T2 services with custom-built integrated implementations.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：