检索结果-内蒙古大学图书馆

Adapting the network interface for high-performance computing: The CNI approach

JOURNAL OF SUPERCOMPUTING 1997年第2期11卷 181-200页

作者： Sarkar, P Bailey, M RINCOM RES CORP TUCSON AZ 85711 USA

As the prices of commodity workstations go down, clusters of workstations have started to emerge as a viable economic solution for scalable computing. Recent advances in networking technology have made it possible to obtain high-bandwidth connections between applications. However, the interconnect latency between workstation nodes in a cluster remains a serious concern and can prove to be the limiting factor in workstation performance. In this paper, we present the CNI or cluster network interface that achieves the twin goals of low latency and high bandwidth. In addition, CNI efficiently supports multiple programming paradigms for programming generality. This is done by functionally coupling the network interface more closely to the CPU without violating the constraints of a standard workstation architecture. CNI results in performance gains for applications, substantially reducing communication overhead and delay.

关键词： network interfaces consistency cluster computing distributed shared memory

来源：评论

学校读者我要写书评

暂无评论

A parallel second-order Moller-Plesset gradient

引用

MOLECULAR PHYSICS 1997年第3期91卷 431-438页

作者： Fletcher, GD Rendell, AP Sherwood, P SERC DARESBURY LABCENT LAB RES COUNCILSWARRINGTON WA4 4ADCHESHIREENGLAND

A second order Moller-Plesset (MP2) energy gradient algorithm for distributed memory parallel computers is described. A direct approach is used in that integrals are recalculated as required, but the degree of recalculation is minimized by exploiting the large global memory typically available on parallel machines. Results, obtained using up to 256 processors of the Gray T3D show very good scalability, with over 99.5% parallelism.

关键词： distributed shared memory PARALLEL computers ALGORITHMS

来源：评论

学校读者我要写书评

暂无评论

Group Consistency Model Which separates the intra-group Consistency Maintenance from the inter-group Consistency Maintenace in Large Scale DSM Systems

引用

Operating Systems Review (ACM) 1997年第2期31卷 23-35页

作者： Li, Qun Ji, Hua Xie, Li Dept. of Computer Science Nanjing University Nanjing 210093 China

According to the characteristics of large scale network computing systems, we proposed a group consistency model based on the concept of group to construct a DSM system. The novel model can use different inter-group and intra-group consistencies and lend itself to flexible, easily-managable, and application-suitable DSM in large scale systems. A group consistency model, which applies entry consistency among groups and lazy release consistency in a group, together with its implementation policy is discussed in this paper. It employs write-update and multiple-writer protocols in a group, and thus facilitates the simultaneous read and write in a group. The suitable protocols eliminate the false sharing and reduce the data acquiring time in a group. Furthermore, the inter-group consistency also suits the features of data sharing among groups and transmits the data modifications originated from a group in bulk to reduce the network traffic. In the end, an example using group consistency model is given and the trivial group consistency is discussed.

关键词： distributed shared memory Entry consistency model memory coherence Release consistency model

来源：评论

学校读者我要写书评

暂无评论

A distributed shared-memory SYSTEM WITH SELF-ADJUSTING COHERENCE SCHEME

引用

PARALLEL COMPUTING 1994年第7期20卷 1007-1025页

作者： WANG, HH CHANG, RC NATL CHIAO TUNG UNIV INST COMP & INFORMAT SCIHSINCHUTAIWAN NATL CHIAO TUNG UNIV INST COMP SCI & INFORMAT ENGNHSINCHUTAIWAN ACAD SINICA INST INFORMAT SCINANKANGTAIWAN

The performance of distributed shared memory depends on the memory coherence algorithms and the access characteristics of shared data. In this paper, we propose an efficient coherence scheme using multiple coherence algorithms with self-adjusting feature. Our method can dynamically choose a more adaptive coherence algorithm for each variable class and the incorrect classification of shared variables will not affect the performance. We show that for each fixed classification, application programs suffer 5.1%, 4.6%, and 48.9% increases in the average execution time, when compared against the performance of a self-adjusting scheme. Experiments have shown our approach achieving good performance.

关键词： distributed shared memory memory COHERENCE MULTIPLE COHERENCE ALGORITHMS SELF-ADJUSTING COHERENCE SCHEME

来源：评论

学校读者我要写书评

暂无评论

MIRAGE+ - A KERNEL IMPLEMENTATION OF distributed shared-memory ON A NETWORK OF PERSONAL COMPUTERS

引用

SOFTWARE-PRACTICE & EXPERIENCE 1994年第10期24卷 887-909页

作者： FLEISCH, BD HYDE, RL JUUL, N UNIV COPENHAGEN DEPT COMP SCIDK-2100 COPENHAGENDENMARK

We describe the evolution of a distributed shared memory (DSM) system, Mirage, and the difficulties encountered when moving the system from a Unix-based kernel on the VAX to a Unix-based kernel on personal computers. Mirage provides a network transparent form of shared memory for a loosely coupled environment. The system hides network boundaries for processes that are accessing shared memory and is upward compatible with the Unix System V Interface Definition. This paper addresses the architectural dependencies in the design of the system and evaluates performance of the implementation. The new version, Mirage+, performs well compared to Mirage even though eight times the amount of data is sent on each page fault because of the larger page size used in the implementation. We show that performance of systems with a large page size to network packet size can be dramatically improved on conventional hardware by applying three well-known techniques: packet blasting, compression, and running at interrupt level. The measured time for a page fault in Mirage+ has been reduced 37 per cent by sending a page using packet blasting instead of using a handshake for each portion of the page. When compression was added to Mirage+, the time to fault a page across the network was further improved by 47 per cent when the page was compressed into one network packet. Our measured performance compares favorably with the amount of time it takes to fault a page from disk. Lastly, running at interrupt level may improve performance 16 per cent when faulting pages without compression.

关键词： distributed shared memory OPERATING SYSTEMS distributed SYSTEMS COMMUNICATION COMPRESSION PERFORMANCE

来源：评论

学校读者我要写书评

暂无评论

IMPLEMENTING OBJECT-BASED distributed shared memory ON TRANSPUTERS

IMPLEMENTING OBJECT-BASED DISTRIBUTED SHARED MEMORY ON TRANS...

引用

1994 World Transputer Congress (WTC 94) - Transputer Applications and Systems 94

作者： HEINZLE, HP BAL, HE LANGENDOEN, K FREE UNIV AMSTERDAM DEPT MATH & COMP SCI1007 MC AMSTERDAMNETHERLANDS

来源：评论

学校读者我要写书评

暂无评论

A HYBRID COHERENCE SCHEME FOR SOFTWARE distributed shared memory

引用

International Journal of High Speed Computing 1994年第4期6卷 519-536页

作者： HSIAO-HSI WANG RUEI-CHUAN CHANG Department of Computer Science and Information Engineering National Chiao Tung University Hsinchu Taiwan Republic of China Institute of Computer and Information Science National Chiao Tung University Hsinchu Taiwan Republic of China To whom all correspondence should be sent. He is also with the Institute of Information Science Academia Sinica Nankang Taipei Republic of China.

Software distributed shared memory (DSM) provides a convenient and effective solution for programming parallel applications on distributed systems. However, the performance of current implementations suffers from large overhead in enforcing memory coherence. Coherence faults are the sources of massive network traffic. Various memory consistency models have been proposed in order to eliminate the effects of network traffic and memory latency. In this paper, we present a novel approach that combines relaxed memory consistency models and a compiler strategy to solve memory coherence problems for DSM. This approach produces fewer coherence faults. Experimental results also show this hybrid approach is effective for reducing the memory coherence overhead of DSM.

关键词： distributed shared memory memory coherence false sharing hybrid coherence scheme

来源：评论

学校读者我要写书评

暂无评论

Message-based efficient remote memory access on a highly parallel computer EM-X

引用

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS 1996年第8期E79D卷 1065-1071页

作者： Kodama, Y Sakane, H Sato, M Yamana, H Sakai, S Yamaguchi, Y Electrotechnical Laboratory Tsukuba-shi 305 Japan Real World Computing Partnership Tsukuba Research Center Tsukuba-shi 305 Japan

Communication latency is central to multiprocessor design. This study presents the design principles of the EM-X distributed-memory multiprocessor towards tolerating communication latency. The EM-X overlaps computation with communication for latency tolerance by multithreading. In particular, we present two types of hardware support for remote memory access: (1) priority-based packet scheduling for thread invocation, and (2) direct remote memory access. The priority-based scheduling policy extends a FIFO ordered thread invocation policy to adopt to different computational needs. The direct remote memory access is designed to overlap remote memory operations with thread execution. The 80-processor prototype of EM-X is developed and is operational since December 1995. We execute several programs on the machine and evaluate how the EM-X effectively overlaps computation with communication toward tolerating communication latency for high performance parallel computing.

关键词： fine grain communication multithread architecture distributed shared memory

来源：评论

学校读者我要写书评

暂无评论

NONH:A New Cache-Based Coherence Protocol for Linked List Structure DSM System and Its Performance Evaluation

引用

Journal of Computer Science & Technology 1996年第4期11卷 405-415页

作者：房至一鞠九滨 DepartmentofComputerScience JilinUniversityChangchun130023 DepartmentofComputerScience JilinUniversi

The management of memory coherence is an important problem in distributed shared memory (DSM) system. In a cache-based coherence DSM system using linked list structure, the key to maintaining the coherence and improving system performance is how to manage the owner in the linked list. This paper presents the design of a new management protocol-NONH (New-OwnerNew-Head) and its performance evaluation. The analysis results show that thisprotocol can improve the scalability and performence of a coherent DSM system using linked list. It is also suitable for managing the cache coherency in tree-like hierarchical architecture.

关键词： Linked list cache coherence distributed shared memory

来源：评论

学校读者我要写书评

暂无评论

Limitations of fast consistency conditions for distributed shared memories

引用

INFORMATION PROCESSING LETTERS 1996年第5期57卷 243-248页

作者： Attiya, H Friedman, R TECHNION ISRAEL INST TECHNOL DEPT COMP SCIIL-32000 HAIFAISRAEL CORNELL UNIV DEPT COMP SCIITHACANY 14853

A consistency condition for distributed shared memory is fast if it has a fast implementation in which the execution time of every operation is significantly faster than the network delay. These conditions include Pipelined RAM, weak consistency, causal memory, and one interpretation of processor consistency. It is shown that if a condition is fast then it does not support non-centralized solutions for mutual exclusion.

关键词： distributed computing distributed shared memory consistency conditions

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：