检索结果-内蒙古大学图书馆

32nd ieee International parallel and distributed processing symposium (IPDPS)

作者： Farres, Albert Rosas, Claudia Hanzich, Mauricio Duran, Alejandro Yount, Charles Univ Politecn Cataluna Barcelona Spain Barcelona Supercomp Ctr Barcelona Spain Intel Corp Iberia Madrid Spain Intel Corp Santa Clara CA USA

ISBN: (纸本)9781538655559

Improving the performance of stencil computations is a long-standing optimization challenge due to their inherent heavy memory-access patterns. This problem has been explored in many wave-propagation simulation engines. Moving towards implementations with elastic waves instead of acoustic ones (e.g., used in medical imaging) results in computationally more expensive processes along with increased memory usage. Despite the computational demand, the elevated cost of exploration combined the need for higher success rates is driving the oil & gas industry to adopt elastic anisotropic wave-propagation models as the core of many geophysical imaging mechanisms to extract subsurface features more accurately, increasing return on investment. To reduce time-to-solution, the more complex stencil codes must run efficiently on modern CPU architectures. The Intel Xeon Phi processors emerge as an energy-efficient solution that provides a good trade-off between market price and computing capability. In this paper, we study the effect of several optimization techniques using the YASK stencil-generation framework to implement and evaluate a 25-point stencil of an elastic-wave propagation engine for Intel Xeon Phi processors. The results showed improvements of up to 7x in computations and 8x in memory bandwidth with respect to the non-tuned version, reaching up to 75% of the attainable floating-point performance at the given operational intensity. We collected performance metrics for a set of the most representative optimizations and revealed the relation between each strategy and fundamental characteristics of both code and hardware.

关键词： Stencil-based wave propagation Performance optimizations Intel Xeon Phi Fully Staggered Grid

来源：评论

学校读者我要写书评

暂无评论

A new method to automatically compute processing times for random walks based distributed algorithms 2

A new method to automatically compute processing times for r...

引用

2nd International symposium on parallel and distributed Computing (ISPDC 2003)

作者： Bernard, T Bui, A Bui, M Sohier, D URCA Dept Math & Informat F-51687 Reims France

ISBN: (纸本)0769520693

Random walks constitute an attractive technique in distributed computing. In this paper, we present an original method using relationship between electrical resistance and random walks, to automatically compute quantities such as cover time, and more generally any processing time measure defined through hitting times. This method comes from electrical theory by using Millman's theorem.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

CONCEPTS And METHODS FOR THE OPTIMIZATION OF distributed DATA-processing 90

引用

2nd INTERNATIONAL SYMP ON DATABASES IN parallel And distributed SYSTEMS ( DPDS 90 )

作者： JABLONSKI, S RUF, T WEDEKInd, H Department of Computer Science VI (Data Base Systems) University of Erlangen-Nuernberg Martensstrasse 3 D-8520 Erlangen

ISBN: (纸本)0818620528

In this paper we introduce and discuss a model of distributed data processing. For this purpose, a typical application system is analyzed and divided into sub-applications. To fulfill the task of the global application, the sub-applications have to communicate in an appropriate manner by exchanging data resp. information. In our model the communication between sub-applications is split up into two steps: the offering of information by sending sub-applications, and its acceptance by receiving sub-applications. For both communication steps synchronous and asynchronous processing modes are defined. Supporting those different communication modes the cooperation between sub-applications can be defined very closely to the specific demands of the application system. This optimizes distributed data processing. At last we demonstrate the prototype implementation of a distributed data management system, which is based on the flexible communication mechanism described in the paper.

关键词： Data processing

来源：评论

学校读者我要写书评

暂无评论

USING JOIN OPERATIONS AS REDUCERS IN distributed QUERY-processing 90

引用

2nd INTERNATIONAL SYMP ON DATABASES IN parallel And distributed SYSTEMS ( DPDS 90 )

作者： CHEN, MS YU, PS IBM Thomas J. Watson Research Center P.O. Box 704 Yorktown Heights New York

ISBN: (纸本)0818620528

Semijoin has traditionally been relied upon for reducing the communication cost required for distributed query processing. However, judiciously applying join operations as reducers can lead to further reduction in the communication cost. In view of this fact, we explore in this paper the approach of using join operations, in addition to semijoins, as reducers in distributed query processing. We first show that the problem of determining a sequence of join operations for a query graph can be transformed to that of finding a set of cuts to that graph, where a cut to a graph is a partition of the nodes in that graph. In light of the mapping we develop an efficient heuristic algorithm to determine an effective sequence of join reducers for a query. The algorithm using the concept of divide-and-conquer is shown to have polynomial time complexity. Examples are also given to illustrate our results.

关键词： Database systems

来源：评论

学校读者我要写书评

暂无评论

2nd Workshop on Advances in parallel and distributed Computational Models 15

2nd Workshop on Advances in Parallel and Distributed Computa...

引用

15th International parallel and distributed processing symposium, IPDPS 2001

作者： Ibarra, O.H. Nakano, K. Olariu, S. Department of Computer Science University of California Santa BarbaraCA93106 United States Dept. of Electrical and Computer Engineering Nagoya Institute of Technology Showa-ku Nagoya466 Japan Department of Computer Science Old Dominion University NorfolkVA23529 United States

ISBN: (纸本)0769509908

The main goal of this workshop is to provide a timely forum for the exchange and dissemination of new ideas, techniques and research in the field of the new parallel and distributed computational models. The workshop is meant to bring together researchers and practitioners interested in all aspects of parallel and distributed computing taken in an inclusive, rather than exclusive, sense. We are convinced that the workshop atmosphere will be conducive to open and mutually beneficial exchanges of ideas between the participants. © 2001 ieee.

关键词： Computation theory

来源：评论

学校读者我要写书评

暂无评论

Comparison of task response times in parallel systems

Comparison of task response times in parallel systems

引用

Proceedings of the 2nd ieee Workshop on Future Trends of distributed Proceedings of the 2nd ieee Workshop on Future Trends of distributed Computing Systems - Future Trends 90 Computing Systems - Future Trends 90

作者： Nelson, Randolph Tantawi, Asser N. IBM T J Watson Res Center Yorktown Heights NY USA

ISBN: (纸本)0818620889

Generic queuing models of parallel systems with K ≥ 2 exponential servers, where jobs may be split into K independent tasks, are considered. The queuing of jobs is distributed if each server has its own queue and centralized if there is a common queue. The scheduling of jobs is no splitting if all tasks of a job must run on one processor and splitting if they can run concurrently on different processors. Exact and approximate expressions for the mean response time, Tr:K, of the rth, r = 1, 2, ..., K, departing task in a job are obtained and compared for four models: distributed/spitting, distributed/no splitting, centralized/splitting, and centralized/no splitting. The queuing models are described. Exact and approximate analyses of the various models are presented where expressions are obtained for the mean task response time. The various models are compared and applications in the areas of distributed query processing and parallel systems are included.

关键词： Computer Systems, Digital

来源：评论

学校读者我要写书评

暂无评论

Commercial virtual reality system based on parallel and distributed approach

Commercial virtual reality system based on parallel and dist...

引用

Proceedings of the 1996 2nd International symposium on parallel Architectures, Algorithms, and Networks, I-SPAN

作者： Hamilton, Paul Chen, Shiping Hintz, Tom Sydney

Virtual Reality (VR) is an exciting yet challenging area. Especially in commercial VR systems, one of the main challenges is how to maintain relatively constant performance under various loading and at low-cost. This paper presents a parallel and distributed solution to the problem under the background of a commercial entertainment VR system. In the paper, the architecture of the system is introduced. The strategies of distribution and the mechanism of the parallel processing is discussed.

关键词： Virtual reality

来源：评论

学校读者我要写书评

暂无评论

Proceedings of the 1996 2nd International symposium on parallel Architectures, Algorithms, and Networks, I-SPAN

Proceedings of the 1996 2nd International Symposium on Paral...

引用

Proceedings of the 1996 2nd International symposium on parallel Architectures, Algorithms, and Networks, I-SPAN

The proceedings contains 92 papers from the 1996 International symposium on parallel Architectures, Algorithms and Networks. Topics discussed include: massively parallel processors;distributed memory parallel computers;multistage interconnection networks;Banyan switching fabrics;internetworking;transmission control protocol/Internet protocol networks;train traffic and event driven simulations;universal broadband network access devices;customer premises networks;and parallel random access machines.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Integrating fault-tolerant and real-time requirements of distributed systems

Integrating fault-tolerant and real-time requirements of dis...

引用

作者： Nett, Edgar Schumann, Ralf German Nat Res Center for Comput Sci Postfach West Germany

ISBN: (纸本)0818620889

The authors propose a distributed dynamic action scheme that allows a recovery concept permitting efficient distributed computing during normal operation to be combined with efficient exception handling in the case of an effective error. They provide a dynamic action model tailored to the dynamic nature of distributed processing. This model offers a recovery concept which allows the recovery region of a recovery line to surmount the size of the corresponding computation in order to gain high efficiency during normal operation. By running the versions of a recovery block as distributed actions, it becomes possible to incorporate a recovery concept that allows efficient distributed processing during normal operation and prompt reaction in the case of error by running the different versions in parallel. To implement the dynamic action model efficiently a redundant recovery graph keeps track of recovery regions. On the basis of this graph the authors provide decentralized protocols that produce a consistent system state that is fast, efficient, and concurrent with normal system activity.

关键词： Computer Systems, Digital

来源：评论

学校读者我要写书评

暂无评论

A 100-MEGA-ACCESS PER 2nd MATCHING MEMORY FOR A DATA-DRIVEN MICROPROCESSOR

引用

ieee JOURNAL OF SOLID-STATE CIRCUITS 1990年第1期25卷 95-99页

作者： TAKATA, H KOMORI, S TAMURA, T ASAI, F SATOH, H OHNO, T TOKUDA, T NISHIKAWA, H TERADA, H OSAKA UNIV DEPT SYST ENGNSUITAOSAKA 565JAPAN

A high-throughput matching memory (MM) for a data-driven microprocessor is discussed. An MM can be constructed using a hashing memory. However, one of the biggest problems with hashing memory is the necessity for selective processing whenever hashed address conflicts occur. To eliminate this problem, the MM incorporated a small amount of associative memory (32 words*50 b) as well as the hashing memory (512 words*42 b). The matching operation is subdivided into three pipeline stages, all controlled by the elastic pipeline scheme. With this structure, an MM with a high throughput of 100-mega-access/s MM can be realized.< >

关键词： Microprocessors Pipelines Latches Associative memory Throughput parallel processing Circuits Large scale integration Research and development Laboratories

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：