检索结果-内蒙古大学图书馆

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 1993年第5期4卷 507-519页

作者： BURKHART, H EIGENMANN, R KINDLIMANN, H MOSER, M SCHOLIAN, H ERGOSOFT AG CH-8595 ALTNAU SWITZERLAND IBM CORP ZURICH RES LAB CH-8803 ZURICH SWITZERLAND SWISS FED INST TECHNOL INST ELEKTR CH-8092 ZURICH SWITZERLAND UNIV ILLINOIS CTR SUPERCOMPUTING RES & DEV URBANA IL 61801 USA

Microprocessor-based multiprocessors offer true parallelism at moderate hardware cost. Although such hardware building blocks are now available at many sites, the basic problem is still how to program such systems. We report about an integrated programming environment for the M3 multiprocessor, which has been built at ETH Zurich. Our tools support the software development cycle of a parallel program, that is the programming, configuration, and debugging/performance measurement phases. Programmer support for performance analysis has been a primary motivation for the system. We identify the sources of performance loss and describe how this information is gathered and analyzed. As a case study, we use a fast maze router algorithm and follow the usage of the different tools. Finally, we compare the M3 environment with other state-of-the-art projects.

关键词： MAN-MACHINE DIALOG MODULA-2 MULTIPROCESSOR parallel algorithm PERFORMANCE ANALYSIS PERFORMANCE MONITORING PROGRAMMING ENVIRONMENT

来源：评论

学校读者我要写书评

暂无评论

Optimal parallel preprocessing algorithms for testing weak visibility of polygons from segments

引用

parallel algorithms and Applications 1993年第2期1卷 83-98页

作者： Hsu, F.R. Chang, R.C. Lee, R.C.T. Institute of Computer Science and Information Engineering National Chiao Tung University Hsinchu 300 Taiwan Department of Computer Science National Tsing Hua University Hsinchu 300 Taiwan

For an n-gon P, we say P is weakly visible from segment s if any point on P is visible from at least one point of the segment. In this paper, we present an optimal preprocessing algorithm which runs in O(log n) time using O(n) processors under the concurrent read exclusive write parallel random access machine model such that after preprocessing, it takes O(log n) time to test if P is weakly visible from a given segment using a single processor. ©1993 Gordon and Breach Science Publishers S.A. All right reserved. © 1993, Taylor & Francis Group, LLC. All rights reserved.

关键词： Computational geometry CREW parallel algorithm Polygon Visibility

来源：评论

学校读者我要写书评

暂无评论

A CONSTANT TIME algorithm FOR REDUNDANCY ELIMINATION IN TASK GRAPHS ON PROCESSOR ARRAYS WITH RECONFIGURABLE BUS SYSTEMS

引用

parallel Processing Letters 1993年第2期3卷 171-177页

作者： B. PRADEEP C. SIVA RAM MURTHY Department of Computer Science and Engineering Indian Institute of Technology Madras 600 036 India

The task or precedence graph formalism is a practical tool to study algorithm parallelization. Redundancy in such task graphs gives rise to numerous avoidable inter-task dependencies which invariably complicates the process of parallelization. In this paper we present an O(1) time algorithm for the elimination of redundancy in such graphs on Processor Arrays with Reconfigurable Bus Systemusing O(n 4 ) processors, The previous parallel algorithm available in the literature for redundancy elimination in task graphs takes O(n 2 ) time using O(n) processors.

关键词： Precedence graph transitive closure redundancy elimination reconfigurable bus system parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

NOVEL PIPELINING AND PROCESSOR ALLOCATION STRATEGY FOR MONOID COMPUTATIONS ON UNSHUFFLE-EXCHANGE NETWORKS

引用

parallel Processing Letters 1993年第2期3卷 189-193页

作者： KUO-LIANG CHUNG HSUN-WEN CHANG Department of Information Management National Taiwan Institute of Technology Taipei Taiwan 10672 R.O.C. Department of Computer Seience and Information Engineering National Chiao Tung University Hsinchu Taiwan 30050 R.O.C.

This short paper presents a novel pipelining and processor allocation strategy for monoid computations on an unshuffle-exchange network. In the strategy, the processor utilization is near 1 and the communication is collision-free. With the characteristics of constant connections to each processor and only a single output node on the network, the method given here can compete with the method of Barnard and Skillicorn based on a hypercube network with multiple output nodes.

关键词： Associative operations divide-and-conquer monoid computations parallel algorithm pipelined architecture processor utilization unshuffle-exchange network

来源：评论

学校读者我要写书评

暂无评论

parallel algorithm DESIGN FOR WORKSTATION CLUSTERS

引用

SOFTWARE-PRACTICE & EXPERIENCE 1991年第3期21卷 235-250页

作者： MAGEE, JN CHEUNG, SC Department of Computing Imperial College University of London 180 Queen's Gate London SW7 2BZ U.K.

Clusters of workstations connected by local area networks are in common use in many organizations. The combined processing power of these clusters is rarely exploited owing to the lack of suitable parallel algorithms. The paper describes a parallel programming paradigm called supervisor-worker, suitable for the workstation environment, which can be used to speed up the execution of a large class of existing sequential programs. Simple formulae are developed to predict the speed-up of a parallel algorithm developed in this way. The predictions depend on two easily-determined parameters of the sequential program and the characteristic communication cost of the workstation cluster. Consequently, it is possible to estimate the benefits of the parallel program before proceeding with detailed implementation. As an example, the parallel version of a travelling salesman program is developed and the measured speed-up compared with the predicted speed-up.

关键词： parallel algorithm WORKSTATION TRAVELING SALESMAN SPEED-UP DISTRIBUTED SYSTEM

来源：评论

学校读者我要写书评

暂无评论

BLOCK SYSTOLIC COMPUTATIONS IN DIGITAL FILTERS

BLOCK SYSTOLIC COMPUTATIONS IN DIGITAL FILTERS

引用

计算机，通信，控制与电力工程国际会议

作者： C.J.Hwang Depart of CESYuan-Ze Institute of Technology

Many efficient systolic algorithms in block computation of digital filters have been *** highly concurrent structures can be implemented from these block systolic computation *** the performances of these highly concurrent structures would be described with the total computation times and the total processors numbers needed for each algorithms.A comparision with the sequential algorithms using the tables of speedup rate and efficiency for each block algorithm were summarized.

关键词： Systolic Array Architecture parallel algorithm Block Implementation.

来源：评论

学校读者我要写书评

暂无评论

ANO(n)parallel algorithm FOR SOLVING THE TRAFFIC CONTROL PROBLEM ON CROSSBAR SWITCH NETWORKS

引用

parallel Processing Letters 1991年第1期1卷 51-58页

作者： K. T. SUN H. C. FU Department of Computer Science and Information Engineering National Chiao-Tung University Hsin Chu Taiwan 300 R.O.C.

In this paper, we propose a parallel algorithm for the traffic control problem (an NP-complete problem) on crossbar switch networks. This problem is to find a set of conflict-free paths such that the maximum number of message packets can be transmitted over the network. The problem can be represented by an energy function. Then by applying our parallel algorithm, the state of the energy function is iteratively updated toward a stable state. When the energy function reaches a stable state, the state represents a solution of the problem. The empirical results show that the throughputs of the proposed algorithm are much better than the linear algorithm. We have shown that the time complexity of a parallel algorithm is O(n) by using n 2 processors. Furthermore, since the traffic control problem can be reduced to the traveling salesman problem, the proposed algorithm can be further applied to some other NP-complete problems.

关键词： NP-complete problem energy function parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

parallel algorithmS FOR TERMINAL-PAIR RELIABILITY

引用

IEEE TRANSACTIONS ON RELIABILITY 1992年第2期41卷 201-209页

作者： DEO, N MEDIDI, M University of Central Florida Orlando USA

This paper reports: 1) parallelization of the two best known sequential algorithms (Dotson & Gobein, and Page & Perry PP-F2TDN) for computing the terminal-pair reliability in a network;2) Reduce&Partition (R&P), a new sequential algorithm which combines the best efficient features of these two algorithms. On published benchmark networks, R&P runs almost twice as fast as the previously known fastest algorithm. A parallel version of R&P is also presented. The execution times of all three parallel algorithms with various numbers of processors for different networks on the BBN Butterfly parallel computer are provided. R&P is both fast and parallelizable. The recursive algorithms require memory O(#vertices 2 #edges), as the recursion depth is limited to (#edges) and at each recursive node a O(#vertices 2) memory is used to represent the network. Thus, the memory requirement of R&P is approximately the same as that of PP-F2TDN and much less than that of the non-recursive Dotson & Gobein algorithm. All 3 algorithms compute exact numerical reliability, but they can easily be modified to produce symbolic reliability expressions. The parallel algorithms were implemented on a shared-memory parallel computer. The R&P approach should be explored to solve other network reliability problem, such as K-terminal reliability. In R&P, the greedy approach was used in selecting shortest paths in order to locally-minimize the number of sub-problems. This selection did not consider the effect of reductions on the subproblems to be generated.

关键词： TERMINAL-PAIR RELIABILITY DIRECTED NETWORK NETWORK REDUCTION FACTORING PARTITIONING parallel algorithm SPEED-UP GRAPH DIGRAPH parallel PROCESSING

来源：评论

学校读者我要写书评

暂无评论

OPTIMAL RANDOMIZED parallel algorithmS FOR COMPUTATIONAL GEOMETRY

引用

algorithmICA 1992年第1期7卷 91-117页

作者： REIF, JH SEN, S Computer Science Department Duke University Durham USA

We present parallel algorithms for some fundamental problems in computational geometry which have a running time of O(log n) using n processors, with very high probability (approaching 1 as n --> infinity). These include planar-point location, triangulation, and trapezoidal decomposition. We also present optimal algorithms for three-dimensional maxima and two-set dominance counting by an application of integer sorting. Most of these algorithms run on a CREW PRAM model and have optimal processor-time product which improve on the previously best-known algorithms of Atallah and Goodrich [5] for these problems. The crux of these algorithms is a useful data structure which emulates the plane-sweeping paradigm used for sequential algorithms. We extend some of the techniques used by Reischuk [26] and Reif and Valiant [25] for flashsort algorithm to perform divide and conquer in a plane very efficiently leading to the improved performance by our approach.

关键词： RANDOMIZED parallel algorithm COMPUTATIONAL GEOMETRY POINT LOCATION TRIANGULATION TRAPEZOIDAL DECOMPOSITION

来源：评论

学校读者我要写书评

暂无评论

A parallel COST-OPTIMAL algorithm TO COMPUTE THE SUPREMUM OF MAX-MIN POWERS

引用

parallel COMPUTING 1992年第5期18卷 551-556页

作者： SURAWEERA, F BHATTACHARYA, P UNIV NEBRASKA DEPT COMP SCI & ENGNLINCOLNNE 68588

A parallel cost-optimal algorithm to compute the supremum of max-min powers of any map (graph) is obtained using the EREW SM SIMD computer as the model of computation. The run-time of the algorithm is O(n) using n pro... 详细信息

关键词： MAX-MIN PRODUCT parallel algorithm SPANNING TREE DEPTH 1ST SEARCH

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：