检索结果-内蒙古大学图书馆

Proceedings of the international symposium on parallel architectures, algorithms and Networks, I-SPAN 1999年 22-27页

作者： Dai, H.K. Oklahoma State Univ Stillwater United States

Concentrators and generalized-concentrators are interconnection networks that provide respectively pairwise vertex-disjoint directed paths and trees to satisfy interconnection requests. An interconnection network is non-blocking in the strict sense if every compatible interconnection request can be satisfied by a path regardless of any existing interconnections. We present an interconnection property equivalent to the generalized-concentration with constrained network capacity and request multiplicity in the strictly non-blocking context, and show a polynomial-time computational complexity result for deciding the strictly non-blocking generalized-concentration properties with constrained network parameters, by using b-matching techniques.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

MPI backend for an automatic parallelizing compiler

Proceedings of the International Symposium on Parallel Archi...

引用

Proceedings of the international symposium on parallel architectures, algorithms and Networks, I-SPAN 1999年 152-157页

作者： Kwon, Daesuk Han, Sangyong Kim, Heunghwan Seoul Natl Univ Seoul Korea Republic of

Many naive parallel processing schemes were not successful as many researchers thought, because of the heavy cost of communication and synchronization resulting from parallelization. In this paper, we will identify the reasons for the poor performance and the compiler requirements for performance improvement. We realized that the decisions for parallelizing should be derived by the overhead information. We added this idea to the automatic parallelizing compiler, SUIF. We substitute the original backend of SUIF with our backend using MPI, and gave it the capability of validating of parallelization decisions based on overhead parameters. This backend converts shared-memory based parallel program into distributed-memory based parallel program with MPI function calls without excessive parallelization that causes performance degradation.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Randomized BSP/CGM algorithm for the maximal independent set problem

Proceedings of the International Symposium on Parallel Archi...

引用

Proceedings of the international symposium on parallel architectures, algorithms and Networks, I-SPAN 1999年 284-289页

作者： Ferreira, Afonso Schabanel, Nicolas CNRS-I3S-INRIA Sophia Antipolis France

This paper presents a randomized parallel algorithm for the Maximal Independent Set problem. Our algorithm uses a BSP-like computer with p processors and requires that n+m/p = Ω(p) for a graph with n vertices and m edges. Under this scalability assumption, and after a preprocessing phase, it computes a maximal independent set after O(log p) communication rounds, with high probability, each round requiring linear computation time O(n+m/p). The preprocessing phase is deterministic and important in order to ensure that degree computations can be implemented efficiently. For this, we give an optimal parallel BSP/CGM algorithm to the p-quantiles search problem, which runs in O(m log p/p) time and a constant number of communication rounds, and could be of interest in its own right, as shown in the text.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel architecture for high speed fractal image coding

Proceedings of the International Symposium on Parallel Archi...

引用

Proceedings of the international symposium on parallel architectures, algorithms and Networks, I-SPAN 1999年 88-93页

作者： Lee, Shinhaeng Aso, Hirotomo Tohoku Univ Sendai Japan

The main problem of fractal image compression is the long search time of the domain pool. For this reason, the dedicated ASIC architecture for fractal image coding is needed. In this paper, we propose an efficient parallel architecture for fractal image coding which is based on fixed-size full-search algorithm. One of the main features of this architecture is that it uses only local communication such that each processor has a range and a domain block which is shifted to the next processor. Another main feature is that it has very regular interconnections and data flow. Domain blocks are formed from the range blocks in processors and the encoding procedure is performed by the regular data flow of domain blocks into the other processors. Each processor performs the fast isometric transformations which are calculated by one full rotation around the center.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for all nearest neighbors of binary images on the BSP model

Parallel algorithms for all nearest neighbors of binary imag...

引用

international symposium on parallel architectures, algorithms and Networks (ISPAN)

作者： T. Ishimizu A. Fujiwara I. Inoue T. Masuzawa H. Fujiwara Nara Institute of Science and Technology Graduate School of Information Science Ikoma Nara Japan Department of Computer Science and Electronics Kyushu Institute of Technology Fukuoka Japan

We present two parallel algorithms for computing the nearest neighbors of an n/spl times/n binary image on the Bulk-Synchronous parallel (BSP) model. The first algorithm is for weighted distance, and the second algorithm is for L/sub p/ distance. Both algorithms run in O(n/sup 2//p+L) computation time and O(g/sup n///spl radic/p+L) communication time using p (1/spl les/p/spl les/n) processors and in O(n/sup 2//p+(d+L)log p/n/log(d+1)) computation time and in O(gn//spl radic/p+(gd+L)log p/n/log(d+1)) communication time using p (n

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Summation algorithms on constrained reconfigurable meshes

Summation algorithms on constrained reconfigurable meshes

引用

international symposium on parallel architectures, algorithms and Networks (ISPAN)

作者： A. Matsuura A. Nagoya NTT Communication Science Laboratories Japan

Constrained reconfigurable meshes are one type of parallel computing model which takes the reconfigurability of hardware into account. With these meshes, a practical assumption is given on the communication power such that a signal is propagated through a constant number of processing elements (PEs) say k PEs, at one unit of time. We present algorithms for the fundamental problem of computing the sum of multiple integers. For the problem of summing n binary values, two show an optimal O(n/k)-time algorithm on a constrained reconfigurable mesh of size m/spl times/n, where m=/spl Theta/(log/sup 2/ k/log log k). For the problem of summing n d-bit integers, we present an O((d+/spl radic/dmm)/k) time algorithm on a constrained reconfigurable mesh of size /spl radic/dmn/spl times//spl times//spl radic/dmn.

关键词： Hardware Delay Laboratories Electronic mail Field programmable gate arrays Information processing Reconfigurable architectures Design automation parallel processing Power system modeling

来源：评论

学校读者我要写书评

暂无评论

A parallel architecture for high speed fractal image coding

A parallel architecture for high speed fractal image coding

引用

international symposium on parallel architectures, algorithms and Networks (ISPAN)

作者： Shinhaeng Lee H. Aso Department of Electrical and Communication Engineering Faculty of Engineering University of Tohoku Sendai Japan

关键词： parallel architectures Fractals Image coding Concurrent computing

来源：评论

学校读者我要写书评

暂无评论

An adaptive parallel grouping algorithm for B-WLL MAC protocol

An adaptive parallel grouping algorithm for B-WLL MAC protoc...

引用

international symposium on parallel architectures, algorithms and Networks (ISPAN)

作者： Min-Su Kim Tae-Young Byun Sun-Woo Lee Sung-Ho Hwang Ki-Jun Han Sung-Jo Kim Department of Computer Engineering Kyungpook National University Daegu South Korea Electronics and Telecommunications Research Institute Electronics and Telecommunications Research Institute Daejeon South Korea

We propose an adaptive parallel grouping algorithm that may reduce the number of collisions and increase overall throughput of B-WLL (Broadband Wireless Local Loop) MAC (Media Access Control) protocol. Also, we present a method for independently executing the adaptive parallel grouping algorithm to reduce the signaling traffic overheads.

关键词： Media Access Protocol B-ISDN Costs Access protocols Throughput Telecommunication computing Optical fibers Coaxial components Investments Programmable control

来源：评论

学校读者我要写书评

暂无评论

Torus assignment for an interconnection network recursive diagonal torus

Proceedings of the International Symposium on Parallel Archi...

引用

Proceedings of the international symposium on parallel architectures, algorithms and Networks, I-SPAN 1999年 74-79页

作者： Fan, Qin Yang, Yulu Funahashi, Akira Amano, Hideharu Nankai Univ Tianjin China

Recursive Diagonal Torus (RDT) is a class of interconnection network consisting of recursively overlaid two-dimensional square diagonal tori for massively parallel computers with up to 216 nodes. Connection structures of the RDT vary according to the assignment of upper rank diagonal tori into a node. Although traditional simple assignment called RDT(2, 4, 1)/α shows enough performance under the uniform traffic, the congestion of low rank tori degrades the performance when local communication is dominant. In this paper, RDT(2, 4, 1)/β torus assignment is proposed, focusing on improving the performance for local communication. With a simplified simulation algorithm, results shows that RDT(2, 4, 1)/β improves the average distance compared with RDT(2, 4, 1)/α assignment when considering local area.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Generalized hierarchical completely-connected networks

Proceedings of the International Symposium on Parallel Archi...

引用

Proceedings of the international symposium on parallel architectures, algorithms and Networks, I-SPAN 1999年 68-73页

作者： Takabatake, Toshinori Kaneko, Keiichi Ito, Hideo Chiba Univ Chiba Japan

In this paper, a new network structure called generalized Hierarchical Completely-Connected networks (HCC) is proposed, and its properties and features are evaluated. A set of the HCCs constructed by the proposed method includes some conventional hierarchical networks, then it is called generalized one. The construction of an HCC is started from a basic block (a level-1 block) which consists of n nodes with a constant degree. Then a level-h (h ≥ 2) block is constructed recursively by interconnecting any pair of macro nodes (n level-(h - 1) blocks) completely. An HCC has the constant node degree regardless of increasing its size (the number of nodes). Furthermore, since an HCC has the hierarchically structured character and the feature of uniformity, a wide variety of inter-cluster connections are possible.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：