检索结果-内蒙古大学图书馆

parallel INFERENCE algorithms FOR THE CONNECTION METHOD ON SYSTOLIC ARRAYS

international JOURNAL OF COMPUTER MATHEMATICS 1994年第3-4期53卷 177-188页

作者： HAN, YF EVANS, DJ LOUGHBOROUGH UNIV TECHNOL PARCLOUGHBOROUGH LE11 3TULEICSENGLAND

In practice, various techniques are used to speed up the reasoning in logic programming and parallel machines. Three major approaches have generally been adapted to solve this problem. The most common approach involves some methods of the development of AND and OR parallelism, as in Parlog[Clark84], Concurrent-Prolog[Shapiro83] and IDIOM[Guptas&Hermenegildo]. In these schemes, the three main forms of implicit parallelism-Independent AND-parallelism, Dependent AND-parallelism and OR-parallelism are exploited. The second approach is to build parallel architectures to execute different level parallelism inherent in inference, such as DADO[Stolfo84, Miranker90], NON-VON[Hillyer86] and PSM[Gupta87]. The third approach is to develop faster match and search algorithms, as in Rete[Forgy82] and Treat[Miranker87]. The bottle-neck in inference systems is the match phase. Around 90% of execution rime is consumed in this phase[Gupta87]. In this paper, we present algorithms to realize the connection method on systolic arrays. The algorithms try to partition the paths in connections matrices for parallel inference. Firstly, parallelism in reasoning is discussed;then the parallel inference on systolic arrays and algorithms for partition of paths are introduced. Finally, the correctness and completeness of the algorithms is shown. The paper consists of five sections. The connection method is presented and parallel inference algorithms on systolic arrays are designed after introduction. The third section describes an example in partition of the paths in the connection method, the example executing on normal systolic and tree systolic models are shown. The fourth section discusses the analysis of the algorithms. The final section works out conclusions and related work.

关键词： INFERENCE algorithms CONNECTION METHOD SYSTOLIC ARRAYS

来源：评论

学校读者我要写书评

暂无评论

NCIC's research and development in parallel processing

NCIC's research and development in parallel processing

引用

Proceedings of the international symposium on parallel architectures, algorithms and Networks (ISPAN)

作者： Li, Guo-jie Beijing China

National Research Center for Intelligent Computing Systems (NCIC for short) is the unique national hi-tech R/D center for advanced computing technology in China. In this overview we first introduce China's Hi-Tech R&D Programme (863 programme) and NCIC, then we reported the state of the art of parallel processing at NCIC. This article discussed the key technologies being exploited by the representative Chinese R&D teams and the wide applications of parallel computers in China. The key technologies in parallel processing we are attacking and reported in this article include wormhole routing and other efficient switching techniques, the Easter series MPP systems, the Dawning series symmetric and multi-thread multiprocessor, parallel operating systems and parallel file systems, parallel compiler and efficient programming tool. The future research directions at NCIC are also mentioned.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Efficient barriers for distributed shared memory computers

Efficient barriers for distributed shared memory computers

引用

Proceedings of the 8th international parallel Processing symposium

作者： Grunwald, Dirk Vajracharya, Suvas Univ of Colorado Boulder United States

ISBN: (纸本)0818656026

Barrier algorithms are central to the performance of numerous algorithms on scalable, high-performance architectures. Numerous barrier algorithms have been suggested and studied for Non-Uniform Memory Access (NUMA) architectures, but less work has been done for Cache Only Memory Access (COMA) or attraction memory [1] architectures such as the KSR-1. In this paper, we presented two new barrier algorithms that offer the best performance we have recorded on the KSR-1 distributed cache multiprocessor. We discuss the trade-offs and the performance of seven algorithms on two architectures. The new barrier algorithms adapt well to a hierarchical caching memory model and take advantage of parallel communication offered by most multiprocessor interconnection networks,. Performance results are shown for a 256-processor KSR-1 and a 20-processor Sequent Symmetry.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

parallel relational database algorithms revisited for range declustered data sets

Parallel relational database algorithms revisited for range ...

引用

Proceedings of the international symposium on parallel architectures, algorithms and Networks (ISPAN)

作者： Schikuta, Erich Univ of Vienna

Today available parallel database systems use conventional parallel hardware architectures employing a highly parallel software architecture. It is an emerging technique to speed up the execution by declustering the stored data sets among a number of parallel and independent disk drives. In this paper we revisit parallel relational database algorithms for range declustering. We adapt the conventional known and well studied parallel algorithms to declustered data, exploit the inherent order property of the partitioned data sets and compare analytically the performance of the algorithms. It is shown that the parallel range declustered variants generally outperform their conventional parallel counterparts.

关键词： Relational database systems

来源：评论

学校读者我要写书评

暂无评论

VIRTUAL SHARED-MEMORY - algorithms AND COMPLEXITY

引用

INFORMATION AND COMPUTATION 1994年第2期113卷 199-219页

作者： CHIN, A MCCOLL, WF Univ Oxford Programming Res Grp 11 Keble Rd Oxford OX1 3QD England

We consider the Block PRAM model of Aggarwal et al. (in ''Proceedings, First Annual ACM symposium on parallel algorithms and architectures, 1989,'' pp. 11-21). For a Block PRAM model with n/log n processors and communication latency l = O(log n), we show that prefix sums can be performed in time O(l log n/log 1), but list ranking requires time OMEGA(l log n);these bounds are tight. These results justify an intuitive observation of Gazit et al (in ''Proceedings, 1987 Princeton Workshop on Algorithm, Architecture and Technology Issues for Models of Concurrent Computation,'' pp. 139-156) that algorithm designers should, when possible, replace the list ranking procedure with the prefix sums procedure. We demonstrate the value of this technique in choosing between two optimal PRAM algorithms for finding the connected components of dense graphs. We also give theoretical improvements for integer sorting and many other algorithms based on prefix sums, and suggest a relationship between the issue of graph density for the connected components problem and alternative approaches to integer sorting. (C) 1994 Academic Press, Inc.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Texture analysis for image processing on general-purpose parallel machines

Texture analysis for image processing on general-purpose par...

引用

Proceedings of the international symposium on parallel architectures, algorithms and Networks (ISPAN)

作者： Boroczky, Lilla Cremonesi, Paolo Scarabottolo, Nello Hungarian Acad of Sciences Budapest Hungary

The problem considered in this paper is the definition of an efficient parallel algorithm for texture analysis of an image. The target architectures are distributed-memory general-purpose MIMD parallel machine. The solutions here proposed are based on two different methods: the Statistic Feature Matrix and the Wavelet Decomposition.

关键词： Image processing

来源：评论

学校读者我要写书评

暂无评论

Scheduling algorithms performance with the pSystem parallel programming environment 6th

引用

6th international Conference on parallel architectures and Languages Europe, PARLE 1994

作者： Lopes, Luís M. B. Silva, Fernando M. A. LIACC Universidade do Porto Rua do Campo Alegre 823 Porto4100 Portugal

ISBN: (纸本)9783540581840

The efficiency of scheduling algorithms is essential in order to attain optimal performances from parallel programming systems. In this paper we use a portable parallel programming environment we have implemented, the pSystem, to evaluate and compare the performance of various scheduling algorithms on shared memory parallel machines. © Springer-Verlag Berlin Heidelberg 1994.

关键词： Scheduling algorithms

来源：评论

学校读者我要写书评

暂无评论

Greedy task clustering heuristic that is provably good

Greedy task clustering heuristic that is provably good

引用

Proceedings of the international symposium on parallel architectures, algorithms and Networks (ISPAN)

作者： Palis, Michael A. Liou, Jing-Chiou Wei, David S.L. New Jersey Inst of Technology Newark United States

A simple greedy algorithm is presented for task clustering with duplication (or recomputation) which, for a task graph with arbitrary granularity, produces a schedule whose makespan is at most twice optimal. Furthermore, the quality of the schedule improves as the granularity of the task graph increases. For example, if the granularity is at least 1/2 , the makespan of the schedule is at most 5/3 times optimal. For a task graph with n tasks and e inter-task communication constraints, the algorithm runs in O(n(n lg n + e)) time, which is n times faster than the currently best known algorithm for this problem. Similar algorithms are developed that produce: (1) optimal schedules for coarse grain graphs;(2) 2-optimal schedules for trees with no task duplication;and (3) optimal schedules for coarse grain trees with no task duplication.

关键词： Heuristic programming

来源：评论

学校读者我要写书评

暂无评论

An approach to machine-independent parallel programming 3rd

引用

3rd Joint international Conference on Vector and parallel Processing, CONPAR 1994 - VAPP VI

作者： Zimmermann, Wolf Löwe, Welf Institut für Programm strukturen und Datenorganisation Universität Karlsruhe Karlsruhe76128 Germany

ISBN: (纸本)9783540584308

Currently, many parallel algorithms are defined for shared- memory architectures. The prefered machine model for designing these algorithms is the PRAM. However, this model does not take into account properties of existing architectures. Recently, Culler et al. defined the LogP machine model which better reflects the behaviour of massively parallel computers. We discuss an important class of programs for shared- memory architectures and show how they can be mapped to the LogP machine. We define this class and show how to compute the mapping at compile time. For this mapping a constant factor delay with respect to the optimal LogP execution time can be guaranteed. © Springer-Verlag Berlin Heidelberg 1994.

关键词： Mapping

来源：评论

学校读者我要写书评

暂无评论

Optimal parallel algorithm for edge-coloring partial k-trees with bounded degrees

Optimal parallel algorithm for edge-coloring partial k-trees...

引用

Proceedings of the international symposium on parallel architectures, algorithms and Networks (ISPAN)

作者： Zhou, Xiao Nishizeki, Takao Tohoku Univ Sendai Japan

Many combinatorial problems can be efficiently solved for partial k-trees (graphs of treewidth bounded by k). The edge-coloring problem is one of the well-known combinatorial problems for which no NC algorithms have been obtained for partial k-trees. This paper gives an optimal and first NC parallel algorithm to find an edge-coloring of any given partial k-tree using a minimum number of colors if k and the maximum degree Δ are bounded.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：