检索结果-内蒙古大学图书馆

ISCC 2014 Workshop - 5th IEEE International Workshop on Performance Evaluation of Communications in Distributed Systems and Web based Service Architectures, PEDISWESA 2014

作者： Húdik, Martin Hodoň, Michal University of Žilina Department of Technical Cybernetics Faculty of Management Science and Informatics Univerzitné 8215/1 010 26 Slovakia

ISBN: (纸本)9781479942770

The high intensity of research and modeling in fields of mathematics, physics, biology and chemistry requires new computing resources. For the big computational complexity of such tasks computing time is large and costly. The most efficient way to increase efficiency is to adopt parallel principles. Purpose of this paper is to present the issue of parallel computing with emphasis on the analysis of parallel systems, the impact of communication delays on their efficiency and on overall execution time. Paper focuses is on finite algorithms for solving systems of linear equations, namely the matrix manipulation (Gauss elimination method GEM). algorithms are designed for architectures with shared memory (openMP), distributed-memory (MPI) and for their combination (MPI+openMP). The properties of the algorithms were analytically determined and they were experimentally verified. The conclusions are drawn for theory and practice. © 2014 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Fast and efficient parallel algorithms for the exact inversion of integer matrices 5th

引用

5th Conferences on Foundations of Software Technology and Theoretical Computer Science, FST and TCS 1985

作者： Pan, Victor Computer Science Department State University of New York at Albany AlbanyNY United States

ISBN: (纸本)9783540160427

Let A = (aij) be a nonsingular n×n integer matrix such that log A ≤ no(1), maxi,j |aij| ≤ A ≤ n maxi,j |aij|. Then adj A, A−1 and all the coefficients of the characteristic polynomial of A including det A can be exactly evaluated on arithmetic circuits using O(log2n) parallel steps and M(n) processors where M(n) is the minimum number of processors required in order to multiply two n×n matrices in O(log n) steps, M(n) = o(n2.5). This substantially improves the processor bound √n M(n) of [Preparata and Sarwate, 78] and extends the recent results of [Pan and Reif, 85], where the same complexity estimates were obtained for the approximate evaluation of A−1 and under the additional assumption that A is a well-conditioned or strongly diagonally dominant matrix. All arithmetic operations can be performed with the precision of ≤ d bits so the total cost of computation is only O(log(dn))2 steps, o(n2.496d log d log log d) processors under the Boolean circuit model of parallel computation. Here d is O(n2log(nA) in the worst case;d is O(n log(nA)) with probability 1-O(1/(nh−1Ah)) for arbitrary constant h. This extends our √n-improvement of the efficiency of the previously known algorithms to the case of the Boolean circuit model and consequently increases the efficiency of the known parallel algorithms for several related algebraic and combinatorial problems. © 1985, Springer-Verlag.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Model-driven transformations for mapping parallel algorithms on parallel computing platforms 2

Model-driven transformations for mapping parallel algorithms...

引用

2nd International Workshop on Model-Driven Engineering for High Performance and CLoud Computing, MDHPCL 2013 - Co-located with 16th International Conference on Model Driven Engineering Languages and Systems, MODELS 2013

作者： Arkin, Ethem Tekinerdogan, Bedir Aselsan MGEO Ankara Turkey Bilkent University Dept. of Computer Engineering Ankara Turkey

One of the important problems in parallel computing is the mapping of the parallel algorithm to the parallel computing platform. Hereby, for each parallel node the corresponding code for the parallel nodes must be implemented. For platforms with a limited number of processing nodes this can be done manually. However, in case the parallel computing platform consists of hundreds of thousands of processing nodes then the manual coding of the parallel algorithms becomes intractable and error-prone. Moreover, a change of the parallel computing platform requires considerable effort and time of coding. In this paper we present a model-driven approach for generating the code of selected parallel algorithms to be mapped on parallel computing platforms. We describe the required platform independent metamodel, and the model-to-model and the model-to-text transformation patterns. We illustrate our approach for the parallel matrix multiplication algorithm. Copyright © 2013 for the individual papers by the papers' authors.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

CUDA parallel algorithms for forward and inverse structural gravity problems 1

CUDA parallel algorithms for forward and inverse structural ...

引用

1st Ural Workshop on parallel, Distributed, and Cloud Computing for Young Scientists, Ural-PDC 2015

作者： Tsidaev, Alexander Bulashevich Institute of Geophysics Yekaterinburg Russia

This paper describes usage of CUDA parallelization scheme for forward and inverse gravity problems for structural boundaries. For- ward problem is calculated using the finite elements approach. This means that the whole calculation volume is split into parallelepipeds and then the gravity effect of each is calculated using known formula. In- verse problem solution is found using iteration local corrections method. This method requires only forward problem calculation on each iteration and does not use the operator inversion. Obtained results show that even cheap consumer video cards are highly effective for algorithm parallelization.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Analysis of parallel algorithms for energy conservation in scalable multicore architectures

Analysis of parallel algorithms for energy conservation in s...

引用

38th International Conference on parallel Processing, ICPP-2009

作者： Korthikanti, Vijay Anand Agha, Gul Department of Computer Science University of Illinois Urbana Champaign United States

ISBN: (纸本)9780769538020

This paper analyzes energy characteristics of parallel algorithms executed on scalable multicore processors. Specifically, we provide a methodology for evaluating energy scalability of parallel algorithms while satisfying performance requirements. Four parallel algorithms are analyzed to illustrate our method. We study the sensitivity of our analysis to changes in parameters such as the ratio of power required for computation versus power required for communication. The results suggest that power and performance scalability of a parallel algorithm can be quite different. Our method can be used to determine how many cores to use in order to minimize energy consumption. © 2009 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Generic parallel algorithms

引用

10th Conference on Computability in Europe, CiE 2014

作者： Dershowitz, Nachum Falkovich, Evgenia School of Computer Science Tel Aviv University Tel Aviv Israel

ISBN: (纸本)9783319080185

We develop a nature-inspired generic programming language for parallel algorithms, one that works for all data structures and control structures. Any parallel algorithm satisfying intuitively-appealing postulates can be modeled by a collection of cells, each of which is an abstract state machine, augmented with the ability to spawn new cells. All cells run the same algorithm and communicate via a shared global memory. © 2014 Springer International Publishing.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms and the models of array processors for calculation of multiple sums

Engineering Simulation

引用

Engineering Simulation 1999年第2期16卷 179-191页

作者： Likhoded, N.A. Sobolevskii, P.I. Acad of Sciences of Belarus Minsk

parallel forms of algorithms for the computation of multiple weighted sums are obtained. Appropriate models of parallel-pipelined VLSI array processors are synthesized. The number of processor elements is independent ... 详细信息

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

SPITFIRE: Scalable parallel algorithms for test set partitioned fault simulation

SPITFIRE: Scalable parallel algorithms for test set partitio...

引用

Proceedings of the 1997 15th VLSI Test Symposium

作者： Krishnaswamy, Dilip Rudnick, Elizabeth M. Patel, Janak H. Banerjee, Prithviraj Univ of Illinois Urbana United States

We propose three synchronous parallel algorithms for scalable parallel test set partitioned fault simulation. The algorithms are based on a new two-stage approach to parallelizing fault simulation for sequential VLSI circuits in which the test set is partitioned among the available processors. The test set partitioning inherent in the algorithms overcomes the good circuit logic simulation bottleneck that exists in traditional fault partitioned approaches to parallel fault simulation. The implementations were done on a shared memory multiprocessor and on a network of workstations. Two of the algorithms show a small degree of pessimism in a few cases, with respect to the fault coverage as compared with a uniprocessor run, while the third algorithm provides the same results as in a uniprocessor run. All algorithms provide excellent speedups and perform much better than a traditional fault partitioned approach, on both shared and distributed memory parallel platforms.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Asynchronous parallel algorithms for test set partitioned fault simulation

Asynchronous parallel algorithms for test set partitioned fa...

引用

Proceedings of the 1997 11th Workshop on parallel and Distributed Simulation

作者： Krishnaswamy, Dilip Banerjee, Prithviraj Rudnick, Elizabeth M. Patel, Janak H. Univ of Illinois Urbana IL United States

We propose in this paper two new asynchronous parallel algorithms for test set partitioned fault simulation. The algorithms are based on a new two-stage approach to parallelizing fault simulation for sequential VLSI circuits in which the test set is partitioned among the available processors. These algorithms provide the same result as the previous synchronous two stage approach. However, due to the dynamic characteristics of these algorithms and due to the fact that there is very minimal redundant work, they run faster than the previous synchronous approach. A theoretical analysis comparing the various algorithms is also given to provide an insight into these algorithms. The implementations were done in MPI and are therefore portable to many parallel platforms. Results are shown for a shared memory multiprocessor.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Interactive animation of fault tolerant parallel algorithms

Interactive animation of fault tolerant parallel algorithms

引用

1992 IEEE Workshop on Visual Languages, VL 1992

作者： Apgar, S.W. Digital Equipment Corp 110 Spit Brook Road NashuaNH03062 United States

ISBN: (纸本)0818630906

Animation of algorithms makes understanding them intuitively easier. This paper describes the software tool Raft (Robust Animator of Fault Tolerant algorithms) which allows the user to animate a number of fault tolerant parallel algorithms which achieve fault tolerant execution. The novelty of the system is that the interface allows the user to create new on-line fault-injecting adversaries as the algorithm executes. The various algorithms animated adapt to this interactive input in order to ensure fault-tolerance. The system has an extensive user interface which allows a choice of the number of processors, the number of parallel tasks, and the adversary to control the processor failures. © 1992 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：