检索结果-内蒙古大学图书馆

Locating and computing in parallel all the simple roots of special functions using PVM

JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS 2001年第1-2期133卷 545-554页

作者： Plagianakos, VP Nousis, NK Vrahatis, MN Univ Patras Dept Math UPAIRC GR-26500 Patras Greece

An algorithm is proposed for locating and computing in parallel and with certainty all the simple roots of any twice continuously differentiable function in any specific interval. To compute with certainty all the roots, the proposed method is heavily based on the knowledge of the total number of roots within the given interval. To obtain this information we use results from topological degree theory and, in particular, the Kronecker-Picard approach. This theory gives a formula for the computation of the total number of roots of a system of equations within a given region, which can be computed in parallel, With this tool in hand, we construct a parallel procedure for the localization and isolation of all the roots by dividing the given region successively and applying the above formula to these subregions until the final domains contain at the most one root. The subregions with no roots are discarded, while for the rest a modification of the well-known bisection method is employed for the computation of the contained root. The new aspect of the present contribution is that the computation of the total number of zeros using the Kronecker-Picard integral as well as the localization and computation of all the roots is performed in parallel using the parallel virtual machine (PVM). PVM is an integrated set of software tools and libraries that emulates a general-purpose, flexible, heterogeneous concurrent computing framework on interconnected computers of varied architectures. The proposed algorithm has large granularity and low synchronization, and is robust. It has been implemented and tested and our experience is that it can massively compute with certainty all the roots in a certain interval. Performance information from massive computations related to a recently proposed conjecture due to Elbert (this issue, J. Comput. Appl. Math. 133 (2001) 65-83) is reported. (C) 2001 Elsevier Science B.V. All rights reserved.

关键词： Elbert's conjecture special functions Bessel functions computation of special functions construction of tables of special functions parallel and distributed algorithms parallel virtual machine zero isolation Kronecker-Picard theory topological degree theory computing simple roots bisection method

来源：评论

学校读者我要写书评

暂无评论

parallel retrograde analysis on different architecture

Parallel retrograde analysis on different architecture

引用

10th Annual IEEE international symposium on High Performance Distributed Computing

作者： Wu, R Beal, DF HP Laboratories Hewlett-Packard Company Palo Alto CA United States

ISBN: (纸本)0769512968

Retrograde analysis is an efficient exhaustive search method. It is a powerful tool that can be used in solving problems where end states have known values but starting states do not. It has been widely used to solve mathematically-precise games such as chess endgames, and is potentially usable in energy-minimization problems. With increasing computing power, both in speed and storage capacity, retrograde analysis will become more and more useful. This paper looks at successful applications to games, the challenges ahead, and the modifications that are required to utilize distributed hardware. The power and the usefulness of retrograde analysis are still limited by the computing resources one has access to. Today, the best sequential retrograde algorithms are capable of solving problems with about 109 states in a few hours on a standard personal computer Bigger problems need more powerful computers, or take much longer to solve, or are simply out of reach of today's technologies, Introducing parallelism to retrograde analysis is a natural way to attack the bigger problems. There are today three main architectures available for doing parallel retrograde analysis: namely Symmetric Multiprocessor systems, High-speed network based distributed systems, and Internet based distributed systems. In this paper, we discuss some of the key issues in doing parallel retrograde analysis on these different architectures. Technical challenges are addressed in detail, as well as some examples and proposals. These examples and proposals are drawn from various board games, but the ideas can be applied to other problem domains.

关键词： Distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

A two-level checkpoint algorithm in a highly-available parallel single level store system

A two-level checkpoint algorithm in a highly-available paral...

引用

1st IEEE/ACM international symposium on Cluster Computing and the Grid, CCGrid 2001

作者： Morin, Christine Lottiaux, Renaud Kermarrec, Anne-Marie Rennes i Univ. France Microsft Research Cambridge United Kingdom

ISBN: (纸本)0769510108

A parallel single level store system (PSLS) integrates a shared virtual memory and a parallel file system. Managing the data globally it provides programmers of scientific applications with the attractive shared memory programming model combined with a large and efficient file system in a cluster. We present a cheap and efficient two-level checkpointing approach enabling a PSLS to tolerate failures. The first level checkpointing algorithm is very efficient and saves data in memory but requires a large amount of memory space. When memories are saturated, an alternative algorithm, saving a checkpoint on disks is implemented. Performance results present the impact of different variants of the checkpointing algorithms. © 2001 IEEE.

关键词： File organization

来源：评论

学校读者我要写书评

暂无评论

Continuous wavelet transform on reconfigurable meshes 15

Continuous wavelet transform on reconfigurable meshes

引用

15th international parallel and Distributed Processing symposium, IPDPS 2001

作者： Pan, Yi Li, Jie Vemuri, Ranga Department of Computer Science Georgia State University AtlantaGA30303 United States Institute of Information Sciences and Electronics University of Tsukuba Tsukuba Science City Ibaraki305-8573 Japan Department of ECECS University of Cincinnati CincinnatiOH45221-0030 United States

ISBN: (纸本)0769509908

Wavelet transforms have proven to be useful tools for several applications, including signal analysis, signal coding, and image compression. In this paper, faster parallel algorithms for computing the continuous wavelet transform are designed for reconfigurable meshes. An O(n1/2/m(log b + log∗ n))-time algorithm for computing the continuous wavelet transform with n signals and an integer grid (m, k) on a 3-D m x k x n reconfigurable mesh is proposed, where b is the number of bits used to represent the values in calculation. A constant-time algorithm 3-D m x k log3 n x t reconfigurable mesh is also proposed. To the best knowledge of the author, this is the first constant-time algorithm for continuous wavelet transform on any parallel architecture. © 2001 IEEE.

关键词： parallel architectures

来源：评论

学校读者我要写书评

暂无评论

An efficient association mining implementation on cluster of SMPs

An efficient association mining implementation on cluster of...

引用

international symposium on parallel and Distributed Processing (IPDPS)

作者： Ruoming Jin G. Agrawal Department of Computer and Information Sciences University of Delaware Newark DE USA

来源：评论

学校读者我要写书评

暂无评论

A study of implicit data distribution methods for OpenMP using the SPEC benchmarks

A study of implicit data distribution methods for OpenMP usi...

引用

international Workshop on OpenMP Applications and Tools, WOMPAT 2001

作者： Nikolopoulos, Dimitrios S. Ayguadé, Eduard Coordinated Science Lab University of Illinois at Urbana-Champaign 1308 West Main Str UrbanaIL61801 United States Department d’ Arquitectura de Computadors Universitat Politecnica de Catalunya c/Jordi Girona 1-3 Barcelona08034 Spain

ISBN: (纸本)9783540445876

In contrast to the common belief that OpenMP requires data-parallel extensions to scale well on architectures with non-uniform memory access latency, recent work has shown that it is possible to develop OpenMP programs with good levels of memory access locality, without any extension of the OpenMP API. The vehicle for localizing memory accesses transparently to the programming model, is a runtime memory manager, which uses memory access tracing and dynamic page migration to implement automatic data distribution. This paper evaluates the effectiveness of using this runtime data distribution method in non embarrassingly parallel codes, such as the SPEC benchmarks. We investigate the extent up to which sophisticated management of physical memory in the runtime system can speedup programs for which the programmer has no knowledge of the memory access pattern. Our runtime memory management algorithms improve the speedup of five SPEC benchmarks by 20-25% on average. The speedups are close to the theoretical maximum speedups for the problem sizes used and they are obtained with a minimal programming effort of about a couple of hours per benchmark. © Springer-Verlag Berlin Heidelberg 2001.

关键词： Application programming interfaces (API)

来源：评论

学校读者我要写书评

暂无评论

parallel decoding architectures for low density parity check codes

Parallel decoding architectures for low density parity check...

引用

IEEE international symposium on Circuits and Systems (ISCAS)

作者： C. Howland A. Blanksby High Speed Communications VLSI Research Department Agere Systems Holmdel NJ USA

ISBN: (纸本)0780366859

A parallel architecture for decoding low density parity check (LDPC) codes is proposed that achieves high coding gain together with extremely low power dissipation, and high throughput. The feasibility of this architecture is demonstrated through the design and implementation of a 1024 bit, rate-1/2, soft decision parallel LDPC decoder.

关键词： Parity check codes Turbo codes Iterative algorithms Iterative decoding Throughput parallel architectures Block codes Signal processing algorithms Sparse matrices Power dissipation

来源：评论

学校读者我要写书评

暂无评论

A genetic programming ecosystem

A genetic programming ecosystem

引用

international symposium on parallel and Distributed Processing (IPDPS)

作者： J. Devaney J. Hagedorn O. Nicolas G. Garg A. Samson M. Michel National Institute for Standards and Technology Gaithersburg MD USA

来源：评论

学校读者我要写书评

暂无评论

Wavelet image and video coding on parallel architectures

Wavelet image and video coding on parallel architectures

引用

international symposium on Image and Signal Processing and Analysis

作者： M. Feil R. Kutil P. Meerwald A. Uhl RIST and Department of Scientific Computing University of Salzburg Austria

We discuss parallel algorithms for wavelet-based image and video coding. After reviewing fundamentals of the parallel discrete wavelet transform, we cover the parallelization of two state-of-the-art compression scheme... 详细信息

ISBN: (纸本)9539676940

关键词： Video coding parallel architectures Wavelet transforms Image coding Wavelet packets Video codecs Video compression Low pass filters parallel algorithms Discrete wavelet transforms

来源：评论

学校读者我要写书评

暂无评论

High-level data mapping for clusters of SMPs

High-level data mapping for clusters of SMPs

引用

international symposium on parallel and Distributed Processing (IPDPS)

作者： S. Benkner T. Brandes Institute for Software Science University of Technology Vienna Austria GMD-German National Research Center for Information Technology SCAI-Institute for Algorithms and Scientific Computing Saint Petersburg Russia

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：