检索结果-内蒙古大学图书馆

parallel algorithms for coupled-cluster methods

parallel COMPUTING 2000年第7-8期26卷 857-867页

作者： Watts, JD Jackson State Univ Dept Chem Computat Ctr Mol Struct & Interact Jackson MS 39217 USA

Coupled-cluster (CC) methods are now widely used in quantum chemistry to calculate the electron correlation energy and many other properties of atoms and molecules. In this paper we outline the basics of the theory, discuss some computational aspects, and review work that has been done toward developing and implementing algorithms for CC methods on parallel computers. (C) 2000 Elsevier Science B.V. All rights reserved.

关键词： coupled-cluster methods quantum chemistry electron correlation parallel computation parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms FOR CIRCLE DETECTION IN IMAGES

引用

PATTERN RECOGNITION 1994年第8期27卷 1019-1028页

作者： KUMAR, S RANGANATHAN, N GOLDGOF, D Department of Computer Science and Engineering ENG 118 University of South Florida Tampa FL 33620 U.S.A.

The detection of circles in images is an important task in many computer vision applications. When the three parameters (center coordinates and radius) of a circle are quantized into O(n) values each, a sequential algorithm using the Hough transform runs with a time complexity of O(n4), where n x n is the size of the image. When information about the gradient direction is also used, the complexity of the sequential algorithm reduces to O(n3). This paper proposes three parallel algorithms for circle detection on an n x n mesh of processing elements operating in the SIMD mode. The first two algorithms use the Hough transform and the third is based on a tracing algorithm. The first algorithm uses only the gradient magnitude and takes O(n3) time. The second uses both the gradient magnitude and gradient direction and runs in O(n2) time. The third method uses a midpoint circle scan conversion algorithm and runs with a complexity of O(n2). This algorithm is the most efficient of the three. It does not use the gradient direction and offers an improvement of O(n2) over its sequential counterpart that runs in O(n4) time. When implemented with a table look-up operation, this algorithm has a low proportionality constant and offers a significant improvement in computational speed.

关键词： parallel algorithms CIRCLE DETECTION HOUGH TRANSFORM SIMD ARCHITECTURE

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for the spectral transform method

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 1997年第3期18卷 806-837页

作者： Foster, IT Worley, PH OAK RIDGE NATL LAB OAK RIDGETN 37831

The spectral transform method is a standard numerical technique for solving partial differential equations on a sphere and is widely used in atmospheric circulation models. Re cent research has identified several promising algorithms for implementing this method on massively parallel computers. however, no detailed comparison of the different algorithms has previously been attempted. In this paper, we describe these different parallel algorithms and report on computational experiments that we have conducted to evaluate their efficiency on parallel computers. The experiments used a testbed code that solves the nonlinear shallow water equations on a sphere: considerable care was taken to ensure that the experiments provide a fair comparison of the different algorithms and that the results are relevant to global models. We focus on hypercube- and mesh-connected multicomputers with cut-through routing, such as the Intel iPSC/860, DELTA, and Paragon, and the nCUBE/2, but we also indicate how the results extend to other parallel computer architectures. The results of this study are relevant not only to the spectral transform method but also to multidimensional fast Fourier transforms (FFTs) and other parallel transforms.

关键词： spectral transform method parallel algorithms performance analysis

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms FOR SOLVING LARGE LINEAR-SYSTEMS

引用

JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS 1994年第1-3期50卷 221-232页

作者： DEKKER, TJ HOFFMANN, W POTMA, K UNIV AMSTERDAM DEPT COMP SYST1098 SJ AMSTERDAMNETHERLANDS

The solution of linear systems continues to play an important role in scientific computing. The problems to be solved often are of very large size, so that solving them requires large computer resources. To solve these problems, at least supercomputers with large shared memory or massive parallel computer systems with distributed memory are needed. This paper gives a survey of research on parallel implementation of various direct methods to solve dense linear systems. In particular are considered: Gaussian elimination, Gauss-Jordan elimination and a variant due to Huard (1979), and an algorithm due to Enright (1978), designed in relation to solving (stiff) ODEs, such that stepsize and other method parameters can easily be varied. Some theoretical results are mentioned, including a new result on error analysis of Huard's algorithm. Moreover, practical considerations and results of experiments on supercomputers and on a distributed-memory computer system are presented.

关键词： GAUSSIAN ELIMINATION GAUSS-JORDAN LINEAR SYSTEMS LU FACTORIZATION PIVOTING STRATEGIES parallel algorithms VECTOR COMPUTING

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for Fuzzy Ontology Reasoning

引用

IEEE TRANSACTIONS ON FUZZY SYSTEMS 2013年第4期21卷 775-781页

作者： Bobillo, Fernando Delgado, Miguel Cesar Sanchez-Sanchez, Jose Univ Zaragoza Dept Comp Sci & Syst Engn Zaragoza 50009 12 Spain Univ Granada Dept Comp Sci & Artificial Intelligence Granada 18010 0 Spain

The need to deal with imprecise and vague information in ontologies is rising in importance, as required by several real-world application domains. As a consequence, there is a growing interest in fuzzy ontologies, which combine ontologies and fuzzy logic theory. In fuzzy ontologies, some reasoning tasks usually become harder to solve, such as the concept subsumption problem and the computation of the Best Degree Bound (BDB) of an axiom. In fact, the current existing algorithms to solve these problems usually require performing some simpler tests several times. In this paper, we present a parallelization of these algorithms, implemented in the DeLorean reasoner, and discuss the encouraging results of an empirical evaluation.

关键词： Fuzzy description logics (DLs) fuzzy ontologies logic for the semantic web parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms AND ARCHITECTURES FOR MULTISPLITTING ITERATIVE METHODS

引用

parallel COMPUTING 1989年第2期12卷 171-182页

作者： PAPATHEODOROU, TS SARIDAKIS, YG UNIV PATRAS DEPT COMP ENGNPATRASGREECE CLARKSON UNIV DEPT MATH & COMP SCIPOTSDAMNY 13676

The Multi-Splitting (MS) iterative method, designed exclusively for multiprocessor environments, is considered for the solution of large systems of linear equations. A general parallel algorithm is devised and implemented on a modular two-level parallel architecture, which utilizes the systolic arrays as building blocks, to demonstrate the point iteration. A particular three-term member of the MS family is applied, for the parallel block iterative solution, on the Poisson's equation discretized by the collocation method.

关键词： collocation Iterative methods parallel algorithms splittings systolic architectures

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for finding polynomial Roots on OTIS-torus

引用

JOURNAL OF SUPERCOMPUTING 2010年第2期54卷 139-153页

作者： Lucas, Keny T. Jana, Prasanta K. Xavier Inst Social Serv Dept Informat Management Ranchi 834001 Bihar India Mines Univ Indian Sch Dept Comp Sci & Engn Dhanbad 826004 Bihar India

We present two parallel algorithms for finding all the roots of an N-degree polynomial equation on an efficient model of Optoelectronic Transpose Interconnection System (OTIS), called OTIS-2D torus. The parallel algorithms are based on the iterative schemes of Durand-Kerner and Ehrlich methods. We show that the algorithm for the Durand-Kerner method requires (N (0.75)+0.5N (0.25)-1) electronic moves + 2(N (0.5)-1) OTIS moves using N processors. The parallel algorithm for Ehrlich method is shown to run in (N (0.75)+0.5N (0.25)-1) electronic moves + 2(N (0.5)-1) OTIS moves with the same number of processors. The algorithms have lower AT cost than the algorithms presented in Jana (parallel Comput 32:301-312, 2006). The scalability of the algorithms is also discussed.

关键词： parallel algorithms Optoelectronic parallel computer OTIS-2D torus Polynomial roots Durand-Kerner scheme Ehrlich scheme

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms Based on the Temporal-Window Method for Non-Alternating 3D-WT over Angiographies Using a Multicomputer

引用

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY 2009年第1-3期55卷 267-279页

作者： Moyano-Avila, E. Orozco-Barbosa, L. Quiles, F. J. Univ Castilla La Mancha Dept Informat Technol & Syst Toledo 45071 Spain Univ Castilla La Mancha Comp Syst Dept Albacete Spain

In this paper, we introduce and evaluate the parallel implementations of two video sequences decorrelation algorithms having been developed based on the non-alternating three-dimensional wavelet transform (3D-WT) and the temporal-window method. The proposed algorithms have been proven to outperform the classic 3D-WT algorithm in terms of a better coding efficiency and lower computational requirements while enabling a lossless coding and a top-quality reconstruction: the two most highly relevant features to medical imaging applications. The parallel implementations of the algorithms are developed and tested on a shared memory system, a SGI Origin 3800 supercomputer, making use of a message-passing paradigm. We evaluate and analyze the performance of the implementations in terms of the response time and speed-up factor by varying the number of processors and various video coding parameters. The key point enabling the development of highly efficient implementations rely on a workload distribution strategy supplemented by the use of parallel I/O primitives, for better exploiting the inherent features of the application and computing platform. Two sets of I/O primitives are tested and evaluated: the ones provided by the C compiler and the ones belonging to the MPI/IO library.

关键词： parallel algorithms parallel I/O Multicomputer Non-alternating wavelet transform Temporal-window method Angiography sequences

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms FOR FLUID-STRUCTURE INTERACTION PROBLEMS IN HAEMODYNAMICS

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 2011年第4期33卷 1598-1622页

作者： Crosetto, Paolo Deparis, Simone Fourestey, Gilles Quarteroni, Alfio Ecole Polytech Fed Lausanne IACS Chair Modelling & Sci Comp CMCS CH-1015 Lausanne Switzerland Politecn Milan MOX Dipartimento Matemat F Brioschi I-20133 Milan Italy

The increasing computational load required by most applications and the limits in hardware performances affecting scientific computing contributed in the last decades to the development of parallel software and architectures. In fluid-structure interaction (FSI) for haemodynamic applications, parallelization and scalability are key issues (see [L. Formaggia, A. Quarteroni, and A. Veneziani, eds., Cardiovascular Mathematics: Modeling and Simulation of the Circulatory System, Modeling, Simulation and Applications 1, Springer, Milan, 2009]). In this work we introduce a class of parallel preconditioners for the FSI problem obtained by exploiting the block-structure of the linear system. We stress the possibility of extending the approach to a general linear system with a block-structure, then we provide a bound in the condition number of the preconditioned system in terms of the conditioning of the preconditioned diagonal blocks, and finally we show that the construction and evaluation of the devised preconditioner is modular. The preconditioners are tested on a benchmark three-dimensional (3D) geometry discretized in both a coarse and a fine mesh, as well as on two physiological aorta geometries. The simulations that we have performed show an advantage in using the block preconditioners introduced and confirm our theoretical results.

关键词： blood-flow models fluid-structure interaction finite elements preconditioners parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for red-black trees

引用

THEORETICAL COMPUTER SCIENCE 2001年第1-2期262卷 415-435页

作者： Park, H Park, K Seoul Natl Univ Dept Comp Engn Seoul 151742 South Korea

We present parallel algorithms for the following four operations on red-black trees: construction, search, insertion, and deletion. Our parallel algorithm for constructing a red-black tree from a sorted list of n items runs in O(1) time with n processors on the CRCW PRAM and runs in O(loglogn) time with n/loglogn processors on the EREW PRAM. Our construction algorithm does not require the assumptions that previous construction algorithms used. Each of our parallel algorithms for search, insertion, and deletion in red-black trees runs in O(logn + logk) time with k processors on the EREW PRAM, where k is the number of unsorted items to search for, insert, or delete and n is the number of nodes in a red-black tree. (C) 2001 Elsevier Science B.V. All rights reserved.

关键词： red-black trees balanced search trees parallel algorithms dictionary operations

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：