检索结果-内蒙古大学图书馆

Structure preserving parallel algorithms for solving the Bethe-Salpeter eigenvalue problem

LINEAR ALGEBRA AND ITS APPLICATIONS 2016年 488卷 148-167页

作者： Shao, Meiyue da Jornada, Felipe H. Yang, Chao Deslippe, Jack Louie, Steven G. Univ Calif Berkeley Lawrence Berkeley Natl Lab Computat Res Div Berkeley CA 94720 USA Univ Calif Berkeley Dept Phys Berkeley CA 94720 USA Univ Calif Berkeley Lawrence Berkeley Natl Lab Div Mat Sci Berkeley CA 94720 USA Univ Calif Berkeley Lawrence Berkeley Natl Lab NERSC Berkeley CA 94720 USA

The Bethe-Salpeter eigenvalue problem is a dense structured eigenvalue problem arising from discretized Bethe-Salpeter equation in the context of computing exciton energies and states. A computational challenge is that at least half of the eigenvalues and the associated eigenvectors are desired in practice. We establish the equivalence between Bethe-Salpeter eigenvalue problems and real Hamiltonian eigenvalue problems. Based on theoretical analysis, structure preserving algorithms for a class of Bethe-Salpeter eigenvalue problems are proposed. We also show that for this class of problems all eigenvalues obtained from the Tamm-Dancoff approximation are overestimated. In order to solve large scale problems of practical interest, we discuss parallel implementations of our algorithms targeting distributed memory systems. Several numerical examples are presented to demonstrate the efficiency and accuracy of our algorithms. (C) 2015 Elsevier Inc. All rights reserved.

关键词： Bethe-Salpeter equation Tamm-Dancoff approximation Hamiltonian eigenvalue problems Structure preserving algorithms parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

EFFICIENT parallel algorithms AND VLSI ARCHITECTURES FOR MANIPULATOR JACOBIAN COMPUTATION

引用

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS 1989年第5期19卷 1154-1166页

作者： YEUNG, TB LEE, CSG PURDUE UNIV SCH ELECT ENGN W LAFAYETTE IN 47907 USA

The real-time computation of the Jacobian that relates the manipulator joint velocities to the linear and angular velocities of the manipulator end-effector is pursued. Since the Jacobian can be expressed in the form of a first-order linear recurrence, the time lower bound for computing the Jacobian can be proved to be of order O(N) on uniprocessor computers and of order O(log/sub 2/ N) on both single-instruction-stream-multiple-data-stream (SIMD) and VLSI pipelined parallel processors, where N is the number of links of the manipulator. To achieve the lower bound, the authors developed a generalized-k method for uniprocessor computers, a parallel forward and backward recursive doubling algorithm (PFABRD) for SIMD computers, and a parallel systolic architecture for VLSI pipelines. All the methods are capable of computing the Jacobian at any desired reference coordinate frame k from the base coordinate frame to the end-effector coordinate frame. The computational effort in terms of floating-point operations is minimal when k is in the range (4,N-3) for the generalized-k method, and k=(N+1)/2 for both the PFABRD algorithm and the parallel pipeline.< >

关键词： parallel algorithms Very large scale integration Computer architecture Jacobian matrices Concurrent computing Manipulators Pipelines Vectors Angular velocity Robot kinematics

来源：评论

学校读者我要写书评

暂无评论

OPTIMAL parallel algorithms FOR FINDING CUT VERTICES AND BRIDGES OF INTERVAL-GRAPHS

引用

INFORMATION PROCESSING LETTERS 1992年第4期42卷 229-234页

作者： SPRAGUE, AP KULKARNI, KH Department of Computer and Information Sciences University of Alabama at Birminghan Birmingham AL 35294 USA Rust College Holly Springs MS 38635 USA

We present 0(log n) time algorithms in the EREW PRAM model, using n /log n processors, to find cut vertices, bridges, and blocks (often called biconnected components) of an interval graph having n vertices. It is assumed the interval graph is represented by an interval model, with ends presorted. If the ends are not presorted, our algorithms, preceded by an optimal sort, form an 0(log n) time algorithm using n processors, which is shown to be optimal. The algorithms rely heavily on the parallel prefix algorithm.

关键词： parallel algorithms INTERVAL GRAPHS CUT VERTICES BRIDGES

来源：评论

学校读者我要写书评

暂无评论

Can parallel algorithms enhance serial implementation?

引用

COMMUNICATIONS OF THE ACM 1996年第9期39卷 88-91页

作者： Vishkin, U UNIV MARYLAND DEPT ELECT ENGNCOLLEGE PKMD 20742 TEL AVIV UNIV DEPT COMP SCIIL-69978 TEL AVIVISRAEL

Performance improvement is a fundamental concern in computer science and engineering. Observing the history of the field, one would expect that any improvement in the ability of computer systems would be quickly met by applications utilizing it. This article presents a software-centric approach, in which ease of programming is a first priority for both uniprocessors and multiprocessors. This article outlines two concrete reasons and one general reason why parallel programs could give gain in performance over serial code on uniprocessors, especially with the current trends in uniprocessor architecture.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

SCALABLE DATA-parallel algorithms FOR TEXTURE SYNTHESIS USING GIBBS RANDOM-FIELDS

引用

IEEE TRANSACTIONS ON IMAGE PROCESSING 1995年第10期4卷 1456-1460页

作者： BADER, DA JALA, J CHELLAPPA, R UNIV MARYLAND INST ADV COMP STUDIESCOLLEGE PKMD 20742

This correspondence introduces scalable data parallel algorithms for image processing. Focusing on Gibbs and Markov random field model representation for textures, we present parallel algorithms for texture synthesis, compression, and maximum likelihood parameter estimation, currently implemented on Thinking Machines CM-2 and CM-5. Use of fine-grained, data parcel processing techniques yields real-time algorithms for texture synthesis and compression that are substantially faster than the previously known sequential implementations. Although current implementations are on Connection Machines, the methodology presented here enables machine-independent scalable algorithms for a number of problems in image processing and analysis.

关键词： parallel algorithms Image restoration Noise shaping Signal to noise ratio Signal processing algorithms Least squares approximation Signal restoration Optimized production technology Image processing Image coding

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms of Multi-relaxation-time Lattice Boltzmann Method

Parallel Algorithms of Multi-relaxation-time Lattice Boltzma...

引用

1st International Conference on Advanced algorithms and Control Engineering, ICAACE 2018

作者： Xu, Lei Cheng, Pan Liu, Zhixiang Zhang, Wu School of Computer Engineering and Science Shanghai University Shanghai200444 China Shanghai Aircraft Design and Research Institute Shanghai201210 China College of Information Shanghai Ocean University Shanghai201306 China

The lattice Boltzmann method has become an attractive and promising approach in computational fluid dynamics. In this paper, the D3Q19 multi-relaxation-time lattice Boltzmann method is employed to simulate complex fluid flow and its parallel algorithm is presented including Cartesian grid generation, domain decomposition method, and data exchange strategy on clusters. Considering load balancing on large scale cluster, details of domain decomposition method are presented. The numerical results show that the presented algorithm have considerable scalability on 2048 cores and the efficiency can achieve 92.01%. © Published under licence by IOP Publishing Ltd.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

NEW APPROACHES TO DERIVING parallel algorithms

引用

parallel COMPUTING 1990年第1-3期15卷 261-265页

作者： TYRTYSHNIKOV, EE Dept of Numerical Mathematics of the USSR Acad of Sciences Moscow Russia

A method is proposed for converting an algorithm admitting no parallel treatment into a new algorithm, in essence, with much better parallel properties. The method is intended for tackling the so called T-algorithms, the term ensuing from first examples of such algorithms concerned in the context of Toeplitz-like matrices. Generalized T-algorithms are also considered.

关键词： Multilinear forms parallel algorithms Toeplitz matrices

来源：评论

学校读者我要写书评

暂无评论

NEW SEQUENTIAL AND parallel algorithms FOR INTERVAL GRAPH RECOGNITION

引用

INFORMATION PROCESSING LETTERS 1990年第4期34卷 215-219页

作者： RAMALINGAM, G RANGAN, CP INDIAN INST TECHNOL DEPT COMP SCI & ENGN MADRAS 600036 TAMIL NADU INDIA

A characterization of interval graphs is used to arrive at an O(n2) interval graph recognition algorithm which also builds an interval representation for interval graphs. The algorithm is fairly simple and directly yi... 详细信息

关键词： Design of algorithms parallel algorithms interval graphs

来源：评论

学校读者我要写书评

暂无评论

Efficient sequential and parallel algorithms for record linkage

引用

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION 2014年第2期21卷 252-262页

作者： Abdullah-Al Mamun Mi, Tian Aseltine, Robert Rajasekaran, Sanguthevar Univ Connecticut Dept Comp Sci & Engn Storrs CT 06269 USA Univ Connecticut Publ Hlth Res Inst E Hartford CT USA

Background and objective Integrating data from multiple sources is a crucial and challenging problem. Even though there exist numerous algorithms for record linkage or deduplication, they suffer from either large time needs or restrictions on the number of datasets that they can integrate. In this paper we report efficient sequential and parallel algorithms for record linkage which handle any number of datasets and outperform previous algorithms. Methods Our algorithms employ hierarchical clustering algorithms as the basis. A key idea that we use is radix sorting on certain attributes to eliminate identical records before any further processing. Another novel idea is to form a graph that links similar records and find the connected components. Results Our sequential and parallel algorithms have been tested on a real dataset of 1083878 records and synthetic datasets ranging in size from 50000 to 9000000 records. Our sequential algorithm runs at least two times faster, for any dataset, than the previous best-known algorithm, the two-phase algorithm using faster computation of the edit distance (TPA (FCED)). The speedups obtained by our parallel algorithm are almost linear. For example, we get a speedup of 7.5 with 8 cores (residing in a single node), 14.1 with 16 cores (residing in two nodes), and 26.4 with 32 cores (residing in four nodes). Conclusions We have compared the performance of our sequential algorithm with TPA (FCED) and found that our algorithm outperforms the previous one. The accuracy is the same as that of this previous best-known algorithm.

关键词： Data Integration Healthcare Records algorithms parallel algorithms Speedups

来源：评论

学校读者我要写书评

暂无评论

Splitting methods parallel algorithms for problems of pollution transport in atmosphere

引用

Journal of Automation and Information Sciences 2014年第10期46卷 58-71页

作者： Gladky, A.V. Blagoveshchenskaya, T.Yu. Bohaienko, V.A. V.M. Glushkov Institute of Cybernetics National Academy of Sciences of Ukraine Kiev Ukraine

The problem of mathematical modeling of the spread of contamination from point sources in the air has been considered. An approach that uses the idea of splitting and organization of computation with explicit difference schemes of a traveling wave has been proposed for the numerical solution of multi-dimensional convection-diffusion equations. Problems of construction of splitting difference schemes, approximation and stability by the initial data have been investigated. parallel algorithms for GPU and cluster systems have been proposed. © 2014 by Begell House Inc.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：