检索结果-内蒙古大学图书馆

A PRACTICAL parallel ALGORITHM FOR SOLVING BAND SYMMETRIC POSITIVE DEFINITE SYSTEMS OF LINEAR-EQUATIONS

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE 1987年第4期13卷 323-332页

作者： BARON, I NYU COURANT INST MATH SCINEW YORKNY 10012

We give a practical parallel algorithm for solving band symmetric positive definite systems of linear equations in O(m* log n) time using nm/log n processors. Here n denotes the system size and m its bandwidth. Hence, the algorithm is efficient. For tridiagonal systems, the algorithm runs in O(log n) time using n/log n processors. Furthermore, an improved version runs in O(log m log n) time using nm2/(log m log n) processors.

关键词： Band Systems of linear equations Gaussian elimination parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

PRACTICAL USE OF POLYNOMIAL PRECONDITIONINGS FOR THE CONJUGATE-GRADIENT METHOD

引用

SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING 1985年第4期6卷 865-881页

作者： SAAD, Y

This paper presents some practical ways of using polynomial preconditions for solving large sparse linear systems of equations issued from discretizations of partial differential equations. For a symmetric positive definite matrix A these techniques are based on least squares polynomials on the interval $[0,b]$ where b is the Gershgorin estimate of the largest eigenvalue. Therefore, as opposed to previous work in the field, there is no need for computing eigenvalues of A. We formulate a version of the conjugate gradient algorithm that is more suitable for parallel architectures and discuss the advantages of polynomial preconditioning in the context of these architectures.

关键词： conjugate gradient method polynomial preconditionings parallel algorithms vectorization

来源：评论

学校读者我要写书评

暂无评论

DISTRIBUTED AND SHARED MEMORY BLOCK algorithms FOR THE TRIANGULAR SYLVESTER EQUATION WITH SEP(-1) ESTIMATORS

引用

SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS 1992年第1期13卷 90-101页

作者： KAGSTROM, B POROMAA, P

Coarse grain message passing and shared memory algorithms for solving the quasi-triangular Sylvester equation are discussed. The basic algorithm is of block type, i.e., rich in matrix-matrix operations. The focus is on computing reliable estimates of the sep-1 function (a natural condition number for the Sylvester equation and the invariant subspace problem). Estimators based on the Frobenius norm and the 1-norm, respectively, are presented. Accuracy, efficiency, and reliability results are presented. The applicability of the estimators to both the shared memory and distributed memory paradigms are discussed. Some performance results of the parallel block algorithms with condition estimators are also presented. The reliability of both estimators are very good. The Frobenius norm-based estimator is much more efficient in both sequential and parallel settings (on average between four to five times). Further, it is applicable to both the standard and generalized problems.

关键词： SYLVESTER EQUATION parallel algorithms CONDITION NUMBER ESTIMATION

来源：评论

学校读者我要写书评

暂无评论

GRID IMPLEMENTATION OF parallel MULTIOBJECTIVE GENETIC ALGORITHM FOR OPTIMISED ALLOCATION OF CHLORINATION STATIONS IN DRINKING WATER DISTRIBUTION SYSTEMS: CHOJNICE CASE STUDY

引用

IFAC Proceedings Volumes 2006年第14期39卷 238-243页

作者： G. Ewald W. Kurek Mietek A. Brdys Department of Automatic Control Gdansk University of Technology ul. Narutowicza 11/12 80-952 Gdansk Poland Department of Electronic Electric and Computer Engineering The University of Birmingham Birmingham B15 2TT UK

There is a group of problems that require big amount of computing power to solve. Computer grids allow building effective computing platforms at relatively low cost. It is expected that algorithms like Genetic Algorithm will perform well on the grid. In this paper, grid implementation of multiobjective distributed genetic algorithm is proposed. A distributed version of the algorithm is based on a modified island algorithm where genetic data exchange is replaced by introduced new Forgetting Island Elitism. The algorithm is applied to booster station allocation in Chojnice water distribution system.

关键词： parallel algorithms parallel computation Multiobjective optimizations Genetic algorithms Computing systems Decision making Quality control Environmental engineering

来源：评论

学校读者我要写书评

暂无评论

THE SOLUTION OF SINGULAR-VALUE AND SYMMETRIC EIGENVALUE PROBLEMS ON MULTIPROCESSOR ARRAYS

引用

SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING 1985年第1期6卷 69-84页

作者： BRENT, RP LUK, FT CORNELL UNIV DEPT COMP SCIITHACANY 14853

parallel Jacobi-like algorithms are presented for computing a singular-value decomposition of an $m \times n$ matrix $(m \geqq n)$ and an eigenvalue decomposition of an $n \times n$ symmetric matrix. A linear array of $O(n)$ processors is proposed for the singular-value problem; the associated algorithm requires time $O(mnS)$, where S is the number of sweeps (typically $S \leqq 10$). A square array of $O(n^2 )$ processors with nearest-neighbor communication is proposed for the eigenvalue problem; the associated algorithm requires time $O(nS)$.

关键词： multiprocessor arrays systolic arrays singular-value decomposition eigenvalue decomposition real symmetric matrices Hestenes method Jacobi method VLSI real-time computation parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Locally adapted hierarchical basis preconditioning

引用

ACM TRANSACTIONS ON GRAPHICS 2006年第3期25卷 1135-1143页

作者： Szeliski, Richard Microsoft Corp Res Redmond WA 98052 USA

This paper develops locally adapted hierarchical basis functions for effectively preconditioning large optimization problems that arise in computer graphics applications such as tone mapping, gradient-domain blending, colorization, and scattered data interpolation. By looking at the local structure of the coefficient matrix and performing a recursive set of variable eliminations, combined with a simplification of the resulting coarse level problems, we obtain bases better suited for problems with inhomogeneous ( spatially varying) data, smoothness, and boundary constraints. Our approach removes the need to heuristically adjust the optimal number of preconditioning levels, significantly outperforms previously proposed approaches, and also maps cleanly onto data-parallel architectures such as modern GPUs.

关键词： computational photography Poisson blending colorization fast PDE solution multilevel techniques parallel algorithms GPU acceleration

来源：评论

学校读者我要写书评

暂无评论

DISTRIBUTED SPARSE GAUSSIAN-ELIMINATION AND ORTHOGONAL FACTORIZATION

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 1995年第6期16卷 1462-1477页

作者： RAGHAVAN, P UNIV ILLINOIS NATL CTR SUPERCOMP APPLICATURBANAIL 61801

A unified framework is presented for a fully parallel solution of large, sparse nonsymmetric linear systems on distributed memory multiprocessors. Unlike earlier work, both symbolic and numeric steps are parallelized. parallel Cartesian nested dissection is used to compute a fill-reducing ordering of A using a compact representation of the column intersection graph, and the resulting separator tree is used to estimate the structure of the factor and to distribute data and perform, multifrontal numeric computations. When the matrix is nonsymmetric but square, the numeric computations involve Gaussian elimination with partial pivoting;when the matrix is overdetermined, row-oriented Householder transforms are applied to compute the triangular factor of an orthogonal factorization. Extensive empirical results are provided to demonstrate that the approach is effective both in preserving sparsity and achieving good parallel performance on an Intel iPSC/860.

关键词： parallel algorithms SPARSE LINEAR SYSTEMS SPARSE MATRIX FACTORIZATION GAUSSIAN ELIMINATION ORTHOGONAL FACTORIZATION NESTED DISSECTION

来源：评论

学校读者我要写书评

暂无评论

An Efficient Large-Scale Sensor Deployment Using a parallel Genetic Algorithm Based on CUDA

引用

INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS 2016年第3期12卷 8612128-8612128页

作者： Seo, Jae-Hyun Yoon, Yourim Kim, Yong-Hyuk Kwangwoon Univ Dept Comp Sci & Engn 20 Kwangwoon Ro Seoul 139701 South Korea Gachon Univ Dept Comp Engn 1342 Sengnamdaero Songnam 461701 Gyeonggi Do South Korea

We have employed evolutionary computation to solve the optimization problem of sensor deployment in battlefield environments. A genetic algorithm has the advantage of delivering results of a higher quality than simple computational algorithms, but it has the drawback of requiring too much computing time. This study aimed not only to shorten the computing time to as close to real-time as possible by using the Compute Unified Device Architecture (CUDA) but also to maintain a solution quality that is as good as or better than the case when the proposed algorithm is not used. In the proposed genetic algorithm, parallelization was applied to speed up the fitness evaluation requiring heavy computation time. The proposed CUDA-based design approach for complex and various sensor deployments is validated by means of simulation. We parallelized two parts in Monte Carlo simulation for the fitness evaluation: moving lots of tested vehicles and calculating the probability of detection (POD) for each vehicle. The experiment was divided into CPU and GPU experiments depending on arithmetic unit types. In the GPU experiment, the results showed similar levels for the detection probability by GPU and CPU, and the computing time decreased by approximately 55-56 times.

关键词： CUDA (Computer architecture) SENSOR placement parallel algorithms GENETIC algorithms LARGE scale systems EVOLUTIONARY computation

来源：评论

学校读者我要写书评

暂无评论

Two packet routing algorithms on a mesh-connected computer

引用

IEEE Transactions on parallel and Distributed Systems 1995年第4期6卷 436-440页

作者： Gu, Qian-Ping Gu, Jun Univ of Aizu Aizu-Wakamatsu Fukushima Japan

In this paper, we give two algorithms for the 1-1 routing problems on a mesh-connected computer. The first algorithm, with queue size 28, solves the 1-1 routing problem on an n × n mesh-connected computer in 2n + O(1) steps. This improves the previous queue size of 75. The second algorithm solves the 1-1 routing problem in 2n - 2 steps with queue size 12ts/s where ts is the time for sorting an s × s mesh into a row major order for all s ≥ 1. This result improves the previous queue size 18.67ts/s.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

NUMERICALLY STABLE SOLUTION OF DENSE SYSTEMS OF LINEAR-EQUATIONS USING MESH-CONNECTED PROCESSORS

引用

SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING 1984年第1期5卷 95-104页

作者： BOJANCZYK, A BRENT, RP KUNG, HT AUSTRALIAN NATL UNIV DEPT COMP SCICANBERRAACT 2600AUSTRALIA CARNEGIE MELLON UNIV DEPT COMP SCIPITTSBURGHPA 15213

We propose a multiprocessor structure for solving a dense system of n linear equations. The solution is obtained in two stages. First, the matrix of coefficients is reduced to upper triangular form via Givens rotations. Second, a back substitution process is applied to the triangular system. A two-dimensional array of $\theta (n^2 )$ processors is employed to implement the first step, and (using a previously known scheme) a one-dimensional array of $\theta (n)$ processors is employed to implement the second step. These processor arrays allow both stages to be carried out in time $O(n)$, and they are well suited for VLSI implementation as identical processors with a simple and regular interconnection pattern are required.

关键词： Givens method least squares linear systems numerical stability orthogonal factorization parallel algorithms QR method special-purpose hardware systolic arrays VLSI

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：