检索结果-内蒙古大学图书馆

An efficient parallel algorithm for the layered planar monotone circuit value problem

algorithmICA 1997年第3期18卷 384-404页

作者： Ramachandran, V Yang, HH Department of Computer Sciences University of Texas at Austin Austin TX 78712 USA. vlr@cs.utexas.edu. yanghh@cs.utexas.edu. US

A planar monotone circuit (PMC) is a Boolean circuit that can be embedded in the plane and that contains only AND and OR gates. A layered PMC is a PMC in which all input nodes are in the external face, and the gates can be assigned to layers in such a way that every wire goes between gates in successive layers. Goldschlager, Cook and Dymond, and others have developed NC2 algorithms to evaluate a layered PMC when the output node is in the same face as the input nodes. These algorithms require a large number of processors (Omega(n(6)), where n is the size of the input circuit). In this paper we give an efficient parallel algorithm that evaluates a layered PMC of size n in O (log(2)n) time using only a linear number of processors on an EREW PRAM. Our parallel algorithm is the best possible to within a polylog factor, and is a substantial improvement over the earlier algorithms for the problem.

关键词： circuit value problem planar monotone circuit plant graph parallel algorithm EREW PRAM

来源：评论

学校读者我要写书评

暂无评论

Exponentially convergent parallel algorithm for nonlinear eigenvalue problems

引用

IMA JOURNAL OF NUMERICAL ANALYSIS 2007年第4期27卷 818-838页

作者： Gavrilyuk, I. P. Klimenko, A. V. Makarov, V. L. Rossokhata, N. O. Natl Acad Sci Ukraine Inst Math Dept Numer Math UA-01601 Kiev 4 Ukraine Staatliche Studienakad Berufsakad Thuringen D-99817 Eisenach Germany

A new algorithm for nonlinear eigenvalue problems is proposed. The numerical technique is based on a perturbation of the coefficients of differential equation combined with the Adomian decomposition method for the nonlinear part. The approach provides an exponential convergence rate with a base which is inversely proportional to the index of the eigenvalue under consideration. The eigenpairs can be computed in parallel. Numerical examples are presented to support the theory. They are in good agreement with the spectral asymptotics obtained by other authors.

关键词： nonlinear eigenvalue problem parallel algorithm exponentially convergent algorithm

来源：评论

学校读者我要写书评

暂无评论

Performance analysis of a parallel algorithm for restoring large-scale CT images

引用

JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS 2017年 310卷 104-114页

作者： Harizanov, Stanislav Lirkov, Ivan Georgiev, Krassimir Paprzycki, Marcin Ganzha, Maria Bulgarian Acad Sci Inst Informat & Commun Technol Acad G BonchevBl 25A BU-1113 Sofia Bulgaria Polish Acad Sci Syst Res Inst Ul Newelska 6 PL-01447 Warsaw Poland Warsaw Univ Technol Dept Math & Informat Sci Ul Koszykowa 75 PL-00661 Warsaw Poland Warsaw Management Acad Dept Management & Tech Sci Ul Kaweczynska 36 PL-03772 Warsaw Poland

In multiple areas of image processing, such as Computed Tomography, in which data acquisition is based on counting particles that hit a detector surface, Poisson noise occurs. Using variance-stabilizing transformations, the Poisson noise can be approximated by a Gaussian one, for which classical denoising filters can be used. This paper presents an experimental performance study of a parallel implementation of the Poissonian image restoration algorithm, introduced in Harizanov et al. (2013). Hybrid parallelization based on MPI and OpenMP standards is investigated. The convergence rate of the algorithm heavily depends on both the image size and the choice of input parameters (rho, sigma), thus maximizing its, parallel efficiency is vital for real-life applications. The implementation is tested for high-resolution radiographic images, on Linux clusters with Intel processors and on an IBM supercomputer. (C) 2016 Elsevier B.V. All rights reserved.

关键词： Primal-dual algorithm Anscombe transform Image restoration parallel algorithm Epigraphical projection

来源：评论

学校读者我要写书评

暂无评论

A cost optimal parallel algorithm for weighted distance transforms

引用

parallel COMPUTING 1999年第4期25卷 405-416页

作者： Fujiwara, A Inoue, M Masuzawa, T Fujiwara, H Kyushu Inst Technol Dept Comp Sci & Elect Iizuka Fukuoka 8208502 Japan Nara Inst Sci & Technol Grad Sch Informat Sci Nara 6300101 Japan

The distance transform and the nearest feature transform are useful operations in image processing. These transforms are based on various kinds of distance functions because the distance functions have different efficiency or usefulness. In this paper, we consider these transforms based on the weighted distance, which is a generalization of many distances, such as city block, chessboard and chamfer distances. This paper presents a parallel algorithm for these transforms of an n x n binary image. The algorithm runs in O(log n) time using n(2)/log n processors on the EREW PRAM and in O(log log n) time using n(2)/log log n processors on the common CRCW PRAM. The algorithm also runs in O(n(2)/p(2) + n) time on a p x p mesh and in O(n(2)/p(2) + (n log p)/p) time on a p(2) processor hypercube (for 1 less than or equal to p less than or equal to n). From these complexities, the algorithm is cost optimal on all models. Also we obtained an Omega(log n) lower bound for the transform on the CREW PRAM. This implies that the algorithm is time optimal on the EREW PRAM. (C) 1999 Elsevier Science B.V. All rights reserved.

关键词： parallel algorithm distance transform nearest feature transform PRAM mesh hypercube

来源：评论

学校读者我要写书评

暂无评论

An efficient parallel algorithm for the efficient domination problem on distance-hereditary graphs

引用

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2002年第9期13卷 985-993页

作者： Hsieh, SY Natl Cheng Kung Univ Dept Comp Sci & Informat Engn Tainan 701 Taiwan

In the literature, there are quite a few sequential and parallel algorithms for solving problems on distance-hereditary graphs. With an n-vertex and m-edge distance-hereditary graph G, we show that the efficient domination problem on G can be solved in O(log(n)(2)) time using O(n + m) processors on a CREW PRAM. Moreover, if a binary tree representation of G is given, the problem can be optimally solved in O(log n) time using O(n/log n) processors on an EREW PRAM.

关键词： parallel algorithm PRAM distance-hereditary graphs the efficient domination problem binary tree contraction technique

来源：评论

学校读者我要写书评

暂无评论

PSeIInv - A distributed memory parallel algorithm for selected inversion: The non-symmetric case

引用

parallel COMPUTING 2018年 74卷 84-98页

作者： Jacquelin, Mathias Lin, Lin Yang, Chao Lawrence Berkeley Natl Lab Computat Res Div Berkeley CA 94720 USA Univ Calif Berkeley Dept Math Berkeley CA 94720 USA

This paper generalizes the parallel selected inversion algorithm called PSeIInv to sparse non-symmetric matrices. We assume a general sparse matrix A has been decomposed as PAQ = LU on a distributed memory parallel machine, where L, U are lower and upper triangular matrices, and P, Q are permutation matrices, respectively. The PSeIInv method computes selected elements of A(-1). The selection is confined by the sparsity pattern of the matrix AT. Our algorithm does not assume any symmetry properties of A, and our parallel implementation is memory efficient, in the sense that the computed elements of A-T over-writes the sparse matrix L U in situ. PSeIInv involves a large number of collective data communication activities within different processor groups of various sizes. In order to minimize idle time and improve load balancing, tree-based asynchronous communication is used to coordinate all such collective communication. Numerical results demonstrate that PSeIInv can scale efficiently to 6,400 cores for a variety of matrices. (C) 2017 Elsevier B.V. All rights reserved.

关键词： Selected inversion parallel algorithm Non-symmetric High performance computation

来源：评论

学校读者我要写书评

暂无评论

An Online parallel algorithm for Recursive Estimation of Sparse Signals

引用

IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS 2016年第3期2卷 290-305页

作者： Yang, Yang Pesavento, Marius Zhang, Mengyi Palomar, Daniel P. Intel Deutschland GmbH D-85579 Neubiberg Germany Tech Univ Darmstadt Commun Syst Grp D-64283 Darmstadt Germany Chinese Univ Hong Kong Dept Comp Sci & Engn Hong Kong Hong Kong Peoples R China Hong Kong Univ Sci & Technol Dept Elect & Comp Engn Kowloon Hong Kong Peoples R China

In this paper, we consider a recursive estimation problem for linear regression where the signal to be estimated admits a sparse representation and measurement samples are only sequentially available. We propose a convergent parallel estimation scheme that consists of solving a sequence of l(1)-regularized least-square problems approximately. The proposed scheme is novel in three aspects: 1) all elements of the unknown vector variable are updated in parallel at each time instant, and the convergence speed is much faster than state-of-the-art schemes which update the elements sequentially;2) both the update direction and stepsize of each element have simple closed-form expressions, so the algorithm is suitable for online(real-time) implementation;and 3) the stepsize is designed to accelerate the convergence but it does not suffer from the common intricacy of parameter tuning. Both centralized and distributed implementation schemes are discussed. The attractive features of the proposed algorithm are also illustrated numerically.

关键词： LASSO linear regression minimization stepsize rule parallel algorithm recursive estimation sparse signal processing stochastic optimization

来源：评论

学校读者我要写书评

暂无评论

A new parallel algorithm for vertex priorities of data flow acyclic digraphs

引用

JOURNAL OF SUPERCOMPUTING 2014年第1期68卷 49-64页

作者： Mo, Zeyao Zhang, Aiqing Yang, Zhang Inst Appl Phys & Computat Math Lab Computat Phys Beijing 100088 Peoples R China

Data flow acyclic directed graphs (digraph) are widely used to describe the data dependency of mesh-based scientific computing. The parallel execution of such digraphs can approximately depict the flowchart of parallel computing. During the period of parallel execution, vertex priorities are key performance factors. This paper firstly takes the distributed digraph and its resource-constrained parallel scheduling as the vertex priorities model, and then presents a new parallel algorithm for the solution of vertex priorities using the well-known technique of forward-backward iterations. Especially, in each iteration, a more efficient vertex ranking strategy is proposed. In the case of simple digraphs, both theoretical analysis and benchmarks show that the vertex priorities produced by such an algorithm will make the digraph scheduling time converge non-increasingly with the number of iterations. In other cases of non-simple digraphs, benchmarks also show that the new algorithm is superior to many traditional approaches. Embedding the new algorithm into the heuristic framework for the parallel sweeping solution of neutron transport applications, the new vertex priorities improve the performance by 20 % or so while the number of processors scales up from 32 to 2048.

关键词： Acyclic digraph parallel algorithm Neutron transport

来源：评论

学校读者我要写书评

暂无评论

A parallel algorithm for the dynamic lot-sizing problem

引用

COMPUTERS & INDUSTRIAL ENGINEERING 2001年第2期41卷 127-134页

作者： Lyu, JJ Lee, MC Natl Cheng Kung Univ Dept Ind Management Sci Tainan 70101 Taiwan

The dynamic lot-sizing model (DLS) is one of the most frequently used models in production and inventory system because lot decisions can greatly affect the performance of the system. The practicality of DLS algorithms is hindered by the huge amount of computer resources required for solving these models, even for a modest problem. This study developed a parallel algorithm to solve the lot-sizing problem efficiently. Given that n is the size of the problem, the complexity of the proposed parallel algorithm is O(n(2)p) with p processors. Numerical experiments are provided to verify the complexity of the proposed algorithm. The empirical results demonstrate that the speedup of this parallel algorithm approaches linearity, which means that the proposed algorithm can take full advantage of the distributed computing power as the size of the problem increases. (C) 2001 Elsevier Science Ltd. All rights reserved.

关键词： materials requirements planning parallel algorithm lot sizing models

来源：评论

学校读者我要写书评

暂无评论

pSIN: A scalable, Parallel algorithm for Seismic INterferometry of large-N ambient-noise data

引用

COMPUTERS & GEOSCIENCES 2016年 93卷 88-95页

作者： Chen, Po Taylor, Nicholas J. Dueker, Ken G. Keifer, Ian S. Wilson, Andra K. McGuffy, Casey L. Novitsky, Christopher G. Spears, Alec J. Holbrook, W. Steven Univ Wyoming Dept Geol & Geophys Laramie WY 82071 USA

Seismic interferometry is a technique for extracting deterministic signals (i.e., ambient-noise Green's functions) from recordings of ambient-noise wavefields through cross-correlation and other related signal processing techniques. The extracted ambient-noise Green's functions can be used in ambient noise tomography for constructing seismic structure models of the Earth's interior. The amount of calculations involved in the seismic interferometry procedure can be significant, especially for ambient noise datasets collected by large seismic sensor arrays (i.e., "large-N" data). We present an efficient parallel algorithm, named pSIN (parallel Seismic INterferometry), for solving seismic interferometry problems on conventional distributed-memory computer clusters. The design of the algorithm is based on a two-dimensional partition of the ambient-noise data recorded by a seismic sensor array. We pay special attention to the balance of the computational load, inter-process communication overhead and memory usage across all MPI processes and we minimize the total number of I/O operations. We have tested the algorithm using a real ambient-noise dataset and obtained a significant amount of savings in processing time. Scaling tests have shown excellent strong scalability from 80 cores to over 2000 cores. (C) 2016 Elsevier Ltd. All rights reserved.

关键词： Seismic interferometry Ambient-noise parallel algorithm Message-passing interface

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：