检索结果-内蒙古大学图书馆

A parallel algorithm for generating chain code of objects in binary images

INFORMATION SCIENCES 2003年第4期149卷 219-234页

作者： Chia, TL Wang, KB Chen, LR Chen, Z Chung Cheng Inst & Technol Dept Elect Engn Taoyuan 33509 Taiwan Natl Chiao Tung Univ Inst Comp Sci & Informat Engn Hsinchu 30050 Taiwan

This paper addresses parallel execution of chain code generation on a linear array architecture. The contours in the proposed algorithm are viewed as a set of edges (or contour segments) that can be traced by a top-down contour tracing method to generate the chain codes for the outer and inner object contours. A parallel algorithm that contains the chain code generating rules and operations needed is also described, and the algorithm is mapped onto a one-dimensional systolic array containing [(1)/(2)(N + 1)] processing elements (PEs) to devise this architecture. The architecture extracts the contours of objects and quickly generates the corresponding chain codes after the image data in all rows are inputted in a linear fashion. The total processing time for generating the chain codes in an N x N image is O(3N). By doing so, the real-time requirement is fulfilled and its execution time is independent of the image content. In addition, a partition method is developed to process an image when the parallel architecture has a fixed number of PEs;say two or more. The total execution time for an N x N image by employing a fixed number of PEs is N(N + 1)/M + 2(M - 1), when M is the fixed number of PEs. (C) 2002 Elsevier Science Inc. All rights reserved.

关键词： chain code parallel algorithm chain collection chain linking

来源：评论

学校读者我要写书评

暂无评论

Efficient parallel algorithm for the two-dimensional diffusion equation subject to specification of mass

引用

INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS 1997年第1-2期64卷 153-163页

作者： Gumel, AB Ang, WT Twizell, EH UNIV MALAYSIA SARAWAK FAC ENGNKOTA SAMARAHAN 94300SARAWAKMALAYSIA BRUNEL UNIV DEPT MATH & STATUXBRIDGE UB8 3PHMIDDXENGLAND

An efficient L-0-stable parallel algorithm is developed for the two-dimensional diffusion equation with non-local time-dependent boundary conditions. The algorithm is based on subdiagonal Pade approximation to the matrix exponentials arising from the use of the method of lines and may be implemented on a parallel architecture using two processors running concurrently with each processor employing the use of tridiagonal solvers at every time-step. The algorithm is tested on two model problems from the literature for which discontinuities between initial and boundary conditions exist. The CPU times together with the associated error estimates are compared.

关键词： Padi approximant parallel algorithm L-o-stability

来源：评论

学校读者我要写书评

暂无评论

A FAST parallel algorithm FOR FINDING THE LARGEST COMMON 4-CONNECTED COMPONENT FROM TWO MATRICES

引用

TEHNICKI VJESNIK-TECHNICAL GAZETTE 2016年第4期23卷 979-984页

作者： Gao, Ying Liu, Haoshen Huang, Jiancong Duan, Jiajie Mu, Lei South China Univ Technol Sch Comp Sci & Engn Waihuan Dong Rd 382 Guangzhou Guangdong Peoples R China YunNan Elect Power Test & Res Inst Grp CO Ltd Kunming Peoples R China Huanggang Dong Rd Jinan Shangdong Peoples R China

We describe a new design of parallel algorithm for solving the two-dimensional longest common substring (2D LCS) problem, taking advantage of the multi-core graphic processing unit architecture offered by Compute Unified Device Architecture (CUDA). In this article we also define the 2D LCS problem as finding the largest common 4-connected component from two input matrices and present an algorithm which can exactly solve this problem in 0 (mnst/P) time with a P-core GPU.

关键词： CUDA largest common 4-connected component parallel algorithm 2DLCS

来源：评论

学校读者我要写书评

暂无评论

An efficient parallel algorithm for the calculation of canonical MP2 energies

引用

JOURNAL OF COMPUTATIONAL CHEMISTRY 2002年第12期23卷 1150-1156页

作者： Baker, J Pulay, P Parallel Quantum Solut Fayetteville AR 72703 USA Univ Arkansas Dept Chem Fayetteville AR 72701 USA

We present the parallel version of a previous serial algorithm for the efficient calculation of canonical MP2 energies (Pula.y. P.;Saebo, S.:, Wolinski, K. Chem Phys Lett 2001, 344, 543), It is based on the Saeho-Almlof direct-integral transformation. coupled with an efficient prescreening of the AO integrals. The parallel algorithm avoids synchronization delays by spawning a second set of slaves during the bin-sort prior to the second half-transformation, Results are presented for systems with up to 2000 basis functions. MP2 energies for molecule,, with 400-500 basis functions can be routinely calculated to microhartree accuracy on a small number of processors, (6-8) in a matter of minutes with modem PC-based parallel computers.

关键词： canonical MP2 energies parallel algorithm Saebo-Almlof integral transformation

来源：评论

学校读者我要写书评

暂无评论

PSeIInv - A distributed memory parallel algorithm for selected inversion: The non-symmetric case

引用

parallel COMPUTING 2018年 74卷 84-98页

作者： Jacquelin, Mathias Lin, Lin Yang, Chao Lawrence Berkeley Natl Lab Computat Res Div Berkeley CA 94720 USA Univ Calif Berkeley Dept Math Berkeley CA 94720 USA

This paper generalizes the parallel selected inversion algorithm called PSeIInv to sparse non-symmetric matrices. We assume a general sparse matrix A has been decomposed as PAQ = LU on a distributed memory parallel machine, where L, U are lower and upper triangular matrices, and P, Q are permutation matrices, respectively. The PSeIInv method computes selected elements of A(-1). The selection is confined by the sparsity pattern of the matrix AT. Our algorithm does not assume any symmetry properties of A, and our parallel implementation is memory efficient, in the sense that the computed elements of A-T over-writes the sparse matrix L U in situ. PSeIInv involves a large number of collective data communication activities within different processor groups of various sizes. In order to minimize idle time and improve load balancing, tree-based asynchronous communication is used to coordinate all such collective communication. Numerical results demonstrate that PSeIInv can scale efficiently to 6,400 cores for a variety of matrices. (C) 2017 Elsevier B.V. All rights reserved.

关键词： Selected inversion parallel algorithm Non-symmetric High performance computation

来源：评论

学校读者我要写书评

暂无评论

An Online parallel algorithm for Recursive Estimation of Sparse Signals

引用

IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS 2016年第3期2卷 290-305页

作者： Yang, Yang Pesavento, Marius Zhang, Mengyi Palomar, Daniel P. Intel Deutschland GmbH D-85579 Neubiberg Germany Tech Univ Darmstadt Commun Syst Grp D-64283 Darmstadt Germany Chinese Univ Hong Kong Dept Comp Sci & Engn Hong Kong Hong Kong Peoples R China Hong Kong Univ Sci & Technol Dept Elect & Comp Engn Kowloon Hong Kong Peoples R China

In this paper, we consider a recursive estimation problem for linear regression where the signal to be estimated admits a sparse representation and measurement samples are only sequentially available. We propose a convergent parallel estimation scheme that consists of solving a sequence of l(1)-regularized least-square problems approximately. The proposed scheme is novel in three aspects: 1) all elements of the unknown vector variable are updated in parallel at each time instant, and the convergence speed is much faster than state-of-the-art schemes which update the elements sequentially;2) both the update direction and stepsize of each element have simple closed-form expressions, so the algorithm is suitable for online(real-time) implementation;and 3) the stepsize is designed to accelerate the convergence but it does not suffer from the common intricacy of parameter tuning. Both centralized and distributed implementation schemes are discussed. The attractive features of the proposed algorithm are also illustrated numerically.

关键词： LASSO linear regression minimization stepsize rule parallel algorithm recursive estimation sparse signal processing stochastic optimization

来源：评论

学校读者我要写书评

暂无评论

AN IMPROVED parallel algorithm FOR CONSTRUCTING VORONOI DIAGRAM ON A MESH-CONNECTED COMPUTER

引用

parallel COMPUTING 1991年第4-5期17卷 505-514页

作者： JEONG, CS Department of Computer Science Pohang Institute of Science and Technology P.O. Box 125 Pohang Kyungbuk 680 Korea

While constructing a Voronoi diagram V(P) for a set of P of n points on a mesh-connected computer (MCC), it is necessary to find a set B of edges which are intersected by the dividing chain C during the merge process of two Voronoi diagrams V(L) and V(R), where L and R contain the leftmost [n/2] points and the rightmost [n/2] points of P respectively. The computation of B requires two operations: First decide for each edge e in V(L) and V(R) whether its end vertices are closer to L or R, and then from that information, determine whether e is intersected by C. However, in the previous parallel algorithm each of the former and latter operations requires planar point location which takes O(square-root n) time on square-root n x square-root n MCC, and in addition the former operation needs to compute convex hulls of L and R. In this paper, we shall show that the latter operation can be done in O(1) time without executing planar point location and the former operation can be executed without the computation of convex hulls. Therefore, the computation of B is reduced to only one planar point location.

关键词： parallel algorithm VORONOI DIAGRAM MESH-CONNECTED COMPUTER

来源：评论

学校读者我要写书评

暂无评论

A FAST parallel algorithm FOR SELECTED INVERSION OF STRUCTURED SPARSE MATRICES WITH APPLICATION TO 2D ELECTRONIC STRUCTURE CALCULATIONS

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 2011年第3期33卷 1329-1351页

作者： Lin, Lin Yang, Chao Lu, Jianfeng Ying, Lexing E, Weinan Princeton Univ Program Appl & Computat Math Princeton NJ 08544 USA Univ Calif Berkeley Lawrence Berkeley Lab Computat Res Div Berkeley CA 94720 USA Courant Inst Math Sci Dept Math New York NY 10012 USA Univ Texas Austin Dept Math Austin TX 78712 USA Univ Texas Austin ICES Austin TX 78712 USA Princeton Univ Dept Math Princeton NJ 08544 USA Princeton Univ PACM Princeton NJ 08544 USA

An efficient parallel algorithm is presented for computing selected components of A(-1) where A is a structured symmetric sparse matrix. Calculations of this type are useful for several applications, including electronic structure analysis of materials in which the diagonal elements of the Green's functions are needed. The algorithm proposed here is a direct method based on a block LDLT factorization. The selected elements of A(-1) we compute lie in the nonzero positions of L+L-T. We use the elimination tree associated with the block LDLT factorization to organize the parallel algorithm, and reduce the synchronization overhead by passing the data level by level along this tree using the technique of local buffers and relative indices. We demonstrate the efficiency of our parallel implementation by applying it to a discretized two dimensional Hamiltonian matrix. We analyze the performance of the parallel algorithm by examining its load balance and communication overhead, and show that our parallel implementation exhibits an excellent weak scaling on a large-scale high performance distributed-memory parallel machine.

关键词： selected inversion parallel algorithm electronic structure calculation

来源：评论

学校读者我要写书评

暂无评论

AN OPTIMAL parallel algorithm FOR THE EUCLIDEAN DISTANCE MAPS OF 2-D BINARY IMAGES

引用

INFORMATION PROCESSING LETTERS 1995年第5期54卷 295-300页

作者： FUJIWARA, A MASUZAWA, T FUJIWARA, H Graduate School of Information Science Nara Institute of Science and Technology (NAIST) 8916-5 Takayama Ikoma Nara 630-01 Japan

This paper presents a PRAM algorithm for computing the n x n Euclidean distance map. This algorithm can be performed in O(log n) time using n(2)/log n processors on the EREW PRAM and in O(log n/log log n) time using n(2) log log n/log n processors on the common CRCW PRAM, respectively. This algorithm is also applicable to many distance maps, for example, cityblock, chessboard, octagonal and chamfer distance maps.

关键词： EUCLIDEAN DISTANCE MAP IMAGE PROCESSING parallel algorithm PRAM

来源：评论

学校读者我要写书评

暂无评论

Study on high speed parallel algorithm using PC grid environment for visualization measurements by Digital Holographic Particle Tracking Velocimetry

引用

COMPUTER PHYSICS COMMUNICATIONS 2008年第1期178卷 1-7页

作者： Satake, Shin-Ichi Anraku, Takafumi Kanamori, Hiroyuki Kunugi, Tomoaki Sato, Kazuho Ito, Tomoyoshi Tokyo Univ Sci Dept Appl Elect Noda Chiba 2788510 Japan Kyoto Univ Grad Sch Engn Dept Nucl Engn Kyoto 6068501 Japan Toyota Ind Corp Aichi 4488671 Japan Chiba Univ Japan Sci & Technol Agcy Inage Ku Chiba 2638522 Japan Chiba Univ Dept Elect & Mech Engn Inage Ku Chiba 2638522 Japan

A micro-digital holographic particle tracking velocimetry with high-speed system is constructed by a PC grid environment that employs Windows XP with AD-POWERs as parallel tool. Two algorithms for high-speed system are evaluated under the same PC grid environment. Both methods are based on a computer-generated hologram algorithm. One method is a division algorithm based on time development for the measurements, while the other is a division algorithm based on spatial reconstruction for the measurement. In case of the former, the performance is increased by a factor of 3.3 by using 4 PCs. The present system can compute huge hologram images and output them "on-site" at an experimental facility. (c) 2007 Elsevier B.V. All fights reserved.

关键词： parallel algorithm grid computing image processing digital holography particle tracking Velocimetry microflow

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：