检索结果-内蒙古大学图书馆

Accelerating computed tomographic imaging spectrometer reconstruction using a parallel algorithm exploiting spatial shift-invariance

引用

OPTICAL ENGINEERING 2020年第5期59卷 55110-55110页

作者： White, Larz Bell, W. Bryan Haygood, Ryan Lockheed Martin Aeronaut Ft Worth TX 76108 USA

Computed tomographic imaging spectrometers capture hyperspectral images in real-time. However, postprocessing the imagery can require enormous computational resources;thus, limiting its application to nonrealtime scenarios. To overcome these challenges, we developed a highly parallelizable algorithm that exploits spatial shift-invariance. To demonstrate the versatility of our algorithm, we developed implementations on a desktop and an embedded graphics processing unit. To our knowledge, our results show the fastest image reconstruction times reported. (C) 2020 Society of Photo-Optical Instrumentation Engineers (SPIE)

关键词： GPU CUDA CTIS computed tomography imaging spectrometer parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

Super-Exponentially Convergent parallel algorithm for Eigenvalue Problems with Fractional Derivatives

引用

COMPUTATIONAL METHODS IN APPLIED MATHEMATICS 2016年第4期16卷 633-652页

作者： Demkiv, Ihor Gavrilyuk, Ivan P. Makarov, Volodymyr L. NAS Ukraine Inst Math 3 Tereshchenkivska Str UA-01601 Kiev 4 Ukraine Univ Cooperat Educ Eisenach Wartenberg 2 D-99817 Eisenach Germany

A new algorithm for eigenvalue problems for linear differential operators with fractional derivatives is proposed and justified. The algorithm is based on the approximation (perturbation) of the coefficients of a part of the differential operator by piecewise constant functions where the eigenvalue problem for the last one is supposed to be simpler than the original one. Another milestone of the algorithm is the homotopy idea which results at the possibility for a given eigenpair number to compute recursively a sequence of the approximate eigenpairs. This sequence converges to the exact eigenpair with a super-exponential convergence rate. The eigenpairs can be computed in parallel for all prescribed indexes. The proposed method possesses the following principal property: its convergence rate increases together with the index of the eigenpair. Numerical examples confirm the theory.

关键词： Fractional Differential Operator Eigenvalue Problem Homotopy Idea parallel algorithm Super-Exponentially Convergent algorithm

来源：评论

学校读者我要写书评

暂无评论

A parallel algorithm for direct solution of large sparse linear systems, well suitable to domain decomposition methods

引用

EUROPEAN JOURNAL OF COMPUTATIONAL MECHANICS 2009年第7-8期18卷 589-605页

作者： Gueye, Ibrahima Juvigny, Xavier Feyel, Frederic Roux, Francois-Xavier Cailletaud, Georges Off Natl Etud & Rech Aerosp Ctr Chatillon 29 Ave Div Leclerc F-92322 Chatillon France Mines ParisTech UMR CNRS 7633 Ctr Mat PM FOURT F-91003 Evry France

The goal of this paper is to develop a parallel algorithm for the direct solution of large sparse linear systems and integrate it into domain decomposition methods. The computational effort for these linear systems, often encountered in numerical simulation of structural mechanics problems by finite element codes, is very significant in terms of run-time and memory requirements. In this paper, a two-level parallelism is exploited. The exploitation of the lower level of parallelism is based on the development of a parallel direct solver with a nested dissection algorithm and to introduce it into the FETI methods. This direct solver has the advantage of handling zero-energy modes in floating structures automatically and properly. The upper level of parallelism is a coarse-grain parallelism between substructures of FETI. Some numerical tests are carried out to evaluate the performance of the direct solver.

关键词： two-level parallelism parallel algorithm nested dissection zero-energy modes

来源：评论

学校读者我要写书评

暂无评论

FAST parallel algorithm FOR ALL-PAIRS SHORTEST-PATH PROBLEM AND ITS VLSI IMPLEMENTATION

引用

IEE PROCEEDINGS-E COMPUTERS AND DIGITAL TECHNIQUES 1989年第2期136卷 85-89页

作者： DEY, S SRIMANI, PK SO ILLINOIS UNIV DEPT COMP SCICARBONDALEIL 62901

We present a new parallel algorithm to solve the all-pairs shortest path problem in a given graph which is considerably faster than the most recently published algorithm [7] for the same problem. Next we propose a suitable VLSI systolic architecture to map our algorithm and evaluate the performance of the proposed architecture in terms of execution time and inter-processor communication time. We show that our implementation has O(log2n) execution time (compare-exchange time) and O(nlogn) communication time compared to O(nlogn) and O(n2) in [7].

关键词： Mathematics computing O(log2 n) execution time Microprocessor chips VLSI systolic architecture VLSI graph theory parallel algorithms cellular arrays Microprocessors and microcomputers compare-exchange time parallel architectures O(n log n) communication Programming and algorithm theory Combinatorial mathematics parallel algorithm all-pairs shortest path problem

来源：评论

学校读者我要写书评

暂无评论

A parallel algorithm for 3D dislocation dynamics

引用

JOURNAL OF COMPUTATIONAL PHYSICS 2006年第2期219卷 608-621页

作者： Wang, Zhiqiang Ghoniem, Nasr Swaminarayan, Sriram LeSar, Richard Univ Calif Los Angeles Los Angeles CA 90095 USA Los Alamos Natl Lab Los Alamos NM 87545 USA

Dislocation dynamics (DD), a discrete dynamic simulation method in which dislocations are the fundamental entities, is a powerful tool for investigation of plasticity, deformation and fracture of materials at the micron length scale. However, severe computational difficulties arising from complex, long-range interactions between these curvilinear line defects limit the application of DD in the study of large-scale plastic deformation. We present here the development of a parallel algorithm for accelerated computer simulations of DD. By representing dislocations as a 3D set of dislocation particles, we show here that the problem of an interacting ensemble of dislocations can be converted to a problem of a particle ensemble, interacting with a long-range force field. A grid using binary space partitioning is constructed to keep track of node connectivity across domains. We demonstrate the computational efficiency of the parallel micro-plasticity code and discuss how O(N) methods map naturally onto the parallel data structure. Finally, we present results from applications of the parallel code to deformation in single crystal fee metals. (c) 2006 Elsevier Inc. All rights reserved.

关键词： 3D dislocation dynamics parallel algorithm single crystal plasticity large scale simulation

来源：评论

学校读者我要写书评

暂无评论

New parallel algorithm for MP2 energy gradient calculations

引用

JOURNAL OF COMPUTATIONAL CHEMISTRY 2007年第12期28卷 2034-2042页

作者： Ishimura, Kazuya Pulay, Peter Nagase, Shigeru Inst Mol Sci Dept Theoret Mol Sci Aichi 4448585 Japan Univ Arkansas Dept Chem & Biochem Fayetteville AR 72701 USA

A new parallel algorithm has been developed for calculating the analytic energy derivatives of full accuracy second order Moller-Plesset perturbation theory (MP2). Its main projected application is the optimization of geometries of large molecules, in which noncovalent interactions play a significant role. The algorithm is based on the two-step MP2 energy calculation algorithm developed recently and implemented into the quantum chemistry program, GAMESS. Timings are presented for test calculations on taxol (C47H51NO14) With the 6-31G and 6-31G(d) basis sets (660 and 1032 basis functions, 328 correlated electrons) and luciferin (C11H8N2O3S2) with aug-cc-pVDZ and aug-cc-pVTZ (530 and 1198 basis functions, 92 correlated electrons). The taxol 6-31G(d) calculations are also performed with up to 80 CPU cores. The results demonstrate the high parallel efficiency of the program. (c) 2007 Wiley Periodicals, Inc.

关键词： MP2 energy gradient parallel algorithm large molecule

来源：评论

学校读者我要写书评

暂无评论

A parallel algorithm FOR MINIMIZING ESOP EXPRESSIONS

引用

JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS 2014年第1期23卷 1450015-1450031页

作者： Papakonstantinou, George Natl Tech Univ Athens Sch Elect & Comp Engn GR-10682 Athens Greece

Two parallel algorithms are proposed in this paper for solving the problem of finding exact exclusive-or sum of products (ESOP) expressions for an arbitrary Boolean function. This minimization problem is a very difficult one and solutions have been proposed only for up to seven variables. The processing time for some symmetric functions of seven variables is of the order of weeks. The proposed algorithm is a hybrid one (OpenMP, MPI) and a speed-up of more than nine could be achieved, for a cluster of three nodes with four cores each.

关键词： Boolean functions ESOP expressions minimization parallel algorithm cluster

来源：评论

学校读者我要写书评

暂无评论

A parallel algorithm for the dynamic lot-sizing problem

引用

COMPUTERS & INDUSTRIAL ENGINEERING 2001年第2期41卷 127-134页

作者： Lyu, JJ Lee, MC Natl Cheng Kung Univ Dept Ind Management Sci Tainan 70101 Taiwan

The dynamic lot-sizing model (DLS) is one of the most frequently used models in production and inventory system because lot decisions can greatly affect the performance of the system. The practicality of DLS algorithms is hindered by the huge amount of computer resources required for solving these models, even for a modest problem. This study developed a parallel algorithm to solve the lot-sizing problem efficiently. Given that n is the size of the problem, the complexity of the proposed parallel algorithm is O(n(2)p) with p processors. Numerical experiments are provided to verify the complexity of the proposed algorithm. The empirical results demonstrate that the speedup of this parallel algorithm approaches linearity, which means that the proposed algorithm can take full advantage of the distributed computing power as the size of the problem increases. (C) 2001 Elsevier Science Ltd. All rights reserved.

关键词： materials requirements planning parallel algorithm lot sizing models

来源：评论

学校读者我要写书评

暂无评论

A randomized time-work optimal parallel algorithm for finding a minimum spanning forest

引用

SIAM JOURNAL ON COMPUTING 2002年第6期31卷 1879-1895页

作者： Pettie, S Ramachandran, V Univ Texas Dept Comp Sci Austin TX 78712 USA

We present a randomized algorithm to find a minimum spanning forest (MSF) in an undirected graph. With high probability, the algorithm runs in logarithmic time and linear work on an exclusive read exclusive write (EREW) PRAM. This result is optimal w.r.t. both work and parallel time, and is the first provably optimal parallel algorithm for this problem under both measures. We also give a simple, general processor allocation scheme for tree-like computations.

关键词： parallel algorithm minimum spanning tree optimal algorithm EREW PRAM

来源：评论

学校读者我要写书评

暂无评论

An efficient wavefront parallel algorithm for structured three dimensional LU-SGS

引用

COMPUTERS & FLUIDS 2016年 134卷 23-30页

作者： Gong, Chunye Bao, Weimin Liu, Jie Tang, Guojian Jiang, Yuewen Natl Univ Def Technol Sch Comp Sci Changsha 410073 Hunan Peoples R China Sci & Technol Space Phys Lab Fengtai Sect Beijing 100076 Peoples R China Natl Univ Def Technol Coll Aerosp Sci & Engn Changsha 410073 Hunan Peoples R China Univ Oxford Dept Engn Sci Oxford OX1 3PJ England

parallel computing is a useful technology for scientific and engineering algorithms/applications. LU-SGS (lower-upper Symmetric-Gauss-Seidel method) is an efficient and robust scheme for CFD (Computational fluid dynamics) and has strong data dependence in its computation. In this paper, we present an efficient wavefront parallel algorithm for 3D (three dimensional) LU-SGS with structured meshes. The corresponding data structure and memory access method with better data locality and communication optimization is designed. The performances of the presented parallel algorithm are reported with different problem sizes. Some discussion and performance issues are also reported. The results show that the overall performance speedup of one Intel E5540 CPU (4 CPU cores) ranges from 2.23 to 2.95 compared with one E5540 core. The parallel efficiency of 1024, 128 processes are up to 35.68%, 72.69% compared with 32 processes on a distributed memory cluster system. The CFD simulation of M6 wing model shows the effect of the presented parallel algorithm. (C) 2016 Elsevier Ltd. All rights reserved.

关键词： CFD LU-SGS parallel computing parallel algorithm 3D structured meshes

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：