检索结果-内蒙古大学图书馆

MESH PERMUTATION ROUTING WITH LOCALITY

INFORMATION PROCESSING LETTERS 1992年第2期43卷 101-105页

作者： CHEUNG, S LAU, FCM UNIV HONG KONG DEPT COMP SCIHONG KONGHONG KONG

Given the permutation routing problem on mesh-connected arrays with a known maximum distance, d, between any source-destination pair, we show how sorting and the greedy algorithm can be combined to yield a deterministic, asymptotically optimal algorithm for solving the problem. This simple algorithm runs in d + O(d/f(d)) time and requires an O(f(d)) buffer size (or O(d) time and constant buffer size if we choose f(d) to be a constant). It also gives efficient solutions to the k-k routing problem with locality.

关键词： parallel algorithms MESH-CONNECTED COMPUTERS PERMUTATION ROUTING K-K ROUTING ROUTING WITH LOCALITY MIMD

来源：评论

学校读者我要写书评

暂无评论

A SIMPLE NC ALGORITHM TO RECOGNIZE WEAKLY TRIANGULATED GRAPHS

引用

INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS 1989年第3-4期30卷 129-131页

作者： OLARIU, S Department of Computer Science Old Dominion University Norfolk VA 23529 United States

Click to increase image sizeClick to decrease image size

关键词： parallel algorithms NC algorithms perfect graphs G.1.0 G.2.2

来源：评论

学校读者我要写书评

暂无评论

Distributional Fractal Creating Algorithm in parallel Environment

引用

INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS 2013年第unknown期2013卷 1-8页

作者： Liu, Shuai Fu, Weina Deng, Huimin Lan, Caihe Zhou, Jiantao Inner Mongolia Univ Coll Comp Sci Hohhot 010012 Peoples R China Inner Mongolia Univ Sch Phys Sci & Technol Hohhot 010012 Peoples R China Hohhot Univ Nationalities Dept Comp Sci & Technol Hohhot 010012 Peoples R China

Nowadays, the fractal is used widely everywhere. Then, its creating time becomes an important study area for complex iteration functions because the escape-time algorithm(ETA), which is the most used algorithmin fractal creating, performs not so well in this condition. In this paper, in order to solve this problem, we improve ETA into the parallel environment and reach well performance. At first, we provide a separationmethod of ETA to reformit into a SIMC-MC2 grid. Secondly, we prove its correctness and compute the complexity of this novel parallel algorithm. Meantime, we separate an improved ETA which we have presented into the same parallel environment and compute its complexity. Additionally, theoretical and experimental results show the characteristics of this novel algorithm. Finally, the computational result shows that a novel environment is needed to decrease large manual allocation strategies, which block the improved benefit.

关键词： MATHEMATICAL programming SEPARATION (Technology) FRACTAL analysis COMPUTATIONAL geometry parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Bilateral shear layer between two parallel Couette flows

引用

Physical Review E 2012年第3期85卷 036302-036302页

作者： Vagesh D. Narasimhamurthy Simen Å. Ellingsen Helge I. Andersson Fluids Engineering Division Department of Energy and Process Engineering Norwegian University of Science and Technology N-7491 Trondheim Norway

We consider an unusual shear layer occuring between two parallel Couette flows. Contrary to the classical free shear layer, the width of the shear zone does not vary in the streamwise direction but rather exhibits a lateral variation. Based on some simplifying assumptions, an analytic solution is derived for this shear layer. These assumptions are justified by a comparison with numerical solutions of the full Navier-Stokes equations, which accord with the analytical solution to better than 1% in the entire domain. An explicit formula is found for the width of the shear zone as a function of the wall-normal coordinate. This width is independent of the wall velocities in the laminar regime. Preliminary results for a cocurrent laminar-turbulent shear layer in the same geometry are also presented. Shear-layer instabilities are then developed and result in an unsteady mixing zone at the interface between the two cocurrent streams.

关键词： SHEAR flow COUETTE flow parallel algorithms LAMINAR flow SHEAR zones (Geology) NAVIER-Stokes equations -- Numerical solutions MATHEMATICAL formulas

来源：评论

学校读者我要写书评

暂无评论

A fast parallel high-precision summation algorithm based on AccSumK

引用

JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS 2022年第0期406卷 113827-113827页

作者： Lei, Xiaojun Gu, Tongxiang Graillat, Stef Jiang, Hao Qi, Jin China Acad Engn Phys Grad Sch Beijing 100088 Peoples R China Inst Appl Phys & Computat Math Beijing 100094 Peoples R China Sorbonne Univ LIP6 CNRS F-75005 Paris France Natl Univ Def Technol Changsha 410073 Peoples R China

In this paper, we present a new parallel accurate algorithm called PAccSumK for computing summation of floating-point numbers. It is based on AccSumK algorithm. In the experiment, for the summation problems with large condition numbers, our algorithm outperforms the PSumK algorithm in terms of accuracy and computing time. The reason is that our algorithm is based on a more accurate algorithm called AccSumK algorithm compared to the SumL algorithm used in PSumK. The proposed parallel algorithm in this paper is designed to compute a result as if computed internally in K-fold the working precision. Numerical results are presented showing the performance and the accuracy of our new parallel algorithm for calculating summation. (c) 2021 Elsevier B.V. All rights reserved.

关键词： parallel algorithms Accurate summation Higher precision Floating-point arithmetic

来源：评论

学校读者我要写书评

暂无评论

A parallelized Method for Discrete-Time Models with Dependence on Calculation Order

引用

IFAC-PapersOnLine 2018年第31期51卷 41-45页

作者： Sata, Kota Matsunaga, Akio Azuma, Shun-ichi Ohata, Akira Toyota Motor Corporation Advanced Powertrain Management System Development Div. Shizuoka Japan Nagoya University Dept. of Mechanical Systems Engineering Nagoya Japan Sophia University Tokyo Japan

The advancement of the powertrain control increases the amount of computation. Mass production ECU (Electronic Control Unit), which is made of single-core architecture, cannot have a higher clock speed. Using multi/many-core architecture is the only way to decrease execution time. However, when implementing engine control software, various problems occur when implementing engine control software on the multi/many-core ECU. One of the biggest problems is sequential structure of control software because the software can only execute with one core on the multi/many-core ECU. The purpose of this paper is to describe the parallelized control design method for discrete-time Models, which has decomposed sequential structure and decreases execution time in the embedded multi/many-core mass production ECU. © 2018

关键词： Control systems Automobile engines Computer architecture Industrial electronics Internal combustion engines parallel algorithms Servomechanisms Automotive control Control design Discrete time model Electronic control units execution times

来源：评论

学校读者我要写书评

暂无评论

Algorithm 944: Talbot Suite: parallel Implementations of Talbot's Method for the Numerical Inversion of Laplace Transforms

引用

ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE 2014年第4期40卷 29-29页

作者： Antonelli, Laura Corsaro, Stefania Marino, Zelda Rizzardi, Mariarosaria CNR ICAR Inst High Performance Comp & Networking I-80131 Naples Italy Univ Napoli Parthenope Dipartimento Studi Aziendali & Quantitativi I-80132 Naples Italy Univ Napoli Parthenope Ctr Direzionale M Rizzardi Dipartimento Sci & Tecnol I-80143 Naples Italy

We present Talbot Suite, a C parallel software collection for the numerical inversion of Laplace Transforms, based on Talbot's method. It is designed to fit both single and multiple Laplace inversion problems, which arise in several application and research fields. In our software, we achieve high accuracy and efficiency, making full use of modern architectures and introducing two different levels of parallelism: coarse and fine grained parallelism. They offer a reasonable tradeoff between accuracy, the main aspect for a few inversions, and efficiency, the main aspect for multiple inversions. To take into account modern high-performance computing architectures, Talbot Suite provides different software versions: an OpenMP-based version for shared memory machines and a MPI-based version for distributed memory machines. Moreover, oriented to hybrid architectures, a combined MPI/OpenMP-based implementation is provided too. We describe our parallel algorithms and the software organization. We also report some performance results. Our software includes sample programs to call the Talbot Suite functions from C and from MATLAB.

关键词： algorithms Performance Inverse Laplace transform parallel algorithms Talbot's method hybrid architectures

来源：评论

学校读者我要写书评

暂无评论

A MASSIVELY parallel EULERIAN-LAGRANGIAN METHOD FOR ADVECTION-DOMINATED TRANSPORT IN VISCOUS FLUIDS

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 2022年第3期44卷 C260-C285页

作者： Kohl, Nils Mohr, Marcus Eibl, Sebastian Rude, Ulrich Friedrich Alexander Univ Erlangen Nurnberg Comp Sci 10 D-91058 Erlangen Germany Ludwig Maximilians Univ Munchen Dept Earth & Environm Sci D-80539 Munich Germany Ctr Europeen Rech & Format Avancee Calcul Sci CER F-31100 Toulouse France

Motivated by challenges in the Earth's mantle convection, we present a massively parallel implementation of an Eulerian-Lagrangian method for the advection-diffusion equation in the advection-dominated regime. The advection term is treated by a particle-based characteristics method coupled to a block-structured finite element framework. Its numerical and computational performance is evaluated in multiple two- and three-dimensional benchmarks, including curved geometries, discontinuous solutions, and pure advection, and it is applied to a coupled nonlinear system modeling buoyancy-driven convection in Stokes flow. We demonstrate the parallel performance in a strong and weak scaling experiment, with scalability to up to 147,456 parallel processes, solving for more than 5.2 x 10(10) (52 billion) degrees of freedom per time-step.

关键词： Eulerian-Lagrangian methods advection-diffusion parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Scalable parallel Distance Field Construction for Large-Scale Applications

引用

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2015年第10期21卷 1187-1200页

作者： Yu, Hongfeng Xie, Jinrong Ma, Kwan-Liu Kolla, Hemanth Chen, Jacqueline H. Univ Nebraska Lincoln NE 68588 USA Univ Calif Davis Davis CA 95616 USA Sandia Natl Labs Albuquerque NM 87123 USA

Computing distance fields is fundamental to many scientific and engineering applications. Distance fields can be used to direct analysis and reduce data. In this paper, we present a highly scalable method for computing 3D distance fields on massively parallel distributed-memory machines. A new distributed spatial data structure, named parallel distance tree, is introduced to manage the level sets of data and facilitate surface tracking over time, resulting in significantly reduced computation and communication costs for calculating the distance to the surface of interest from any spatial locations. Our method supports several data types and distance metrics from real-world applications. We demonstrate its efficiency and scalability on state-of-the-art supercomputers using both large-scale volume datasets and surface models. We also demonstrate in-situ distance field computation on dynamic turbulent flame surfaces for a petascale combustion simulation. Our work greatly extends the usability of distance fields for demanding applications.

关键词： Distance field in-situ processing parallel algorithms scalability spatial data structures scientific simulations geometric modeling large-scale scientific data analytics and visualization

来源：评论

学校读者我要写书评

暂无评论

Massively parallel computation of absolute binding free energy with well-equilibrated states

引用

Physical Review E 2009年第2期79卷 021914-021914页

作者： Hideaki Fujitani Yoshiaki Tanida Azuma Matsuura Fujitsu Laboratories Ltd. 10-1 Morinosato-Wakamiya Atsugi 243-0197 Japan

A force field formulator for organic molecules (FF-FOM) was developed to assign bond, angle, and dihedral parameters to arbitrary organic molecules in a unified manner including proteins and nucleic acids. With the unified force field parametrization we performed massively parallel computations of absolute binding free energies for pharmaceutical target proteins and ligands. Compared with the previous calculation with the ff99 force field in the Amber simulation package (Amber99) and the ligand charges produced by the Austin Model 1 bond charge correction (AM1-BCC), the unified parametrization gave better absolute binding energies for the FK506 binding protein (FKBP) and ligand system. Our method is based on extensive work measurement between thermodynamic states to calculate the free energy difference and it is also the same as the traditional free energy perturbation. There are important requirements for accurate calculations. The first is a well-equilibrated bound structure including the conformational change of the protein induced by the binding of the ligand. The second requirement is the convergence of the work distribution with a sufficient number of trajectories and dense spacing of the coupling constant between the ligand and the rest of the system. Finally, the most important requirement is the force field parametrization.

关键词： binding energy free energy molecular force constants organic compounds parallel algorithms MOLECULAR-DYNAMICS SIMULATIONS AMBER FORCE-FIELD EFFICIENT GENERATION ATOMIC CHARGES AM1-BCC MODEL NUCLEIC-ACIDS ENSEMBLE RNA POTENTIALS STABILITY

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：