检索结果-内蒙古大学图书馆

Scalable computational algorithms for geospatial COVID-19 spread using high performance computing

MATHEMATICAL BIOSCIENCES AND ENGINEERING 2023年第8期20卷 14634-14674页

作者： Sharma, Sudhi Dolean, Victorita Jolivet, Pierre Robinson, Brandon Edwards, Jodi D. Kendzerska, Tetyana Sarkar, Abhijit Carleton Univ Dept Civil & Environm Engn Ottawa ON Canada Univ Strathclyde Dept Math & Stat Glasgow Scotland Univ Cote dAzur Lab JA Dieudonne CNRS Nice France Sorbonne Univ CNRS Paris France Univ Ottawa Sch Epidemiol & Publ Hlth Ottawa ON Canada Univ Ottawa Heart Inst Ottawa ON Canada ICES Ottawa ON Canada Ottawa Hosp Res Inst Ottawa ON Canada Univ Ottawa Fac Med Dept Med Div Respirol Ottawa ON Canada

A nonlinear partial differential equation (PDE) based compartmental model of COVID-19 provides a continuous trace of infection over space and time. Finer resolutions in the spatial discretiza-tion, the inclusion of additional model compartments and model stratifications based on clinically relevant categories contribute to an increase in the number of unknowns to the order of millions. We adopt a parallel scalable solver that permits faster solutions for these high fidelity models. The solver combines domain decomposition and algebraic multigrid preconditioners at multiple levels to achieve the desired strong and weak scalabilities. As a numerical illustration of this general methodology, a five-compartment susceptible-exposed-infected-recovered-deceased (SEIRD) model of COVID-19 is used to demonstrate the scalability and effectiveness of the proposed solver for a large geographical domain (Southern Ontario). It is possible to predict the infections for a period of three months for a system size of 186 million (using 3200 processes) within 12 hours saving months of computational effort needed for the conventional solvers.

关键词： COVID-19 spatio-temporal model overlapping Schwarz method high performance com-puting

来源：评论

学校读者我要写书评

暂无评论

A DIRECTIONAL EQUISPACED INTERPOLATION-BASED FAST MULTIPOLE METHOD FOR OSCILLATORY KERNELS

引用

SIAM JOURNAL ON SCIENTIFIC computing 2023年第1期45卷 C20-C48页

作者： Chollet, Igor Claeys, Xavier Fortin, Pierre Grigori, Laura INRIA Alpines Inst Sci Calcul & Donnees ISCD Paris France Sorbonne Univ Inria Equipe ALPINES Lab Jacques Louis Lions F-75005 Paris France Sorbonne Univ CNRS LIP6 F-75005 Paris France Univ Lille CNRS Cent Lille UMR CRIStAL 9189 F-59000 Lille France

Fast multipole methods (FMMs) based on the oscillatory Helmholtz kernel can reduce the cost of solving N-body problems arising from boundary integral equations (BIEs) in acoustics or electromagnetics. However, their cost strongly increases in the high-frequency regime. This paper introduces a new directional FMM for oscillatory kernels (defmm: directional equispaced interpolation-based fmm), whose precomputation and application are FFT-accelerated due to poly-nomial interpolations on equispaced grids. We demonstrate the consistency of our FFT approach and show how symmetries can be exploited in the Fourier domain. We also describe the algorithmic de-sign of defmm, well-suited for the BIE nonuniform particle distributions, and present performance optimizations on one CPU core. Finally, we exhibit important performance gains on all test cases for defmm over a state-of-the-art FMM library for oscillatory kernels.

关键词： directional fast multipole method fast Fourier transform high performance com-puting symmetries SIMD computing

来源：评论

学校读者我要写书评

暂无评论

high Resolution Tsunami Inundation Simulations 20

High Resolution Tsunami Inundation Simulations

引用

20th International Congress on Modelling and Simulation (MODSIM)

作者： Roberts, S. G. Oishi, Y. Li, M. Australian Natl Univ Inst Math Sci Canberra ACT 0200 Australia Fujitsu Labs Europe London England

ISBN: (纸本)9780987214331

In this paper we investigate the high performance computing efficiency of the shallow water software package ANUGA. This package is developed as a collaborative project between the Australian National University (ANU) and Geoscience Australia (GA) and is available as Free and Open Source Software (FOSS). ANUGA uses a shallow water model and approximates the model using the finite volume method based on unstructured meshes of triangles. The geometrical flexibility of unstructured meshes is convenient for tsunami inundation modeling where the tsunami wave source generally consists of long wavelength components, and waves around the coast consists of short wavelengths, which can both be modeled in the same simulation. ANUGA is written in the high level computer language PYTHON. We will present an overview of the model and the numerical method in the early sections of the paper. We will then present our work on parallelizing the ANUGA code, in particular our efforts to obtain efficient simulations using 100s of CPU cores. Our results demonstrate that our PYTHON based software can obtain high efficiency on highly parallel computers. The results presented in this paper demonstrate better than real time simulation of medium resolution (millions of triangles) tsunami models. Our ultimate goal is the solution of high resolution (tens of millions of triangles) simulations in better than real time.

关键词： Shallow water wave equations tsunami simulation finite volume method high performance com-puting python

来源：评论

学校读者我要写书评

暂无评论

Speeding Up the Training of Neural Networks with CUDA Technology

Speeding Up the Training of Neural Networks with CUDA Techno...

引用

11th International Conference on Artificial Intelligence and Soft computing (ICAISC)

作者： Chevitarese, Daniel Salles Szwarcman, Dilza Vellasco, Marley Pontif Catholic Univ Dept Elect Engn Gavea Rio De Janeiro Brazil

ISBN: (纸本)9783642293467;9783642293474

Training feed-forward neural networks can take a long time when there is a large amount of data to be used, even when training with more efficient algorithms like Levenberg-Marquardt. Parallel architectures have been a common solution in the area of high performance computing, since the technology used in current processors is reaching the limits of speed. An architecture that has been gaining popularity is the GPGPU (General-Purpose computing on Graphics Processing Units), which has received large investments from companies such as NVIDIA that introduced CUDA (compute Unified Device Architecture) technology. This paper proposes a faster implementation of neural networks training with Levenberg-Marquardt algorithm using CUDA. The results obtained demonstrate that the whole training time can be almost 30 times shorter than code using Intel Math Library (MKL). A case study for classifying electrical company customers is presented.

关键词： Artificial Neural Networks Software Engineering high performance com-puting GPGPU CUDA

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：