检索结果-内蒙古大学图书馆

Seventh International Symposium on Parallel Architectures, Algorithms and Programming

作者： Yang, Zhi Zhang, Chunping Hu, Mu Lin, Feng State Grid Elect Power Sci Res Inst Nanjing Jiangsu Peoples R China

ISBN: (纸本)9781467391160

In the Big Data computing, improving performance with memory computing is one of hot spots. In the memory computing, the data deployment directly affects load balance and task efficiency. In the scene of memory computing of electric power data, two unsolved problems are: (1) only memory space, without the CPU frequency and nuclear number, could be considered for load balance and improving performance;(2) there are so many manual operations that it is difficult to complete data deployment automatically. This paper provides an electric power data deployment solution for distributed memory computing to solve the above challenges. In the solution, according to business logic and hardware configuration of cluster nodes, the data deployment strategy can be established. Then, the deployment scheme can be implemented with interface operation. Lastly, cluster nodes load data according to the deployment scheme. The solution has been applied to the Objectification Parallel computing (OPC). The application result shows that OPC can achieve the best performance which can meet the demand of system efficiency and the operation of data deployment is simple.

关键词： component: Big Data distributed memory computing Objectification Parallel computing Data Deployment strategy

来源：评论

学校读者我要写书评

暂无评论

OPC:A distributed computing and memory computing-based Effective Solution of Big Data

OPC:A Distributed Computing and Memory Computing-based Effec...

引用

2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity)

作者： Yang, Zhi Zhang, Chunping Hu, Mu Lin, Feng State Grid Elect Power Sci Res Inst Nanjing Jiangsu Peoples R China

ISBN: (纸本)9781509018932

The Big Data computing is one of hot spots of the internet of things and cloud computing. How to compute efficiently on the Big Data is the key of improving performance. By means of distributed computing or memory computing, many companies and institutions provide some technologies and produces. But they are invalid in the scene in which there are real-time demands in the low-configure cluster. To deal with the problem, this paper provides a distributed computing and memory computing-based effective solution (Objectification Parallel computing, OPC). In the solution, the data can be formatted into object. Then the objects are distributed stored in the computer memories and parallel compute to complete tasks. The OPC is applied to the Electric Asset Quality Supervision Manage System (EAQSMS) of State Grid of China, the result shows that with PCs the system is efficiently available, reliable, and flexible expansible.

关键词： component Big Data distributed memory computing Parallel computing Objectification Parallel computing Architecture of Objectification Parallel computing

来源：评论

学校读者我要写书评

暂无评论

distributed-memory concepts in the wave model WAVEWATCH III

引用

PARALLEL computing 2002年第1期28卷 35-52页

作者： Tolman, HL NOAA NCEP SAIC GSOEnvironm Modeling Ctr Camp Springs MD 20746 USA

Parallel concepts for spectral wind-wave models are discussed, with a focus on the WAVE-WATCH III model which runs in a routine operational mode at NOAA/NCEP. After a brief description of relevant aspects of wave models, basic parallelization concepts are discussed. It is argued that a method including data transposes is more suitable for this model than conventional domain decomposition techniques, Details of the implementation, including specific buffering techniques for the data to be communicated between processors, are discussed. Extensive timing results are presented for up to 450 processors on an IBM RS6000 SP. The resulting model is shown to exhibit excellent parallel behavior for a large range of numbers of processors. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： ocean wind-wave modelling distributed memory computing message passing

来源：评论

学校读者我要写书评

暂无评论

Large-eddy simulations on distributed shared memory clusters

引用

JOURNAL OF PARALLEL AND distributed computing 2004年第10期64卷 1103-1112页

作者： Stone, C Menon, S Georgia Inst Technol Dept Aerosp Engn Atlanta GA 30332 USA

The practicality of Large-eddy simulation (LES) of turbulent combustion, as is found in gas turbine engines, on clusters of commodity PC-based symmetric multi-processor (SMP) systems in 2-, 4-, and 8-way configurations has been investigated. Bandwidth demands from both memory and networking in the benchmark LES algorithm are shown to the primary performance inhibitors. Contention in the various SMP architectures tested is shown to compound these two hardware limitations. To investigate the ability of the parallel clustered systems, low-level hardware studies are conducted in conjunction with bench-marking of the LES application. The hardware tests focus on memory and communication contention under loads found in the LES algorithm. For comparison, the benchmarks are also applied to two industry leading high-performance super-computing architectures. It is found that contention in the 4- and 8-way SNIP architecture studied here limits their applicability while the 2-way systems shows competitive performance and speed-up compared to its industry counterparts. It is concluded that design-level combustion LES on clusters of commodity hardware, when equipped with sufficient memory and communication bandwidth. are a viable substitute for more expensive super-computing platforms. (C) 2004 Elsevier Inc. All rights reserved.

关键词： computational fluid dynamics (CFD) distributed memory computing parallel performance benchmarks memory bandwidth network bandwidth cluster computing

来源：评论

学校读者我要写书评

暂无评论

Implementing scoped behavior for flexible distributed data sharing

引用

IEEE CONCURRENCY 2000年第3期8卷 63-73页

作者： Lu, P Univ Alberta Dept Comp Sci Edmonton AB T6G 2H1 Canada

In the Aurora distributed shared data system, the programmer instantiates shared-data objects and uses scoped behavior to incrementally tune applications on a per-object and per-context basis. A class library implements shared-data objects as abstract data types and scoped behavior implements the optimizations within standard C++. Using a network of workstations connected by an ATM switch, the author demonstrates that Aurora performs comparably to message passing

关键词： distributed memory computing Shared Data Data Sharing Patterns Optimizations Scoped Behavior Network Of Workstations

来源：评论

学校读者我要写书评

暂无评论

Performance of preconditioned iterative solvers in MFiX-Trilinos for fluidized beds

引用

JOURNAL OF SUPERcomputing 2018年第8期74卷 4104-4126页

作者： Kotteda, V. M. Krushnarao Kumar, Vinod Spotz, William Univ Texas El Paso 500 W Univ Ave El Paso TX 79968 USA Sandia Natl Labs POB 5800MS 1320 Albuquerque NM USA

MFiX, a general-purpose Fortran-based suite, simulates the complex flow in fluidized bed applications via BiCGStab and GMRES methods along with plane relaxation preconditioners. Trilinos, an object-oriented framework, contains various first- and second-generation Krylov subspace solvers and preconditioners. We developed a framework to integrate MFiX with Trilinos as MFiX does not possess advanced linear methods. The framework allows MFiX to access advanced linear solvers and preconditioners in Trilinos. The integrated solver is called MFiX-Trilinos, here after. In the present work, we study the performance of variants of GMRES and CGS methods in MFiX-Trilinos and BiCGStab and GMRES solvers in MFiX for a 3D gas-solid fluidized bed problem. Two right preconditioners employed along with various solvers in MFiX-Trilinos are Jacobi and smoothed aggregation. The flow from MFiX-Trilinos is validated against the same from MFiX for BiCGStab and GMRES methods. And, the effect of the preconditioning on the iterative solvers in MFiX-Trilinos is also analyzed. In addition, the effect of left and right smoothed aggregation preconditioning on the solvers is studied. The performance of the first- and second-generation solver stacks in MFiX-Trilinos is studied as well for two different problem sizes.

关键词： Linear solvers Preconditioners MFiX-Trilinos Trilinos MFiX Fluidized beds distributed memory computing

来源：评论

学校读者我要写书评

暂无评论

A case study in mechanically deriving dense linear algebra code

引用

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE computing APPLICATIONS 2013年第4期27卷 440-453页

作者： Marker, Bryan Batory, Don van de Geijn, Robert Univ Texas Austin Dept Comp Sci Austin TX 78701 USA

Design by Transformation (DxT) is a top-down approach to mechanically derive high-performance algorithms for dense linear algebra. We use DxT to derive the implementation of a representative matrix operation, two- sided Trmm. We start with a knowledge base of transformations that were encoded for a simpler set of operations, the level-3 BLAS, and add only a few transformations to accommodate the more complex two- sided Trmm. These additions explode the search space of our prototype system, DxTer, requiring the novel techniques defined in this paper to eliminate large segments of the search space that contain suboptimal algorithms. Performance results for the mechanically optimized implementations on 8192 cores of a BlueGene/P architecture are given.

关键词： program generation distributed memory computing high-performance numerical algorithms dense linear algebra autotuning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：