检索结果-内蒙古大学图书馆

On matlab experience in accelerating DIRECT-GLce algorithm for constrained global optimization through dynamic data structures and parallelization

引用

APPLIED MATHEMATICS AND COMPUTATION 2021年 390卷 125596-125596页

作者： Stripinis, Linas Zilinskas, Julius Casado, Leocadio G. Paulavicius, Remigijus Vilnius Univ Inst Data Sci & Digital Technol Akad 4 LT-08663 Vilnius Lithuania Univ Almeria CeiA3 Informat Dept Almeria Spain

In this paper, two different acceleration techniques for a deterministic DIRECT (DIviding RECTangles)-type global optimization algorithm, DIRECT-GLce, are considered. We adopt dynamic data structures for better memory usage in matlab implementation. We also study shared and distributed parallel implementations of the original DIRECT-GLce algorithm, and a distributed parallel version for the aggressive counterpart. The efficiency of DIRECT-type parallel versions is evaluated solving box- and generally constrained global optimizations problems with varying complexity, including a practical NASA speed reducer design problem. Numerical results show a good efficiency, especially for the distributed parallel version of the original DIRECT-GLce on a multi-core PC. (C) 2020 Elsevier Inc. All rights reserved.

关键词： DIRECT-type algorithm Derivative-free optimization Dynamic data structures parallel optimization parallel matlab parallel computing toolbox

来源：评论

学校读者我要写书评

暂无评论

Analysis of performance enhancement on graphic processor based heterogeneous architecture: A CUDA and matlab experiment

Analysis of performance enhancement on graphic processor bas...

引用

National Conference on parallel Computing Technologies (PARCOMPTECH)

作者： Naik, Vilas H. Kusur, Chidanand S. Basaveshwar Engn Coll Dept Comp Sci & Engn Bagalkot Karnataka India BLDEAs Coll Engn & Technol Dept Comp Sci & Engn Bijapur Karnataka India

ISBN: (纸本)9781479969180

Today multiprocessors, multicores, clusters and heterogeneous computing are becoming the most popular architectures to achieve high performance computing. The different approaches are made by system designers to enhance the system performance such as increasing clock frequency of CPUs from MHz to GHz and addition of more number of CPU cores i.e from single core processor to dual core, quad core, hexa core, octo core, ten core and more processors. Still, multicore processing creates some challenges of its own. The extra core results into increased processor size and also high power consumption. Meanwhile, General Purpose Graphics Processing Units (GPGPUs) are designed and implemented that contain hundreds of cores with more number of Arithmetic and Logic Units and Control Units. These GPGPUs can be used in addition to CPU for heterogeneous computing for the enhancement of system performance for selected applications by data parallelism. The heterogeneous programming environment that includes other processors like GPGPU in addition to CPU can be used to enhance the execution performance of computational intensive programs. So, it is necessary for the programmer to run and analyze the selected computational intensive programs on both homogeneous and heterogeneous programming platform. The homogeneous programming environment makes the use of multi core CPU, where as the heterogeneous programming environment makes the use of different processors such as General Purpose Graphics Processing Unit (GPGPUs), Field Programmable Gate Arrays (FPGAs), Digital Signal Processors (DSPs) in addition to CPU. Hence, the programmer needs to write the code that makes the use of both CPU and other processors by using heterogeneous software environment such as parallel matlab with GPU enabled functions, matlab supported CUDA kernels and CUDA C for the execution of parallel code to achieve high performance in heterogeneous programming environment in comparison with homogeneous (sequential)

关键词： Multicore Heterogeneous Computing High Performance Computing GPGPU parallel matlab CUDA kernels

来源：评论

学校读者我要写书评

暂无评论

Driving Big Data With Big Compute

Driving Big Data With Big Compute

引用

IEEE Conference on High Performance Extreme Computing (HPEC)

作者： Byun, Chansup Arcand, William Bestor, David Bergeron, Bill Hubbell, Matthew Kepner, Jeremy McCabe, Andrew Michaleas, Peter Mullen, Julie O'Gwynn, David Prout, Andrew Reuther, Albert Rosa, Antonio Yee, Charles MIT Lincoln Lab Lexington MA 02173 USA

ISBN: (纸本)9781467315760;9781467315777

Big Data (as embodied by Hadoop clusters) and Big Compute (as embodied by MPI clusters) provide unique capabilities for storing and processing large volumes of data. Hadoop clusters make distributed computing readily accessible to the Java community and MPI clusters provide high parallel efficiency for compute intensive workloads. Bringing the big data and big compute communities together is an active area of research. The LLGrid team has developed and deployed a number of technologies that aim to provide the best of both worlds. LLGrid MapReduce allows the map/reduce parallel programming model to be used quickly and efficiently in any language on any compute cluster. D4M (Dynamic Distributed Dimensional Data Model) provided a high level distributed arrays interface to the Apache Accumulo database. The accessibility of these technologies is assessed by measuring the effort to use these tools and is typically a few lines of code. The performance is assessed by measuring the insert rate into the Accumulo database. Using these tools a database insert rate of 4M inserts/second has been achieved on an 8 node cluster.

关键词： component LLGridMapReduce parallel ingestion concurrent query scheduler hdfs parallel matlab d4m

来源：评论

学校读者我要写书评

暂无评论

pmatlab parallel matlab library

引用

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 2007年第3期21卷 336-359页

作者： Bliss, N. Travinin Kepner, J. MIT Lincoln Lab Lexington MA 02420 USA

matlab (R) has emerged as one of the languages most commonly used by scientists and engineers for technical computing, with approximately one million users worldwide. The primary benefits Of matlab are reduced code development time via high levels of abstractions (e.g. first class multi-dimensional arrays and thousands of built in functions), interpretive, interactive programming, and powerful mathematical graphics. The compute intensive nature of technical computing means that many matlab users have codes that can significantly benefit from the increased performance offered by parallel computing. plmatlab provides this capability by implementing parallel global array semantics using standard operator overloading techniques. The core data structure in pmatlab is a distributed numerical array whose distribution onto multiple processors is specified with a "map" construct. Communication operations between distributed arrays are abstracted away from the user and plmatlab transparently supports redistribution between any block-cyclic-overlapped distributions up to four dimensions. pmatlab is built on top of the matlabMPI communication library and runs on any combination of heterogeneous systems that support matlab, which includes Windows, Linux, MacOS X, and SunOS. This paper describes the overall design and architecture of the pmatlab implementation. Performance is validated by implementing the HPC Challenge benchmark suite and comparing plmatlab performance with the equivalent C+MPI codes. These results indicate that plmatlab can often achieve comparable performance to C+MPI, usually at one tenth the code size. Finally, we present implementation data collected from a sample of real pmatlab applications drawn from the approximately one hundred users at MIT Lincoln Laboratory. These data indicate that users are typically able to go from a serial code to an efficient pmatlab code in about 3 hours while changing less than 1% of their code.

关键词： parallel computing parallel programming models parallel matlab HPC challenge

来源：评论

学校读者我要写书评

暂无评论

parallel Simulation for Parameter Estimation of Optical Tissue Properties

Parallel Simulation for Parameter Estimation of Optical Tiss...

引用

16th International Euro-Par Conference on parallel Processing

作者： Duta, Mihai Thiyagalingam, Jeyarajan Trefethen, Anne Goyal, Ayush Grau, Vicente Smith, Nic Oxford E Res Ctr Oxford England Univ Oxford Comp Lab Oxford England

ISBN: (纸本)9783642152900

Several important laser-based medical treatments rest on the crucial knowledge of the response of tissues to laser penetration. Optical properties are often localised and are measured using optically active fluorescent microspheres injected into the tissue. However, the measurement process combines the tissue properties with the optical characteristics of the measuring device which in turn requires numerically intensive mathematical simulations for extracting the tissue properties from the data. In this paper, we focus on exploiting the algorithmic parallelism in the bio-computational simulation, in order to achieve significant runtime reductions. The entire simulation accounts for over 30,000 spatial points and is too computationally demanding to run in a serial fashion. We discuss our strategies of parallelisation at different levels of granularity and we present our results on two different parallel platforms. We also emphasise the importance of retaining a high level of code abstraction in the application to benefit both agile coding and interdisciplinary collaboration between research groups.

关键词： tissue optics parallel simulation CUDA parallel matlab

来源：评论

学校读者我要写书评

暂无评论

matlab^A®: A Language for parallel Computing

引用

INTERNATIONAL JOURNAL OF parallel PROGRAMMING 2009年第1期37卷 3-36页

作者： Sharma, Gaurav Martin, Jos The MathWorks Natick MA 01760 USA MathWorks Ltd Cambridge CB4 0HH England

parallel computing with the matlab(A (R)) language and environment has received interest from various quarters. The parallel Computing Toolbox(TM) and matlab(A (R)) Distributed Computing Server(TM) from The MathWorks are among several available tools that offer this capability. We explore some of the key features of the parallel matlab language that these tools offer. We describe the underlying mechanics as well as the salient design decisions and rationale for certain features in the toolset. The paper concludes by identifying some issues that we must address as the language features evolve.

关键词： parallel matlab parallel language design parallel Computing Toolbox

来源：评论

学校读者我要写书评

暂无评论

Distributed identification of the lineality space of a cone

引用

JOURNAL OF SUPERCOMPUTING 2009年第2期48卷 163-182页

作者： Caire, Mario E. Lopez, Francisco J. Williams, David H. Univ Texas El Paso ECE Dept El Paso TX 79968 USA Univ Texas El Paso Dept Informat & Decis Sci El Paso TX 79968 USA

A distributed approach is described for solving lineality (or linearity) space (LS) problems with large cardinalities and a large number of dimensions. The LS solution has applications in engineering, science, and business, and includes a subset of solutions of the more general extended linear complementarity problem (ELCP). A parallel matlab framework is employed and results are computed on an 8-node Rocks based cluster computer using Remote Procedure Calls (RPCs) and the MPICH2 Message Passing Interface (MPI). Results show that both approaches perform comparably when solving distributed LS problems. This indicates that when deciding which parallel approach to use, the implementation details particular to the method are the decisive factors, which in this investigation give MPICH2 MPI the advantage.

关键词： Beowulf cluster parallel distribution parallel matlab Generators Lineality space Positive hull

来源：评论

学校读者我要写书评

暂无评论

The design of a distributed matlab-based environment for computing pseudospectra

引用

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE 2005年第6期21卷 930-941页

作者： Bekas, C Kokiopoulou, E Gallopoulos, E Univ Patras Comp Engn & Informat Dept Patras Greece

It has been documented in the literature that the pseudospectrurn of a matrix is a powerful concept that broadens our understanding of phenomena based on matrix computations. When the matrix A is non-normal, however, the computation of the pseudospectrum becomes a very expensive computational task. Thus, the use of high performance computing resources becomes key to obtaining useful answers in acceptable amounts of time. In this work we describe the design and implementation of an environment that integrates a suite of state-of-the-art algorithms running on a cluster of workstations to enable the matrix pseudospectrum become a practical too for scientists and engineers. The user interacts with the environment via the graphical user interface PPsGUI. The environment is constructed on top of CMTM, an existing environment that enables distributed computation via an MPI API for matlab. (c) 2005 Elsevier B.V. All rights reserved.

关键词： eigenvalues pseudospectrum problem solving environments parallel matlab MPI CMTM Grid computing

来源：评论

学校读者我要写书评

暂无评论

matlabMPI

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2004年第8期64卷 997-1005页

作者： Jeremy, K Ahalt, S MIT Lincoln Lab Lexington MA 02420 USA Ohio State Univ Dept Elect Engn Columbus OH 43210 USA

In many projects the true costs of high performance computing are currently dominated by software. Addressing these costs may require shifting to higher level languages such as matlab. matlabMPI is a matlab implementation of the Message Passing Interface (MPI) standard and allows any matlab program to exploit multiple processors. matlabMPI currently implements the basic six functions that are the core of the MPI point-to-point communications standard. The key technical innovation of matlabMPI is that it implements the widely used MPI "look and feel" on top of standard matlab file I/O, resulting in an extremely compact (similar to350 lines of code) and "pure" implementation which runs anywhere matlab runs, and on any heterogeneous combination of computers. The performance has been tested on both shared and distributed memory parallel computers (e.g. Sun, SGI, HP, IBM, Linux, MacOSX and Windows). matlabMPI can match the bandwidth of C based MPI at large message sizes. A test image filtering application using matlabMPI achieved a speedup of similar to300 using 304 CPUs and similar to15% of the theoretical peak (450 Gigaflops) on an IBM SP2 at the Maui High Performance Computing Center. In addition, this entire parallel benchmark application was implemented in 70 software-lines-of-code, illustrating the high productivity of this approach. (C) 2004 Published by Elsevier Inc.

关键词： message passing high level languages parallel matlab

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：