检索结果-内蒙古大学图书馆

7th International Conference on Engineering Computational Technology

作者： Fischer, M. Firl, M. Masching, H. Bletzinger, K. -U. Tech Univ Munich Chair Struct Anal D-80290 Munich Germany

ISBN: (纸本)9781905088416

This contribution presents a computational framework for simulation and gradient-based structural optimization of geometrically nonlinear and large-scale structural finite element models. CAGD-free optimization methods have been developed to integrate shape optimization in an early stage of design and to reduce the related modelling effort. To overcome the problem of an increasing numerical cost due to the large design space, the design sensitivities for objectives and constraints are evaluated via adjoint formulations. A new parallel computation strategy for sensitivity evaluation is presented which takes advantage of a completely parallelized simulation and optimization environment. Two application examples illustrate the method and demonstrate the high parallel efficiency.

关键词： object-oriented programming parallel programming finite element method structural optimization nonlinear kinematics CAGD-free optimization C plus

来源：评论

学校读者我要写书评

暂无评论

Hybrid parallel programming on SMP Clusters Using XPFortran and OpenMP

Hybrid Parallel Programming on SMP Clusters Using XPFortran ...

引用

6th International Workshop on OpenMP 2010

作者： Zhang, Yuanyuan Iwashita, Hidetoshi Ishii, Kuninori Kaneko, Masanori Nakamura, Tomotake Hotta, Kohichiro Fujitsu Ltd Next Generat Tech Comp Unit Software Dev Div Kawasaki Kanagawa 2118588 Japan

ISBN: (纸本)9783642132162

Process-thread hybrid programming paradigm is commonly employed in SMP clusters. XPFortran, a parallel programming language that specifies a set of compiler directives and library routines, can be used to realize process-level parallelism in distributed memory systems. In this paper, we introduce hybrid parallel programming by XPFortran to SNIP clusters, in which thread-level parallelism is realized by OpenMP. We present the language support and compiler implementation of OpenMP directives in XPFortran, and show sonic of our experiences in XPFortran-OpenMP hybrid programming. For nested loops parallelized by process-thread hybrid programming, it's common sense to use process parallelization for outer loops and thread parallelization for inner ones. However, we have found that in sonic cases it's possible to write XPFortran-OpenMP hybrid program in a reverse way, i.e., OpenMP outside, XPFortran inside. Our evaluation results show that this programming style sometimes delivers better performance than the traditional one. We therefore recommend using the hybrid parallelization flexibly.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

CharmPy: A Python parallel programming Model

CharmPy: A Python Parallel Programming Model

引用

IEEE International Conference on Cluster Computing (CLUSTER)

作者： Galvez, Juan J. Senthil, Karthik Kale, Laxmikant V. Univ Illinois Dept Comp Sci Champaign IL 61820 USA

ISBN: (纸本)9781538683194

parallel programming can be extremely challenging. programming models have been proposed to simplify this task, but wide acceptance of these remains elusive for many reasons, including the demand for greater accessibility and productivity. In this paper, we introduce a parallel programming model and framework called CharmPy, based on the Python language. CharmPy builds on Charm++, and runs on top of its C++ runtime. It presents several unique features in the form of a simplified model and API, increased flexibility, and the ability to write everything in Python. CharmPy is a high-level model based on the paradigm of distributed migratable objects. It retains the benefits of the Charm++ runtime, including dynamic load balancing, asynchronous execution model with automatic overlap of communication and computation, high performance, and scalability from laptops to supercomputers. By being Python-based, CharmPy also benefits from modern language features, access to popular scientific computing and data science software, and interoperability with existing technologies like C, Fortran and OpenMP. To illustrate the simplicity of the model, we will show how to implement a distributed parallel map function based on the Master-Worker pattern using CharmPy, with support for asynchronous concurrent jobs. We also present performance results running stencil code and molecular dynamics mini-apps fully written in Python, on Blue Waters and Cori supercomputers. For stencil3d, we show performance similar to an equivalent MPI-based program, and significantly improved performance for imbalanced computations. Using Numba to JIT-compile the critical parts of the code, we show performance for both mini-apps similar to the equivalent C++ code.

关键词： programming model parallel programming distributed computing multiprocessing Python HPC

来源：评论

学校读者我要写书评

暂无评论

Solving the Crossword Generation Problem Using parallel programming in Moderns Browsers: A Case Study 9th

Solving the Crossword Generation Problem Using Parallel Prog...

引用

9th International Congress on Information and Communication Technology (ICICT)

作者： Rodrigues, Daniel T. Bianchini, Calebe P. Univ Prebiteriana Mackenzie Comp & Informat Dept Sao Paulo Brazil

ISBN: (纸本)9789819735556;9789819735563

The use of modern browsers reveals itself more and more essential to the world. Features like Web Workers are becoming more adopted over the most used browsers of the Internet, enabling performance enhancements in web applications. As consequence, execution of tasks with higher computational demand inside the browser. Technique of task parallelization using Web Workers, presenting as study case an algorithm of crossword generation, being executed in a browser context. The results show even superlinear speedups for a parallel version of the algorithm.

关键词： parallel programming Rust Web Workers WebAssembly

来源：评论

学校读者我要写书评

暂无评论

Introducing parallel programming to Traditional Undergraduate Courses

Introducing Parallel Programming to Traditional Undergraduat...

引用

Frontiers in Education Conference (FIE)

作者： de Freitas, Henrique Cota Pontificia Univ Catolica Minas Gerais PUC Minas Dept Comp Sci Belo Horizonte MG Brazil

ISBN: (纸本)9781467313513

parallel programming is an important issue for current multi-core processors and necessary for new generations of many-core architectures. This includes processors, computers, and clusters. However, the introduction of parallel programming in undergraduate courses demands new efforts to prepare students for this new reality. This paper describes an experiment on a traditional Computer Science course during a two-year period. The main focus is the question of when to introduce parallel programming models in order to improve the quality of learning. The goal is to propose a method of introducing parallel programming based on OpenMP (a shared-variable model) and MPI (a message-passing model). Results show that when the OpenMP model is introduced before the MPI model the best results are achieved. The main contribution of this paper is the proposed method that correlates several concepts such as concurrency, parallelism, speedup, and scalability to improve student motivation and learning.

关键词： parallel programming Computer Science and Engineering Education Learning Evaluation

来源：评论

学校读者我要写书评

暂无评论

Variable grain architectures for MPP computation and structured parallel programming 3

Variable grain architectures for MPP computation and structu...

引用

3rd Working Conference on Massively parallel programming Models (MPPM97)

作者： Vanneschi, M Univ Pisa Dept Comp Sci Pisa Italy

ISBN: (纸本)0818684275

The paper discusses the relationships between hierarchically composite MPP architectures and the software technology derived from the structured parallel programming methodology, in particular the architectural support to successive modular refinements of parallel applications, and the architectural support to the parallel programming paradigms and their combinations. The structured parallel programming methodology referred here is an application of the Skeletons model. The considered hierarchically composite architectures are MPP machine models for PetaFlops computing, composed of proper combinations of current architectural models of different granularities, where the Processors-In-Memory model is adopted at the finest granularity level. The methodologies are discussed with reference to the current PQE2000 Project on MPP general purpose systems.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

TSPI: A Tuplespace Based parallel programming Library

TSPI: A Tuplespace Based Parallel Programming Library

引用

International Conference on Computer Science and Information Technology

作者： Zhou, Bei Wang, Lei Huang, Yong-zhong Zhengzhou Informat Sci & Technol Inst Zhengzhou 450002 Henan Peoples R China

ISBN: (纸本)9780769533087

As a network middleware, Tuplespace provides a powerful way for distributed computing. Having used the TSpaces implementation, we describe the design and implementation of TSPI, a Tuplespace based parallel programming library which can be called in C. Especially, TSPI supports some important functions such as reliability, high performance, computers joining and quitting dynamically. Furthermore, the corresponding parallel program structure is proposed Compared with MPI, TSPI is simple, supporting dynamic environment and load balance.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Teaching HPC Systems and parallel programming with Small-Scale Clusters

Teaching HPC Systems and Parallel Programming with Small-Sca...

引用

IEEE/ACM Workshop on Education for High-Performance Computing (EduHPC)

作者： Alvarez, Lluc Ayguade, Eduard Mantovani, Filippo Univ Politecn Cataluna Barcelona Supercomp Ctr Barcelona Spain Barcelona Supercomp Ctr Barcelona Spain

ISBN: (纸本)9781728101903

In the last decades, the continuous proliferation of High-Performance Computing (HPC) systems and data centers has augmented the demand for expert HPC system designers, administrators, and programmers. For this reason, most universities have introduced courses on HPC systems and parallel programming in their degrees. However, the laboratory assignments of these courses generally use clusters that are owned, managed and administrated by the university. This methodology has been shown effective to teach parallel programming, but using a remote cluster prevents the students from experimenting with the design, set up and administration of such systems. This paper presents a methodology and framework to teach HPC systems and parallel programming using a small-scale cluster of single-board computers. These boards are very cheap, their processors are fundamentally very similar to the ones found in HPC, and they are ready to execute Linux out of the box. So they represent a perfect laboratory playground for students experiencing how to assemble a cluster, setting it up, and configuring its system software. Also, we show that these small-scale clusters can be used as evaluation platforms for both, introductory and advanced parallel programming assignments.

关键词： HPC systems parallel programming teaching

来源：评论

学校读者我要写书评

暂无评论

parallel programming of Resistive Cross-point Array for Synaptic Plasticity

引用

Procedia Computer Science 2014年 41卷 126-133页

作者： Zihan Xu Abinash Mohanty Pai-Yu Chen Deepak Kadetotad Binbin Lin Jieping Ye Sarma Vrudhula Shimeng Yu Jae-sun Seo Yu Cao School of Electrical Computer and Energy Engineering Arizona State University Tempe AZ 85287 USA School of Computing Informatics and Decision Systems Engineering Arizona State University Tempe AZ 85281 USA

This paper proposes a parallel programming scheme for the cross-point array with resistive random access memory (RRAM). Synaptic plasticity in unsupervised learning is realized by tuning the conductance of each RRAM cell. Inspired by the spike-timing-dependent-plasticity (STDP), the programming strength is encoded into the spike firing rate (i.e., pulse frequency) and the overlap time (i.e., duty cycle) of the pre-synaptic node and post-synaptic node, and simultaneously applied to all RRAM cells in the cross-point array. Such an approach achieves parallel programming of the entire RRAM array, only requiring local information from pre-synaptic and post-synaptic nodes to each RRAM cell. As demonstrated by digital peripheral circuits implemented in 65 nm CMOS, the programming time of a 40 kb RRAM array is 84 ns, indicating 900X speedup as compared to state-of-the-art software approach of sparse coding in image feature extraction.

关键词： Resistive cross-point array parallel programming Synaptic plasticity Dictionary learning

来源：评论

学校读者我要写书评

暂无评论

Pabble: Parameterised Scribble for parallel programming

Pabble: Parameterised Scribble for Parallel Programming

引用

22nd Euromicro International Conference on parallel, Distributed, and Network-Based Processing (PDP)

作者： Ng, Nicholas Yoshida, Nobuko Univ London Imperial Coll Sci Technol & Med London SW7 2AZ England

ISBN: (纸本)9781479927289

Many parallel and distributed message-passing programs are written in a parametric way over available resources, in particular the number of nodes and their topologies, so that a single parallel program can scale over different environments. This paper presents a parameterised protocol description language, Pabble, which can guarantee safety and progress in a large class of practical, complex parameterised message-passing programs through static checking. Pabble can describe an overall interaction topology, using a concise and expressive notation, designed for a variable number of participants arranged in multiple dimensions. These parameterised protocols in turn automatically generate local protocols for type checking parameterised MPI programs for communication safety and deadlock freedom. In spite of undecidability of endpoint projection and type checking in the underlying parameterised session type theory, our method guarantees the termination of endpoint projection and type checking.

关键词： concurrency control message passing parallel programming program verification type theory Pabble communication safety complex parameterised message-passing programs deadlock freedom distributed message-passing programs endpoint projection termination endpoint projection undecidability interaction topology local protocols parallel programming parameterised MPI program type checking parameterised Scribble parameterised protocol description language parameterised session type theory static checking Protocols Receivers Safety Syntactics System recovery Topology Upper bound multiparty session types parallel programming scribble protocol language

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：