检索结果-内蒙古大学图书馆

arXiv 2017年

作者： McKenney, Paul E. Facebook United States

The purpose of this book is to help you program shared-memory parallel systems without risking your sanity.1 Nevertheless, you should think of the information in this book as a foundation on which to build, rather than as a completed cathedral. Your mission, if you choose to accept, is to help make further progress in the exciting field of parallel programming—progress that will in time render this book obsolete. parallel programming in the 21st century is no longer focused solely on science, research, and grand-challenge projects. And this is all to the good, because it means that parallel programming is becoming an engineering discipline. Therefore, as befits an engineering discipline, this book examines specific parallel-programming tasks and describes how to approach them. In some surprisingly common cases, these tasks can be automated. This book is written in the hope that presenting the engineering discipline underlying successful parallel-programming projects will free a new generation of parallel hackers from the need to slowly and painstakingly reinvent old wheels, enabling them to instead focus their energy and creativity on new frontiers. However, what you get from this book will be determined by what you put into it. It is hoped that simply reading this book will be helpful, and that working the Quick Quizzes will be even more helpful. However, the best results come from applying the techniques taught in this book to real-life problems. As always, practice makes perfect. But no matter how you approach it, we sincerely hope that parallel programming brings you at least as much fun, excitement, and challenge that it has brought to us!MSC Codes 68-01, 68M20, 68N19, 68N25, 68N30 Copyright © 2017, The Authors. All rights reserved.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Characterizing and Improving the Performance of Many-Core Task-Based parallel programming Runtimes

Characterizing and Improving the Performance of Many-Core Ta...

引用

IEEE International Symposium on parallel and Distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Jaume Bosch Xubin Tan Carlos Álvarez Daniel Jiménez-González Xavier Martorell Eduard Ayguadé Barcelona Supercomputing Center (BSC) Universitat Politècnica de Catalunya

parallel task-based programming models like OpenMP support the declaration of task data dependences. This information is used to delay the task execution until the task data is available. The dependences between tasks are calculated at runtime using shared graphs that are updated concurrently by all threads. However, only one thread can modify the task graph at a time to ensure correctness; others need to wait before doing their modifications. This waiting limits the application's parallelism and becomes critical in many-core systems. This paper characterizes this behavior, analyzing how it hinders performance and presenting an alternative organization suitable for the runtimes of task-based programming models. This organization allows managing the runtime structures asynchronously or synchronously, adapting the runtime to reduce the waste of computation resources and increase theperformance. Results show that the new runtime structure outperforms the peak speedup of the original runtime model whencontention is huge and achieves similar or better performance results for real applications.

关键词： Runtime Message systems Organizations parallel processing parallel programming Computational modeling

来源：评论

学校读者我要写书评

暂无评论

Using cognitive computing for learning parallel programming: An IBM Watson solution

arXiv

引用

arXiv 2017年

作者： Chozas, Adrián Calvo Memeti, Suejb Pllana, Sabri Linnaeus University Växjö351 95 Sweden

While modern parallel computing systems provide high performance resources, utilizing them to the highest extent requires advanced programming expertise. programming for parallel computing systems is much more difficult than programming for sequential systems. OpenMP is an extension of C++ programming language that enables to express parallelism using compiler directives. While OpenMP alleviates parallel programming by reducing the lines of code that the programmer needs to write, deciding how and when to use these compiler directives is up to the programmer. Novice programmers may make mistakes that may lead to performance degradation or unexpected program behavior. Cognitive computing has shown impressive results in various domains, such as health or marketing. In this paper, we describe the use of IBM Watson cognitive system for education of novice parallel programmers. Using the dialogue service of the IBM Watson we have developed a solution that assists the programmer in avoiding common OpenMP mistakes. To evaluate our approach we have conducted a survey with a number of novice parallel programmers at the Linnaeus University, and obtained encouraging results with respect to usefulness of our approach. Copyright © 2017, The Authors. All rights reserved.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

RAI: A Scalable Project Submission System for parallel programming Courses

RAI: A Scalable Project Submission System for Parallel Progr...

引用

IEEE International Symposium on parallel and Distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Abdul Dakkak Carl Pearson Cheng Li Wen-mei Hwu Department of Computer Science Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign Urbana USA

A major component of many advanced programming courses is an open-ended "end-of-term project" assignment. Delivering and evaluating open-ended parallel programming projects for hundreds or thousands of students brings a need for broad system reconfigurability coupled with challenges of testing and development uniformity, access to esoteric hardware and programming environments, scalability, and security. We present RAI, a secure and extensible system for delivering open-ended programming assignments configured with access to different hardware and software requirements. We describe how the system was used to deliver a programming-competition-style final project in an introductory GPU programming course at the University of Illinois Urbana-Champaign.

关键词： Tools programming profession Graphics processing units parallel programming Servers Security

来源：评论

学校读者我要写书评

暂无评论

An approach of performance comparisons with OpenMP and CUDA parallel programming on multicore systems

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2016年第16期28卷 4230-4245页

作者： Chang, Chih-Hung Lu, Chih-Wei Yang, Chao-Tung Chang, Tzu-Chieh Hsiuping Univ Sci & Technol Dept Informat Management Taichung Taiwan Tunghai Univ Dept Comp Sci Taichung Taiwan

In the past, the tenacious semiconductor problems of operating temperature and power consumption limited the performance growth for single-core microprocessors. Microprocessor vendors hence adopt the multicore chip organizations with parallel processing because the new technology promises faster and lower power needed. In a short time, this trend floods first the development of CPU, then also the other peripherals like GPU. Modern GPUs are very efficient in manipulating computer graphics, and their highly parallel structure makes them even more effective than general-purpose CPUs for a range of graphical complex algorithms. However, technology of multicore processor brought revolution and unavoidable collision to the programming personnel. Multicore processor has high performance;however, parallel processing brings not only the opportunity but also a challenge. The issue of efficiency and the way how programmer or compiler parallelizes the software explicitly are the keys that enhance the performance on multicore chip. In this paper, we propose a parallel programming approach using hybrid CUDA, OpenMP, and MPI programming. There would be two verificational experiments presented in the paper. In the first, we would verify the availability and correctness of the auto-parallel tools, and discuss the performance issues on CPU, GPU, and embedded system. In the second, we would verify how the hybrid programming could surely improve performance. Copyright (C) 2016 John Wiley & Sons, Ltd.

关键词： auto-parallel parallel programming multicore OpenMP CUDA

来源：评论

学校读者我要写书评

暂无评论

Finding Partial Hash Collisions by Brute Force parallel programming 37

Finding Partial Hash Collisions by Brute Force Parallel Prog...

引用

37th IEEE Sarnoff Symposium (Sarnoff)

作者： Chiriaco, Vincent Franzen, Aubrey Thayil, Rebecca Zhang, Xiaowen Univ North Alabama Dept Comp Sci Florence AL 35632 USA Northern Kentucky Univ Dept Comp Sci Highland Hts KY 41099 USA Bryn Mawr Coll Dept Phys Bryn Mawr PA 19010 USA CUNY Coll Staten Isl Dept Comp Sci Staten Isl NY 10314 USA

ISBN: (纸本)9781509015405

A hash function maps an arbitrary length of (longer) message into a fixed length of shorter string, called message digest. Inevitably there will be a lot of different messages being hashed to the same or similar digest. We call this collision or partial collision. By utilizing multiple processors from the CUNY High Performance Computing Center's facility, we locate partial collisions for MD5 and SHA-1 by brute force parallel programming in C with MPI library. The brute force method of finding a second preimage collision entails systematically computing all of the permutations, digests, and Hamming distances of the target preimage. We explore varying size target strings and the number of processors allocation and examine the effect these variables have on finding partial collisions. The results show that for the same message space the search time for the partial collisions is roughly halved for each doubling of the number of processors;and the longer the message is the better partial collisions are produced.

关键词： Partial hash collision brute force MD5 SHA-1 high performance computing parallel programming MPI

来源：评论

学校读者我要写书评

暂无评论

An approach of performance comparisons with OpenMP and CUDA parallel programming on multicore systems

An approach of performance comparisons with OpenMP and CUDA ...

引用

4th International Workshop on Embedded Multicore Computing and Applications in conjunction with the 16th IEEE International Conference on High Performance and Communications

作者： Chang, Chih-Hung Lu, Chih-Wei Yang, Chao-Tung Chang, Tzu-Chieh Hsiuping Univ Sci & Technol Dept Informat Management Taichung Taiwan Tunghai Univ Dept Comp Sci Taichung Taiwan

关键词： auto-parallel parallel programming multicore OpenMP CUDA

来源：评论

学校读者我要写书评

暂无评论

The ForeC Synchronous Deterministic parallel programming Language for Multicores 10

The ForeC Synchronous Deterministic Parallel Programming Lan...

引用

10th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSOC)

作者： Yip, Eugene Girault, Alain Roop, Partha S. Biglari-Abhari, Morteza Univ Bamberg Software Technol Res Grp D-96045 Bamberg Germany Inria Rennes France Univ Grenoble Alpes Lab LIG Grenoble France CNRS Lab LIG F-38000 Grenoble France Univ Auckland Dept ECE Auckland New Zealand

ISBN: (纸本)9781509035311

Cyber-physical systems (CPSs) are embedded systems that are tightly integrated with their physical environment. The correctness of a CPS depends on the output of its computations and on the timeliness of completing the computations. This paper proposes the ForeC language for the deterministic parallel programming of CPS applications on multi-core execution platforms. ForeC's synchronous semantics is designed to greatly simplify the understanding and debugging of parallel programs. ForeC allows programmers to express many forms of parallel patterns while ensuring that programs are amenable to static timing analysis. One of ForeC's main innovation is its shared variable semantics that provides thread isolation and deterministic thread communication. Through benchmarking, we demonstrate that ForeC can achieve better parallel performance than Esterel, a widely used synchronous language for concurrent safety-critical systems, and OpenMP, a popular desktop solution for parallel programming. We demonstrate that the worst-case execution time of ForeC programs can be estimated precisely.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

A parallel programming Course Based on an Execution Time-Energy Consumption Optimization Problem 30

A Parallel Programming Course Based on an Execution Time-Ene...

引用

30th IEEE International parallel and Distributed Processing Symposium (IPDPS)

作者： Cuenca, Javier Gimenez, Domingo Univ Murcia Dept Engn & Technol Comp E-30071 Murcia Spain Univ Murcia Dept Comp & Syst E-30071 Murcia Spain

ISBN: (纸本)9781509036820

This paper presents an experience of Problem-based learning in a parallel programming course. The course includes the basics of parallel programming, from methodological and technological aspects to the analysis and design of parallel algorithms. The students work with an optimization problem in the field of parallel Computing. The execution time and the energy consumption of a simplified master-slave scheme in a simplified heterogeneous system are optimized, so treating it as a bi-objective optimization problem, which is addressed with sequential, shared-memory, message-passing and hybrid parallel programming. In this way, the students follow the various parts of the syllabus of the course by working with a problem in which topics studied in previous courses are combined (green computing, computational systems architecture, optimization, heuristics), and this contributes to a deeper understanding of these topics and motivates the introduction of new concepts.

关键词： problem-based learning parallel programming bi-objective optimization execution time energy consumption

来源：评论

学校读者我要写书评

暂无评论

GEMS: Shared-memory parallel programming for *** 2016

GEMS: Shared-memory parallel programming for ***

引用

2016 ACM SIGPLAN International Conference on Object-Oriented programming, Systems, Languages, and Applications, OOPSLA 2016

作者： Bonetta, Daniele Salucci, Luca Marr, Stefan Binder, Walter Oracle Labs Austria Università della Svizzera italiana Switzerland Johannes Kepler University Linz Linz Austria

ISBN: (纸本)9781450344449

JavaScript is the most popular programming language for client-side Web applications, and *** has popularized the language for server-side computing, too. In this domain, the minimal support for parallel programming remains however a major limitation. In this paper we introduce a novel parallel programming abstraction called Generic Messages (GEMS). GEMS allow one to combine message passing and shared-memory parallelism, extending the classes of parallel applications that can be built with ***. GEMS have customizable semantics and enable several forms of thread safety, isolation, and concurrency control. GEMS are designed as convenient JavaScript abstractions that expose high-level and safe parallelism models to the developer. Experiments show that GEMS outperform equivalent *** applications thanks to their usage of shared memory. © 2016 ACM.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：