检索结果-内蒙古大学图书馆

2nd USENIX Workshop on Hot Topics in parallelism, HotPar 2010

作者： Jenista, James C. Eom, Yong Hun Demsky, Brian

Developing parallel software using current tools can be challenging. Developers must reason carefully about the use of locks to avoid both race conditions and deadlocks. We present a compiler-assisted approach to parallel programming inspired by out-of-order hardware. In our approach, the developer annotates code blocks as reorderable to decouple these blocks from the parent thread of execution. OoOJava uses static analysis to extract all data dependences from both variables and data structures to generate an executable that is guaranteed to preserve the behavior of the original sequential code. We have implemented OoOJava and achieved significant speedups for a ray tracer and a K-Means cluster benchmark. The straightforward development model, compiler feedback, and speedups are promising indicators that a simple deterministic parallel programming model with strong guarantees can become mainstream. © HotPar 2010.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Implementation and performance evaluation of XcalableMP: A parallel programming language for distributed memory systems

Implementation and performance evaluation of XcalableMP: A p...

引用

International Workshop on Scheduling and Resource Management for parallel and Distributed Systems

作者： Lee, Jinpil Sato, Mitsuhisa Graduate School of Systems and Information Engineering University of Tsukuba Tsukuba Japan Center for Computational Sciences University of Tsukuba Tsukuba Japan

ISBN: (纸本)9780769541570

Although MPI is a de-facto standard for parallel programming on distributed memory systems, writing MPI programs is often a time-consuming and complicated process. XcalableMP is a language extension of C and Fortran for parallel programming on distributed memory systems that helps users to reduce those programming efforts. XcalableMP provides two programming models. The first one is the global view model, which supports typical parallelization based on the data and task parallel paradigm, and enables parallelizing the original sequential code using minimal modification with simple, OpenMP-like directives. The other one is the local view model, which allows using CAF-like expressions to describe internode communication. Users can even use MPI and OpenMP explicitly in our language to optimize performance explicitly. In this paper, we introduce XcalableMP, the implementation of the compiler, and the performance evaluation result. For the performance evaluation, we parallelized HPCC Benchmark in XcalableMP. It shows that users can describe the parallelization for distributed memory system with a small modification to the original sequential code. © 2010 IEEE.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

A parallelised distributed implementation of a Branch and Fix Coordination algorithm

引用

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2015年第1期244卷 77-85页

作者： Pages-Bernaus, Adela Perez-Valdes, Gerardo Tomasgard, Asgeir Norwegian Univ Sci & Technol Inst Ind Econ & Technol Management Trondheim Norway SINTEF Technol & Soc Appl Econ Dept Trondheim Norway

Branch and Fix Coordination is an algorithm intended to solve large scale multi-stage stochastic mixed integer problems, based on the particular structure of such problems, so that they can be broken down into smaller subproblems. With this in mind, it is possible to use distributed computation techniques to solve the several subproblems in a parallel way, almost independently. To guarantee non-anticipativity in the global solution, the values of the integer variables in the subproblems are coordinated by a master thread. Scenario 'clusters' lend themselves particularly well to parallelisation, allowing us to solve some problems noticeably faster. Thanks to the decomposition into smaller subproblems, we can also attempt to solve otherwise intractable instances. In this work, we present details on the computational implementation of the Branch and Fix Coordination algorithm. (C) 2015 The Authors. Published by Elsevier B.V.

关键词： Stochastic mixed-integer problems Branch and fix coordination algorithm parallel programming

来源：评论

学校读者我要写书评

暂无评论

The last mile: parallel programming and usability

The last mile: Parallel programming and usability

引用

FSE/SDP Workshop on the Future of Software Engineering Research, FoSER 2010

作者： Sadowski, Caitlin Shewmaker, Andrew Computer Science Department University of California at Santa Cruz Santa Cruz CA United States

ISBN: (纸本)9781450304276

Multiprocessors are now commonplace, and cloud computing is swiftly following suit. While it is possible to write high performance code for these systems, concurrency bugs are extremely common and theoretical performance is often difficult to realize. In order to take advantage of increasing numbers of parallel resources, numerous parallel programming systems have been proposed and deployed, usually without a systematic evaluation of their usability. In order to make both programmers and their parallel applications more effective, we need more useful metrics for measuring programmer productivity and a better way to evaluate such metrics. We posit that usability is a key factor in the effectiveness of a parallel programming system, and that theoretical performance gains can only be realized if programmers are able to successfully reason about their parallel code. Copyright 2010 ACM.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

A task-parallel programming language for interactive applications 10

A task-parallel programming language for interactive applica...

引用

ACM SIGGRAPH ASIA 2010 Posters

作者： Tebbs, Duncan Square-Enix Research Center Japan

ISBN: (纸本)9781450305242

Task-parallel programming is a methodology in which algorithms are specified as a set of tasks to be executed, and the dependencies between them. A scheduler can then automatically determine the correct execution order and extract parallelism. Task programming is well-known to be a very effective way to leverage parallel hardware (and is gaining popularity among game developers [Lavaire and Quenin 2010]), however there is significant programming overhead associated with maintaining a program in this form.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

The STAPL Skeleton Framework 27

The STAPL Skeleton Framework

引用

27th International Workshop on Languages and Compilers for parallel Computing (LCPC)

作者： Zandifar, Mani Thomas, Nathan Amato, Nancy M. Rauchwerger, Lawrence Texas A&M Univ Dept Comp Sci Parasol Lab College Stn TX 77843 USA

ISBN: (纸本)9783319174730;9783319174723

This paper describes the stapl Skeleton Framework, a high-level skeletal approach for parallel programming. This framework abstracts the underlying details of data distribution and parallelism from programmers and enables them to express parallel programs as a composition of existing elementary skeletons such as map, map-reduce, scan, zip, butterfly, allreduce, alltoall and user-defined custom skeletons. Skeletons in this framework are defined as parametric data flow graphs, and their compositions are defined in terms of data flow graph compositions. Defining the composition in this manner allows dependencies between skeletons to be defined in terms of point-to-point dependencies, avoiding unnecessary global synchronizations. To show the ease of composability and expressivity, we implemented the NAS Integer Sort (IS) and Embarrassingly parallel (EP) benchmarks using skeletons and demonstrate comparable performance to the hand-optimized reference implementations. To demonstrate scalable performance, we show a transformation which enables applications written in terms of skeletons to run on more than 100,000 cores.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Unfolding based Minimal Test Suites for Testing Multithreaded Programs 15

Unfolding based Minimal Test Suites for Testing Multithreade...

引用

15th International Conference on Application of Concurrency to System Design (ACSD)

作者： Ponce-de-Leon, Hernan Saarikivi, Olli Kahkonen, Kari Heljanko, Keijo Aalto Univ HIIT Helsinki Finland Aalto Univ Dept Comp Sci Helsinki Finland

ISBN: (纸本)9781467378826

This paper focuses on the problem of computing the minimal test suite for a terminating multithreaded program that covers all its executable statements. We have in previous work shown how to use unfoldings to capture the true concurrency semantics of multithreaded programs and to generate test cases for it. In this paper we rely on this earlier work and show how the unfolding can be used to generate the minimal test suite that covers all the executable statements of the program. The problem of generating such a minimal test suite is shown to be NP-complete in the size of the unfolding, and as a side result, covering executable transitions of any terminating safe Petri net is also NP-complete in the size of its unfolding. We propose SMT-encodings to these problems and give initial results on applying this encoding to compute the minimal test suite for several benchmarks.

关键词： Concrete Concurrent computing Electronic mail Encoding Instruction sets Petri nets Testing Multithreaded programs SMT unfoldings instruction sets Petri nets electronic mail Surface mount technology parallel programming Multithreading Concrete Nondeterministic polynomial-time hard Minimal

来源：评论

学校读者我要写书评

暂无评论

parallel Objects for Multicores: A Glimpse at the parallel Language ENCORE 15th

Parallel Objects for Multicores: A Glimpse at the Parallel L...

引用

15th International School on Formal Methods for the Design of Computer, Communication, and Software Systems (SFM)

作者： Brandauer, Stephan Castegren, Elias Clarke, Dave Fernandez-Reyes, Kiko Johnsen, Einar Broch Pun, Ka I. Tarifa, S. Lizeth Tapia Wrigstad, Tobias Yang, Albert Mingkun Uppsala Univ Dept Informat Technol Uppsala Sweden Univ Oslo Dept Informat N-0316 Oslo Norway

ISBN: (纸本)9783319189413;9783319189406

The age of multi-core computers is upon us, yet current programming languages, typically designed for single-core computers and adapted post hoc for multi-cores, remain tied to the constraints of a sequential mindset and are thus in many ways inadequate. New programming language designs are required that break away from this old-fashioned mindset. To address this need, we have been developing a new programming language called Encore, in the context of the European Project UpScale. The paper presents a motivation for the Encore language, examples of its main constructs, several larger programs, a formalisation of its core, and a discussion of some future directions our work will take. The work is ongoing and we started more or less from scratch. That means that a lot of work has to be done, but also that we need not be tied to decisions made for sequential language designs. Any design decision can be made in favour of good performance and scalability. For this reason, Encore offers an interesting platform for future exploration into object-oriented parallel programming.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Exascaling Your Library: Will Your Implementation Meet Your Expectations? 15

Exascaling Your Library: Will Your Implementation Meet Your ...

引用

29th ACM International Conference on Supercomputing (ICS)

作者： Shudler, Sergei Calotoiu, Alexandru Hoefler, Torsten Strube, Alexandre Wolf, Felix Tech Univ Darmstadt Darmstadt Germany Swiss Fed Inst Technol Zurich Switzerland Julich Supercomp Ctr Julich Germany

ISBN: (纸本)9781450335591

Many libraries in the HPC field encapsulate sophisticated algorithms with clear theoretical scalability expectations. However, hardware constraints or programming bugs may sometimes render these expectations inaccurate or even plainly wrong. While algorithm engineers have already been advocating the systematic combination of analytical performance models with practical measurements for a very long time, we go one step further and show how this comparison can become part of automated testing procedures. The most important applications of our method include initial validation, regression testing, and benchmarking to compare implementation and platform alternatives. Advancing the concept of performance assertions, we verify asymptotic scaling trends rather than precise analytical expressions, relieving the developer from the burden of having to specify and maintain very fine-grained and potentially non-portable expectations. In this way, scalability validation can be continuously applied throughout the whole development cycle with very little effort. Using MPI as an example, we show how our method can help uncover non-obvious limitations of both libraries and underlying platforms.

关键词： software engineering high performance computing parallel programming performance analysis

来源：评论

学校读者我要写书评

暂无评论

Performance Evaluation of Unscented Kalman Filter Using Multi-Core Processors Environment

Performance Evaluation of Unscented Kalman Filter Using Mult...

引用

IEEE International Conference on Computer, Communication and Control (IC4)

作者： Sharma, Suresh Kumar Nene, Manisha J. Deemed Univ Def Inst Adv Technol Pune 411025 Maharashtra India

ISBN: (纸本)9781479981632

The Unscented Kalman Filter (UKF) is widely used to solve nonlinear systems, like submarine tracking, aircraft surveillance, autonomous robotics and mobile systems. One of the typical problems solved using UKF is Bearing-Only Target Motion Analysis (BOTMA) for manoeuvring and non manoeuvring targets. This paper proposes a methodology for parallel execution of UKF with an aim to enhance its performance in terms of computational throughput. parallel algorithm and its execution of UKF for BOTMA will use multi-core processor environment. The study concentrate on identifying the phases of UKF enabled BOTMA that can be parallelized to execute on the hardware underneath to enhance the response time. The performance is observed and results are verified.

关键词： Unscented Kalman Filter (UKF) Bearing-Only Target Motion Analysis (BOTMA) parallel programming time complexity computational complexity

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：