检索结果-内蒙古大学图书馆

Library-Independent Data Race Detection

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2014年第10期25卷 2606-2616页

作者： Jannesari, Ali Tichy, Walter F. German Res Sch Simulat Sci Multicore Programming Grp Aachen Germany Rhein Westfal TH Aachen Aachen Germany Karlsruhe Inst Technol D-76021 Karlsruhe Germany

Data races are a common problem on shared-memory parallel computers, including multicores. Analysis programs called race detectors help find and eliminate them. However, current race detectors are geared for specific concurrency libraries. When programmers use libraries unknown to a given detector, the detector becomes useless or requires extensive reprogramming. We introduce a new synchronization detection mechanism that is independent of concurrency libraries. It dynamically detects synchronization constructs based on a characteristic code pattern. The approach is non-intrusive and applicable to various concurrency libraries. Experimental results confirm that the approach identifies synchronizations and detects data races regardless of the concurrency libraries involved. With this mechanism, race detectors can be written once and need not be adapted to particular libraries.

关键词： parallel programming parallelization libraries ad hoc synchronization synchronization primitives dynamic analysis data race detection debugging multicore

来源：评论

学校读者我要写书评

暂无评论

parallel classification and feature selection in microarray data using SPRINT

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2014年第4期26卷 854-865页

作者： Mitchell, Lawrence Sloan, Terence M. Mewissen, Muriel Ghazal, Peter Forster, Thorsten Piotrowski, Michal Trew, Arthur Univ Edinburgh Sch Phys & Astron EPCC Edinburgh EH9 3JZ Midlothian Scotland Univ Edinburgh Sch Med Div Pathway Med Edinburgh EH16 4SB Midlothian Scotland

The statistical language R is favoured by many biostatisticians for processing microarray data. In recent times, the quantity of data that can be obtained in experiments has risen significantly, making previously fast analyses time consuming or even not possible at all with the existing software infrastructure. High performance computing (HPC) systems offer a solution to these problems but at the expense of increased complexity for the end user. The Simple parallel R Interface is a library for R that aims to reduce the complexity of using HPC systems by providing biostatisticians with drop-in parallelised replacements of existing R functions. In this paper we describe parallel implementations of two popular techniques: exploratory clustering analyses using the random forest classifier and feature selection through identification of differentially expressed genes using the rank product method.

关键词： HPC Genomics parallel programming

来源：评论

学校读者我要写书评

暂无评论

Multiprocessing with GUI-awareness using OpenMP-like directives in Java

引用

parallel COMPUTING 2014年第2期40卷 69-89页

作者： Vikas Giacaman, Nasser Sinnen, Oliver Univ Auckland Dept Elect & Comp Engn Auckland Mail Ctr Auckland 1142 New Zealand

Directives based incremental parallelism is an uncomplicated and expressive parallelisation practice and has led to wide adoption of OpenMP. However, the OpenMP specification does not present a binding for the Java language and the OpenMP threading model finds limited use for GUI (Graphical User Interface) application development. This paper focuses on the study of a semantic interpretation of OpenMP in the context of an object orientated environment. It proposes novel concepts to extend OpenMP for applications with a Graphical User Interface (GUI), based on the distinction between parallelism and concurrency. We present a compiler-runtime system for OpenMP-like directives in Java, enhanced with GUI related constructs. Acknowledging the productivity gains of the incremental parallelism approach of OpenMP, the GUI related constructs enable the developer to incrementally introduce concurrency. We present and discuss the performance of programs written using our system by comparing them with previous attempts and traditional ways of parallelisation-concurrency, using the parallel Java Grande Forum (JGF) benchmarks and a set of GUI applications. (C) 2013 Elsevier B.V. All rights reserved.

关键词： parallel programming OpenMP Object orientation Graphical User Interface Java

来源：评论

学校读者我要写书评

暂无评论

parallel from the beginning: The case for multicore programming in the computer science undergraduate curriculum

Parallel from the beginning: The case for multicore programm...

引用

44th ACM Technical Symposium on Computer Science Education, SIGCSE 2013

作者： Ko, Yousun Burgstaller, Bernd Scholz, Bernhard Yonsei University Seoul Korea Republic of University of Sydney Sydney Australia

ISBN: (纸本)9781450320306

The computing landscape has shifted towards multicore architectures. To learn about software development, it is increasingly important for students to gain hands-on parallel programming experience in multicore environments. This experience will be significantly different from programming for uniprocessors, because it involves a profound understanding of how to write software that is (1) free of concurrency bugs and (2) able to effectively utilize the underlying parallel hardware architecture. We present our work at Yonsei University and The University of Sydney to teach parallel programming to first and second-year undergraduate students. Our objective is to introduce parallelism early on in the curriculum, to instill it as a first principle of computation. We introduce a series of five parallel programming course modules suitable for a one semester introductory programming course. Each module teaches one fundamental concept of parallel programming: parallelism and execution indeterminism, thread-and-lock based programming, performance of parallel programs, hardware acceleration using OpenCL, and stream-parallel programming with StreamIt. We report our experience from four course offerings (2008-2011) at Yonsei University, and two course offerings at The University of Sydney. Over 73%of students surveyed enjoyed this multicore programming experience and preferred exposure to parallelism at this early stage of their CS education. Our course has been awarded an Intel microgrant for "parallelism in the Classroom", and it is available online at Intel's Multicore Curriculum Initiative Website. Copyright © 2013 ACM.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Optimized Cell programming for Flash Memories With Quantizers

引用

IEEE TRANSACTIONS ON INFORMATION THEORY 2014年第5期60卷 2780-2795页

作者： Qin, Minghai Yaakobi, Eitan Siegel, Paul H. Univ Calif San Diego Dept Elect & Comp Engn La Jolla CA 92093 USA Univ Calif San Diego Ctr Magnet Recording Res La Jolla CA 92093 USA CALTECH Dept Elect Engn Pasadena CA 91125 USA

Multilevel flash memory contains blocks of cells that represent data by the amount of charge stored in them. The cell writing-or programming-process applies specified voltages in a sequential manner, injecting charge to achieve a desired level. Reducing a cell level requires a costly block erasure, so programming only increases cell levels. parallel programming, whereby a common voltage is applied to a group of cells to inject charge simultaneously, simplifies circuitry and increases programming speed. However, cell-to-cell variations and limited programming round can adversely affect its precision. In this paper, we consider algorithms for efficient cell programming. Since cell levels are quantized to a discrete set of values, our objective is to minimize the number of cells that are not quantized to their target levels. For a specified number of programming rounds, we derive an optimal parallel programming algorithm with complexity that is polynomial in the number of cells. We extend the algorithm to account for intercell interference, where the voltage applied to a cell can affect the level of adjacent cells. We then consider noisy programming of a single cell, with and without feedback about the cell level. In both scenarios, we present an algorithm that, for a given number of programming rounds, minimizes the probability of an incorrect cell level.

关键词： parallel programming flash memory programming intercell interference

来源：评论

学校读者我要写书评

暂无评论

Software Engineering with Transactional Memory Versus Locks in Practice

引用

THEORY OF COMPUTING SYSTEMS 2014年第3期55卷 555-590页

作者： Pankratius, Victor Adl-Tabatabai, Ali-Reza Intel Corp Programming Syst Lab Santa Clara CA USA

Transactional Memory (TM) promises to simplify parallel programming by replacing locks with atomic transactions. Despite much recent progress in TM research, there is very little experience using TM to develop realistic parallel programs from scratch. In this article, we present the results of a detailed case study comparing teams of programmers developing a parallel program from scratch using transactional memory and locks. We analyze and quantify in a realistic environment the development time, programming progress, code metrics, programming patterns, and ease of code understanding for six teams who each wrote a parallel desktop search engine over a fifteen week period. Three randomly chosen teams used Intel's Software Transactional Memory compiler and Pthreads, while the other teams used just Pthreads. Our analysis is exploratory: Given the same requirements, how far did each team get? The TM teams were among the first to have a prototype parallel search engine. Compared to the locks teams, the TM teams spent less than half the time debugging segmentation faults, but had more problems tuning performance and implementing queries. Code inspections with industry experts revealed that TM code was easier to understand than locks code, because the locks teams used many locks (up to thousands) to improve performance. Learning from each team's individual success and failure story, this article provides valuable lessons for improving TM.

关键词： parallel programming Transactional memory Language design Human factors Synchronization programming techniques

来源：评论

学校读者我要写书评

暂无评论

parallel indirect solution of optimal control problems

引用

OPTIMAL CONTROL APPLICATIONS & METHODS 2014年第2期35卷 204-230页

作者： Fabien, Brian C. Univ Washington Dept Mech Engn Seattle WA 98195 USA

This paper presents an algorithm for the indirect solution of optimal control problems that contain mixed state and control variable inequality constraints. The necessary conditions for optimality lead to an inequality constrained two-point BVP with index-1 differential-algebraic equations (BVP-DAEs). These BVP-DAEs are solved using a multiple shooting method where the DAEs are approximated using a single-step linearly implicit Runge-Kutta (Rosenbrock-Wanner) method. An interior-point Newton method is used to solve the residual equations associated with the multiple shooting discretization. The elements of the residual equations, and the Jacobian of the residual equations, are constructed in parallel. The search direction for the interior-point method is computed by solving a sparse bordered almost block diagonal (BABD) linear system. Here, a parallel-structured orthogonal factorization algorithm is used to solve the BABD system. Examples are presented to illustrate the efficiency of the parallel algorithm. It is shown that an American National Standards Institute C implementation of the parallel algorithm achieves significant speedup with the increase in the number of processors used. Copyright (c) 2013 John Wiley & Sons, Ltd.

关键词： BVP multiple shooting optimal control differential-algebraic equations interior-point method parallel programming

来源：评论

学校读者我要写书评

暂无评论

Systematic Debugging Methods for Large-Scale HPC Computational Frameworks

引用

COMPUTING IN SCIENCE & ENGINEERING 2014年第3期16卷 48-56页

作者： Humphrey, Alan Meng, Qingyu Berzins, Martin de Oliveira, Diego Caminha B. Rakamaric, Zvonimir Gopalakrishnan, Ganesh Univ Utah Sci Comp & Imaging Inst Salt Lake City UT 84112 USA Univ Utah Sch Comp Salt Lake City UT 84112 USA Univ Utah Salt Lake City UT 84112 USA Univ Utah Ctr Parallel Comp Salt Lake City UT 84112 USA

parallel computational frameworks for high-performance computing are central to the advancement of simulation-based studies in science and engineering. Finding and fixing bugs in these frameworks can be time consuming. If left unchecked, these bugs diminish the amount of new science performed. A systematic study of the Uintah Computational Framework investigates debugging approaches, leveraging the framework's modular structure.

关键词： computational modeling and frameworks debugging aids parallel programming reliability scientific computing

来源：评论

学校读者我要写书评

暂无评论

The SpiNNaker Project

引用

PROCEEDINGS OF THE IEEE 2014年第5期102卷 652-665页

作者： Furber, Steve B. Galluppi, Francesco Temple, Steve Plana, Luis A. Univ Manchester Sch Comp Sci Manchester M13 9PL Lancs England

The spiking neural network architecture (SpiNNaker) project aims to deliver a massively parallel million-core computer whose interconnect architecture is inspired by the connectivity characteristics of the mammalian brain, and which is suited to the modeling of large-scale spiking neural networks in biological real time. Specifically, the interconnect allows the transmission of a very large number of very small data packets, each conveying explicitly the source, and implicitly the time, of a single neural action potential or "spike.'' In this paper, we review the current state of the project, which has already delivered systems with up to 2500 processors, and present the real-time event-driven programming model that supports flexible access to the resources of the machine and has enabled its use by a wide range of collaborators around the world.

关键词： Brain modeling multicast algorithms multiprocessor interconnection networks neural network hardware parallel programming

来源：评论

学校读者我要写书评

暂无评论

New Faster CHARMM Molecular Dynamics Engine

引用

JOURNAL OF COMPUTATIONAL CHEMISTRY 2014年第5期35卷 406-413页

作者： Hynninen, Antti-Pekka Crowley, Michael F. Natl Renewable Energy Lab Computat Sci Ctr Golden CO 80401 USA Natl Renewable Energy Lab Biosci Ctr Golden CO 80401 USA

We introduce a new faster molecular dynamics (MD) engine into the CHARMM software package. The new MD engine is faster both in serial (i.e., single CPU core) and parallel execution. Serial performance is approximately two times higher than in the previous version of CHARMM. The newly programmed parallelization method allows the MD engine to parallelize up to hundreds of CPU cores. (c) 2013 Wiley Periodicals, Inc.

关键词： CHARMM molecular dynamics parallel programming domain decomposition

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：