检索结果-内蒙古大学图书馆

A practical automatic polyhedral parallelizer and locality optimizer

acm sigplan NOTICES 2008年第6期43卷 101-113页

作者： Bondhugula, Uday Hartono, Albert Ramanujam, J. Sadayappan, P. Ohio State Univ Dept Comp Sci & Engn Columbus OH 43210 USA Louisiana State Univ Dept Elect & Comp Engn Baton Rouge LA 70803 USA Louisiana State Univ CCT Baton Rouge LA 70803 USA

We present the design and implementation of an automatic polyhedral source-to-source transformation framework that can optimize regular programs ( sequences of possibly imperfectly nested loops) for parallelism and locality simultaneously. Through this work, we show the practicality of analytical model-driven automatic transformation in the polyhedral model-far beyond what is possible by current production compilers. Unlike previous works, our approach is an end-to-end fully automatic one driven by an integer linear optimization framework that takes an explicit view of finding good ways of tiling for parallelism and locality using affine transformations. The framework has been implemented into a tool to automatically generate OpenMP parallel code from C program sections. Experimental results from the tool show very high speedups for local and parallel execution on multi-cores over state-of-the-art compiler frameworks from the research community as well as the best native production compilers. The system also enables the easy use of powerful empirical/iterative optimization for general arbitrarily nested loop sequences.

关键词： algorithms design experimentation performance automatic parallelization locality optimization polyhedral model loop transformations affine transformations tiling

来源：评论

学校读者我要写书评

暂无评论

design and evaluation of a compiler for embedded stream programs

引用

acm sigplan NOTICES 2008年第7期43卷 131-140页

作者： Newton, Ryan R. Girod, Lewis D. Craig, Michael B. Madden, Samuel R. Morrisett, J. Greg MIT CSAIL Cambridge MA 02139 USA Harvard Univ Cambridge MA 02138 USA

Applications that combine live data streams with embedded, parallel, and distributed processing are becoming more commonplace. WaveScript is a domain-specific language that brings high-level, type-safe, garbage-collected programming to these domains. This is made possible by three primary implementation techniques, each of which leverages characteristics of the streaming domain. First, we employ a novel evaluation strategy that uses a combination of interpretation and reification to partially evaluate programs into stream dataflow graphs. Second, we use profile-driven compilation to enable many optimizations that are normally only available in the synchronous (rather than asynchronous) dataflow domain. Finally, we incorporate an extensible system for rewrite rules to capture algebraic properties in specific domains (such as signal processing). We have used our language to build and deploy a sensor-network for the acoustic localization of wild animals, in particular, the Yellow-Bellied marmot. We evaluate WaveScript's performance on this application, showing that it yields good performance on both embedded and desktop-class machines, including distributed execution and substantial parallel speedups. Our language allowed us to implement the application rapidly, while outperforming a previous C implementation by over 35%, using fewer than half the lines of code. We evaluate the contribution of our optimizations to this success.

关键词： design languages performance stream processing language sensor networks

来源：评论

学校读者我要写书评

暂无评论

The design and implementation of Typed Scheme

The Design and Implementation of Typed Scheme

引用

35th acm-sigplan-SIGACT Symposium on Principles of programming languages

作者： Tobin-Hochstadt, Sam Felleisen, Matthias Northeastern Univ PLT Boston MA 02115 USA

ISBN: (纸本)9781595936899

When scripts in untyped languages grow into large programs, maintaining them becomes difficult. A lack of types in typical scripting languages means that programmers must (re)discover critical pieces of design information every time they wish to change a program. This analysis step both slows down the maintenance process and may even introduce mistakes due to the violation of undiscovered invariants. This paper presents Typed Scheme, an explicitly typed extension of an untyped scripting language. Its type system is based on the novel notion of occurrence typing, which we formalize and mechanically prove sound. The implementation of Typed Scheme additionally borrows elements from a range of approaches, including recursive types, true unions and subtyping, plus polymorphism combined with a modicum of local inference. Initial experiments with the implementation suggest that Typed Scheme naturally accommodates the programming style of the underlying scripting language, at least for the first few thousand lines of ported code. © 2008 acm.

关键词： Type Systems Scheme

来源：评论

学校读者我要写书评

暂无评论

Automatic volume management for programmable microfluidics

引用

acm sigplan NOTICES 2008年第6期43卷 56-67页

作者： Amin, Ahmed M. Thottethodi, Mithuna Vijaykumar, T. N. Wereley, Steven Jacobson, Stephen C. Purdue Univ Sch Elect & Comp Engn W Lafayette IN 47907 USA Purdue Univ Sch Mech Engn W Lafayette IN 47907 USA Indiana Univ Dept Chem Bloomington IN 47405 USA

Microfluidics has enabled lab-on-a-chip technology to miniaturize and integrate biological and chemical analyses to a single chip comprising channels, valves, mixers, heaters, separators, and sensors. Recent papers have proposed programmable labs-on-a-chip as an alternative to traditional application-specific chips to reduce design effort, time, and cost. While these previous papers provide the basic support for programmability, this paper identifies and addresses a practical issue, namely, fluid volume management. Volume management addresses the problem that the use of a fluid depletes it and unless the given volume of a fluid is distributed carefully among all its uses, execution may run out of the fluid before all its uses are complete. Additionally, fluid volumes should not overflow (i.e., exceed hardware capacity) or underflow (i.e., fall below hardware resolution). We show that the problem can be formulated as a linear programming problem ( LP). Because LP's complexity and slow execution times in practice may be a concern, we propose another approach, called DAGSolve, which over-constrains the problem to achieve linear complexity while maintaining good solution quality. We also propose two optimizations, called cascading and static replication, to handle cases involving extreme mix ratios and numerous fluid uses which may defeat both LP and DAGSolve. Using some real-world assays, we show that our techniques produce good solutions while being faster than LP.

关键词： algorithms design microfluidics programmable lab-on-a-chip fluid volume management

来源：评论

学校读者我要写书评

暂无评论

XMem: Type-safe, transparent, shared memory for cross-runtime communication and coordination

引用

acm sigplan NOTICES 2008年第6期43卷 327-338页

作者： Wegiel, Michal Krintz, Chandra Univ Calif Santa Barbara Dept Comp Sci Santa Barbara CA 93106 USA

Developers commonly build contemporary enterprise applications using type-safe, component-based platforms, such as J2EE, and architect them to comprise multiple tiers, such as a web container, application server, and database engine. Administrators increasingly execute each tier in its own managed runtime environment (MRE) to improve reliability and to manage system complexity through the fault containment and modularity offered by isolated MRE instances. Such isolation, however, necessitates expensive cross-tier communication based on protocols such as object serialization and remote procedure calls. Administrators commonly co-locate communicating MREs on a single host to reduce communication overhead and to better exploit increasing numbers of available processing cores. However, state-of-the-art MREs offer no support for more efficient communication between co-located MREs, while fast inter-process communication mechanisms, such as shared memory, are widely available as a standard operating system service on most modern platforms. To address this growing need, we present the design and implementation of XMem - type-safe, transparent, shared memory support for co-located MREs. XMem guarantees type-safety through coordinated, parallel, multi-process class loading and garbage collection. To avoid introducing any level of indirection, XMem manipulates virtual memory mapping. In addition, object sharing in XMem is fully transparent: shared objects are identical to local objects in terms of field access, synchronization, garbage collection, and method invocation, with the only difference being that shared-to-private pointers are disallowed. XMem facilitates easy integration and use by existing communication technologies and software systems, such as RMI, JNDI, JDBC, serialization/XML, and network sockets. We have implemented XMem in the open-source, production-quality HotSpot Java Virtual Machine. Our experimental evaluation, based on core communication technologies un

关键词： design experimentation languages management measurement performance interprocess communication managed runtimes shared memory transparent type-safe garbage collection synchronization class loading parallel

来源：评论

学校读者我要写书评

暂无评论

Merge: A programming model for heterogeneous multi-core systems

引用

acm sigplan NOTICES 2008年第3期43卷 287-296页

作者： Linderman, Michael D. Collins, Jamison D. Wang, Hong Meng, Teresa H. Stanford Univ Dept Elect Engn Stanford CA 94305 USA Intel Corp Microarchitecture Res Lab Santa Cruz CA USA

In this paper we propose the Merge framework, a general purpose programming model for heterogeneous multi-core systems. The Merge framework replaces current ad hoc approaches to parallel programming on heterogeneous platforms with a rigorous, library-based methodology that can automatically distribute computation across heterogeneous cores to achieve increased energy and performance efficiency. The Merge framework provides (1) a predicate dispatch-based library system for managing and invoking function variants for multiple architectures;(2) a high-level, library-oriented parallel language based on map-reduce;and (3) a compiler and runtime which implement the map-reduce language pattern by dynamically selecting the best available function implementations for a given input and machine configuration. Using a generic sequencer architecture interface for heterogeneous accelerators, the Merge framework can integrate function variants for specialized accelerators, offering the potential for to-the-metal performance for a wide range of heterogeneous architectures, all transparent to the user. The Merge framework has been prototyped on a heterogeneous platform consisting of an Intel Core 2 Duo CPU and an 8-core 32-thread Intel Graphics and Media Accelerator X3000, and a homogeneous 32-way Unisys SNIP system with Intel Xeon processors. We implemented a set of benchmarks using the Merge framework and enhanced the library with X3000 specific implementations, achieving speedups of 3.6x - 8.5x using the X3000 and 5.2x - 22x using the 32-way system relative to the straight C reference implementation on a single IA32 core.

关键词： performance design languages heterogeneous multi-core GPGPU predicate dispatch

来源：评论

学校读者我要写书评

暂无评论

EventScript: An Event-Processing language Based on Regular Expressions with Actions 08

EventScript: An Event-Processing Language Based on Regular E...

引用

conference on languages, Compilers and Tools for Embedded Systems

作者： Cohen, Norman H. Kalleberg, Karl Trygve IBM Thomas J Watson Res Ctr Hawthorne NY USA

ISBN: (纸本)9781605581040

EventScript is a simple but powerful language for programming reactive processes. A stream of incoming events is matched against a regular expression. Actions embedded within the regular expression are executed in response to the matching of patterns of events. These actions include assigning computed values to variables and emitting output events. The definition of EventScript presented a number of novel and interesting language-design choices. EventScript has an efficient implementation, and has been used in a development environment for complex event-based applications. We have used EventScript to program both small examples and large industrial applications. Readers of EventScript programs find them easy to understand, and are comfortable with the familiar model of matching regular expressions.

关键词： event processing regular expressions reactive programs sensors actuators

来源：评论

学校读者我要写书评

暂无评论

design and evaluation of a compiler for embedded stream programs 08

Design and evaluation of a compiler for embedded stream prog...

引用

conference on languages, Compilers and Tools for Embedded Systems

作者： Newton, Ryan R. Girod, Lewis D. Craig, Michael B. Madden, Samuel R. Morrisett, J. Greg MIT CSAIL Cambridge MA 02139 USA Harvard Univ Cambridge MA 02138 USA

ISBN: (纸本)9781605581040

关键词： design languages performance stream processing language sensor networks

来源：评论

学校读者我要写书评

暂无评论

From massively monster machines to microchips: Forces affecting lisp language design for five decades

From massively monster machines to microchips: Forces affect...

引用

Celebrating the 50th Anniversary of Lisp, Lisp50@OOPSLA'08

作者： White, Jonl Bourbaki, Nickieben Ginger IceCream Factory of Palo Alto United States Morrison Tombstone Designs United States

ISBN: (纸本)9781605583839

I worked on Lisp design and implementation from the late 1960s almost until I retired about 5 years ago - and since then I've remained in the community by helping organize Lisp conferences. This means I've been in the thick of Lisp for most of its lifetime. In my talk there were a couple of points I wanted to make. First, computer hardware over the years has imposed constraints on the design of Lisp, ranging from gigantic machines in the early days - gigantic in size but miniscule in computing power - to tiny ones today (whose computing power was once considered "super".) Second, it was certain mindsets of the people involved in the design and implementation of Lisp that most strongly influenced its design - in particular, it was their educational background, driven by interests and talents, that had a great impact on the language. Copyright © 2008 acm.

关键词： Computing power

来源：评论

学校读者我要写书评

暂无评论

design and implementation of transactional constructs for C/C++ 08

Design and implementation of transactional constructs for C/...

引用

23rd acm conference on Object-Oriented programming Systems, languages, and Applications, OOPSLA 2008

作者： Ni, Yang Welc, Adam Adl-Tabatabai, Ali-Reza Bach, Moshe Berkowits, Sion Cownie, James Geva, Robert Kozhukow, Sergey Narayanaswamy, Ravi Olivier, Jeffrey Preis, Serguei Saha, Bratin Tal, Ady Tian, Xinmin Intel Corporation Santa Clara CA USA Intel Corporation Haifa Israel Intel Corporation Glasgow United Kingdom Intel Corporation Novosibirsk Russian Fed. Intel Corporation Champaign IL USA Intel Coporation Novosibirsk Russian Fed.

ISBN: (纸本)9781605582153

This paper presents a software transactional memory system that introduces first-class C++ language constructs for transactional programming. We describe new C++ language extensions, a production-quality optimizing C++ compiler that translates and optimizes these extensions, and a high-performance STM runtime library. The transactional language constructs support C++ language features including classes, inheritance, virtual functions, exception handling, and templates. The compiler automatically instruments the program for transactional execution and optimizes TM overheads. The runtime library implements multiple execution modes and implements a novel STM algorithm that supports both optimistic and pessimistic concurrency control. The runtime switches a transaction's execution mode dynamically to improve performance and to handle calls to precompiled functions and I/O libraries. We present experimental results on 8 cores (two quad-core CPUs) running a set of 20 non-trivial parallel programs. Our measurements show that our system scales well as the numbers of cores increases and that our compiler and runtime optimizations improve scalability. Copyright © 2008 acm.

关键词： Concurrency control

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：