检索结果-内蒙古大学图书馆

EXECUTION FLOW CONTROL:SIMPLIFIED DESIGN OF PARALLEL APPLICATIONS

EXECUTION FLOW CONTROL:SIMPLIFIED DESIGN OF PARALLEL APPLICA...

2011 IEEE International Conference on Cloud Computing and Intelligence Systems(2011年第一届IEEE云计算与智能系统国际会议 IEEE CCIS2011)

作者： Jing Jin Xin Li Shanzhi Chen Yan Wang State Lab of Switching & Networking Technology Beijing University of Posts and Telecommunications B China Academy of Telecommunication Technology Beijing China Xinyang Power Supply Company of Henan Electronic Power Company Xinyang Henan China

Google’s MapReduce enables program automatic parallelization by partitioning input data and replicating functions, but it does not directly support complex parallel modes like pipeline. However, many parallel modes are helpful to optimize solution of parallel computing problem. In this paper, we propose EFC (Execution Flow Control), a novel programming model and related implementation. It supports an execution-flow control interface which makes the model more compatible with different parallel modes. It allows user to modify execution flow as needed. The new model enables simple compact design of most parallel modes.

关键词： programming model cloud computing execution-flow control parallel process

来源：评论

学校读者我要写书评

暂无评论

Concurrent Collections

引用

SCIENTIFIC programming 2010年第3-4期18卷 203-217页

作者： Budimlic, Zoran Burke, Michael Cave, Vincent Knobe, Kathleen Lowney, Geoff Newton, Ryan Palsberg, Jens Peixotto, David Sarkar, Vivek Schlimbach, Frank Tasirlar, Sagnak Rice Univ Houston TX 77251 USA Intel Corp Santa Clara CA 95051 USA Univ Calif Los Angeles Los Angeles CA 90024 USA

We introduce the Concurrent Collections (CnC) programming model. CnC supports flexible combinations of task and data parallelism while retaining determinism. CnC is implicitly parallel, with the user providing high-level operations along with semantic ordering constraints that together form a CnC graph. We formally describe the execution semantics of CnC and prove that the model guarantees deterministic computation. We evaluate the performance of CnC implementations on several applications and show that CnC offers performance and scalability equivalent to or better than that offered by lower-level parallel programming models.

关键词： Concurrent Collections (CnC) programming model parallel programming

来源：评论

学校读者我要写书评

暂无评论

A programming system for sensor-based scientific applications

引用

JOURNAL OF COMPUTATIONAL SCIENCE 2010年第4期1卷 206-220页

作者： Jiang, Nanyan Parashar, Manish Rutgers State Univ Dept Elect & Comp Engn Ctr Auton Comp Piscataway NJ 08855 USA

Technical advances are leading to a pervasive computational ecosystem that integrates computing infrastructures with embedded sensors and actuators, and are giving rise to a new paradigm for monitoring, understanding, and managing natural and engineered systems one that is information/data-driven. In this paper, we present a programming system that can support such end-to-end sensor-based dynamic data-driven applications. Specifically, the programming system enables these applications at two levels. First, it provides programming abstractions for integrating sensor systems with computational models for scientific and engineering processes and with other application components in an end-to-end experiment. Second, it provides programming abstractions and system software support for developing in-network data processing mechanisms. The former supports complex querying of the sensor system, while the latter enables development of in-network data processing mechanisms such as aggregation, adaptive interpolation and assimilation. Furthermore, for the latter, we also explore the use of temporal and spatial correlations of sensor measurements in the targeted application domains to. tradeoff between the complexity of coordination among sensor clusters and the savings that result from having fewer sensors for in-network processing, while maintaining an acceptable error threshold. The research is evaluated using two application scenarios: the management and optimization of an instrumented oil field and the management and optimization of an instrumented data center. Experimental results show that the provided programming system reduces overheads while achieving near optimal and timely management and control in both application scenarios. (C) 2010 Elsevier B.V. All rights reserved.

关键词： programming model Middleware Sensor-based scientific applications In-network processing End-to-end system

来源：评论

学校读者我要写书评

暂无评论

SmartGridRPC: The new RPC model for high performance Grid computing

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2010年第18期22卷 2467-2487页

作者： Brady, Thomas Dongarra, Jack Guidolin, Michele Lastovetsky, Alexey Seymour, Keith Univ Coll Dublin Sch Comp Sci & Informat Dublin 2 Ireland Univ Tennessee Dept Elect Engn & Comp Sci Knoxville TN USA

The paper presents the SmartGridRPC model, an extension of the GridRPC model, which aims to achieve higher performance. The traditional GridRPC provides a programming model and API for mapping individual tasks of an application in a distributed Grid environment, which is based on the client-server model characterized by the star network topology. SmartGridRPC provides a programming model and API for mapping a group of tasks of an application in a distributed Grid environment, which is based on the fully connected network topology. The SmartGridRPC programming model and API and its performance advantages over the GridRPC model are outlined in this paper. In addition, experimental results using a real-world application are also presented. Copyright (C) 2010 John Wiley & Sons, Ltd.

关键词： high performance computing GridRPC Grid programming model middleware

来源：评论

学校读者我要写书评

暂无评论

THE SARC ARCHITECTURE

引用

IEEE MICRO 2010年第5期30卷 16-29页

作者： Ramirez, Alex Cabarcas, Felipe Juurlink, Ben Alvarez Mesa, Mauricio Sanchez, Friman Azevedo, Arnaldo Meenderinck, Cor Ciobanu, Catalin Isaza, Sebastian Gaydadjiev, Georgi Barcelona Supercomp Ctr Comp Architecture Grp Barcelona Spain Tech Univ Berlin Fac Elect Engn & Comp Sci D-1000 Berlin Germany Univ Politecn Cataluna Comp Architecture Dept E-08028 Barcelona Spain Delft Univ Technol Comp Engn Lab Fac Elect Engn Math & Comp Sci NL-2600 AA Delft Netherlands Delft Univ Technol Comp Engn Lab Microelectron & Comp Engn Dept NL-2600 AA Delft Netherlands

The sarc architecture is composed of multiple processor types and a set of user-managed Direct Memory Access (DMA) engines that let the runtime scheduler overlap data transfer and computation. The runtime system automatically allocates tasks on the heterogeneous cores and schedules the data transfers through the DMA engines. SARC's programming model supports various highly parallel applications, with matching support from specialized accelerator processors.

关键词： accelerator Decoding heterogeneous architecture multicore Multicore processing Parallel processing programming programming model Runtime System-on-a-chip

来源：评论

学校读者我要写书评

暂无评论

Efficient Transaction Nesting in Hardware Transactional Memory

Efficient Transaction Nesting in Hardware Transactional Memo...

引用

23rd International Conference on Architecture of Computing Systems (ARCS 2010)

作者： Liu, Yi Su, Yangming Zhang, Cui Wu, Mingyu Zhang, Xin Li, He Qian, Depei Beihang Univ Sinogerman Joint Software Inst Beijing 100191 Peoples R China Xi An Jiao Tong Univ Dept Comp Xian 710049 Peoples R China

ISBN: (纸本)9783642119491

Efficient transaction nesting is one of the ongoing challenges for hardware transactional memory. To increase efficiency of closed nesting, this paper proposes a conditional partial rollback (CPR) scheme which supports conditional partial rollback without increasing hardware complexities significantly. In stead of rolling back to the outermost transaction as in commonly-used flattening model, the CPR scheme just rolls back to the conflicted transaction itself or one of its outer-level transactions if given conditions are satisfied. By recording access status of each nested transaction, the scheme uses one global data set for all of the nested transactions rather than independent data set for each nested transaction. Hardware transactional memory architecture with Hie support of CPR scheme is also proposed based on multi-core processor and current cache coherence mechanism. Time system is implemented by simulation, and evaluated using seven benchmark applications. Evaluation results show that the CPR scheme achieves better performance and scalability than the flattening model which is commonly-used in hardware transactional memory.

关键词： transactional memory transaction nesting multi-core processor programming model programmability

来源：评论

学校读者我要写书评

暂无评论

memCUDA: Map Device Memory to Host Memory on GPGPU Platform

memCUDA: Map Device Memory to Host Memory on GPGPU Platform

引用

IFIP International Conference on Network and Parallel Computing

作者： Jin, Hai Li, Bo Zheng, Ran Zhang, Qin Ao, Wenbing Huazhong Univ Sci & Technol Sch Comp Sci & Technol Cluster & Grid Comp Lab Serv Comp Technol & Syst Lab Wuhan 430074 Peoples R China

ISBN: (纸本)9783642156717

The Compute Unified Device Architecture (CUDA) programming environment from NVIDIA is a milestone towards making programming many-core GPUs more flexible to programmers. However, there are still many challenges for programmers when using CUDA. One is how to deal with GPU device memory, and data transfer between host memory and GPU device memory explicitly. In this study, source-to-source compiling and runtime library technologies are used to implement an experimental programming system based on CUDA, called memCUDA, which can automatically map GPU device memory to host memory. With some pragma directive language, programmer can directly use host memory in CUDA kernel functions, during which the tedious and error-prone data transfer and device memory management are shielded from programmer. The performance is also improved with some near-optimal technologies. Experiment results show that memCUDA programs can get similar effect with well-optimized CUDA programs with more compact source code.

关键词： GPU CUDA memory mapping programming model

来源：评论

学校读者我要写书评

暂无评论

programming Pervasive Spaces 7th

Programming Pervasive Spaces

引用

7th International Conference on Autonomic and Trusted Computing

作者： Helal, Sumi Univ Florida Comp & Informat Sci & Engn Dept Gainesville FL 32611 USA

ISBN: (纸本)9783642163548

In principle, the entire world can exploit ubiquitous and pervasive systems to great societal benefits. In practice, however, there is as yet no fundamental basis or widely accepted programming models for such systems. There is also no established curriculum for teaching pervasive and sensor-based computing. In this talk, I will present our ongoing research efforts in defining and supporting programmable pervasive spaces. I will start by presenting our experience and lessons learnt in building “assistive environments” for the elderly, to demonstrate the need for space programmability and to define critical new requirements particular to pervasive spaces. I will then present ATLAS, a middleware architecture and a sensor platform that supports self-integration and enables SODA – a service-oriented programming model. I will show how ATLAS was used as the foundation on which we built and programmed the Gator Tech Smart House, and how it enabled pervasive application development, and scalable data collection and analysis. I will then delineate the limitations of SODA and present programming model extensions that address space and user safety as well as reliability and scalability. Finally, I will present our views of a possible ecosystem within which our programming models and system support can be used to promote the proliferation of programmable and manageable pervasive systems. © Springer-Verlag Berlin Heidelberg 2010.

关键词： Pervasive spaces programming model assistive environments

来源：评论

学校读者我要写书评

暂无评论

On The Parallel programming of Flash Memory Cells

On The Parallel Programming of Flash Memory Cells

引用

IEEE Information Theory Workshop (ITW)

作者： Yaakobi, Eitan Jiang, Anxiao (Andrew) Siegel, Paul H. Vardy, Alexander Wolf, Jack K. Univ Calif San Diego Elect & Comp Engn La Jolla CA 92093 USA Texas A&M Univ Comp Sci Engn College Stn TX 77843 USA

ISBN: (纸本)9781424482641

Parallel programming is an important tool used in flash memories to achieve high write speed. In parallel programming, a common programm voltage is applied to many cells for simultaneous charge injection. This property significantly simplifies the complexity of the memory hardware, and is a constraint that limits the storage capacity of flash memories. Another important property is that cells have different hardness for charge injection. It makes the charge injected into cells differ even when the same program voltage is applied to them. In this paper, we study the parallel programming of flash memory cells, focusing on the above two properties. We present algorithms for parallel programming when there is information on the cells' hardness for charge injection, but there is no feedback information on cell levels during programming. We then proceed to the programming model with feedback information on cell levels, and study how well the information on the cells' hardness for charge injection can be obtained. The results can be useful for understanding the storage capacity of flash memories with parallel programming.

关键词： charge injection electronic engineering computing feedback information flash memories flash memory cells high write speed memory hardware parallel programming parallel programming program voltage programming model simultaneous charge injection storage capacity

来源：评论

学校读者我要写书评

暂无评论

Hierarchical message passing through a ProActive/GCM based runtime

Hierarchical message passing through a ProActive/GCM based r...

引用

作者： Mathias, Elton Nicoletti

In the past several years, grid computing has emerged as a way to harness computing resources geographically distributed across multiple organizations. Due to its inherently largely distributed and heterogeneous nature, grid computing has enlarged the importance of specific requirements, such as scalability, performance and the need of an adequate programming model. Several programming models have been proposed for grid programming. Nonetheless, so far, none of them met all the requirements. Differently, in the field of high performance cluster computing, the message passing model became a true standard with a large number of libraries and legacy applications. This work proposes a hybrid framework that combines the high performance and high acceptability of the MPI standard boosted with intuitive extensions to enable developers to design grid applications or "gridify" existing ones with the flexibility of a component-based runtime modeling resources hierarchy and offering support to inter-cluster communication. The proposed solution relies on the addition of new MPI communicators and a related API, which may offer a support well-suited to programmers used to MPI in order to reflect a hierarchical topology within the deployed application. Carlo Simulation, a Mergesort and a Poissond3D solver) have shown that the "gridification" of applications improve application performance on grid environments. Even if the goal is not to compete against existing MPI distributions, the performance of the solution is comparable with MPI performance, even better in some cases. From the results obtained in the evaluation of this prototype, we conclude that the overhead introduced by the components is not negligible, but inside of the expected. However, we can expect the benefits to grid applications to bypass the generated overhead. Besides, the extended interface may offer users the adequate abstractions to design parallel algorithms in a hierarchical way addressing grid environments

关键词： Processamento paralelo Parallel programming Component-oriented programming Computação em grade programming model Mpi Grid programming Messsage passing MPI Dissertação

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：