检索结果-内蒙古大学图书馆

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 2009年第3期23卷 277-283页

作者： Luszczek, Piotr Univ Tennessee Knoxville TN 37996 USA

A visit to the neighborhood PC retail store provides ample proof that we are in the multi-core era. The key differentiator among manufacturers today is the number of cores that they pack onto a single chip. The clock frequency of commodity processors has reached its limit, however, and is likely to stay below 4 GHz for years to come. As a result, adding cores is not synonymous with increasing computational power. To take full advantage of the performance enhancements offered by the new multi-core hardware, a corresponding shift must take place in the software infrastructure - a shift to parallel computing.

关键词： MATLAB parallel Computing Toolbox multi-core parallel computing parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel programming models for a multiprocessor SoC platform applied to networking and multimedia

引用

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS 2006年第7期14卷 667-680页

作者： Paulin, Pierre G. Pilkington, Chuck Langevin, Michel Bensoudane, Essaid Lyonnard, Damien Benny, Olivier Lavigueur, Bruno Lo, David Beltrame, Giovanni Gagne, Vincent Nicolescu, Gabriela STMicroelectronics Ottawa ON K2H 8R6 Canada Ecole Polytech Montreal PQ H3C 3A7 Canada

The MultiFlex system is an application-to-platform mapping tool that integrates heterogeneous parallel components - H/W or S/W - into a homogeneous platform programming environment. This leads to higher quality designs through encapsulation and abstraction. Two high-level parallel programming models are supported by the following MultiFlex platform mapping tools: a distributed system object component (DSOC) object-oriented message passing model and a symmetrical multiprocessing (SMP) model using shared memory. We demonstrate the combined use of the MultiFlex multiprocessor mapping tools, supported by high-speed hardware-assisted messaging, context-switching, and dynamic scheduling using the StepNP demonstrator multiprocessor system-on-chip platform, for two representative applications: 1) an Internet traffic management application running at 2.5 Gb/s and 2) an MPEG4 video encoder (VGA resolution, at 30 frames/s). For these applications, a combination of the DSOC and SMP programming models were used in interoperable fashion. After optimization and mapping, processor utilization rates of 85%-91% were demonstrated for the traffic manager. For the MPEG4 decoder, the average processor utilization was 88%.

关键词： multimedia computing multiprocessor interconnection parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel programming in computational science:: an introductory practical training course for computer science undergraduates at Aachen University

引用

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE 2003年第8期19卷 1309-1319页

作者： Bücker, HM Lang, B Bischof, CH Univ Aachen Inst Comp Sci D-52056 Aachen Germany Univ Wuppertal Appl Comp Sci Grp D-42097 Wuppertal Germany

parallel programming of high-performance computers has emerged as a key technology for the numerical solution of large-scale problems arising in computational science and engineering (CSE). The authors believe that principles and techniques of parallel programming are among the essential ingredients of any CSE as well as computer science curriculum. Today, opinions on the role and importance of parallel programming are diverse. Rather than seeing it as a marginal beneficial skill optionally taught at the graduate level, we understand parallel programming as crucial basic skill that should be taught as an integral part of the undergraduate computer science curriculum. A practical training course developed for computer science undergraduates at Aachen University is described. Its goal is to introduce young computer science students to different parallel programming paradigms for shared and distributed memory computers as well as to give a first exposition to the field of computational science by simple, yet carefully chosen sample problems. (C) 2003 Elsevier B.V. All rights reserved.

关键词： parallel programming Java computational science and engineering education

来源：评论

学校读者我要写书评

暂无评论

parallel programming to identify cellular contexts

Parallel programming to identify cellular contexts

引用

IEEE International Workshop on Genomic Signal Processing and Statistics

作者： Tembe, Waibhav Zhang, Shaoyan Raghavan, Siddharth Lowey, James Kim, Seungchan Suh, Edward Translat Genom Res Inst High Performance Biocomp Div Phoenix AZ USA Arizona State Univ Sch Comp & Informat Tempe AZ 85287 USA

ISBN: (纸本)9781424423712

High-throughput distributed data analysis based on clustered computing is gaining increasing importance in the field of computational biology. This paper describes a parallel programming approach and its software implementation using Message Passing Interface (MPI) to parallelize a computationally intensive algorithm for identifying cellular contexts. We report successful implementation on a 1,024 processor Beowulf cluster to analyze microarray data consisting of hundreds of thousands of measurements from different datasets. Detailed performance evaluation shows that data analysis that could have taken months on a stand-alone computer was accomplished in less than a day.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel programming of Resistive Cross-point Array for Synaptic Plasticity 5

Parallel Programming of Resistive Cross-point Array for Syna...

引用

5th Annual International Conference on Biologically Inspired Cognitive Architectures (BICA)

作者： Xu, Zihan Mohanty, Abinash Chen, Pai-Yu Kadetotad, Deepak Lin, Binbin Ye, Jieping Vrudhula, Sarma Yu, Shimeng Seo, Jae-sun Cao, Yu Arizona State Univ Sch Elect Comp & Energy Engn Tempe AZ 85287 USA Arizona State Univ Sch Comp Informat & Decis Syst Engn Tempe AZ 85281 USA

This paper proposes a parallel programming scheme for the cross-point array with resistive random access memory (RRAM). Synaptic plasticity in unsupervised learning is realized by tuning the conductance of each RRAM cell. Inspired by the spike-timing-dependent-plasticity (STDP), the programming strength is encoded into the spike firing rate (i.e., pulse frequency) and the overlap time (i.e., duty cycle) of the pre-synaptic node and post-synaptic node, and simultaneously applied to all RRAM cells in the cross-point array. Such an approach achieves parallel programming of the entire RRAM array, only requiring local information from pre-synaptic and post-synaptic nodes to each RRAM cell. As demonstrated by digital peripheral circuits implemented in 65nm CMOS, the programming time of a 40kb RRAM array is 84 ns, indicating 900X speedup as compared to state-of-the-art software approach of sparse coding in image feature extraction.

关键词： Resistive cross-point array parallel programming Synaptic plasticity Dictionary learning

来源：评论

学校读者我要写书评

暂无评论

parallel programming with Object Assemblies 09

Parallel Programming with Object Assemblies

引用

24th Annual ACM Conference on Object-Oriented programming, Systems, Languages and Applications

作者： Lublinerman, Roberto Chaudhuri, Swarat Cerny, Pavol Penn State Univ University Pk PA 16802 USA

ISBN: (纸本)9781605587349

We present Chorus, a high-level parallel programming model suitable for irregular, heap-manipulating applications like mesh refinement and epidemic simulations, and JChorus. an implementation of the model on top of Java. One goal of Chorus is to express the dynamic and instance-dependent patterns of memory access that are common in typical irregular applications. Its other focus is locality of effects the property that in many of the same applications, typical imperative commands only affect small, local regions in the shared heap Chorus addresses dynamism and locality through the unifying abstraction of an object assembly: a local region in a shared data structure equipped with a short-lived, speculative thread of control The thread of control in an assembly can only access objects within the assembly. While objects can migrate from assembly to assembly. such migration is local-i.e., objects only move from one assembly to a neighboring one-and does not lead to aliasing. programming primitives include a merge operation, by which an assembly merges with an adjacent assembly, and a split operation, which splits an assembly into smaller ones Our abstractions are race and deadlock-free, and inherently data-centric. We demonstrate that Chorus and JChorus allow natural programming of several important applications exhibiting irregular data-parallelism. We also present an implementation of JChorus based on a many-to-one mapping of assemblies to lower-level threads, and report on preliminary performance numbers.

关键词： parallel programming programming abstractions Irregular parallelism Data-parallelism Ownership

来源：评论

学校读者我要写书评

暂无评论

parallel programming on a Soft-Core Based Multi-core System

Parallel Programming on a Soft-Core Based Multi-core System

引用

10th International Conference on Algorithms and Architectures for parallel Processing

作者： Lee, Liang-Teh Lee, Shin-Tsung Chen, Ching-Wei Tatung Univ Dept Comp Sci & Engn Taipei 10452 Taiwan

ISBN: (纸本)9783642131356

Soft-core system allows designers to modify the components which are in the architecture they designed conveniently. In some systems, uni-core processor can not provide enough computing power to support a huge amount of computing for specific applications. In order to improve the performance of a multi-core system, in addition to the hardware architecture design, parallel programming is an important issue. The current parallelizing compilers are hard to parallelize the programs effectively. The programmer must think about how to allot the task to each processor in the beginning. In this paper, we present a software framework for designing parallel program. The proposed framework provides a convenient parallel programming environment for programmers to design the multi-core system's software. From the experiments, the proposed framework can parallelize the program effectively by applying the provided functions.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel programming models for a multiprocessor SoC platform applied to networking and multimedia

Parallel programming models for a multiprocessor SoC platfor...

引用

2nd International Conference on Hardware/Software Codesign and Systems Synthesis

关键词： multimedia computing multiprocessor interconnection parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallel programming with Coq: Map and Reduce Skeletons on Trees 19

Parallel Programming with Coq: Map and Reduce Skeletons on T...

引用

34th ACM/SIGAPP Annual International Symposium on Applied Computing (SAC)

作者： Philippe, Jolan Loulergue, Frederic No Arizona Univ Sch Informat Comp & Cyber Syst Flagstaff AZ 86011 USA

ISBN: (纸本)9781450359337

SyDPaCC is a set of libraries for the Coq interactive theorem prover. It allows to develop correct functional parallel programs on distributed lists based on the transformation of naive sequential programs that are considered as specifications. To offer the parallelization of functions on other data structures, the first step is to implement a parallel version of the considered data structure and to provide parallel implementations of primitive functions manipulating it. This paper presents such a first step: a binary tree extension which includes new map and reduce pure functional algorithmic skeletons for binary trees. Such algorithmic skeletons are templates of parallel algorithms, realized in a functional context as higherorder functions implemented in parallel. The use of these new primitives is illustrated on example applications.

关键词： Functional programming parallel programming Coq

来源：评论

学校读者我要写书评

暂无评论

parallel programming in computational science:: an introductory practical training course for computer science undergraduates at Aachen University

Parallel programming in computational science:: an introduct...

引用

International Conference on Computational Science

作者： Bücker, HM Lang, B Bischof, CH Univ Aachen Inst Comp Sci D-52056 Aachen Germany Univ Wuppertal Appl Comp Sci Grp D-42097 Wuppertal Germany

关键词： parallel programming Java computational science and engineering education

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：