检索结果-内蒙古大学图书馆

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2011年 6161 LNCS卷 XVII页

作者： Gries, Matthias Intel Germany Research Lab Germany

ISBN: (纸本)9783642243219

We present the design of the experimental single-chip cloud computer (SCC) by Intel Labs. The SCC is a research microprocessor containing the most Intel architecture cores ever integrated on a single silicon chip: 48 cores. We envision SCC as a concept vehicle for research in the areas of parallel computing including system software, compilers and applications. It incorporates technologies intended to scale multi-core processors to 100 cores and beyond including an on-chip network, advanced power management technologies, new data-sharing options using software-managed memory coherency or hardwareaccelerated message passing, and intelligent resource management. SCC is implemented in a 45-nm process integrating 1.3-B transistors. It is based on a tiled architecture with each tile containing two Pentium class cores, private L1 and L2 caches, and one mesh router. All 24 tiles have access to four DDR3 memory channels. These channels can provide up to 64-GB of main memory to the system. The on-die communication is organized in a regular 6×4 mesh of tiles using 16-B-wide data links. The SCC contains one frequency domain for each tile and eight voltage domains: two for on and off chip I/O and six for the cores. Each tile contains sensors to monitor the thermal state. SCC has a NUMA architecture including local caches and on-die distributed memory for low latency, hardware-assisted message passing or scratchpad use as well as an abundant external DRAM bandwidth and capacity. Thus, the processor can be used as a proxy for future manycore platforms by running several independent applications and operating systems concurrently on dedicated resources while applying fine-grain voltage and frequency scaling for best energy efficiency. In this talk we review the chip’s architecture and highlight different system configurations that enable the exploration of compute, memory or communication limited workloads. We show the emulation-based design flow that enabled us to build the SCC w

关键词：

来源：评论

学校读者我要写书评

暂无评论

Compiling Esterel for Multi-core Execution

Compiling Esterel for Multi-core Execution

引用

Euromicro symposium on Digital System Design

作者： Simon Yuan Li Hsien Yoong Partha S. Roop Electrical and Computer Engineering University of Auckland Auckland New Zealand

Esterel is a synchronous language suited for describing reactive embedded systems. It combines fine-grained parallelism with precise timing control for the execution of threads. Due to this, Esterel programs have typically been compiled into sequential code in software implementations, as tight synchronization between a large number of threads cannot be efficiently managed with an operating system (OS). This has enabled concurrent Esterel programs to be executed directly on single-core processors. Recently, however, multi-core processors have been increasingly used to achieve better performance in embedded applications. The conventional approach of generating sequential code from Esterel programs is unable to take advantage of multi-core processors. We overcome this limitation by compiling Esterel into a limited number of thread partitions (up to the number of available cores) to avoid the large overheads of implementing each Esterel thread separately within a conventional multithreading scheme. These partitions are then distributed onto separate cores using a static load balancing heuristic. The Esterel threads within a partition may then be dynamically scheduled with or without an OS. To evaluate the viability of this approach, we present experimental results comparing the execution of a set of benchmarks using one to four cores on the Intel Core 2 Quad with Linux, and one to two cores on the Xilinx Micro blaze without any OS. We have performed extensive benchmarking over large Esterel programs to illustrate that achieving throughput with parallel execution of Esterel is benchmark dependent.

关键词： Signal resolution Message systems Multicore processing Synchronization Benchmark testing Program processors Concurrent computing

来源：评论

学校读者我要写书评

暂无评论

Performance Analysis of parallel FEM Codes Using TAU Toolkits

Performance Analysis of Parallel FEM Codes Using TAU Toolkit...

引用

9th international symposium on distributed Computing and Applications to Business, engineering and Science (DCABES 2010)

作者： Ru Zhongliang Wang Min Zhao Hongbo Henan Tech Univ Coll Civil Engn Jizozuo Peoples R China

ISBN: (纸本)9780769541105

Domain decomposition method is a popular algorithm, which is adopted to the parallel finite element method(FEM). The formulation for solving sparse linear systems of equations is presented. The TAU performance analysis software is used to analyze and understand the execution behavior of the parallel algorithm such as: communication patterns, processor load balance, and computation versus communication ratios, timing characteristics, and processor idle time. This is all done by displays of post-mortem trace-files. Performance bottlenecks can easily be identified at the appropriate level of detail. A large-scale mechanical calculation of a dam by the parallel FEM program was brought out using the Dawning 5000A parallel computer at the Henan technical University Supercomputer Center. The TAU performance analysis software are used to analyze and understand the execution behavior of the parallel algorithm such as: communication patterns, processor load balance, computation versus communication ratios, timing characteristics, and processor idle time. This is all done by displays of post-mortem trace-files. Statistics show that the formulation is efficient in parallel computing environments and that the formulation is significantly faster and consumes less memory.

关键词： domain decomposed method perforcemence parallel FEM TAU

来源：评论

学校读者我要写书评

暂无评论

parallel Simulation of Multi-Agent systems using Terracotta 10

Parallel Simulation of Multi-Agent Systems using Terracotta

引用

14th IEEE/ACM international symposium on distributed Simulation and Real-Time Applications (DS-RT 2010)

作者： Cicirelli, Franco Furfaro, Angelo Giordano, Andrea Nigro, Libero Univ Calabria Lab Ingn Software Dipartimento Elettron Informat & Sistemist I-87036 Arcavacata Di Rende CS Italy

ISBN: (纸本)9780769542515

This paper describes a novel approach to parallel simulation of complex multi-agent systems which is based on actors and the Java middleware Terracotta. The approach aims to an exploitation of the computing power of modern multi-core machines. Terracotta was chosen because it transparently allows to cluster the JVM. The paper discusses design and implementation aspects of the approach, and demonstrates the achievable execution performance through the parallel simulation of a scalable multi-agent system based on the predator/prey model.

关键词： Actors modeling and parallel simulation multi-core architectures complex multi-agent systems Terracotta Java

来源：评论

学校读者我要写书评

暂无评论

The Implementation and Comparison of Two Kinds of parallel Genetic Algorithm Using Matlab

The Implementation and Comparison of Two Kinds of Parallel G...

引用

9th international symposium on distributed Computing and Applications to Business, engineering and Science (DCABES 2010)

作者： Li Nan Gao Pengdong Lu Yongquan Yu Wenhua Commun Univ China Ctr High Performance Comp Beijing 100024 Peoples R China

ISBN: (纸本)9780769541105

Two kinds of parallel genetic algorithm (PGA) are implemented in this paper based on the MATLAB (R) parallel Computing Toolbox (TM) and distributed Computing Server T software. parallel for-loops, SPMD (Single Program Multiple Data) block and co-distributed arrays, three basic parallel programming modes in MATLAB are employed to accomplish the global and coarse-grained PGAs. To validate and compare our implementation, both PGAs are applied to run the problem of range image registration. A set of experiments have illustrated that it is convenient and effective to use MATLAB to parallelize the existing algorithms. At the same time, a higher speed-up and performance enhancement can be obtained obviously.

关键词： parallel genetic algorithm MATLAB distributed computing parallel programming

来源：评论

学校读者我要写书评

暂无评论

Benefits of software rejuvenation on HPC systems

Benefits of software rejuvenation on HPC systems

引用

IEEE international symposium on parallel and distributed Processing With Applications

作者： Naksinehaboon, Nichamon Taerat, Narate Leangsuksun, Chokchai Chandler, Clayton F. Scott, Stephen L. College of Engineering and Science Louisiana Tech. University Ruston LA 71270 United States Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN 37831 United States

ISBN: (纸本)9780769541907

Rejuvenation is a technique expected to mitigate failures in HPC systems by replacing, repairing, or resetting system components. Because of the small overhead required by software rejuvenation, we primarily focus on OS/kernel rejuvenation. In this paper, we propose three rejuvenation scheduling techniques. Moreover, we investigate the claim that software rejuvenation prolongs failure times in HPC systems. Also, we compare the lost computing times of the checkpoint/restart mechanism with and without rejuvenation after each checkpoint. © 2010 IEEE.

关键词： Endocrinology

来源：评论

学校读者我要写书评

暂无评论

A Guess to Detect the Downloader-like Programs

A Guess to Detect the Downloader-like Programs

引用

9th international symposium on distributed Computing and Applications to Business, engineering and Science (DCABES 2010)

作者： Wu Peng Guo Qingping Song Huijuan Tang Xiaoyi Wuhan Univ Technol Distributed Parallel Proc Lab Wuhan 430063 Peoples R China

ISBN: (纸本)9780769541105

Nowadays, more and more computer malwares or viruses have evolved to a new special form that depends on the Internet, which is called downloader. In this article, we will show something about the downloader's destructive power and several available methods to bypass the heuristic scanning of Kaspersky and Eset's newest antivirus software for their heuristic scanning technology are the most advanced in the windows OS platforms. Even though the Heuristic Scanning Technology is the key of protection software, more and more new methods are built to bypass it. And then, I will give my guess about how to detect and Intercept the downloader-like programs. Note that I never hope do harm to Kaspersky and Eset's products but only to learn.

关键词： downloader Timing attacks bypass vc

来源：评论

学校读者我要写书评

暂无评论

A Scalability Metric Based on Beowulf Cluster System

A Scalability Metric Based on Beowulf Cluster System

引用

9th international symposium on distributed Computing and Applications to Business, engineering and Science (DCABES 2010)

作者： Zhu, Yongzhi Cao, Baoxiang Qufu Normal Univ Coll Comp Sci RiZhao 276826 Peoples R China

ISBN: (纸本)9780769541105

Along with the rapid development of parallel computing technology and the popularity of Beowulf cluster system, the scalability of parallel algorithm-machine combinations, which measures the capacity of a parallel algorithm to effectively utilize an increasing number of processors, becomes more and more important. This ratio of parallel overhead to computation is reviewed in this paper, the merit and deficiencies of this metric are pointed out. Then in order to apply the distributed parallel computation environment based on Beowulf cluster it is improved, obtain the new extensible function which reflects the scalability of distributed parallel systems more directly and precisely when the size of machines and the scale of problems are extending in the environment of Beowulf cluster. Finally, the new metric is used to analyze and prove the scalability of parallel algorithms and Beowulf cluster.

关键词： scalability iso ratio of parallel overhead to computation Beowulf cluster distributed computation

来源：评论

学校读者我要写书评

暂无评论

STUDY OF APPROXIMATE distributed DYNAMIC MODEL OF MULTISOURCE AND MULTI-SINK STEAM MANIFOLD SYSTEM ON THERMAL POWER PLANT

引用

CHEMICAL engineering COMMUNICATIONS 2010年第2期197卷 204-212页

作者： Pan, Lei Shen, Jiong Southeast Univ Sch Energy & Environm Nanjing 210096 Peoples R China

Due to the deficiencies of prior modeling methods of systems of boilers in parallel operation through a common manifold, in this article we use simplified approximate distributed modeling based on the compartmentalization and combination of the manifold. First, a principle is established to plot out the manifold into some basic pipe sections between each pair of source/sink points. Second, a simplified approximate distributed decoupled transfer function matrix model without steady errors is built for each pipe section. Finally, a smooth joint arithmetic is adopted to combine all the conterminous subsections of the pipe section with boilers/turbines at its ends into a whole system model. This method can decrease the size of the equipment and, therefore, makes it easier to model larger systems. The model presents more distributed characteristics and fits well in control system simulating. Some experiments have been done with favorable results to prove the validity of the method.

关键词： Boilers in parallel operation distributed dynamic model Source and sink Steam manifold

来源：评论

学校读者我要写书评

暂无评论

The design and implement of memory manager in STM

The design and implement of memory manager in STM

引用

9th international symposium on distributed Computing and Applications to Business, engineering and Science (DCABES 2010)

作者： Zhang Ping Li QingBao Huang GuoRui Zeng GuangYu Zhengzhou Inst Informat Sci & Technol Dept Comp Sci Zhengzhou Henan Peoples R China

ISBN: (纸本)9780769541105

software transactional memory (STM) is one of the promising models in parallel programming for multi-core processor system and has being studied by many researchers. Memory management is an important aspect of STM system design which affects the performance of the whole STM system directly. This paper presents the design and implementation of an effective memory manager. It uses private heap to manage transactions' memory space of each thread and a global heap to manage the whole memory space in STM system. Algorithms ensure that the memory access is no-blocking. Tests show that performance of the memory manager is satisfying.

关键词： Multi-core processor software transactional memory memory manage memory allocation garbage collection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：