检索结果-内蒙古大学图书馆

ON RELIABILITY ANALYSIS OF CHORDAL RINGS

Journal of Circuits, Systems and Computers 1995年第2期5卷 199-213页

作者： AMIYA NAYAK NICOLA SANTORO Center for Parallel & Distributed Computing School of Computer Science Carleton University Ottawa Canada K1S 5B6 Canada

A common technique to improve the reliability of loop (or ring) networks is by introducing link redundancy; that is, by providing several alternative paths for communication between pairs of nodes. With alternate paths between nodes, the network can now sustain several node and link failures by bypassing the faulty components. However, faults occurring at strategic locations in a ring can prevent the computation by disrupting I/O operations, blocking the flow of information, or even segmenting the structure into pieces which can no longer be suitable for any practical purpose. An extensive characterization of fault-tolerance in ring topologies is given in this paper. This characterization augments the results known in the literature to date. The characterization has revealed several properties which describe the problem of constructing subrings and linear arrays in the presence of node failures in the original ring for a specified link configuration. Also in this paper, bounds are established on the degree of fault tolerance achievable in a redundant loop network, with a given degree of redundancy, when performing a computation that requires a minimal number of operational nodes. Also the bounds on the size of the problems guaranteed to be solved in the presence of a given number of faults in the network are derived.

关键词：

来源：评论

学校读者我要写书评

暂无评论

An automated and power-aware framework for utilization of IP cores in hardware generated from C descriptions targeting FPGAs

An automated and power-aware framework for utilization of IP...

引用

Annual IEEE Symposium on Field-Programmable Custom computing Machines (FCCM)

作者： A. Jones P. Banerjee Center for Parallel and Distributed Computing Department of Electrical and Computer Engineering Technological Institute Northwestern University Evanston IL USA

Use of hand optimized Intellectual Property (IP) logic cores is prolific in hardware design. These IP cores range from rather complicated signal processing transforms and filters to arithmetic operators. While IP cores remain a standard way to utilize the improvement in FPGA technology and contend with time to market pressure through reuse, popularity of tools generating hardware descriptions from high-level languages is increasing in popularity. The PACT HDL behavioral synthesis tool attempts to combine these two methods within a power-aware framework. PACT HDL generates RTL HDL codes in VHDL and Verilog using a finite state machine (FSM) style. These codes use intrinsic operators to represent calculations such as addition, subtraction, and multiplication. The output HDL codes are passed to commercial RTL synthesis tools that generate the gate-level hardware descriptions. Each intrinsic operator is replaced with a hardware implementation of the calculation by the synthesis tool. Unfortunately, by leaving this decision to the synthesis tool, the gate-level instantiation may not be appropriate for the desired constraints, particularly those relating to power consumed. The synthesis tools tend to use combinational implementations that are area and power hungry. In some cases, the tool may not be able to instantiate the appropriate logic, such as the division operator, at all.

关键词： Power generation Field programmable gate arrays Hardware design languages Design optimization Intellectual property Logic design Signal processing Filters Arithmetic Time to market

来源：评论

学校读者我要写书评

暂无评论

Resynthesis of sequential circuits for low power

Resynthesis of sequential circuits for low power

引用

IEEE International Symposium on Circuits and Systems (ISCAS)

作者： S. Roy P. Banerjee Ambit Design Systems Inc. Santa Clara USA Center for Parallel & Distributed Computing Northwestern University USA

At the logic level, a popular approach is to power down the sequential machine during the self-loops of the underlying finite state machine (FSM). In this work, we extend this idea to resynthesize existing sequential circuits to reduce power. We report a novel technique based on symbolic simulation of a sequential circuit to extract its self-loops without extracting the corresponding state transition diagram (STG). Since self loops may not be inherently present in the corresponding FSM, we partition the circuit heuristically and identify partial-self-loops for each partition to bring down the corresponding sub-circuit by gating the clock sub-tree feeding that partition. By using this approach, we could save up to 45% of the total power on a controller circuit of a microprocessor design, where traditional techniques could not save any power.

关键词： Sequential circuits Binary decision diagrams Circuit simulation Data structures Boolean functions Input variables Equations

来源：评论

学校读者我要写书评

暂无评论

A NOTE ON THE LOAD BALANCING PROBLEM FOR COARSE-GRAINED HYPERCUBE DICTIONARY MACHINES

引用

JOINT INTERNATIONAL CONF ON VECTOR AND parallel PROCESSING ( CONPAR 90 - VAPP IV )

作者： DEHNE, F GASTALDO, M Center for Parallel and Distributed Computing School of Computer Science Carleton University Ottawa K1S 5B6 Canada Laboratoire de I'lnformatique du Parallelisms - IMAG Ecole Normale Superieure de Lyon Lyon cedex 07 69364 France

ISBN: (纸本)3540530657

The main problem for the design of dictionary machines on coarse grained hypercube multiprocessors, in comparison to the widely studied dictionary problem for fine grained hypercube multiprocessors, is that due to unequal distribution of the inserted and deleted records, the sizes of the sets stored at the individual processors may vary considerably. This problem, which is usually referred to as the load balancing problem, may lead to considerable degradation of the dictionary machine's performance. In this note we show that the load balancing problem for coarse grained hypercube dictionary machines can be solved with provable bounds on the sizes of the data sets, and with only little computational overhead. © Springer-Verlag Berlin Heidelberg 1990.

关键词： Geometry

来源：评论

学校读者我要写书评

暂无评论

Noncontiguous I/O accesses through MPI-IO 03

Noncontiguous I/O accesses through MPI-IO

引用

IEEE/ACM International Symposium on Cluster computing and the Grid (CCGRID)

作者： A. Ching A. Choudhary K. Coloma Wei-keng Liao R. Ross W. Gropp Center for Parallel and Distributed Computing Northwestern University Evanston IL USA Argonne National Laboratory Argonne IL USA

ISBN: (纸本)9780769519197

I/O performance remains a weakness of parallel computing systems today. While this weakness is partly attributed to rapid advances in other system components, I/O interfaces available to programmers and the I/O methods supported by file systems have traditionally not matched efficiently with the types of I/O operations that scientific applications perform, particularly noncontiguous accesses. The MPI-IO interface allows for rich descriptions of the I/O patterns desired for scientific applications and implementations such as ROMIO have taken advantage of this ability while remaining limited by underlying file system methods. A method of noncontiguous data access, list I/O, was recently implemented in the parallel Virtual File System (PVFS). We implement support for this interface in the ROMIO MPI-IO implementation. Through a suite of noncontiguous I/O tests we compared ROMIO list I/O to current methods of ROMIO noncontiguous access and found that the list I/O interface provides performance benefits in many noncontiguous cases.

关键词： File systems Testing distributed computing Mathematics Computer science Laboratories parallel processing Programming profession Tiles Checkpointing

来源：评论

学校读者我要写书评

暂无评论

parallel branch and bound on fine-grained hypercube multiprocessors

Parallel branch and bound on fine-grained hypercube multipro...

引用

International Conference on Tools for Artificial Intelligence (ICTAI)

作者： F. Dehne A.G. Ferreira A. Rau-Chaplin Center for Parallel and Distributed Computing School of Computer Science Carleton University Ottawa Canada Laboratoire de I'lnformatique du Parallelisme-IMAG Ecole Normale Superieure Lyon France Center for Parallel and Distributed Computing. School of Computer Science Carleton University Ottawa Canada

An efficient branch and bound algorithm for fine-grained hypercube multiprocessors is presented. The method uses a global storage allocation scheme where all processors collectively store all back-up paths such that each processor needs to store only a constant amount of information. At each iteration of the algorithm, all nodes of the current back-up tree may decide whether they need to create new children, be pruned, or remain unchanged. An algorithm that, on the basis of these decisions, updates the current back-up tree and distributes global information in O(log m) steps, where m is the current number of nodes, is described. This method also provides a dynamic allocation mechanism that obtains optimal load balancing. Another important property of the method is that, even if very drastic changes in the current back-up tree occur, the performance of the load balancing mechanism remains constant. The method is currently being implemented on the Connection Machine.< >

关键词： Hypercubes Load management distributed computing Computer science Search methods Linear programming Artificial intelligence Operations research High definition video Concurrent computing

来源：评论

学校读者我要写书评

暂无评论

Recognition of catastrophic faults

Recognition of catastrophic faults

引用

1992 IEEE International Workshop on Defect and Fault Tolerance in VLSI Systems, DFT 1992

作者： Nayak, Amiya Pagli, Linda Santoro, Nicola Center for Parallel and Distributed Computing School of Computer Science Carleton University OttawaONK1S 5B6 Canada Dipartimento di Scienze dell'Informazione University of Pisa Corso Italia 40 Pisa56100 Italy

ISBN: (纸本)0818628375

Fault tolerance through the incorporation of redundancy and reconfiguration is quite common. Regular systems are being designed with massive redundancy built into them [5,6,12], These systems also make use of the redundancy to reconfigure in the event of failure in one or more components;normally, a reconfiguration process is triggered as soon as a fault is detected. Many different reconfiguration schemes [1-4,6,7,10-13] have been proposed in the literature which reconfigure regular systems in the presence of faulty components. The distribution of faults can have severe impact on the effectiveness of any reconfiguration scheme;in fact, patterns of faults occurring at strategic locations may render an entire system unusable regardless of its component redundancy and of its reconfiguration capabilities. For a given design, it is not difficult to identify a set of elements whose failure will have catastrophic consequence. There exist many patterns (random distribution) of faults, not in a block, which can be fatal for the system [9]. Therefore, the characterization of such fault patterns is crucial for the identification, testing and detection of such catastrophic events. © 1992 IEEE.

关键词： Redundancy

来源：评论

学校读者我要写书评

暂无评论

On implementation of an enhanced and integrated parallel programming web-based toolkit

On implementation of an enhanced and integrated parallel pro...

引用

ITRE 2005 - 3rd International Conference on Information Technology: Research and Education

作者： Li, Kuan-Ching Yang, Chao-Tung Parallel and Distributed Processing Center Dept. of Computer Science and Information Management Providence University Shalu Taichung 43301 Taiwan High Performance Computing Laboratory Dept. of Computer Science and Information Engineering Tunghai University Taichung City Taichung 40704 Taiwan

ISBN: (纸本)0780389328

The computing power provided by high performance and low cost PC-based clusters and Grid computing platforms are attractive and they are equal or superior to supercomputers and mainframes. In parallel, discussions on how to obtain more computing power from these computing platforms become an interesting issue. The development of applications for these high-performance computing platforms is complicated for several reasons: The complexity of applications themselves, which combines aspects of supercomputing and distributed computing, and by the need to achieve higher performance. This paper describes the design rationale and implementation of a parallel programming web-based toolkit, to ease the parallel programming learning process, with the use of web-based interface. The toolkit has widely been used in MPI parallel programming courses (both in graduate and undergraduate levels) and industry trainings. © 2005 IEEE.

关键词： Computer programming

来源：评论

学校读者我要写书评

暂无评论

The anatomy of a course in cluster and grid computing

The anatomy of a course in cluster and grid computing

引用

ITRE 2005 - 3rd International Conference on Information Technology: Research and Education

作者： Yang, Chao-Tung Li, Kuan-Ching High Performance Computing Laboratory Dept. of Computer Science and Information Engineering Tunghai University Taichung City Taichung 40704 Taiwan Parallel and Distributed Processing Center Dept. of Computer Science and Information Management Providence University Shalu Taichung 43301 Taiwan

ISBN: (纸本)0780389328

Cluster and grid computing is a relatively new interdisciplinary field, where computer science, engineering and computational biology as its core supporting disciplines. The rise of cluster and grid computing discipline brings to computer science faculty members new opportunities and challenges, both in education and in research. In this paper, we will explore issues in teaching Cluster and Grid computing, the developing of curricula, and thoughts to foster student to work on research projects in Cluster and Grid computing. © 2005 IEEE.

关键词： Education

来源：评论

学校读者我要写书评

暂无评论

Some fast parallel algorithms for parentheses matching 3rd

引用

3rd International Conference on computing and Information, ICCI 1991

作者： Das, Sajal K. Chen, Calvin C.-Y. Lewis, Gene Prasad, Sushil Center for Research in Parallel and Distributed Computing Department of Computer Science University of North Texas P.O. Box 13886 DentonTX76203-3886 United States Department of Mathematics and Computer Science Georgia State University AtlantaGA30303 United States

ISBN: (纸本)9783540540298

The parentheses matching problem is to determine the mate of each parenthesis in a balanced string of n parentheses. In this paper, we present three novel and elegant parallel algorithms for this problem on parallel random-access machine (PRAM) models. Each of our algorithms has polylog-time complexity and two of them are cost-optimal. © Springer-Verlag Berlin Heidelberg 1991.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：