Increase in the complexity of VLSI digital circuit design demands faster logic simulation techniques than those currently available. One of the ways of speeding up existing logic Simulation Agorithms is by exploiting ...
详细信息
Temporal logic is widely acclaimed to be a highly successful tool for analyzing non-real-time properties of programs. However, a few fundamental problems arise while designing temporal logic-based-techniques to verify...
详细信息
作者:
Pifarre, G.D.Gravano, L.Felperin, S.A.Sanz, J.L.C.ESLAI
Escuela Superior Latino Americana de Informática CC 3193 (1000) Buenos Aires Argentina. and Computer Research and Advanced Applications Group IBM Argentina Ing. E. Butti 275 (1300) Buenos Aires Argentina. and Computer Science Dept. IBM Almadeu Research Center San José California Computer Science Dept.
IBM Almaden Research Center San José California. Computer Research and Advanced Applications Group IBM Argentina Ing. E. Butti 275 (1300) Buenos Aires Argentina
This paper deals with the problem of packet-switched routing in parallel machines. Several new routing algorithms for different interconnection networks are presented. While the new techniques apply to a wide variety ...
详细信息
ISBN:
(纸本)0897914384
This paper deals with the problem of packet-switched routing in parallel machines. Several new routing algorithms for different interconnection networks are presented. While the new techniques apply to a wide variety of networks, routing algorithms will be shown for the hypercube, the 2-dimensional mesh, and the shuffle-exchange. The techniques presented for hypercubes and meshes are fully-adaptive and minimal. A similar technique can be devised for tori. A fully-adaptive and minimal routing is one in which all possible minimal paths between a source and a destination are of potential use at the time a message is injected into the network. Minimal paths followed by messages ultimately depend on the local congestion encountered in each node of the network. In the shuffle-exchange network, the routing scheme also exhibits adaptivity but paths could be up to 3 log TV long for an iV node machine. The shuffle-exchange algorithm is the first adaptive and deadlock-free method that requires a small (and independent of N) number of buffers and queues in the routing nodes for that network. Furthermore, all of the new techniques are completely free of deadlock situations. In dynamic message injection models, the routing methods are also ensured to be free of livelock if messages competing for resources are handled with fairness. In contrast to other approaches in which adaptivity, deadlock and livelock freedom can be guaranteed at the expense of complex architectures, the algorithms presented in this paper require a very moderate amount of routing hardware. In particular, it will be shown that only two central queues per routing node of the network are necessary for the cases of the 2-dimensionaJ mesh and the hypercube, and four queues for the shuffle-exchange. This paper demonstrates that "hanging" an interconnection network from a node [Gun81, MS80, BGSS89, Kon90] is a convenient methodology for creating and visualizing routing functions and understanding deadlock-free polic
A new approach is presented for explicitly relating image observables to models of curved three-dimensional objects. This relationship is used for object recognition and positioning. Object models consist of collectio...
详细信息
This paper discusses the parallel implementation of a hypercube-based circuit partitioning scheme. The circuit partitioner is based on a simulated annealing algorithm and is designed to provide near optimal circuit pa...
详细信息
This paper discusses the parallel implementation of a hypercube-based circuit partitioning scheme. The circuit partitioner is based on a simulated annealing algorithm and is designed to provide near optimal circuit partitions for simulating the circuits on a hypercube architecture. The circuit partitioning scheme optimizes multiple criteria such as tight coupling of nodes within a subcircuit, load balancing among processors and communication overheads. Design of a partitioner that obtains near optimal solutions satisfying such multiple criteria is difficult using any simple heuristic techniques other than simulated annealing.
The importance of circuit simulation in the design of VLSI circuits has channelised research work in the direction of finding methods to speedup this highly compute-intensive problem. On one hand, attempts have been m...
详细信息
The importance of circuit simulation in the design of VLSI circuits has channelised research work in the direction of finding methods to speedup this highly compute-intensive problem. On one hand, attempts have been made to find better algorithms and use faster hardware; and on the other hand, to use parallel architectures for accelerating the circuit simulation task. In this paper, we examine the various issues involved in parallelizing two well-known circuit simulation approaches – direct methods and relaxation methods. A number of parallel computer architectures which have been used for this purpose are also surveyed.
Circuit simulation is a highly compute-intensive task as it involves solving thousands of ordinary differential equations (ODEs) describing the VLSI circuit under consideration. This paper describes an effort towards ...
详细信息
Circuit simulation is a highly compute-intensive task as it involves solving thousands of ordinary differential equations (ODEs) describing the VLSI circuit under consideration. This paper describes an effort towards speeding up this task using a hypercube-based architecture. The paper focusses on the design and development of HIRECS (Hypercube Implementation of RElaxation-Based Circuit Simulation). HIRECS is based on the relaxation approach of solving the ODEs describing the circuit. The natural decomposition of the problem makes the relaxation algorithms amenable to parallel implementation. HIRECS employs the Waveform Relaxation (WR) algorithm. The special feature of WR algorithm is that the latency of the circuit can be exploited better, effecting a saving in the total computation time. The concept of ’windowing’ has been incorporated in HIRECS to effect a saving in the memory requirement. Another important feature of HIRECS is a novel synchronization scheme called partial synchronization. HIRECS runs on a DEC-1090 system and is developed using SIMULA. Performance studies of HIRECS based on parameters such as speedup, efficiency, and utilization of processors have been carried out. The performance evaluation of HIRECS in the simulation of some bench mark circuits like inverter chains and multiplexers indicates that a significant speedup, almost linear, can be obtained using a hypercube. For circuits with large number of nodes, such an implementation can result in tremendous saving in the computation time.
This paper presents a modeling methodology devoted to the performance evaluation of parallel architectures. The methodology is based on the decomposition of the modeling process into seven stages. In each stage specif...
详细信息
This paper presents a modeling methodology devoted to the performance evaluation of parallel architectures. The methodology is based on the decomposition of the modeling process into seven stages. In each stage specific techniques, appropriate to parallel architectures, are applied, such as aggregation methods, a thorough distinction between the architecture model and the application program model, the constitution of program classes using data analysis techniques. The methodology is then illustrated through a case study, the loosely Coupled Array of Processors (lCAP) system, designed at IBM Kingston. Use of the lCAP model allows to predict the performance of the lCAP system, in terms of response time, resource utilization, waiting times, and to investigate many alternatives with regard to the system configuration (e. g. number of system components, component interconnection scheme, component characteristics), or to the parallel program structure (e. g. parallel task granularity, load imbalance).
The paper describes a 3-dimensional Shape Descriptor Function (SDF) which is invariant under the action of SO(3). We concentrate here on the analytical derivation of the SSD, and show how certain requirements constrai...
详细信息
The design of a 10-20 MW, 40 nsec cyclotron auto-resonance maser experiment is presented. The basic components of the CARM are a novel 600 kV, 200 A. field emission gun and a "whispering gallery" mode ripple...
详细信息
暂无评论