Self-timed circuits present an attractive solution to the problem of process variation. However implementing self-timed combinational logic is complex and expensive. This paper presents a novel method for synthesising...
详细信息
ISBN:
(纸本)9781424439331
Self-timed circuits present an attractive solution to the problem of process variation. However implementing self-timed combinational logic is complex and expensive. This paper presents a novel method for synthesising indicating implementations of arbitrary encoded function blocks. The synthesis method reduces the cost of the implementations by distributing indication between the individual outputs of a function block. Covers are constructed by determining the minimal cost set of Prime Indicants which are required to indicate all of the input transitions of the function block The results of the procedure are demonstrated on a wide range of combinational logic blocks and show a reduction in literal count of between 38-99%.
Designing asynchronouscircuits is becoming easier as a number of design styles are making the transition from research projects to real, usable tools. However, designing asynchronous "systems" is still a di...
详细信息
In this paper, we put forward an energy-efficient configurable asynchronous SNN accelerator for energy-constrained applications, which includes 256 neurons and 131K synapses with 8-bit fixed point weight. To achieve h...
详细信息
ISBN:
(纸本)9781728141329
In this paper, we put forward an energy-efficient configurable asynchronous SNN accelerator for energy-constrained applications, which includes 256 neurons and 131K synapses with 8-bit fixed point weight. To achieve high energy efficiency and on-chip learning ability, we propose a sparse target propagation (S-TP) algorithm and design the accelerator with Click-based bundled-data asynchronouscircuits. The SNN accelerator is implemented in 28nm CMOS process, and the post place and router (post-PAR) simulation results indicate that the SNN accelerator achieves on-chip learning with inference power efficiency of 3.97 pJ/SOP and 95.7% classification accuracy on NMNIST test dataset, which outperforms prior neuromorphic on-chip learning systems.
This paper presents design and simulation results of two high-performance asynchronous pipeline circuits. The first circuit is a two-phase micropipeline but uses pseudo-static Svensson-style double edge-triggered D-fl...
详细信息
Neuromorphic processing architectures seek to emulate the functionality of the brain by realizing parallel, efficient, event-based processing which can be directly applied to solve many of the pressing problems within...
详细信息
ISBN:
(纸本)9781665451093
Neuromorphic processing architectures seek to emulate the functionality of the brain by realizing parallel, efficient, event-based processing which can be directly applied to solve many of the pressing problems within artificial intelligence and big data. However, implementation of these systems leads to slow response times, high power dissipation, or incoherent output. In this paper, an analog cellular neural network processing element is demonstrated to perform asynchronous spatiotemporal filtering operations in an area and power efficient manner. It utilizes a pair of analog memories to encode spike timings and perform event-based bandpass temporal processing. Information from the local clique of temporal filters is leveraged by a parallel, spatial processor which maps CNN arithmetic to the current-domain for compact computation. Preliminary circuit verification demonstrated the ability of the element to perform spatiotemporal filtering operations with latencies less than 1.8 mu s while only consuming 1.6pJ/spike.
The Integrated systems Laboratory (IIS) of ETH Zurich (Swiss Federal Institute of Technology) has been active in Globally-asynchronous Locally-Synchronous (GALS) research since 1998. During this time, a number of GALS...
详细信息
ISBN:
(纸本)0769524982
The Integrated systems Laboratory (IIS) of ETH Zurich (Swiss Federal Institute of Technology) has been active in Globally-asynchronous Locally-Synchronous (GALS) research since 1998. During this time, a number of GALS circuits have been fabricated and tested successfully on silicon. From a hardware designers point of view, this article summarizes the evolution from proof of concept designs over multi-point interconnects to applications that specifically take advantage of GALS operation to improve cryptographic security. In spite of the fact that they fail to address numerous idiosyncrasies of GALS (such as good partitioning into synchronous islands, port controller design, pausable clock generators, design for test, etc.), hierarchical design flows have been found to form a workable basis. What prevents GALS from gaining a wider acceptance mainly is the initial effort required to come up with a design flow that is efficient and dependable.
An asynchronous pipeline scheme that combines a low power static circuit with a high-speed dual-rail dynamic circuit is proposed. The scheme utilizes a dual-rail circuit only in the critical path of an SRT division an...
详细信息
An asynchronous pipeline scheme that combines a low power static circuit with a high-speed dual-rail dynamic circuit is proposed. The scheme utilizes a dual-rail circuit only in the critical path of an SRT division and square root calculation unit. The proposed implementation of the calculation unit reduced power consumption by more than 1/2 of the full-dynamic implementation while maintaining the calculation speed. Because of the elimination of spurious transitions, the proposed implementation showed even less power consumption over synchronous static circuit implementations. By using 0.3 μm triple metal CMOS technology, the calculation time of floating point 56-b full mantissa division and square root is expected to be 45ns in the worst case.
This work explores the use of asynchronous approximate multiply-accumulate (MAC) operators and research ways to alleviate the inherent area overhead of such circuits, while leveraging on asynchronouscircuits advantag...
详细信息
ISBN:
(纸本)9781665484855
This work explores the use of asynchronous approximate multiply-accumulate (MAC) operators and research ways to alleviate the inherent area overhead of such circuits, while leveraging on asynchronouscircuits advantages. It analyzes three approximate MAC architectures with varying error rates and area trade-offs. Accurate and approximate, synchronous and asynchronous MAC operators are compared. Experiments show it is feasible to decrease the area overhead of the accurate asynchronous MAC from 8.1x down to 1.6x by recurring to approximate multipliers, for varying controlled error rates. The use of asynchronous quasi-delay insensitive (QDI) circuits allows applying extensive voltage scaling to all asynchronous MAC operators. Power and energy per operation can thus be significantly reduced, achieving savings of up to 2.66x in power and 3.17x in energy per operation when compared to the accurate asynchronous MAC.
This paper proposes multiple-clock multiple-edge-triggered multiple-bit flip-flops for designing simple and straightforward asynchronous control circuits of the two-phase handshaking protocol. The proposed flip-flops ...
详细信息
ISBN:
(纸本)9781479934324
This paper proposes multiple-clock multiple-edge-triggered multiple-bit flip-flops for designing simple and straightforward asynchronous control circuits of the two-phase handshaking protocol. The proposed flip-flops have multiple clocks and multiple data inputs, and each data input can be stored in the flip-flop at both the rising edge and the falling edge of the corresponding clock. They can be applied in the asynchronous design of the two-phase handshaking protocol not only for synthesizing simple control circuits, but also for obtaining robust circuits. The performance of the proposed flip-flops have been evaluated using the PTM 22nm HP device parameters.
This paper is concerned with the containment control problem for discrete-time high-order multi-agent systems with switching topologies under the asynchronous setting. Based on the distributed asynchronous consensus p...
详细信息
ISBN:
(纸本)9781728103976
This paper is concerned with the containment control problem for discrete-time high-order multi-agent systems with switching topologies under the asynchronous setting. Based on the distributed asynchronous consensus protocol using only each agent's own information and its neighbors' partial information, the asynchronous high-order containment control problem with switching topologies is transformed into a product problem of infinite time-varying row-stochastic matrices. Then the properties of row-stochastic matrices are explored to derive a sufficient condition involving graph topologies for asynchronous containment control of high-order multi-agent systems. The theoretical results are finally validated through numerical simulations.
暂无评论