The increased use of multi-bit processing elements such as digital signal processors, multipliers, multi-bit addressable memory cells, and CPU cores has presented new opportunities for field-programmablegate Array (F...
详细信息
The increased use of multi-bit processing elements such as digital signal processors, multipliers, multi-bit addressable memory cells, and CPU cores has presented new opportunities for field-programmablegate Array (FPGA) architects to utilize the regularity of multi-bit signals to increase the area efficiency of FPGAs. In particular, configuration memory sharing has been traditionally used to exploit multi-bit regularity for area. We observe that the process of creating configuration memory sharing routing resources often leads to the use of much sparser switch patterns for connecting multi-bit elements to their routing tracks. In this work, we empirically evaluate the effect of these sparse switch patterns on the area efficiency of FPGAs. It is shown that the sparse switch patterns alone contribute significantly to the area reduction observed in configuration memory sharing FPGAs. In particular, our experiments show that, without configuration memory sharing, sparse switch patterns can reduce the implementation area of multi-bit routing resources by 10.4% while configuration memory sharing contributes to an additional 1.2% in area savings. The observation holds over a wide range of connection block flexibility values and demonstrates that efficient switch pattern designs can be effectively used to increase the area efficiency of FPGA routing resources.
In the paper, the mathematical model of memory consumption and through-put parameter estimation of a cryptographic system that employs transposition chaotic maps is investigated. The target platform for the system imp...
详细信息
ISBN:
(数字)9781728141848
ISBN:
(纸本)9781728141855
In the paper, the mathematical model of memory consumption and through-put parameter estimation of a cryptographic system that employs transposition chaotic maps is investigated. The target platform for the system implementation is field-programmablegate Array (FPGA) devices. The proposed mathematical model takes into account limitations of FPGA circuits and can be used to assess usage of FPGA memory resources preliminary and to evaluate throughput efficiency of the encryption system. First, expressions for memory consumption are introduced. The second part is devoted to analysis of the system throughput. Different architectures for memory organization have been considered in the paper as well as all stages of the cipher pipeline. The architectures are characterized by the different level of parallelism on the level of message processing that results in variations in throughput value. The work presents generalized expression for throughput estimation for all the considered cipher architectures. It allows the designer of the system to select an FPGA die that fulfills the basic requirements for cryptographic system implementation.
The course graph expansion (CGE) detailed routing algorithm is presented for FPGAs (field-programmablegatearrays). The algorithm has the ability to resolve routing conflicts by considering the side-effects of one co...
详细信息
The course graph expansion (CGE) detailed routing algorithm is presented for FPGAs (field-programmablegatearrays). The algorithm has the ability to resolve routing conflicts by considering the side-effects of one connection on another, and can be used over a wide range of FPGA interconnection architectures. CGE has been used to obtain excellent routing results for several industrial circuits with various FPGA routing architectures. The results show that CGE is able to route relatively large FPGAs in the absolute minimum number of tracks as determined by global routing, and that CGE has a linear run-time over circuit size.< >
Rapid system prototyping is one of the main applications for field-programmablegatearrays (FPGAs). At the stage of rapid system prototyping, design specifications can often be changed since they cannot always be det...
详细信息
Rapid system prototyping is one of the main applications for field-programmablegatearrays (FPGAs). At the stage of rapid system prototyping, design specifications can often be changed since they cannot always be determined completely. In this paper, layout design change is focused on and a layout reconfiguration algorithm is proposed for FPGAs. In layout reconfiguration, the main problem is to add LUTs to initial layouts. Our algorithm consists of two steps: For given placement and global routing of LUTs, Step 1 places an added LUT with allowing that the position of the added LUT may overlap that of a preplaced LUT; Then Step 2 moves preplaced LUTs to their adjacent positions so that the overlap of the LUT positions can be resolved. Global routes are updated corresponding to reconfiguration of placement. The algorithm keeps routing congestion small by evaluating global routes directly both in Steps 1 and 2. Especially in Step 2, if the minimum number of preplaced LUTs are moved to their adjacent positions, our algorithm minimizes routing congestion. Experimental results demonstrate the effectiveness and efficiency of the algorithm.
Motivated by improving FPGA performance, we propose a new three-dimensional (3D) FPGA architecture, along with a fabrication methodology. We analyze the expected manufacturing yield, and raise several physical-design ...
详细信息
Motivated by improving FPGA performance, we propose a new three-dimensional (3D) FPGA architecture, along with a fabrication methodology. We analyze the expected manufacturing yield, and raise several physical-design issues in the new 3D paradigm. Our techniques also have good implications for resource utilization, physical size, and power consumption.
This paper investigates area-speed trade-offs for Hierarchical FPGA (HFPGA) architectures. Using a set of new CAD tools, we measured the timing performance of HFPGAs and conventional symmetrical FPGAs using data gathe...
详细信息
This paper investigates area-speed trade-offs for Hierarchical FPGA (HFPGA) architectures. Using a set of new CAD tools, we measured the timing performance of HFPGAs and conventional symmetrical FPGAs using data gathered from experiments on a subset of benchmark circuits from the Microelectronics Centre of North Carolina (MCNC). Experiments were also performed to determine the effect of timing optimized placements on routing channel requirements. These experiments demonstrate that HFPGAs can achieve both better area and speed than symmetrical FPGA architectures.
In this paper, we investigate the use of field-programmablegatearrays (FPGAs) in the design of a highly scalable variable block size motion estimation architecture for the H.264/AVC video encoding standard. The scal...
详细信息
ISBN:
(纸本)9781424419606
In this paper, we investigate the use of field-programmablegatearrays (FPGAs) in the design of a highly scalable variable block size motion estimation architecture for the H.264/AVC video encoding standard. The scalability of the architecture allows one to incorporate the system into low cost single FPGA solutions for low-resolution video encoding applications as well as into high performance multi-FPGA solutions targeting high-resolution applications. To overcome the performance gap between FPGAs and application specific integrated circuits, our algorithm intelligently increases its parallelism as the design scales while minimizing the use of memory bandwidth. The core computing unit of the architecture is implemented on FPGAs and its performance is reported. It is shown that the computing unit is able to achieve 28 frames per second (fps) performance for 640x480 resolution VGA video while incurring only 4% device utilization on a Xilinx XC5VLX330 FPGA. With 8 computing units at 37% device utilization, the architecture is able to achieve 31 fps performance for encoding full 1920x1088 progressive HDTV video.
Preisach-type operators model hysteresis via weighted superposition of a large (and even infinite) number of basic hysteretic elements (called hysterons), and they have proven capable of capturing various complicated ...
详细信息
Preisach-type operators model hysteresis via weighted superposition of a large (and even infinite) number of basic hysteretic elements (called hysterons), and they have proven capable of capturing various complicated hysteretic behaviors. While inverse compensation is an effective approach to control of hysteretic systems, inversion of Preisach-type operators is a bottleneck in demanding, high-speed applications due to the high computational cost. In this paper a novel and general framework is proposed for fast inversion of a wide class of Preisach-type operators, by exploiting the massive parallelism offered by field-programmablegatearrays (FPGAs) to process the inherently parallel hysteresis operators. The theory, algorithm, and implementation of the inversion are presented. The inversion output is computed iteratively with guaranteed convergence (up to machine precision) provided the hysteresis operator is piecewise monotone and Lipschitz continuous. For an operator consisting of m hysterons, the proposed approach shows a computational complexity of O(log m), in contrast to O(m) for methods using general DSPs. The effectiveness of the fast inversion approach is demonstrated by implementation results on open-loop tracking of kHz reference signals, based on inversion of a Krasnosel'skii-Pokrovskii operator.
A third-generation family of field-programmablegatearrays (FPGAs) that utilizes a combination of architectural and process improvements and features up to twice the density and speed of currently available FPGA devi...
详细信息
A third-generation family of field-programmablegatearrays (FPGAs) that utilizes a combination of architectural and process improvements and features up to twice the density and speed of currently available FPGA devices is discussed. The architecture allows complete and efficient automated design implementation of FPGA-based designs, as well as maximum density and performance. User-configurable on-chip static memory resources further contribute to the high integration levels available to users of the third-generation devices.< >
This paper describes a new detailed routing algorithm, specially designed for those architectures that are found in most recent generations of field-programmablegatearrays (FPGAs). The algorithm also brings a soluti...
详细信息
This paper describes a new detailed routing algorithm, specially designed for those architectures that are found in most recent generations of field-programmablegatearrays (FPGAs). The algorithm also brings a solution for those architectures where multiplexed switches are used in order to decrease the chip area like the recently proposed FIPSOC FPGA. The algorithm, called RAISE, can be applied to a broad range of optimization problems and has been used for detailed routing of symmetrical FPGAs, whose routing architecture consists of rows and columns of logic cells interconnected by routing channels, with or without the use of multiplexed switches. RAISE (Router using Adaptive Simulated Evolution) searches not just for a possible solution, but tries to find the one with minimum delay. Excellent routing results have been obtained over a set of several benchmark circuits getting solutions close to the minimum number of tracks.
暂无评论