Traditional placement algorithms for FPGAs are normally carried out on a fixed clustering solution of a circuit. the impact of clustering on wirelength and delay of the placement solutions is not well quantified. In t...
详细信息
Several implementations of Artificial Neural Networks have been reported in scientific papers. Nevertheless, these implementations do not allow the direct use of off-line trained networks because of the much lower pre...
详细信息
In this paper we describe a parameterizable FPGA-based implementation of a sigma-delta converter used in a 96kHz audio DAC. From specifications of the converter’s input bitwidth and data sampling frequency, VHDL gene...
详细信息
this paper presents a hardware algorithm for finding a maximum clique of a given graph, and shows experimental results of the proposed algorithm running on an FPGA. the proposed algorithm is constructed according to a...
详细信息
Summary form only given. the exponentially increasing complexity of many scientific applications and the high cost of supercomputing force us to explore new, sustainable, and affordable high-performance computing plat...
详细信息
Summary form only given. the exponentially increasing complexity of many scientific applications and the high cost of supercomputing force us to explore new, sustainable, and affordable high-performance computing platforms. Recent significant advances in FPGA technology and the inherent advantages of configurable logic have brought about new research efforts in the configurable computing field: parallel processing on configurable chips. We explore here parallel LU factorization of large sparse block-diagonal-bordered (BDB) matrices on a configurable multiprocessor that we have designed and implemented. A dynamic load balancing strategy is proposed and analyzed. Performance results for IEEE power test systems are provided. Our research provides evidence that configurable logic can be a viable alternative to high-performance scientific computing.
In this paper, we develop a unified theory in analyzing optimal switch box design problems, particularly for the unsolved irregular cases, where different pin counts are allowed on different sides. the results drawn f...
详细信息
Recently, the appearance of very large (3 – 10M gate) FPGAs with embedded arithmetic units has opened the door to the possibility of floating point computation on these devices. While previous researchers have descri...
详细信息
Summary form only given. We have searched for a performance platform to run billions of simulations in the cellular automata model for optimizing applications. the question was how much speed-up could be gained by usi...
详细信息
Summary form only given. We have searched for a performance platform to run billions of simulations in the cellular automata model for optimizing applications. the question was how much speed-up could be gained by using the FPGA technology compared to optimized software. We have implemented to cellular automata rules in software on a PC and in hardware. On our low end experimental platform we reached a speed-up of 3 for a medium complex rule and 22 for a complex rule. If we would use the latest high end FPGA technology, speed-ups up to many thousand are realistic. A cluster of thousands of workstations would be necessary to reach the same performance, which is much more costly than the FPGA solution.
Function evaluation is at the core of many compute-intensive applications which perform well on reconfigurable ***, in order to implement function evaluation efficiently, the FPGA programmer has to choose between a mu...
详细信息
Hybrid reconfigurable systems integrate DSPs and general purpose processors with an FPGA fabric. these systems may support features such as efficient start-up and shut-down, dynamic voltage scaling, and reconfiguratio...
详细信息
暂无评论