In this paper, the synchronization control of a non-autonomous Lotka-Volterra system with time delay and stochastic effects is studied. The purpose is to firstly establish sufficient conditions for the existence of gl...
详细信息
In this paper, the synchronization control of a non-autonomous Lotka-Volterra system with time delay and stochastic effects is studied. The purpose is to firstly establish sufficient conditions for the existence of global positive solution by constructing a suitable Lyapunov function. Some synchronization criteria are then derived by designing an appropriate full controller and a pinning controller, respectively. Finally, an example is presented to illustrate the feasibility and validity of the main theoretical results based on the field-programmable gate array hardware simulation tool.
Image processing algorithms, implemented in hardware, have recently emerged as the most viable solution for improving the performance of image processing systems. In this paper, a version of an anisotropic diffusion t...
详细信息
Image processing algorithms, implemented in hardware, have recently emerged as the most viable solution for improving the performance of image processing systems. In this paper, a version of an anisotropic diffusion technique is used to reduce noise from retinal images, namely Speckle Reducing Anisotropic Diffusion ( SRAD). The SRAD filter can improve images corrupted by multiplicative or additive noise, but it has been the most computationally complex and it has not been suitable for software implementation in real-time processing. In this paper, an efficient field-programmable gate array ( FPGA)-based implementation of the SRAD filter is presented to accelerate the processing time. A comparison of the most used classical suppression filters like Gaussian, Median, Perona and Malik anisotropic diffusion has been carried out. The experimental results reveal a 38x performance improvement over the original MATLAB implementation and a 1.33x performance improvement over the hardware implementation using the Xilinx System Generator tool.
A technique for debugging the software of a data acquisition and preliminary processing device with a network interface for a 2D position-sensitive thermal-neutron detector with delay-line readout is described. The or...
详细信息
A technique for debugging the software of a data acquisition and preliminary processing device with a network interface for a 2D position-sensitive thermal-neutron detector with delay-line readout is described. The original software transfers data through two ring buffers. Changes are proposed in the software code to check the possibility of ring-buffer overflow. It is shown that there is no data loss in the ring buffers at input-pulse frequencies up to 1 MHz. The corresponding dead time of recording is on the order of 1 mu s.
We demonstrate real-time CD equalization (CDE) for coherent optical transmission systems using a low complexity time-domain (TD) multiplierless finite-impulse response (FIR)-based equalizer, based on a field-programma...
详细信息
ISBN:
(纸本)9781538608593
We demonstrate real-time CD equalization (CDE) for coherent optical transmission systems using a low complexity time-domain (TD) multiplierless finite-impulse response (FIR)-based equalizer, based on a field-programmable gate array (FPGA) implementation. The real-time operation is performed for a single-channel 2.5 Gb/s QPSK optical signal with a performance penalty of only similar to 0.15 dB with respect to the maximum performance. The hardware complexity is also evaluated in terms of occupation in a Virtex-6 FPGA-XC6VLX240T, revealing the high efficiency of the proposed CDE algorithm.
Although the expectation maximization (EM)based 3D computed tomography (CT) reconstruction algorithm lowers radiation exposure, its long execution time hinders practical usage. To accelerate this process, we introduce...
详细信息
ISBN:
(纸本)9781479951116
Although the expectation maximization (EM)based 3D computed tomography (CT) reconstruction algorithm lowers radiation exposure, its long execution time hinders practical usage. To accelerate this process, we introduce a novel external memory bandwidth reduction strategy by reusing both the sinogram and the voxel intensity. Also, a customized computing engine based on field-programmable gate array (FPGA) is presented to increase the effective memory bandwidth. Experiments on actual patient data show that 85X speedup can be achieved over single-threaded CPU.
The paper proposes a method for implementing Space Vector Modulation (SVM) on the CompactRio embedded platform. The concept of space vectors is presented and the mathematical model of the SVM is derived. A description...
详细信息
ISBN:
(纸本)9781538671894
The paper proposes a method for implementing Space Vector Modulation (SVM) on the CompactRio embedded platform. The concept of space vectors is presented and the mathematical model of the SVM is derived. A description of the CompactRio RTOS/FPGA Embedded platform is presented. A new method for analyzing and generating SVM gate control pulses is proposed. The proposed method to generate SVM pulses based on the symmetric SVM switching pattern using the clock of the FPGA processor is implemented and performance studied.
This paper presents the architecture of a hardware accelerator for a cellular neural network (CeNN) with an application to real-time edge detection on visible-range and infrared video. The accelerator features fully-p...
详细信息
ISBN:
(纸本)9781728128610
This paper presents the architecture of a hardware accelerator for a cellular neural network (CeNN) with an application to real-time edge detection on visible-range and infrared video. The accelerator features fully-pipelined processing elements (PEs) that exploit the data parallelism in the algorithm to perform an iteration of the CeNN on a stream of video data with high throughput. The memory architecture exploits the locality of reference in the CeNN, so that each PE uses only 5 line buffers to store pixel, state, and output data, thus achieving low on-chip memory utilization. Implemented on a Xilinx XC7A200T FPGA running at 245MHz, the accelerator performs edge detection on 1080p video using a single CeNN iteration with a throughput of 118 frames per second (fps), a total latency of 15.7 mu s, and 618mW of power consumption. The architecture features static reconfiguration to store built-in kernels and to add more PEs to support multiple iterations of the CeNN algorithm. More kernels can be added dynamically through a serial interface.
THE OIL AND GAS INDUSTRY IS A MAJOR USER OF HIGH-PERFORMANCE COMPUTING, AND GEOSCIENCE COMPUTATIONAL CYCLES ARE DOMINATED BY KERNELS THAT ARE RELATIVELY FEW AND WELL DEFINED. THIS PROJECT EXPLORES ACCELERATING GEOSCIE...
详细信息
THE OIL AND GAS INDUSTRY IS A MAJOR USER OF HIGH-PERFORMANCE COMPUTING, AND GEOSCIENCE COMPUTATIONAL CYCLES ARE DOMINATED BY KERNELS THAT ARE RELATIVELY FEW AND WELL DEFINED. THIS PROJECT EXPLORES ACCELERATING GEOSCIENCE APPLICATIONS USING FPGA-BASED HARDWARE, OPTIMIZING THE ALGORITHM AND THE HARDWARE TO ACHIEVE MAXIMUM PERFORMANCE. THIS APPROACH CAN DELIVER SPEEDUP OF 20 TO 70 TIMES COMPARED WITH A CONVENTIONAL HPC NODE.
A novel hardware architecture for the reconstruction of digital holograms with autofocusing is presented in this paper. The architecture is based on a novel autofocusing algorithm operating on a smaller local block lo...
详细信息
A novel hardware architecture for the reconstruction of digital holograms with autofocusing is presented in this paper. The architecture is based on a novel autofocusing algorithm operating on a smaller local block located at the center of the source digital holograms. By exploiting global information contained in the local block, accurate focus distance can be computed with less computational complexities. Interval search is also adopted to further accelerate the process. The circuits for the fast autofocusing algorithm and subsequent reconstruction operations are effectively integrated in the proposed architecture. Two fast Fourier transform cores are shared by the operations for parallel computations with low area costs. The architecture is implemented by fieldprogrammablegatearray, and is used as a hardware accelerator in a network on chip system for performance evaluation. Experimental results demonstrate that the proposed circuit exhibits the advantages of high speed computation, low power dissipation, accurate focus distance search, and hologram reconstruction for three-dimensional rendering applications.
The paper presents an enhanced architecture of finite impulse response digital filters. The proposed architecture contains one multiply-accumulate unit and random access men-wiry to store data. The architecture utiliz...
详细信息
The paper presents an enhanced architecture of finite impulse response digital filters. The proposed architecture contains one multiply-accumulate unit and random access men-wiry to store data. The architecture utilizes serial calculation to achieve minimum requirements on area. Time architecture is suitable for implementation in application specific integrated circuits and fieldprogrammablegatearrays. The main advantages of the architecture are higher operating frequency, lower power consumption and smaller area utilization in particular cases. (C) 2015, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.
暂无评论