The smartphone-based human activity recognition (HAR) systems are not capable to deliver high-end performance for challenging applications. We propose a dedicated hardware-based HAR system for smart military wearables...
详细信息
The smartphone-based human activity recognition (HAR) systems are not capable to deliver high-end performance for challenging applications. We propose a dedicated hardware-based HAR system for smart military wearables, which uses a multilayer perceptron (MLP) algorithm to perform activity classification. To achieve the flexible and efficient hardware design, the inherent MLP architecture with parallel computation is implemented on FPGA. The system performance has been evaluated using the UCI human activity dataset with 7767 feature samples of 20 subjects. The three combinations of a dataset are trained, validated, and tested on ten different MLP models with distinct topologies. The MLP design with the 7-6-5 topology is finalized from the classification accuracy and cross entropy performance. The five versions of the final MLP design (7-6-5) with different data precision are implemented on FPGA. The analysis shows that the MLP designed with 16-bit fixed-point data precision is the most efficient MLP implementation in the context of classification accuracy, resource utilization, and power consumption. The proposed MLP design requires only 270 ns for classification and consumes 120 mW of power. The recognition accuracy and hardware results performance achieved are better than many of the recently reported works.
The Cognitive Radio (CR) is one of the significant technology to enhance the spectrum utilization efficiency. The Radio Frequency (RF) spectrum is the most valuable natural resource. Due to an increased usage of the c...
详细信息
The Cognitive Radio (CR) is one of the significant technology to enhance the spectrum utilization efficiency. The Radio Frequency (RF) spectrum is the most valuable natural resource. Due to an increased usage of the cognitive radio, the sensing issues in spectrum are increasing day by day. The CR is emerging technologies that increase the effectiveness and efficiency of the spread spectrum communication. Generally, the spectrum sensing is based on the energy detection and cyclostationary feature detection. The energy detection is a fundamental basic spectrum sensing technique. However, it performs poorly under a low Signal-to-Noise Ratio (SNR) environment. The cyclostationary based sensing technique improves Primary Users (PUs) detection performance with the high complexity of implementation and hardware utilization. Traditional median and Finite Impulse Response (FIR) filters are utilized complex adder and multiplier, which requires large memory space to store the filter coefficients and more hardware utilization. Hence, these filters have computational complexity and more hardware utilization. To overcome above-mentioned problems, Adaptive Absolute SCORE (AA-SCORE) architecture is designed based on optimal FIR filter for further reducing the system complexity to enhance the spectrum utilization efficiency in CR. In this research, the FIR filter is designed by using Radix-8 and Carry Select Adder (CSLA) for reducing the filter complexity. The proposed method is named as AAS-R8-CSLA architecture. The AAS-R8-CSLA architecture was implemented in the field programmable gate array (FPGA) platform through Verilog code. In order to improve the spectrum utilization efficiency of CR by using AAS-R8-CSLA. The experimental outcome showed that the proposed AAS-R8-CSLA architecture has improved FPGA performance up to 2-3% compared to existing methods like MS and ACS architecture. (C) 2019 Elsevier B.V. All rights reserved.
Image encryption has attained a great attention due to the necessity to safeguard confidential images. Digital documents, site images, battlefield photographs, etc. need a secure approach for sharing in an open channe...
详细信息
Image encryption has attained a great attention due to the necessity to safeguard confidential images. Digital documents, site images, battlefield photographs, etc. need a secure approach for sharing in an open channel. Hardware - software co-design is a better option for exploiting unique features to cipher the confidential images. Cellular automata (CA) and synthetic image influenced transform domain approach for image encryption is proposed in this paper. The digital image is initially divided into four subsections by applying integer wavelet transform. Confusion is accomplished on low - low section of the transformed image using CA rules 90 and 150. The first level of diffusion with consecutive XORing operation of image pixels is initiated by CA rule 42. A synthetic random key image is developed by extracting true random bits generated by Cyclone V field programmable gate array 5CSEMA5F31C6. This random image plays an important role in second level of diffusion. The proposed confusion and two level diffusion assisted image encryption approach has been validated through the entropy, correlation, histogram, number of pixels change rate, unified average change intensity, contrast and encryption quality analyses.
Intensity modulation/direct detection (IM/DD) orthogonal frequency division multiplexing (OFDM) is very suitable for high-speed cost-sensitive passive optical network (PON) applications. However, real-time OFDM system...
详细信息
Intensity modulation/direct detection (IM/DD) orthogonal frequency division multiplexing (OFDM) is very suitable for high-speed cost-sensitive passive optical network (PON) applications. However, real-time OFDM systems are sensitive to synchronization bias, and thus symbol timing synchronization (STS) becomes a critical issue. We proposed a proportional-sign cross-correlation (PS-CC) STS algorithm with the advantages of distance-independence and low resource utilization. In the proposed PS-CC STS algorithm, the sign operation is applied instead of the multiplication operation with high computational complexity to decrease the resource utilization and the characteristic of low resource utilization will drive its application in real-time systems. Meanwhile, we use field programmable gate array (FPGA) to design a baseband IM/DD OFDM receiver with pipeline structure and built a universal loop test system, which is applied to implement the proposed PS-CC STS algorithm and evaluate its performance. In the experiment of a 16-QAM OFDM-PON system, a 30-km fiber transmission only results in 1.98% ripple on the peak amplitude of correlation, which verifies the distance-independence of the proposed method.
field programmable gate arrays (FPGAs), due to their programmability, have become a popular design choice for control and processing blocks of modern-day digital design. However, this flexibility makes them larger, sl...
详细信息
field programmable gate arrays (FPGAs), due to their programmability, have become a popular design choice for control and processing blocks of modern-day digital design. However, this flexibility makes them larger, slower, and less power-efficient when compared to Application Specific Integrated Circuits (ASICs). On the other hand, ASICs have their own drawbacks, such as lack of programmability and inflexibility. One potential solution is specialized fine-grained reconfigurable architectures that have improved flexibility over ASICs and better resource utilization than FPGAs. However, designing a fine-grained reconfigurable architecture is a daunting task in itself due to lack of high-level design-flow support. This article proposes an automated design-flow for the system-level simulation, optimization, and resource estimation of generic as well as custom fine-grained reconfigurable architectures. The proposed framework is generic in nature as it can be used for both control-oriented and compute-intensive applications and then generates a homogeneous or heterogeneous reconfigurable architecture for them. Four sets of homogeneous and heterogeneous benchmarks are used in this work to show the efficacy of our proposed design-flow, and simulation results reveal that our framework can generate both generic and custom fine-grained reconfigurable architectures. Moreover, the area and power estimations show that auto-generated domain-specific reconfigurable architectures are 76% and 73% more area and power-efficient, respectively, than generic FPGA-based implementations. These results are consistent with the savings reported for manual designs in the literature.
Radiation effects encountered in space or aviation environments can affect the configuration bits in field programmable gate arrays (FPGA) causing errors in FPGA output. One method of increasing FPGA reliability in ra...
详细信息
Radiation effects encountered in space or aviation environments can affect the configuration bits in field programmable gate arrays (FPGA) causing errors in FPGA output. One method of increasing FPGA reliability in radiation environments includes adding redundant logic to mask errors and allow time for repair. Despite the redundancy added with triple modular redundancy (TMR) and configuration scrubbing there exist some configuration bits that individually affect multiple TMR domains causing errors in FPGA output. A new tool called DeBit is introduced that identifies hardware resources associated with a single bit failure. This tool identifies a novel failure mode involving global routing resources and the failure mode is verified through a series of directed tests on global routing resources. Lastly, a mitigation strategy is proposed and tested on a single error in a triple modular redundancy (TMR) design.
In contact resonance force microscopy and related dynamic atomic force microscopy methods, an accurate description of the real-time cantilever dynamics is essential to the mapping of local material properties, such as...
详细信息
In contact resonance force microscopy and related dynamic atomic force microscopy methods, an accurate description of the real-time cantilever dynamics is essential to the mapping of local material properties, such as viscoelasticity, piezo response, and chemical composition. Stiffness and damping variations of the tip-sample contact result in variations in the cantilever's resonance frequency and quality factor as it scans a sample of interest. When measuring characteristics of the resonance, generally, there is a tradeoff between full spectral coverage, best obtained by sweeping the amplitude versus frequency response in the time or frequency domain, and high-speed information, obtained by observing the cantilever response at one or two discrete frequencies, that may be required to track a resonance frequency that changes spatially. Here, we introduce a new option for performing contact resonance force microscopy with a low-cost multifrequency lock-in amplifier system with up to eight simultaneous independent excitation and detection frequencies. We demonstrate how the multifrequency approach can measure contact resonance frequency, quality factor, amplitude, and phase during imaging, with high precision and error estimation, without the need for frequency-tracking feedback. We show, using a wood composite sample, that this multifrequency approach can determine resonance frequency and quality factor, and associated uncertainty. This ability to estimate uncertainty of resonance parameters is not possible with 1 and 2 frequency methods. We further utilize the multifrequency lock-in to develop a novel means of increasing the stiffness range for highly sensitive nanomechanical sensing by dividing the eight lock-in frequencies to monitor two or four simultaneous eigenmodes, each of which is optimized for sensitivity in a particular stiffness regime. Overall, we show how multifrequency lock-in amplifiers with observation frequency chosen to coincide with an expected eig
Space-Time Adaptive Processing (STAP) can harness the efficacy of interference and clutter significantly. Calculations of the STAP weights involve solving linear equations which require very intensive computations. In...
详细信息
Space-Time Adaptive Processing (STAP) can harness the efficacy of interference and clutter significantly. Calculations of the STAP weights involve solving linear equations which require very intensive computations. In this paper, the QR decomposition (QRD) using the modified gram-schmidt (MGS) algorithm is parameterized with vector size to create a trade-off between the hardware resources utilization and computation time. To achieve an efficient floating point structure, the proposed architecture of QRD-MGS algorithm is simulated and implemented in two modes: single-vector and multi-vector. Results show that the multi-vector method can lead to a high-performance design with higher operating frequency, lower power consumption, and less resource utilization than the single-vector method. For example, Modelism simulations show that the decomposition of a 12x51 matrix with vector size of 17 takes 7.86 mu s with the maximum clock frequency of 282MHz, for implementation on the Arria10 FPGA. In real STAP applications, the matrix sizes are too large to be fit on FPGAs and the update rate of the weights are high. Therefore, this method can fit any matrix in the contemporary FPGAs with an acceptable update rate.
In AES, the total time taken by the architecture while implementing in low power and high-speed circuit is the most important thing that to be considered. Also in AES, implementing S-Boxes consumes the major part of t...
详细信息
In AES, the total time taken by the architecture while implementing in low power and high-speed circuit is the most important thing that to be considered. Also in AES, implementing S-Boxes consumes the major part of the total time consumed by the entire architecture. In this research paper, we propose a very low-power and high efficient S-Box circuit architecture: a multi-stage modified version of PPRM architecture over composite fields. In this modified S box design, only AND and Hazard transparent XOR gates are used. Because of this architecture dynamic hazards which form the main aspect of power consumption in S box gets eliminated. A low propagation delay of 4.58ns and occupies 120 slices in the xilinx FPGA device xc6vlx75t-3ff784, while the low propagation delay and slice area is 5.552ns and 120 respectively for the conventional PPRM architecture. This new proposed architecture is used to protect the mammographic images from being unauthorized access.
Changing trends in the communication industry pertaining to configuration of devices and their processing for maximised result. Each device needs a processing unit comprising a microcontroller or a fieldprogrammable ...
详细信息
Changing trends in the communication industry pertaining to configuration of devices and their processing for maximised result. Each device needs a processing unit comprising a microcontroller or a field programmable gate array (FPGA). This paper deals with the use of FPGAs and how they can be configured as Hardware in Loop (HIL) for validation along with Simulink and Xilinx System Generator (XSG). Further, their compatibility is mentioned for long term use and durability in communication. The comparison of related work in the field of communication is done with the FPGA implementation of Long Term Evolution (LTE) physical layer with different modulation schemes, different antenna configurations and different signal to noise ratio systems implemented on Virtex and Spartan FPGA boards. On the other hand the simulation is carried out with Xilinx Vivado Design suite to analyse the power, resource utilisation, timing summary and memory utilisation.
暂无评论