In complex-valued neural network (CVNN) applications, complex number calculations require high performance rather than high precision. However, most previous studies focused on high-precision approaches, which have lo...
详细信息
In complex-valued neural network (CVNN) applications, complex number calculations require high performance rather than high precision. However, most previous studies focused on high-precision approaches, which have low speed and high hardware costs. This paper proposes a universal methodology of complex number computation for low-complexity and high-speed implementation. The proposed methodology is based on the piecewise linear (PWL) method and can be used for different types of complex number computations. Considering that multiplication operations consume considerable resources, multiplication, fused square-add (FSA) and fused multiply-add (FMA) operations are the focus of optimization. The partial products of the square operation are reduced by folding and merging techniques because of their symmetry in the FSA operation. The partial products of the multiplication and FMA operations are reduced via Booth encoding. In addition, the partial products are further reduced by the proposed step-by-step truncation method. The proposed segmenter, which simulates the hardware implementation, automatically divides the nonlinear functions in the complex number computations into the smallest number of segments according to the required precision. The results show that the proposed approach improves performance and reduces hardware costs compared with the state-of-the-art methods for complex number calculations involving square roots, reciprocals and logarithms.
The Lempel-Ziv (LZ) 4 compression algorithm, widely used in data transmission and storage, faces the challenge of high-speed implementation and increased complexity in the era of big data. Therefore, this paper propos...
详细信息
The Lempel-Ziv (LZ) 4 compression algorithm, widely used in data transmission and storage, faces the challenge of high-speed implementation and increased complexity in the era of big data. Therefore, this paper proposes a single-core parallel architecture for LZ4 algorithm with high throughput and low complexity. Firstly, to enhance throughput, two innovative approaches are introduced from the perspective of parallelism and frequency with an acceptable compression ratio loss: each parallelization window is restricted to performing a single match, bridging the gap between actual and theoretical parallelism;the feedback loop in the circuit is broken by utilizing the spatial correlation between adjacent matches for higher frequency. Secondly, two optimization schemes are employed on resource-consuming modules to achieve low complexity. Multi-port hash tables using Live Value Table (LVT) are improved based on inherent data characteristics, significantly reducing the hardware resource consumption while ensuring excellent scalability on hash table depth and frequency. The match comparison operation is moved ahead, further reducing the logic resources by 64.36%. Finally, our design is implemented on FPGA and ASIC platforms. Experimental results on FPGA demonstrate that the proposed architecture achieves a throughput of 17.39 Gb/s, exhibiting a 2.86x improvement over the state-of-the-art, along with a 6.46x enhancement in area efficiency. Further optimizations including Canonic Signed Digit (CSD) coding and computational reuse on the ASIC platform result in a 45x improvement in area efficiency.
Intelligent reflecting surfaces (IRSs) are a promising low-cost solution for achieving high spectral and energy efficiency in future communication systems by enabling the customization of wireless propagation environm...
详细信息
Intelligent reflecting surfaces (IRSs) are a promising low-cost solution for achieving high spectral and energy efficiency in future communication systems by enabling the customization of wireless propagation environments. Despite the plethora of research on resource allocation design for IRS-assisted multiuser wireless communication systems, the optimal design and the corresponding performance upper bound are still not fully understood. To bridge this gap in knowledge, in this paper, we investigate the optimal resource allocation design for IRS-assisted multiuser multiple-input single-output (MISO) systems employing practical discrete IRS phase shifters. In particular, we jointly optimize the beamforming vector at the base station (BS) and the discrete IRS phase shifts to minimize the total transmit power for the cases of perfect and imperfect channel state information (CSI) knowledge. To this end, two novel algorithms based on the generalized Benders decomposition (GBD) method are developed to obtain the globally optimal solution for perfect and imperfect CSI, respectively. Moreover, to facilitate practical implementation, we propose two corresponding low-complexity suboptimal algorithms with guaranteed convergence by capitalizing on successive convex approximation (SCA). In particular, for imperfect CSI, we adopt a bounded error model to characterize the CSI uncertainty and propose a new transformation to convexify the robust quality-of-service (QoS) constraints. Our numerical results confirm the optimality of the proposed GBD-based algorithms for the considered system for both perfect and imperfect CSI. Furthermore, we unveil that both proposed SCA-based algorithms can attain a locally optimal solution within a few iterations. Moreover, compared with the state-of-the-art solution based on alternating optimization (AO), the proposed low-complexity SCA-based schemes achieve a significant performance gain, especially for moderate-to-large numbers of IRS elements.
This paper presents the development and implementation of a human-like robotic hand integrated with advanced triboelectric nanogenerator (TENG) based tactile sensors for shape and material recognition. Meanwhile, trad...
详细信息
This paper presents the development and implementation of a human-like robotic hand integrated with advanced triboelectric nanogenerator (TENG) based tactile sensors for shape and material recognition. Meanwhile, traditional piezo sensors' effectiveness is limited, sensitive to the temperature, and the manufacturing cost is high. TENG sensors offer a self-powered alternative with simplified circuitry, cost-effective fabrication, and enhanced durability. To capitalize on these benefits, we propose a novel machine learning approach that represents time-series data as two-dimensional images processed using a two-dimensional convolutional neural network (2D CNN). This method is compared against the traditional one-dimensional convolutional neural network (1D CNN) method. The research methodology encompasses TENG sensor preparation, noise cancellation, robotic hand design, and control electronics. Experimental results demonstrate that the proposed 2D CNN method significantly improves shape and material recognition accuracy, achieving 98% and 99%, respectively, compared to 94% and 98% with the 1D CNN method. Real-time evaluation further validates the robustness and adaptability of the proposed model in unstructured environments. These findings underscore the potential of integrating TENG sensors with advanced neural network architectures for autonomous dexterous manipulation in various industrial applications, paving the way for future advancements in robotic tactile sensing.
Hybrid beamforming (HBF) is emerging as a key technology for future wireless networks, particularly in millimeter wave (mmWave) bands. In this paper, we present a non-uniform sub-connected hybrid beamforming (NSC-HBF)...
详细信息
Hybrid beamforming (HBF) is emerging as a key technology for future wireless networks, particularly in millimeter wave (mmWave) bands. In this paper, we present a non-uniform sub-connected hybrid beamforming (NSC-HBF) system that enables the implementation of the difference co-array for the underdetermined angle-of-arrival (AOA) estimation. We propose a broad beam synthesis technique to mitigate performance degradation caused by uneven analog combining gains. To further enhance estimation efficiency, we modified the co-array least mean squares (co-array LMS) method by directly solving the least-squares problem to determine the filter weights. The proposed method reduces computational complexity and eliminates iterative procedures for faster processing. Our numerical results demonstrate the pseudo-spectrum and detection performance for both pencil and broad beams, highlighting the importance of analog combining codebook design. Additionally, the proposed method achieves higher estimation accuracy compared to the co-array root-LMS.
In this paper, we propose the decentralized likelihood ascent search (DLAS)-aided detection for the distributed large-scale multiple-input multiple-output (MIMO) systems to achieve more remarkable performance gains. W...
详细信息
This paper presents the development and evaluation of a calibration methodology for pressure-resistive sensors, implemented within a specialised diagnostic laboratory. The approach utilises algorithmic correction, str...
详细信息
This paper presents the development and evaluation of a calibration methodology for pressure-resistive sensors, implemented within a specialised diagnostic laboratory. The approach utilises algorithmic correction, structured measurement routines, and real-time signalprocessing to perform sensor calibration exclusively at the software level, without requiring hardware intervention. A custom sensor profiling framework has been designed to enable precise analysis of sensor characteristics, degradation, and long-term behaviour. The calibration algorithm employs mathematical interpolation and dataset synchronisation techniques to enhance accuracy and reduce measurement uncertainty. Experimental validation demonstrated a reduction in average deviation from 18.1% to 1.1%, with an overall tolerance range of just 2.6%. These results confirm the effectiveness of the proposed methodology in improving sensor precision and stability. The system is particularly suited for industrial environments where reliable pressure sensing is essential for safety, efficiency, and predictive maintenance.
This paper introduces a synthesizable μ-architectural design method to boost the performance of a given RISC-V processor architecture by utilizing Canonical Signed Digit (CSD) representation during the execution stag...
详细信息
The Cochlea, a spiral-shaped structure in the inner ear, plays a crucial role in the process of hearing by converting sound waves into electrical signals that the brain can interpret. This study introduces a cost-effe...
详细信息
The Cochlea, a spiral-shaped structure in the inner ear, plays a crucial role in the process of hearing by converting sound waves into electrical signals that the brain can interpret. This study introduces a cost-effective adaptation of 2D artificial Cochlea mathematical modeling using a planar approximation technique. The main novelty and contribution of our work is a method employs surface-based functions and is known as the Surface-Based Approximation Model of Cochlea (SBAMoC). By simplifying complex multiplication processes in nonlinear components, the SBAMoC reduces costs and enhances efficiency, making it suitable for FPGA implementation with minimal hardware requirements. The proposed model is evaluated in scenarios involving two-coupled oscillations and grid-based cochlear networks to better understand its performance. Through hardware synthesis on a Virtex-II board, the SBAMoC demonstrates improved efficiency and reduced computational expenses compared to the original model, achieving faster speeds and greater cost-effectiveness. In practical tests, the SBAMoC exhibits higher operational speeds and increased scalability, outperforming the original model by replicating accurate cochlear behaviors with minimal deviations. Specifically, the single SBAMoC implementation in our model achieves a speed boost of approximately 1.333 times compared to the original model (381.292 MHz vs. 286.029 MHz) and supports a greater number of fitted SBAMoCs (75 vs. 35), showcasing its superior efficiency and performance enhancements. In case of real-world applications, it can be considered for the development of more efficient and cost-effective cochlear implants, leading to improved hearing restoration solutions for individuals with hearing impairments. Also, the findings from this study could also be leveraged to enhance the design and implementation of signalprocessingsystems in various audio and communication devices, paving the way for advanced audio processing technolog
Complete simulation environments for automotive scenarios require simulated sensor data. The widespread use of vision sensors led to the availability of comprehensive simulations of their measurements. The same level ...
详细信息
暂无评论