This work presents an optimized exponential function VLSI hardware design by Taylor series expansion. The proposed architecture implements the exponential by approximating the logic design of a 4 th -order Taylor seri...
This work presents an optimized exponential function VLSI hardware design by Taylor series expansion. The proposed architecture implements the exponential by approximating the logic design of a 4 th -order Taylor series and explores efficient CMOS arithmetic operation strategies. It implements a shift-based divider and explores an efficient 4-2 adder compressor in the adder tree. The proposal with a −7 to 11 input values range shows an output error of around 2% of MRED with a reduced energy consumption of 3.63 pJ/operation for 32-bit output. For a 64-bit output, the energy per operation of the VLSI exponential unit is 14.97pJ/op, being able to process a more comprehensive input range (i.e., −14 to 22) for a negligible mean output error of around 1.7% of MRED.
The ingrained principles of fairness in a dialogue system’s decision-making process and generated responses are crucial for user engagement, satisfaction, and task achievement. Absence of equitable and inclusive prin...
详细信息
Teaching concepts in Thailand's universities have abruptly changed, due to the advancement of the COVID-19 pandemic, including changes in classroom to online formats, as well as administrative difficulties. The re...
详细信息
In-band network telemetry is a powerful framework for network monitoring. It allows the collection of telemetry data in real-time and provides network-wide visibility. However, depending on the routing of network flow...
详细信息
In-band network telemetry is a powerful framework for network monitoring. It allows the collection of telemetry data in real-time and provides network-wide visibility. However, depending on the routing of network flows and which telemetry data are collected, the network-wide visibility and the performance of monitoring applications may decrease. In this paper, we present the in-band network telemetry problem and extend the existing mathematical optimization models of the problem by proposing a new model that computes the routing of network flows. Results show that the new model outperforms existing models in term of network coverage and monitoring applications performance. The results of this work can be useful for network managers and enterprises to gain real-time insights into network performance.
In the intricate domain of software systems verification, dynamically model checking multifaceted system characteristics remains paramount, yet challenging. This research proposes the advanced observe-based statistica...
详细信息
This study investigates the application of diffusion models in medical image classification (DiffMIC), focusing on skin and oral lesions. Utilizing the datasets PAD-UFES-20 for skin cancer and P-NDB-UFES for oral canc...
详细信息
The Middle 40 Per Cent (M40) demographic comprises a substantial proportion of Malaysia's consumer base, comprising the median 40% of income earners. This group is a big part of the country's customer base. Th...
详细信息
Speech content is closely related to the stability of speaker embeddings in speaker verification tasks. In this paper, we propose a novel architecture based on self-constraint learning (SCL) and reconstruction task (R...
详细信息
Federated Learning (FL) is a privacy-preserving distributed machine learning scheme, where each participant's data remains on the participant's devices and only the local model generated utilizing the local co...
详细信息
Addition units are present across many computational kernels inherent in various error-tolerant applications, including machine learning, signal, image, and video processing. Notably, adder compressors are the target ...
Addition units are present across many computational kernels inherent in various error-tolerant applications, including machine learning, signal, image, and video processing. Notably, adder compressors are the target when high speed and low power are the main design concerns. This work introduces energy-efficient structures of 3-2 approximate adder compressors (Ax3-2) and three distinct versions of 4-2 approximate adder compressors (Ax4-2). We compared our proposed Ax3-2 and Ax4-2 compressors with state-of-the-art energy-efficient approximate adder compressors (AxACs). Both Ax3-2 and Ax4-2 models passed through rigorous tests as standalone units. Furthermore, we integrated the Ax4-2 models into an essential application kernel appearing in video coding, i.e., the Sum Squared Differences (SSD) video accelerator. We introduce Ax3-2 and Ax4-2 that yield a new Pareto front concerning both energy-quality and area-quality trade-offs, thereby demonstrating a marked improvement over the prevailing state-of-the-art energy-efficient AxACs.
暂无评论