Formal techniques offer an opportunity to significantly reduce the cost of microprocessor verification. We propose a model checking based approach to automatically generate functional test programs for pipelined proce...
详细信息
Formal techniques offer an opportunity to significantly reduce the cost of microprocessor verification. We propose a model checking based approach to automatically generate functional test programs for pipelined processors. We specify the processor architecture in an Architecture Description Language (ADL). The processor model is extracted from the ADL specification. Specific properties are applied to the processor model using SMV model checker to generate test programs. We applied this methodology on a single-issue DLX processor to demonstrate the usefulness of our approach.
Quantum computing has gained considerable attention, especially after the arrival of the Noisy Intermediate-Scale Quantum (NISQ) era. Quantum processors and cloud services have been made worldwide increasingly availab...
详细信息
This paper brings together reliability and testability and introduces certain rules for generating high level test macros for processors. These rules help to generate higher quality test macros. On the other hand, the...
详细信息
ISBN:
(纸本)9781479920976
This paper brings together reliability and testability and introduces certain rules for generating high level test macros for processors. These rules help to generate higher quality test macros. On the other hand, these rules can be a reference guide for a programmer to write more reliable codes. The basic idea of these rules comes from the motto that a more testable code results in a lower reliability and vice versa. The empirical results show the effect of these rules in generating high quality high-level test macros and use of which results in a less reliable overall code. The programmer can use these guidelines for generating of less efficient testable code, and better reliable programs.
This paper presents a novel multiplier design leveraging In-Memory Computation (IMC) with a Content Addressable Memory (CAM) module and associated processors to achieve high energy efficiency and optimal area utilizat...
详细信息
ISBN:
(数字)9798331508456
ISBN:
(纸本)9798331508463
This paper presents a novel multiplier design leveraging In-Memory Computation (IMC) with a Content Addressable Memory (CAM) module and associated processors to achieve high energy efficiency and optimal area utilization. Un-like conventional multipliers that rely on separate memory and processing units, leading to latency and power inefficiencies, the proposed architecture integrates computation within the memory. By utilizing the parallel search and match capabilities of CAM, the design minimizes data movement, reducing power consumption and hardware footprint. Implemented in Verilog HDL and validated using Xilinx VIVADO, the proposed multiplier demonstrates a power reduction of 83.06 % compared to approximate multipliers and 90.09 % relative to accurate multipliers. Additionally, the design achieves a 54.05% reduction in LUT utilization and a 49.23% decrease in flip-flop usage. While a marginal increase in I/O ports and buffer gates utilization is observed due to the CAM module's delay compensation, the results highlight the potential of in-memory computation as a transformative approach for energy-efficient. compact hardware in modern computing systems.
Fine-tuning large pre-trained LLMs generally demands extensive GPU memory. Traditional first-order optimizers like SGD encounter substantial difficulties due to increased memory requirements from storing activations a...
详细信息
Multicore processors spend a varying amount of time and energy when executing a user application. The specific amount of time and energy consumed depends on application parameters as well as on the execution mode, whi...
详细信息
ISBN:
(数字)9798331524937
ISBN:
(纸本)9798331524944
Multicore processors spend a varying amount of time and energy when executing a user application. The specific amount of time and energy consumed depends on application parameters as well as on the execution mode, which includes the number of threads or the operational frequency used for the execution of an application. However, also the characteristics of the multicore processor can have a significant impact. In this article, three Intel multicore processors (Broadwell, Cascade Lake and Sapphire Rapid) of different generations are investigated with respect to their time and energy expenditure. The user application is a numerical solution of time-dependent partial differential equations. The experiments include an analysis of each processor’s unique power characteristics, with an emphasis on investigating and modeling the power behavior of these multicore processors. Processor specific power models distinguishing static and dynamic power are presented and their validity is shown with experimental data. The best suited power models differ strongly for the different processors reflecting the progress in internal processor design with respect to energy management.
The rising trend of artificial intelligence (AI) usage in many applications requires high-performance processors, demanding power to an unprecedented level. Recently, in the 48 V two-stage conversion system, the verti...
详细信息
ISBN:
(数字)9798331516116
ISBN:
(纸本)9798331516123
The rising trend of artificial intelligence (AI) usage in many applications requires high-performance processors, demanding power to an unprecedented level. Recently, in the 48 V two-stage conversion system, the vertical power delivery (VPD) solution is sought, where the second stage is placed directly underneath the processor to remove the "last inch" power loss found in the lateral power delivery (LPD) architecture. However, the VPD solution gives a strict size requirement for the voltage regulator module (VRM), where the bottleneck is usually the magnetic component. In addition, high-performance processors require a fast transient response. This article proposed a new air gap-less powder-core-based multiphase integrated lateral flux negative coupled inductor structure with a footprint of only 100 mm 2 and a height of 2.9 mm, enabling a high-density and fast-transient VRM solution. The negative coupling is achieved electrically through the coupled winding, which enables symmetrical N-phase coupling and straight-phase winding, resulting in an extremely small DCR. The proposed inductor is experimentally tested at up to 300 A (75 A/phase) to prove its high current handling capability, achieving a high 3 A/mm 2 current density.
Transformer-based large language models (LLMs) have demonstrated exceptional capabilities in sequence modeling and text generation, with improvements scaling proportionally with model size. However, the limitations of...
详细信息
With the advent of the generative AI era, high-bandwidth memory (HBM) has emerged as an irreplaceable solution that can provide ultra-high memory bandwidth (BW) of more than 1TB/s to AI processors. To enable such a hi...
详细信息
ISBN:
(数字)9798331541019
ISBN:
(纸本)9798331541026
With the advent of the generative AI era, high-bandwidth memory (HBM) has emerged as an irreplaceable solution that can provide ultra-high memory bandwidth (BW) of more than 1TB/s to AI processors. To enable such a high BW, HBM3E accommodates 16 channels (CHs) with two pseudo CHs (pCHs) each, and HBM4 increases them to 32 CHs to double the BW. Each pCH receives a dedicated differential write data strobe (WDQS) from the host. Then, the quadrature clocks (WDQS/2 IN S) at half frequency are passed to the WDQS buffer, which generates the output clocks, 5 OUT xS (X = I, Q, IB, QB), to sample the DQ data in parallel (top left of Fig. 8.5.1). To ensure error-free sampling for all DQs, low jitter is necessary in the $S_{\text{OUT}X}\mathrm{s}$ , but it is difficult to achieve due to the power-supply-induced jitter (PSIJ) issues of the WDQS buffer. A sudden toggle of WDQS/2 IN in response to the command signal $(ACT_{\mathrm{C}\text{MD}})$ from the host causes a surge in the instantaneous load current $(l_{\mathrm{L}})$ of the WDQS buffer (e.g., edge time (T EDGE ) < 100ps) [1] (top right of Fig. 8.5.1). This results in a significant voltage droop in the supply voltage $(V_{\mathrm{D}\mathrm{D}})$ and a substantial increase in PSIJ [2]. This problem is particularly severe when the $V_{\mathrm{D}\mathrm{D}}$ is unstable and noisy due to complex power grids and limited capacitor availability.
“Automated Crop Harvesting using Robotics and AI” research may truly change the future of modern farming. New Robot Systems Integrated with Smarter Artificial Intelligence So the objective for making self-operating ...
详细信息
ISBN:
(数字)9798331543358
ISBN:
(纸本)9798331543365
“Automated Crop Harvesting using Robotics and AI” research may truly change the future of modern farming. New Robot Systems Integrated with Smarter Artificial Intelligence So the objective for making self-operating harvesters that can move around fields and find ripe crops using advanced machine vision and executing selective harvesting accurately. Among them, the project on real-time data from Using environmental sensors and AI-driven predictive analytics, the system hopes to improve harvesting. operations, increase crop yield, reduce waste. Innovation addresses pivotal issues such as labour Shortages and high operational costs while promoting sustainable farming practices. Ultimately, this project It works toward making farming better, ensuring that food is produced in a more reliable and efficient way. Also, this project considers how these technologies can work with farm management systems to Provides full monitoring and control. The all-inclusive method not only makes immediate harvesting better. Tasks and also helps with long-term farming plans and managing resources. By encouraging with new developments in robots and AI, the project aims to create better ways in farming technology, opening the This is the way forward to smart farming, the future.
暂无评论