fpgas are now used in public clouds to accelerate a wide range of applications, including many that operate on sensitive data such as financial and medical records. We present ShEF, a trusted execution environment (TE...
详细信息
ISBN:
(纸本)9781450392051
fpgas are now used in public clouds to accelerate a wide range of applications, including many that operate on sensitive data such as financial and medical records. We present ShEF, a trusted execution environment (TEE) for cloud-based reconfigurable accelerators. ShEF is independent from CPU-based TEEs and allows secure execution under a threat model where the adversary can control all software running on the CPU connected to the FPGA, has physical access to the FPGA, and can compromise the FPGA interface logic of the cloud provider. ShEF provides a secure boot and remote attestation process that relies solely on existing FPGA mechanisms for root of trust. It also includes a Shield component that provides secure access to data while the accelerator is in use. The Shield is highly customizable and extensible, allowing users to craft a bespoke security solution that fits their accelerator's memory access patterns, bandwidth, and security requirements at minimum performance and area overheads. We describe a prototype implementation of ShEF for existing cloud fpgas, map ShEF to a performant and secure storage application, and measure the performance benefits of customizable security using five additional accelerators.
In recent years, the emergence of deep learning and the need for big data analysis have created a demand for computers with high computational performance and energy efficiency. Since conventional ASICs and general-pu...
详细信息
Cloud automation is one of the most promising and upcoming applications of IoT [1], as it allows users to control and monitor their environment remotely. Adopting the Internet of Things (IoT) in the automation industr...
详细信息
Modern software architectures for robotics map tasks to heterogeneous computing platforms comprising multi-core CPUs, GPUs, and fpgas. fpgas promise huge potential for energy efficient and fast computation, but their ...
详细信息
ISBN:
(纸本)9781665472609
Modern software architectures for robotics map tasks to heterogeneous computing platforms comprising multi-core CPUs, GPUs, and fpgas. fpgas promise huge potential for energy efficient and fast computation, but their use in robotics requires profound knowledge of hardware design and is thus challenging. ReconROS, a combination of the reconfigurable operating system ReconOS and the robot operating system (ROS) aims to overcome this challenge with a consistent programming model across the hardware/software boundary and support of event-driven programming. In this paper, we summarize different approaches for mapping tasks to computational resources in ReconROS. These approaches include static and dynamic mappings, and the exploitation of data parallelism for single ROS nodes. Further, for dynamic mapping we propose and analyse different replacement strategies for hardware nodes to minimize reconfiguration overhead. We evaluate the presented techniques and illustrate ReconROS' capabilites through an autonomous vehicle example in a hardware-in-the-loop simulation.
The demand for energy-efficient and high-performance embedded systems drives the evolution of new hardware architectures, including concepts like approximate computing. This paper presents a novel reconfigurable embed...
详细信息
ISBN:
(纸本)9798350380392;9798350380385
The demand for energy-efficient and high-performance embedded systems drives the evolution of new hardware architectures, including concepts like approximate computing. This paper presents a novel reconfigurable embedded platform named "phoeniX", using the standard RISC-V ISA, maximizing energy efficiency while maintaining acceptable application-level accuracy. The platform enables the integration of approximate circuits at the core level with diverse structures, accuracies, and timings without requiring modifications to the core, particularly in the control logic. The platform introduces novel control features, allowing configurable trade-offs between accuracy and energy consumption based on specific application requirements. To evaluate the effectiveness of the platform, experiments were conducted on a set of applications, such as image processing and Dhrystone benchmark. The core with its original execution engine, occupies 0.024mm(2) of area, with average power consumption of 4.23mW at 1.1V operating voltage, average energy-efficiency of 7.85pJ per operation at 620MHz frequency in 45nm CMOS technology. The configurable platform with a highly optimized 3-stage pipelined RV32I(E)M architecture, possesses a DMIPS/MHz of 1.89, and a CPI of 1.13, showcasing remarkable capabilities for an embedded processor.
The recent advances in large vision and language models, along with generative artificial intelligence, have led to a growing demand for compute resources such as increased processing capacity, faster memory, and larg...
详细信息
Within the disciplines of electronic engineering and computer science, the quest for cutting-edge integrated circuit design has become more acute, reflecting the burgeoning need for high-performance computing solution...
详细信息
The addition of a 16-bit unsigned divider to Xilinx's Kintex and Virtex FPGA devices improves digital design and computational performance. Developers may divide nonnegative 16-bit integers with the 16-bit unsigne...
详细信息
In the past conventional von Neumann computer architecture, the data that are stored in the memory unit and the computing core are separated which leads to the increase in power consumption and latency as data size be...
详细信息
The proceedings contain 15 papers. The topics discussed include: extensible embedded hardware description languages with compilation, simulation and verification;breaking boundaries: optimizing FPGA CAD with flexible ...
ISBN:
(纸本)9798400700439
The proceedings contain 15 papers. The topics discussed include: extensible embedded hardware description languages with compilation, simulation and verification;breaking boundaries: optimizing FPGA CAD with flexible and multi-threaded re-clustering;efficient FPGA implementation of amoeba-inspired SAT solver with feedback and bounceback control: harnessing variable-level parallelism for large-scale problem solving in edge computing;quantitative study of floating-point precision on modern fpgas;resource-efficient RISC-V vector extension architecture for FPGA-based accelerators;CSA based Radix-4 Gemmini systolic array for machine learning applications;and noise resilience of reduced precision neural networks.
暂无评论