The elaboration of new systems on embedded targets is becoming more and more complex. In particular, multimedia devices are now implemented using mixed hardware and software architecture, which improve the computation...
详细信息
The proceedings contain 9 papers. The topics discussed include: architecture of a low latency H.264/AVC video codec for robust ML based image classification;automotive perception system evaluation with reference data ...
ISBN:
(纸本)9781450389013
The proceedings contain 9 papers. The topics discussed include: architecture of a low latency H.264/AVC video codec for robust ML based image classification;automotive perception system evaluation with reference data obtained by a UAV;convolutional fully-connected capsule network (CFC-CapsNet);DExIE – an IoT-class hardware monitor for real-time fine-grained control-flow integrity;GEGELATI: lightweight artificial intelligence through generic and evolvable tangled program graphs;hardware-software implementation of the PointPillars network for 3D object detection in point clouds;low-power sign-magnitude FFT design for FMCW radar signalprocessing;multiple transform selection concept modeling and implementation using interface based SDF graphs;and on cache limits for dataflow applications and related efficient memory management strategies.
In the multiuser MIMO broadcast channel, the use of precoding techniques is required in order to detect the signal at the users' terminals without any cooperation between them. This contribution presents the desig...
详细信息
In order to obtain depth information about a scene in computer vision, one needs to process pairs of stereo images. The calculation of dense depth maps in real-time is computationally challenging as it requires search...
详细信息
Modern computer vision and imageprocessing embedded systems exploit hardware acceleration inside scalable parallel architectures, such as tightly-coupled clusters, to achieve stringent performance and energy efficien...
详细信息
ISBN:
(纸本)9791092279016
Modern computer vision and imageprocessing embedded systems exploit hardware acceleration inside scalable parallel architectures, such as tightly-coupled clusters, to achieve stringent performance and energy efficiency targets. Architectural heterogeneity typically makes software development cumbersome, thus shared memory processor-to-accelerator communication is typically preferred to simplify code offloading to HW IPs for critical computational kernels. However, tightly coupling a large number of accelerators and processors in a shared memory cluster is a challenging task, since the complexity of the resulting system quickly becomes too large. We tackle these issues by proposing a template of heterogeneous shared memory cluster which scales to a large number of accelerators, achieving up to 40% better performance/area/watt than simply designing larger main interconnects to accommodate several HW IPs. In addition, following a trend towards standardization of acceleration capabilities of future embedded systems, we develop a programming model which simplifies application development for heterogeneous clusters.
In the last few years, efficient resource management turned out to be one of the major challenges for hardware designers. Strategies of reusability through reconfiguration have demonstrated interesting potentials to a...
详细信息
ISBN:
(纸本)9791092279016
In the last few years, efficient resource management turned out to be one of the major challenges for hardware designers. Strategies of reusability through reconfiguration have demonstrated interesting potentials to address it, providing also power and area minimization. The Multi-Dataflow Composer (MDC) tool has been presented to the scientific community to automatically build-up runtime coarse-grained reconfigurable platforms. Originally conceived in the field of Reconfigurable Video Coding (RVC), the MDC allows achieving attractive results also for hardware designers operating in different application fields. In this work, we intend to demonstrate the potential orthogonality of the MDC approach with respect to the RVC domain. A runtime reconfigurable wavelet denoiser, targeted for biomedical applications, has been developed and prototyped onto an FPGA Development Board Spartan 3E 1600. Impressive results in terms of resource minimization have been achieved with respect to more traditional solutions.
The proceedings contain 9 papers. The special focus in this conference is on design and architectures for signal and imageprocessing. The topics include: Brain Blood Vessel Segmentation in Hyperspectral images T...
ISBN:
(纸本)9783031299698
The proceedings contain 9 papers. The special focus in this conference is on design and architectures for signal and imageprocessing. The topics include: Brain Blood Vessel Segmentation in Hyperspectral images Through Linear Operators;Neural Network Predictor for Fast Channel Change on DVB Set-Top-Boxes;AINoC: New Interconnect for Future Deep Neural Network Accelerators;Real-Time FPGA Implementation of the Semi-global Matching Stereo Vision Algorithm for a 4K/UHD Video Stream;TaPaFuzz - An FPGA-Accelerated Framework for RISC-V IoT Graybox Fuzzing;Adaptive Inference for FPGA-Based 5G Automatic Modulation Classification;High-Level Online Power Monitoring of FPGA IP Based on Machine Learning.
The proceedings contain 9 papers. The special focus in this conference is on design and Architecture for signal and imageprocessing. The topics include: sEMG-Based Gesture Recognition with Spiking Neural Network...
ISBN:
(纸本)9783031628733
The proceedings contain 9 papers. The special focus in this conference is on design and Architecture for signal and imageprocessing. The topics include: sEMG-Based Gesture Recognition with Spiking Neural Networks on Low-Power FPGA;A Highly Configurable Platform for Advanced PPG Analysis;preface;Standalone Nested Loop Acceleration on CGRAs for signalprocessing Applications;optimising Graph Representation for Hardware Implementation of Graph Convolutional Networks for Event-Based Vision;Improving the Energy Efficiency of CNN Inference on FPGA Using Partial Reconfiguration;scratchy: A Class of Adaptable architectures with Software-Managed Communication for Edge Streaming Applications.
Embedded applications are usually coming with stringent constraints in term of cost, energy consumption and realtime. Consequently, fixed-point arithmetic is mainstream for their implementation into embedded systems. ...
详细信息
Noise often limits the performance of transmitted signals and degrades signals quality. Moreover, stochastic nature of noise makes it difficult to predict, and hence, is hard to detect. In hardware implementation, the...
详细信息
暂无评论