Embedded systems typically require the transmission of significant amounts of data to small-scale CPUs for applications such as radar signal processing, imageprocessing, and embedded AI. Ensuring data integrity durin...
详细信息
ISBN:
(纸本)9798350377217;9798350377200
Embedded systems typically require the transmission of significant amounts of data to small-scale CPUs for applications such as radar signal processing, imageprocessing, and embedded AI. Ensuring data integrity during transmission is typically managed using Cyclic Redundancy Check (CRC) algorithms. However, achieving real-time CRC calculation and data storage poses challenges, often necessitating large FIFO memories and multiple clock domains. These additional resources involve a greater hardware complexity. This paper presents an approach aimed at synchronizing the CPU frequency with data transmission. This enables having a single clock domain and a reduction of power consumption. Using hardware/software co-design, it is possible to achieve real-time data storage and CRC calculation without data loss and with a low power consumption.
Division is one of the most commonly sort after algorithm for performing imageprocessing operations such as normalization, filtering, enhancement, deconvolution etc. Hence, the design of efficient division algorithm ...
详细信息
Medical image segmentation plays a pivotal role in computer-aided diagnosis by facilitating the extraction of essential features necessary for disease detection and treatment strategies. The continuous progress in ima...
详细信息
In order to comply with the trend of intelligent visual communication, this study proposed an innovative visual communication scenario based on imageprocessingalgorithms. The framework aims to optimize traditional k...
详细信息
Deep Neural Network (DNN) algorithms have become ubiquitous within the vision domain, encompassing various tasks, including object detection, segmentation, and classification. However, executing complex DNNs in real-t...
详细信息
ISBN:
(纸本)9781510679344;9781510679351
Deep Neural Network (DNN) algorithms have become ubiquitous within the vision domain, encompassing various tasks, including object detection, segmentation, and classification. However, executing complex DNNs in real-time systems demands improved energy efficiency, runtime, and accuracy. Traditional embedded imaging designs, typically implemented on homogeneous architectures, face hardware limitations, prompting the need for heterogeneous computing architectures. These architectures combine CPUs, GPUs, FPGAs, and other accelerators, enabling applications to use the most efficient architecture for a given algorithm. The challenge lies in scheduling and partitioning algorithms across accelerators with different computing paradigms and tool-sets. This requires balancing computational power, memory bandwidth, and communication overhead. Effective scheduling involves considering task dependencies, resource availability, and synchronisation. Current deep learning libraries often target single architectures and lack mechanisms to intelligently partition sub-operations across the most suitable processors. This paper introduces a scheduler for heterogeneous vision systems that finely partitions and maps sub-operations of convolutional neural networks and imageprocessingalgorithms. Leveraging state-of-the-art compiler frameworks such as PyTorch, TVM, and ONNX, the proposed scheduler optimally distributes tasks across heterogeneous components. Experimental results show that the heterogeneous platform achieves on average 1.12x & 1.08x improvements in kernel runtime and energy consumption, compared to the best-performing discrete hardware counterparts, GPU and FPGA. The study demonstrates that partitioning algorithms based on their runtime and energy properties and optimally scheduling them improves energy and runtime efficiency compared to homogeneous components executing the complete algorithm.
This paper investigates classical imageprocessing techniques and unsupervised deep learning algorithms for segmenting images with high variance for an under researched industrial problem, focusing on beam burns gener...
详细信息
Watermelon is a commonly cultivated fruit worldwide, especially in Southeast Asia. As one of the top exports in Asia, the commercialization of watermelon has its market. Its consumer appeal makes it one of the most so...
详细信息
ISBN:
(纸本)9798350372113;9798350372106
Watermelon is a commonly cultivated fruit worldwide, especially in Southeast Asia. As one of the top exports in Asia, the commercialization of watermelon has its market. Its consumer appeal makes it one of the most sought-after fruits globally. With the watermelon fruit consisting of different varieties, consumers usually have difficulty in classifying watermelons due to their similar external appearances, especially when labeled under the same name. This study is conducted to implement a system that detects and classifies three red watermelon varieties, Red Export, Orchid Sweet, and Dixie Queen, with imageprocessingalgorithms. The researchers utilized Canny Edge Detection for the dataset's preprocessing phase and Convolutional Neural Network (CNN) for its classification. The Raspberry Pi is also applied to this study. Moreover, the researchers created and collected their datasets for testing and validation data. The model used in this study has acquired 84.71% overall accuracy.
This article explores the application of deep learning (DL) algorithms in power system load forecasting. With the continuous advancement of the construction of new power systems, traditional load forecasting models de...
详细信息
The development of computer vision systems stimulates the development of various applications in the field of image recognition. Methods and algorithms for image recognition in document processingsystems play a cruci...
详细信息
This paper aims to investigate a system that uses various machine-learning algorithms to predict symptoms and deep-learning techniques for imageprocessing that leads to early disease prediction, an essential aspect o...
详细信息
暂无评论