Using MRI to reliably diagnose brain tumors is important but it is often time-consuming. The study uses an automated method for brain tumor detection and classification using imageprocessing techniques and convention...
详细信息
The growing demand for real-timeimageprocessing on edge devices calls for novel approaches that balance computational efficiency with high performance. This paper introduces an integrated solution combining ShuffleN...
详细信息
The intersection of deep learning and programmable logic controllers (PLCs) can lead to innovative applications in automation. One of the exciting application areas are gesture-based control systems for Automated Guid...
详细信息
ISBN:
(纸本)9781510673199;9781510673182
The intersection of deep learning and programmable logic controllers (PLCs) can lead to innovative applications in automation. One of the exciting application areas are gesture-based control systems for Automated Guided Vehicles (AGVs). AGVs are used in various industries for material handling, logistics, warehouse automation, etc. Traditionally, these vehicles are controlled using predefined routes or remote controls, but with gesture-based control, operators can communicate more naturally and efficiently. The incorporation of YOLO-Pose in YOLO versions 7 and 8 has elevated the YOLO algorithm to a leading tool for creating gesture recognition models. The YOLO algorithm employs convolutional neural networks (CNN) to detect objects in real-time. These latest YOLO models offer significantly improved accuracy, speed, and reduced training times. This paper presents the comparative results of 2D gesture recognition transfer learning models created using the YOLO v5, v7, and v8 models, along with the steps taken to implement the model in a PLC-controlled AGV. Over 14,000 images were collected to build the models. A semi-automated approach was used to annotate them. Five models were created: two Keypoint models and three object detection models using transfer learning techniques with the same hyperparameters.
With the advancement of technology and due to the recent pandemic situation, the education sector has turned to the online teaching method. But the main problem here is the inconvenience and irregularities in the stud...
详细信息
image classifiers for domain-specific tasks like Synthetic Aperture Radar Automatic Target Recognition (SAR ATR) and chest X-ray classification often rely on convolutional neural networks (CNNs). These networks, while...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
image classifiers for domain-specific tasks like Synthetic Aperture Radar Automatic Target Recognition (SAR ATR) and chest X-ray classification often rely on convolutional neural networks (CNNs). These networks, while powerful, experience high latency due to the number of operations they perform, which can be problematic in real-time applications. Many image classification models are designed to work with both RGB and grayscale datasets, but classifiers that operate solely on grayscale images are less common. Grayscale image classification has critical applications in fields such as medical imaging and SAR ATR. In response, we present a novel grayscale image classification approach using a vectorized view of images. By leveraging the lightweight nature of Multi-Layer Perceptrons (MLPs), we treat images as vectors, simplifying the problem to grayscale image classification. Our approach incorporates a single graph convolutional layer in a batch-wise manner, enhancing accuracy and reducing performance variance. Additionally, we develop a customized accelerator on FPGA for our model, incorporating several optimizations to improve performance. Experimental results on benchmark grayscale image datasets demonstrate the effectiveness of our approach, achieving significantly lower latency (up to 16x less on MSTAR) and competitive or superior performance compared to state-of-the-art models for SAR ATR and medical image classification.
This research investigates the implementation of real-time aerial image edge detection using the Canny edge detection algorithm with the MicroWatt Power Instruction Set Architecture (ISA)-Open Core Processor on Field ...
详细信息
With the continuous development of digital imageprocessing technology, edge detection technology is playing an increasingly important role in the field of imageprocessing. The Canny algorithm is a classical gradient...
详细信息
ISBN:
(纸本)9798400707032
With the continuous development of digital imageprocessing technology, edge detection technology is playing an increasingly important role in the field of imageprocessing. The Canny algorithm is a classical gradient-based edge detection algorithm with excellent performance and robustness. However, due to its large computational amount and poor real-time performance, the traditional software implementation method has been unable to meet the demands of modern high-speed imageprocessing. Therefore, the Canny algorithm hardware is turned into a popular research direction. As a programmable logic device, FPGA has the advantages of high flexibility, short development period and strong parallel computing power, which is widely used in the field of digital signal processing. At present, there has been a lot of research work on FPGA in implementing the Canny algorithm, but most of the schemes have some problems, such as slow speed and high resource occupancy rate. Therefore, this paper presents an improved scheme for the hardware design of Canny algorithm based on FPGA, aiming to improve the speed and efficiency of imageprocessing while reducing the utilization of hardware resources.
Fire smoke needs early detection and accurate identification, so as to protect people's lives and property, while manual control method has problems such as large time consumption, subjective misjudgment, so an ef...
详细信息
In this paper, the development of an immersive virtual conference room designed to replicate the experience of a physical meeting environment has been presented. Utilizing Unity software, authors have created a virtua...
详细信息
ISBN:
(纸本)9798350391893;9798350391886
In this paper, the development of an immersive virtual conference room designed to replicate the experience of a physical meeting environment has been presented. Utilizing Unity software, authors have created a virtual space equipped with several simple screens instead of holograms, each attached with individual cameras. Live camera feeds seamlessly integrate real-time participant interactions within virtual environments, enhancing telepresence and fostering immersive collaboration akin to face-to-face meetings. While the current implementation operates locally, our future focus is on enabling remote connectivity, facilitating collaboration among individuals across different geographic locations, and later a hologram-based virtual conference. This innovative approach aims to enhance remote collaboration experiences and bridge the gap between virtual interactions and physical presence.
This paper tackles the problem of mixed Gaussian and impulsive noise suppression in color images. The proposed method comprises two essential steps. Firstly, we detect impulsive noise through an approach based on the ...
详细信息
ISBN:
(纸本)9781510673878;9781510673861
This paper tackles the problem of mixed Gaussian and impulsive noise suppression in color images. The proposed method comprises two essential steps. Firstly, we detect impulsive noise through an approach based on the concept of digital path exploring the local pixel neighborhood. Each pixel is assigned a cost of a path connecting the boundary of a local processing window with its center. When the central pixel exhibits a high value of the path with lowest cost, it is identified as an impulse. To achieve this, we use a thresholding procedure for detecting corrupted pixels. Analyzing the distribution of minimum path costs, we employ the k-means technique to classify pixels into three distinct categories: those nearly undistorted, those corrupted by Gaussian noise, and those affected by impulsive noise. Subsequently, we employ the Laplace interpolation technique to restore the impulsive pixels - a fast and effective method yielding satisfactory denoising results. In the second step, we address the residual Gaussian noise using the Non-Local Means method, which selectively considers pixels from the local window that have not been flagged as impulsive. The experimental results confirm that our proposed hybrid method consistently yields superior outcomes compared to state-of-the-art denoising techniques. Moreover, its computational complexity remains low, rendering it suitable for real-time applications.
暂无评论