This paper investigates the optimization and deployment of YOLOv7 deep learning model on NVIDIA Jetson Nano, an AI-focused edge computing platform for object detection in various computervision applications. The work...
详细信息
ISBN:
(数字)9798331527549
ISBN:
(纸本)9798331527556
This paper investigates the optimization and deployment of YOLOv7 deep learning model on NVIDIA Jetson Nano, an AI-focused edge computing platform for object detection in various computervision applications. The work leverages TensorRT and quantization techniques for model acceleration for good detection accuracy. Further it examines performance metrics such as speed, accuracy, and resource utilization for image dataset. The model is trained using 80 different classes of objects and demonstrates the use of 6 classes. The average detection accuracy obtained 92.35% and the average processing time is 117.8ms. This work supports AI by demonstrating the feasibility of running deep learning models on edge devices and provides insight into the challenges and opportunities of optimizing AI models for energy-efficient, real-time operations on edge devices for various computervision applications.
In recent years, Artificial Intelligent (AI) is rapidly developed. The image caption is attracting the attention of many scientists. It is very interesting work. Automatically generating image caption into the natural...
详细信息
In this paper, a robust image watermarking scheme based on all phase sine biorthogonal transform (APSBT), singular value decomposition (SVD) and dynamic stochastic resonance (DSR) is presented. Firstly, the cover imag...
详细信息
ISBN:
(纸本)9789813292918;9789813292901
In this paper, a robust image watermarking scheme based on all phase sine biorthogonal transform (APSBT), singular value decomposition (SVD) and dynamic stochastic resonance (DSR) is presented. Firstly, the cover image is transformed by APSBT and then a gray-scale logo is embedded through the singular value decomposition. After the authentication process which essentially resolves the false-positive extraction of SVD in watermarking, a phenomenon based on dynamic stochastic resonance is deployed for the logo extraction from the watermarked image. The simulation results demonstrate that the proposed scheme has better performance in the aspect of robustness and invisibility.
This paper presents an innovative framework that employs camera-captured visual data to detect and suggest optimal sitting postures. The framework consists of two crucial components: a video capture object and an obje...
This paper presents an innovative framework that employs camera-captured visual data to detect and suggest optimal sitting postures. The framework consists of two crucial components: a video capture object and an object detection system that incorporates Deep Learning to enhance efficiency and reliability. The camera initially captures the user’s posture image, which is then subjected to video processing to extract video metadata. Subsequently, the object is created by extracting the image from the video, and the object detection algorithm is applied to provide posture recommendations. The algorithm continually monitors posture correctness and provides suggestions to improve it as necessary, ultimately benefiting the user’s daily life and mitigating potential long-term problems. Additionally, the algorithm can be further developed to recognize posture patterns and suggest corrective exercises or techniques. Furthermore, the algorithm’s efficiency can be enhanced by optimizing landmark detection for more effective outcomes. This cutting-edge framework offers immense potential for improving posture and overall health, and its development can significantly enhance the quality of life for individuals.
Down syndrome is a genetic disorder that affects 1 in every 1000 babies born worldwide. The cases of Down syndrome have increased in the past decade. It has been observed that humans with Down syndrome generally tend ...
详细信息
ISBN:
(纸本)9789813290884;9789813290877
Down syndrome is a genetic disorder that affects 1 in every 1000 babies born worldwide. The cases of Down syndrome have increased in the past decade. It has been observed that humans with Down syndrome generally tend to have distinct facial features. This paper proposes a model to identify people suffering from Down syndrome based on their facial features. Deep representation from different parts of the face is extracted and combined with the aid of Deep Convolutional Neural Networks. The combined representations are then classified using a Random Forest-based pipeline. The model was tested on a dataset of over 800 individuals suffering from Down syndrome and was able to achieve a recognition rate of 98.47%.
With the booming development of imageprocessing technology and computervision technology, scene detection and imageprocessing in special weather has become an important research direction in this field. Among them,...
详细信息
With the booming development of imageprocessing technology and computervision technology, scene detection and imageprocessing in special weather has become an important research direction in this field. Among them, images taken in foggy days are easily affected by fog or haze, resulting in blurred details and low contrast to the loss of important image information, and to solve such problems image defogging algorithms are born. To address these challenges, a lightweight convolutional neural network based on multi-scale dense connectivity, called MSDL, is proposed in this paper for reconstructing blurred images. DehazeNet is the End-to-End defogging system that takes the fogged image as input, its transmission map as output, and then uses an atmospheric scattering model for image reduction. The proposed MSDL uses the transformed atmospheric scattering model to jointly estimate the transmission map and atmospheric light. In addition, a novel feature extraction module MSDB is proposed. Finally, extensive experiments are carried out using synthetic and natural hazy images. The experimental results show superiority over both non-deep learning and deep learning methods in both qualitative and quantitative evaluation. The PSNR, SSIM and MSE metrics were measured on different datasets, and the advantages were obtained on NYU2 dataset.
With the rapid development of technology, computer technology is gradually maturing. In the computervision field, imageprocessing technology in specific weather has become an important research field. For those phot...
With the rapid development of technology, computer technology is gradually maturing. In the computervision field, imageprocessing technology in specific weather has become an important research field. For those photos taken in foggy weather, the mist will affect natural light leading to a certain impact on the visual effect making pictures seem unclear. Additionally, the useful information in the pictures will decrease. To solve the image blurring problem caused by natural weather such as haze, this paper adopts the histogram equalization method. By averaging the gray histogram of the image, the blurred areas are magnified and dispersed, and a clear state is visually presented. The retinex algorithm is used to process the histogram equalization result image to enhance the useful image information and further process the result details to defog the image and obtain the final defogged image.
image caption or description generation is a fundamental problem of artificial intelligence. It requires both knowledge, natural language processing, and computervision together. It automatically produces description...
详细信息
The images of construction work receipts are filled with various noises, such as stains, scratches, handwritten handwriting, and seal images, which result in poor performance of traditional text detection methods. Thi...
详细信息
Windstorms, foggy winters, and sand storms generally degrade image quality and have an impact on computervision applications, which could be a safety concern for drivers because of light dispersing and retention by d...
详细信息
暂无评论