The advancement in on demand Multimedia Streaming Applications (MAS) enables faster video transmission as per the user request in various fields. This system suffers from poor speed, flexibility and efficiency in acce...
详细信息
The advancement in on demand Multimedia Streaming Applications (MAS) enables faster video transmission as per the user request in various fields. This system suffers from poor speed, flexibility and efficiency in accessing and presenting the multimedia contents from the archive. It mostly undergoes delay, packet loss and congestion during data delivery. Hence, the requirement of manual annotation is required for access and retrieval but it suffers from poor retrieval accuracy over large databases. The need of automatic annotation in MAS takes the lead for increased retrieval accuracy on most similar image retrieval systems based on various low-level features. Thus, it eliminates the gap between the high-level semantics and low-level feature representation. The approach on automated annotation of images is considered dependent on the accuracy of a model while detecting edges, color, texture, shape and spatial information. In this paper, we develop an automated annotation model that retrieves visually similar images from online multimedia streams with optimal feature extraction. The automated annotation model is designed with a Multi-modal Active learning (MAL) that uses Convolutional Recurrent Neural Network (CRNN) for automatic annotation of labels based on visually similar contents or features like edges, color, texture, shape and spatial information. Further, a deep Reinforcement learning (DRL) algorithm is used that increases the performance of the retrieval engine based on validating the visually extracted features. The simulation of MAL-CNN is conducted over large online streaming databases and it is then validated by DRL on an online real-time streaming. The performance is validated in terms of its retrieval accuracy, sensitivity, specificity, f-measure, geometric mean and mean absolute percentage error (MAPE). The results confirm the accuracy of the proposed MAL-DRL model against conventional machine learning, reinforcement learning and deeplearning automati
It is important to accurately identify and measure in-focus droplets from shadowgraph droplet images that typically contain a large number of defocused droplets for the research of multiphase flow. However, convention...
详细信息
It is important to accurately identify and measure in-focus droplets from shadowgraph droplet images that typically contain a large number of defocused droplets for the research of multiphase flow. However, conventional in-focus droplet identification methods are time-consuming and laborious due to the noise and background illumination in experimental data. In this paper, a deeplearning-based method called focus-droplet generative adversarial network (FocGAN) is developed to automatically detect and characterize the focused droplets in shadow images. A generative adversarial network framework is adopted by our model to output binarized images containing only in-focus droplets, and inception blocks are used in the generator to enhance the extraction of multi-scale features. To emulate the real shadow images, an algorithm based on the Gauss blur method is developed to generate paired datasets to train the networks. The detailed architecture and performance of the model were investigated and evaluated by both the synthetic data and spray experimental data. The results show that the present learning-based method is far superior to the traditional adaptive threshold method in terms of effective extraction rate and accuracy. The comprehensive performance of FocGAN, including detection accuracy and robustness to noise, is higher than that of the model based on a convolutional neural network. Moreover, the identification results of spray images with different droplet number densities clearly exhibit the feasibility of FocGAN in real experiments. This work indicates that the proposed learning-based approach is promising to be widely applied as an efficient and universal tool for processing particle shadowgraph images.
This study presents an empirical assessment of identifying human blood groups using imageprocessing assisted by deeplearning principles, specifically employing a cascaded Convolutional Neural Network (CNN) and Light...
详细信息
Lane detection technology plays a pivotal role in enabling autonomous navigation in vehicles. However, existing systems primarily cater to well-structured roads with clear lane markings, rendering them ineffective in ...
详细信息
deeplearning models have been a huge success in image recognition which hence can be used for the purpose of text generation. In the field of imaging science, captioning images and videos is regarded as an intellectu...
详细信息
Natural gas extraction systems often encounter manufacturing defects or develop defects over time, leading to gas leaks. These leaks pose challenges, causing revenue losses and environmental pollution. Detecting gas l...
ISBN:
(纸本)9781959025030
Natural gas extraction systems often encounter manufacturing defects or develop defects over time, leading to gas leaks. These leaks pose challenges, causing revenue losses and environmental pollution. Detecting gas leaks in the vast array of extraction, transfer, and storage equipment within these systems can be arduous, allowing leaks to persist unnoticed. Additionally, natural gas leaks are not visible to naked eyes, further complicating their detection. We developed a novel deeplearningimageprocessing model that utilizes videos captured by a specialized Optical Gas Imaging (OGI) camera to detect natural gas leaks. The temporal deeplearning algorithm is designed to identify patterns associated with gas leaks and improve its performance through supervised learning. Our model incorporates algorithms to detect background environments, motion, equipment, and classify gas leaks. Our model employs leak identification algorithms to determine the presence of gas leaks. These algorithms calculate the probability of detected motion indicating a gas leak based on long-term and short-term background subtraction, detected motion, motion duration, equipment location, and telemetry data. To minimize false positives, we have developed image segmentation and object detection models to identify known objects, such as equipment, people, and cars, within the video footage. To train our model we collect more than 10,000 short videos from real fields and include simulated data with known rate controlled gas release in different situations. Data consist of wide range of weather situations including different temperature, wind speed, humidity in sunny, rainy, and snowy fields. We validated our model by conducting experiments involving actual footage from the field. The model achieved a 98% true positive rate, and a 100% true negative rate, correctly refraining from sending an alarm for all non-releases. Additionally, we developed a postprocessing algorithm capable of estimating the
Nighttime detection and harvesting are key issues for achieving all-day operation of tomato-picking robots. Currently, most general detection algorithms are limited to natural daylight conditions, with significantly r...
详细信息
The proceedings contain 12 papers. The topics discussed include: real-time detection of maize crop disease via a deeplearning-based smartphone app;parallel artificial neural networks using wavelet-based features for ...
ISBN:
(纸本)9781510635791
The proceedings contain 12 papers. The topics discussed include: real-time detection of maize crop disease via a deeplearning-based smartphone app;parallel artificial neural networks using wavelet-based features for classification of remote-sensing hyperspectral images;no-reference image quality assessment based on residual neural networks (ResNets);coverless image steganography framework using distance local binary pattern and convolutional neural network;the combined denoising of images on the optical and thermal range onboard the UAV;portable flow device using Fourier ptychography microscopy and deeplearning for detection of biosignatures;parallel color image watermarking scheme for multiple picture object based on multithreading coding;and performance analysis of semantic segmentation algorithms trained with JPEG compressed datasets.
The latest spinoffs in the field of Autonomous Vehicles have paved way for a revolution in mobility and transportation;particularly in the warehousing and distribution sector. AMRs, Autonomous Mobile Robots, are being...
详细信息
ISBN:
(数字)9781905824694
ISBN:
(纸本)9781905824694
The latest spinoffs in the field of Autonomous Vehicles have paved way for a revolution in mobility and transportation;particularly in the warehousing and distribution sector. AMRs, Autonomous Mobile Robots, are being deployed to assist in warehousing activities as they present multiple advantages. In this paper, an AMR coupled with imageprocessing and deeplearning is introduced as a novel approach to solve a two-fold problem: surveillance and disinfection. deeplearning will make use of real-time data collected by the AMR's camera as a smart surveillance method for abnormal event detection. YOLOv4 is used to train a custom dataset for object detection on five different classes. The latter obtained a 74.40% accuracy. The vehicle will also be used to diffuse disinfecting agents as a mean to sanitize the stores and stocks against Covid-19. Moreover, autonomous navigation of the AMR will be based on imageprocessing techniques for path track detection.
Character recognition methods are applied in many fields, greatly improving work efficiency in daily life[1], such as license plate retrieval, invoice printing recognition, lottery betting codes, tax reports, etc. Dig...
详细信息
ISBN:
(纸本)9781665416061
Character recognition methods are applied in many fields, greatly improving work efficiency in daily life[1], such as license plate retrieval, invoice printing recognition, lottery betting codes, tax reports, etc. Digital recognition has been widely used in the field of computer vision and image recognition, and deeplearning algorithms are currently popular image recognition algorithms. deeplearning has been widely studied and applied in target recognition and speech content recognition. With the rapid increase in production requirements and computer data processing speed, the application of character recognition in actual production and life is becoming more and more common[2]. It is also extremely important for automatic retrieval and real-time, fast and accurate character input. However, traditional pattern recognition and feature extraction algorithms cannot well meet the requirements of real-time and correctness in production. At the same time, due to the vigorous development of deeplearning, character recognition technology based on deeplearning has advantages that traditional recognition algorithms cannot match. This paper proposes a barcode recognition algorithm based on a deep neural network combined with a global optimization method. It uses a convolutional recurrent network to extract the characteristics of each character in the barcode and classify it. Compared with the traditional method, it has stronger adaptability and generalization. Chemical energy.
暂无评论