Facial video re-targeting is a challenging problem aiming to modify the facial attributes of a target subject in a seamless manner by a driving monocular sequence. We leverage the 3D geometry of faces and Generative A...
详细信息
The super-resolving images investigation has advanced in recent years through the use of cutting-edge deeplearning-based architectures. Numerous previously documented super-resolution-based solutions need the most ad...
The super-resolving images investigation has advanced in recent years through the use of cutting-edge deeplearning-based architectures. Numerous previously documented super-resolution-based solutions need the most advanced and top-tier Graphics processing Units (GPUs) to execute picture super-resolution. With the growing advancement of technology, this research focuses on suggesting the needed quantity of convolutional layers for creating the realtime super resolution of single image using Convolutional Neural Network. The proposed Six Convolutional Layered deep Convolutional Neural Network (6CL-DCNN) to predict the super resolution of images with high accuracy and ideal Peak Signal-to-Noise Ratio. The dataset extracted from ***(http://***/Research/Projects/CS/vision/grouping/BSR/BSR_***). The dataset contains 800 images with the combination of low resolution and high-resolution images having the image resolution of 300 * 300 pixels. The proposed 6CL-DCNN built with single input layer followed by six convolutional layers followed by single lambda optimized output layer that predicts the super resolution of both high resolution images and low resolution images. Python was used through 500 training iterations and a 64-bit block size on a NVidia Geforce Tesla V100 GPU workstation. The processed low resolution images and high-resolution images are applied with proposed 6CL-DCNN model and also the performance is compared using Peak Signal to Noise Ratio with other optimized output layers. Experimental results shows that the proposed model 6CL-DCNN shows the maximum Peak Signal to Noise Ratio of 48 dB when compared to other optimized output layers.
The realm of face detection has become a focal point of extensive research, driven by its diverse applications spanning computer vision, communication, and automatic control systems. realizing real-time recognition of...
详细信息
In this paper, in order to solve various problems occurring in the workspace, a deeplearning-based workspace identification module was designed, and the performance was analyzed through an experiment on the recogniti...
详细信息
ISBN:
(纸本)9781728176383
In this paper, in order to solve various problems occurring in the workspace, a deeplearning-based workspace identification module was designed, and the performance was analyzed through an experiment on the recognition accuracy according to the configuration of the training dataset and the number of training. The data model of the designed deeplearning module is ResNet18, and after setting up three dataset strategies, a dataset using five types of workspaces of the manufacturing industry was selected. In terms of the average top 5 and all training, strategy 2 was 812% and 76.4%, respectivel, confirming that it was the best among the 3 strategies. In the future, after upgrading the designed module, it is planned to implement a module with real-time workspace identification performance level of practical use in a mobile environment with an image input device installed.
Quality analysis of the polarizer of a production line can be performed using imageprocessing technology. The existing method of detecting defective images based on deeplearning can ensure accurate classification;ho...
详细信息
Quality analysis of the polarizer of a production line can be performed using imageprocessing technology. The existing method of detecting defective images based on deeplearning can ensure accurate classification;however, its detection speed is low, the model requires a large amount of memory, and it is difficult to meet the real-time requirements of online detection systems when hardware resources are limited. Therefore, in this study a lightweight polarizer defect detection network, called DDN, was developed based on deeplearning. First, a parallel module was designed to build the network. This module has two main advantages. First, it mixes different convolution template sizes, and can fuse the features of different scales and extract more defect features than the traditional convolution layer. Second, depthwise separable convolution is used to replace full convolution in this module, which significantly reduces the number of parameters and the multiply-accumulate operations. Finally, a global average pooling (GAP) layer is used instead of a fully connected layer. The GAP layer has no parameters to optimize, which substantially reduces the number of network parameters. Experimental results show that the proposed method is better than existing methods in terms of classification speed, precision, and memory consumption for polarizer detection, and can satisfy real-time requirements.
In Laser Powder Bed Fusion (LPBF), it is a major challenge to obtain detailed spatial information on different powder bed defects in real-time and simultaneously. deeplearning (DL) algorithms under the field of Machi...
详细信息
The rapid spread of the COVID-19 pandemic, in early 2020, has radically changed the lives of people. In our daily routine, the use of a face (surgical) mask is necessary, especially in public places, to prevent the sp...
详细信息
The rapid spread of the COVID-19 pandemic, in early 2020, has radically changed the lives of people. In our daily routine, the use of a face (surgical) mask is necessary, especially in public places, to prevent the spread of this disease. Furthermore, in crowded indoor areas, the automated recognition of people wearing a mask is a requisite for the assurance of public health. In this direction, imageprocessing techniques, in combination with deeplearning, provide effective ways to deal with this problem. However, it is a common phenomenon that well-established datasets containing images of people wearing masks are not publicly available. To overcome this obstacle and to assist the research progress in this field, we present a publicly available annotated image database containing images of people with and without a mask on their faces, in different environments and situations. Moreover, we tested the performance of deeplearning detectors in images and videos on this dataset. The training and the evaluation were performed on different versions of the YOLO network using Darknet, which is a state-of-the-art real-time object detection system. Finally, different experiments and evaluations were carried out for each version of YOLO, and the results for each detector are presented.
Defocus blur detection, as an important pre-processing step of imageprocessing, has attracted more and more attention. Albeit great success has been made, there are still several challenges for accurate defocus blur ...
详细信息
Classification of fruit and vegetable freshness plays an essential role in the food industry. Freshness is a fundamental measure of fruit and vegetable quality that directly affects the physical health and purchasing ...
详细信息
Classification of fruit and vegetable freshness plays an essential role in the food industry. Freshness is a fundamental measure of fruit and vegetable quality that directly affects the physical health and purchasing motivation of consumers. In addition, it is a significant determinant of market price;thus, it is imperative to study the freshness of fruits and vegetables. Owing to similarities in color, texture, and external environmental changes, such as shadows, lighting, and complex backgrounds, the automatic recognition and classification of fruits and vegetables using machine vision is challenging. This study presents a deep-learning system for multiclass fruit and vegetable categorization based on an improved YOLOv4 model that first recognizes the object type in an image before classifying it into one of two categories: fresh or rotten. The proposed system involves the development of an optimized YOLOv4 model, creating an image dataset of fruits and vegetables, data argumentation, and performance evaluation. Furthermore, the backbone of the proposed model was enhanced using the Mish activation function for more precise and rapid detection. Compared with the previous YOLO series, a complete experimental evaluation of the proposed method can obtain a higher average precision than the original YOLOv4 and YOLOv3 with 50.4%, 49.3%, and 41.7%, respectively. The proposed system has outstanding prospects for the construction of an autonomous and real-time fruit and vegetable classification system for the food industry and marketplaces and can also help visually impaired people to choose fresh food and avoid food poisoning.
Medium-scale traveling ionospheric disturbances (MSTIDs) are observed as parallelly arrayed wavelike perturbations of Total Electron Content (TEC) in ionospheric F region leading to satellite navigation error and comm...
详细信息
Medium-scale traveling ionospheric disturbances (MSTIDs) are observed as parallelly arrayed wavelike perturbations of Total Electron Content (TEC) in ionospheric F region leading to satellite navigation error and communication signal scintillation. The observation method for MSTIDs, detrended TEC (dTEC) map, summarizes the perturbation component of TEC having the merits of full-time and two-dimensional. However, previous automatic processing methods for dTEC map cannot discriminate MSTIDs from other irregular ionospheric perturbations intelligently. With the development of artificial intelligence in recent years, deeplearning approach is expecting to clarify the controversy of MSTID external dependence (season and solar/geomagnetic activity) under debating for decades. Therefore, this research proposes a real-timeprocessing algorithm for dTEC maps based on Mask Region-Convolutional Neural Network (R-CNN) model of deeplearning instance segmentation to detect wavelike perturbations intelligently with an accuracy of about 80% and a processing speed of about 8 fps. Then isolated perturbations are eliminated and only MSTID waveforms are chosen to obtain statistical characteristics of MSTIDs. With this algorithm, we analyzed up to 1,209,600 dTEC maps from 1997 to 2019 over Japan automatically and established a database of hourly averaged MSTID characteristics. This research introduces the partial correlation coefficient for the first time to clarify the solar/geomagnetic activity dependence of MSTID characteristics which is independent with each other.
暂无评论