image illumination correction has been a long standing topic for research in the Computer Vision problem. However, all previous literature on this topic has either been statistical in nature in the sense that a specif...
详细信息
image illumination correction has been a long standing topic for research in the Computer Vision problem. However, all previous literature on this topic has either been statistical in nature in the sense that a specified algorithm has been developed for approaching a particular case of illumination normalization, or involves extremely complex deeplearning methods for illumination correction of either one of over illuminated or under illuminated images. We present here a very simple deeplearning based image illumination correction architecture which works on color images of paintings irrespective of whether they are under or over illuminated. We have tested the results using a synthetic database as well as on real world painting images of diverse nature. (C) 2020 Elsevier B.V. All rights reserved.
During invasive surgery, the use of deeplearning techniques to acquire depth information from lesion sites in real-time is hindered by the lack of endoscopic environmental datasets. This work aims to develop a high-a...
详细信息
During invasive surgery, the use of deeplearning techniques to acquire depth information from lesion sites in real-time is hindered by the lack of endoscopic environmental datasets. This work aims to develop a high-accuracy three-dimensional (3D) simulation model for generating image datasets and acquiring depth information in real-time. Here, we proposed an end-to-end multi-scale supervisory depth estimation network (MMDENet) model for the depth estimation of pairs of binocular images. The proposed MMDENet highlights a multi-scale feature extraction module incorporating contextual information to enhance the correspondence precision of poorly exposed regions. A multi-dimensional information-guidance refinement module is also proposed to refine the initial coarse disparity map. Statistical experimentation demonstrated a 3.14% reduction in endpoint error compared to state-of-the-art methods. With a processingtime of approximately 30fps, satisfying the requirements of real-time operation applications. In order to validate the performance of the trained MMDENet in actual endoscopic images, we conduct both qualitative and quantitative analysis with 93.38% high precision, which holds great promise for applications in surgical navigation.
In humanoid robot soccer, the capacity to precisely track a ball is a crucial problem that is made challenging by processing limits and the subsequent inability to interpret all data from a high-definition image. This...
详细信息
ISBN:
(纸本)9781450397117
In humanoid robot soccer, the capacity to precisely track a ball is a crucial problem that is made challenging by processing limits and the subsequent inability to interpret all data from a high-definition image. This research suggests a method for locating and sizing balls in a computationally effective field setting. This research presents an enhanced, Faster Region-Based CNN-based deeplearning architecture for multi-class ball and goal recognition. The proposed framework incorporates improved Faster RCNN model development, data argumentation, ball and goal image library building, and performance assessment. This study is a pioneer in employing 1000 real-world photographs to build a multi-labeled image class ball and goal. The convolutional and pooling layers are also improved for more precise and quick identification. The test findings reveal that the suggested method outperformed conventional detectors regarding detecting accuracy and processing speed. It has excellent potential for use in developing an autonomous, real-timeimage recognition system for humanoid robots.
Dentists judge the quality of root canal therapy for each patient very time-consuming, and inefficient, lack of quantitative evaluation criteria, easy to cause judgment errors. At the same time, the traditional method...
详细信息
timely diagnosis and accurate identification of grape leaf diseases are decisive for controlling the spread of disease and ensuring the healthy development of the grape *** objective of this research was to propose a ...
详细信息
timely diagnosis and accurate identification of grape leaf diseases are decisive for controlling the spread of disease and ensuring the healthy development of the grape *** objective of this research was to propose a simple and efficient approach to improve grape leaf disease identification accuracy with limited computing resources and scale of training image dataset based on deep transfer learning and an improved MobileNetV3 model(GLD-DTL).A pre-training model was obtained by training MobileNetV3 using the imageNet dataset to extract common features of the grape *** the last convolution layer of the pre-training model was modified by adding a batch normalization function.A dropout layer followed by a fully connected layer was used to improve the generalization ability of the pre-training model and realize a weight matrix to quantify the scores of six diseases,according to which the Softmax method was added as the top layer of the modified networks to give probability distribution of six ***,the grape leaf diseases dataset,which was constructed by processing the image with data augmentation and image annotation technologies,was input into the modified networks to retrain the networks to obtain the grape leaf diseases recognition(GLDR)*** showed that the proposed GLD-DTL approach had better performance than some recent *** identification accuracy was as high as 99.84%while the model size was as small as 30 MB.
Wheat is an important cereal crop and is the second most consumed cereal after rice globally. It is a staple food for more than one-third of the world’s population. The production of wheat depends on various factors,...
Wheat is an important cereal crop and is the second most consumed cereal after rice globally. It is a staple food for more than one-third of the world’s population. The production of wheat depends on various factors, such as climate, temperature, soil, pests, bacteria, and other biotic and abiotic factors. However, diseases can have a significant impact on wheat production. Various wheat diseases can affect crop yields,including leaf rust, leaf spot, spike infection, virus, bacterial back chaff, bacterial spike blight, and aphids. Leaf rust, in particular, is known to cause severe damage to wheat leaves. To combat these diseases, researchers have been investigating the use of advanced technologies such as deeplearning and image-processing approaches for plant disease recognition. The process of disease detection involves several steps, including image preprocessing, segmentation, feature extraction, and classification. The accuracy of these steps directly affects the reliability and accuracy of the classification algorithms used to identify plant diseases. Recent studies have reviewed the state-of-the-art techniques used in this context and evaluated their effectiveness in real life applications. Overall, the use of advanced technologies such as deeplearning and imageprocessing for disease detection in wheat crops holds great promise in improving crop yields and reducing losses due to diseases. By identifying diseases at an early stage, farmers can take appropriate measures to control the spread of the disease and prevent significant crop damage. This research work presents a comparative analysis of the state-of-the-art work that has been carried out in this context and the effectiveness of the techniques in real-time.
SAR target Automatic Target Recognition (ATR) is indispensable in SAR image interpretation. Recently, deeplearning technology has been widely used in SAR target recognition tasks. Most networks achieve incremental im...
详细信息
ISBN:
(数字)9798331515669
ISBN:
(纸本)9798331515676
SAR target Automatic Target Recognition (ATR) is indispensable in SAR image interpretation. Recently, deeplearning technology has been widely used in SAR target recognition tasks. Most networks achieve incremental improvements in target recognition by modifying their structures to extract visual features of targets. However, due to the unique imaging mechanism, relying solely on visual features often leads to the loss of target information. In contrast, the ASC model, which captures the electromagnetic scattering characteristics of the target, plays a crucial role in target recognition tasks. Unfortunately, traditional parameter estimation methods for extracting the ASC model are computationally expensive and time-consuming, making them impractical for real-world applications. To address these issues, we propose a novel target recognition method based on electromagnetic scattering features in this paper. First, a lightweight network-based feature extraction module is designed. Then, the target ASC image is used as the ground truth for guidance, with image intensity and target structure serving as the loss functions during training. Finally, an ASC model-guided feature fusion network is designed, utilizing the fused features for target recognition. On the MSTAR dataset, a visual assessment experiment demonstrated that the proposed feature extraction module effectively extracts electromagnetic scattering features under various operating conditions. Subsequently, in downstream classification tasks, the inclusion of the proposed module resulted in improved accuracy compared to other networks. Additionally, a visualization analysis of the classification network showed that, under the guidance of electromagnetic scattering features, the network achieved good interpretability.
Visual tracking remains a challenging research problem because of appearance variations of the object over time, changing cluttered background and requirement for real-time speed. In this paper, we investigate the pro...
详细信息
Visual tracking remains a challenging research problem because of appearance variations of the object over time, changing cluttered background and requirement for real-time speed. In this paper, we investigate the problem of real-time accurate tracking in a instance-level tracking-by-verification mechanism. We propose a multi-stream deep similarity learning network to learn a similarity comparison model purely off-line. Our loss function encourages the distance between a positive patch and the background patches to be larger than that between the positive patch and the target template. Then, the learned model is directly used to determine the patch in each frame that is most distinctive to the background context and similar to the target template. Within the learned feature space, even if the distance between positive patches becomes large caused by the interference of background clutter, impact from hard distractors from the same class or the appearance change of the target, our method can still distinguish the target robustly using the relative distance. Besides, we also propose a complete framework considering the recovery from failures and the template updating to further improve the tracking performance without taking too much computing resource. Experiments on visual tracking benchmarks show the effectiveness of the proposed tracker when comparing with several recent real-time-speed trackers as well as trackers already included in the benchmarks.
Semantic segmentation is a pixel-level prediction task to classify each pixel of the input image. deeplearning models, such as convolutional neural networks (CNNs), have been extremely successful in achieving excelle...
详细信息
Based on deep-learning approaches, we developed a real-time violence detector for surveillance video systems. In the model given here (overall generality-accuracy-fast response time), CNN serves as a space feature ext...
详细信息
暂无评论