Superimposing Electronic Navigational Chart (ENC) data on marine radar images can enrich information for navigation. However, direct image superposition is affected by the performance of various instruments such as Gl...
详细信息
Superimposing Electronic Navigational Chart (ENC) data on marine radar images can enrich information for navigation. However, direct image superposition is affected by the performance of various instruments such as Global Navigation Satellite Systems (GNSS) and compasses and may undermine the effectiveness of the resulting information. We propose a data fusion algorithm based on deeplearning to extract robust features from radar images. By deeplearning in this context we mean employing a class of machine learning algorithms, including artificial neural networks, that use multiple layers to progressively extract higher level features from raw input. We first exploit the ability of deeplearning to perform target detection for the identification of marine radar targets. Then, imageprocessing is performed on the identified targets to determine reference points for consistent data fusion of ENC and marine radar information. Finally, a more intelligent fusion algorithm is built to merge the marine radar and electronic chart data according to the determined reference points. The proposed fusion is verified through simulations using ENC data and marine radar images from real ships in narrow waters over a continuous period. The results suggest a suitable performance for edge matching of the shoreline and real-time applicability. The fused image can provide comprehensive information to support navigation, thus enhancing important aspects such as safety.
The sorghum panicle is an important trait related to grain yield and plant development. Detecting and counting sorghum panicles can provide significant information for plant phenotyping. Current deep-learning-based ob...
The sorghum panicle is an important trait related to grain yield and plant development. Detecting and counting sorghum panicles can provide significant information for plant phenotyping. Current deep-learning-based object detection methods for panicles require a large amount of training data. The data labeling is time-consuming and not feasible for real application. In this paper, we present an approach to reduce the amount of training data for sorghum panicle detection via semi-supervised learning. Results show we can achieve similar performance as supervised methods for sorghum panicle detection by only using 10% of original training data.
Face recognition is one of the most challenging research fields in imageprocessing, pattern recognition and artificial intelligence, and its practical application has a bright future. Face is vulnerable to illuminati...
详细信息
Text detection from natural scene images is an active research area for computer vision, signal, and imageprocessing because of several real-time applications such as driving vehicles automatically and tracing person...
详细信息
Text detection from natural scene images is an active research area for computer vision, signal, and imageprocessing because of several real-time applications such as driving vehicles automatically and tracing person behaviors during sports or marathon events. In these situations, there is a high probability of missing text information due to the occlusion of different objects/persons while capturing images. Unlike most of the existing methods, which focus only on text detection by ignoring the effect of missing texts, this work detects and predicts missing texts so that the performance of the OCR improves. The proposed method exploits the property of DCT for finding significant information in images by selecting multiple channels. For chosen DCT channels, the proposed method studies texture distribution based on statistical measurement to extract features. We propose to adopt Bayesian classifier for categorizing text pixels using extracted features. Then a deeplearning model is proposed for eliminating false positives to improve text detection performance. Further, the proposed method employs a Natural Language processing (NLP) model for predicting missing text information by using detected and recognition texts. Experimental results on our dataset, which contains texts occluded by objects, show that the proposed method is effective in predicting missing text information. To demonstrate the effectiveness and objectiveness of the proposed method, we also tested it on the standard datasets of natural scene images, namely, ICDAR 2017-MLT, Total-Text, and CTW1500.
The problem of transport is one of the fundamental problems in today's large cities and one of the fundamental pillars on which work is being done under the paradigm of smart cities. Although the use of public tra...
详细信息
Existing lane detection methods have achieved remarkable performance in complex real-world scenarios, but many of them still have some problems, such as poor anti-interference ability, poor real-time performance, and ...
Existing lane detection methods have achieved remarkable performance in complex real-world scenarios, but many of them still have some problems, such as poor anti-interference ability, poor real-time performance, and difficult deployment at the vehicle end. In this work, we combine the characteristics of deeplearning and traditional algorithms and propose a two-stage lane detection method: Firstly, the model uses the output of backbone based on line-anchor for feature pooling, and at the same time, an attention mechanism based on line-anchor is applied to extract global features. Then, the pooled features are integrated with the global features to solve the problems such as the occlusion of the lane lines. Then, the fused features are input into the classification branch and regression branch, and then the relevant position information such as the confidence degree and offset of the lane line are output. Finally, according to the characteristics of lane detection, a special predictive post-processing module, Post-processing of Prediction Results (PPR) is designed based on the output lane information. PPR can improve the detection performance without increasing the number of model training parameters. The experimental results show that the recognition time of one frame image by our method is only 0.012s, meanwhile the accuracy rate can reach 95.6%.
Nowadays, deeplearning is a current and a stimulating field of machine learning. deeplearning is the most effective, supervised, time and cost efficient machine learning approach. deeplearning is not a restricted l...
详细信息
Nowadays, deeplearning is a current and a stimulating field of machine learning. deeplearning is the most effective, supervised, time and cost efficient machine learning approach. deeplearning is not a restricted learning approach, but it abides various procedures and topographies which can be applied to an immense speculum of complicated problems. The technique learns the illustrative and differential features in a very stratified way. deeplearning methods have made a significant breakthrough with appreciable performance in a wide variety of applications with useful security tools. It is considered to be the best choice for discovering complex architecture in high-dimensional data by employing back propagation algorithm. As deeplearning has made significant advancements and tremendous performance in numerous applications, the widely used domains of deeplearning are business, science and government which further includes adaptive testing, biological image classification, computer vision, cancer detection, natural language processing, object detection, face recognition, handwriting recognition, speech recognition, stock market analysis, smart city and many more. This paper focuses on the concepts of deeplearning, its basic and advanced architectures, techniques, motivational aspects, characteristics and the limitations. The paper also presents the major differences between the deeplearning, classical machine learning and conventional learning approaches and the major challenges ahead. The main intention of this paper is to explore and present chronologically, a comprehensive survey of the major applications of deeplearning covering variety of areas, study of the techniques and architectures used and further the contribution of that respective application in the real world. Finally, the paper ends with the conclusion and future aspects.
Applications in which researchers aim to extract a single land type from remotely sensed data are quite common in practical scenarios: extract the urban footprint to make connections with socio-economic factors;map th...
详细信息
Applications in which researchers aim to extract a single land type from remotely sensed data are quite common in practical scenarios: extract the urban footprint to make connections with socio-economic factors;map the forest extent to subsequently retrieve biophysical variables and detect a particular crop type to successively calibrate and deploy yield prediction models. In this scenario, the (positive) targeted class is well defined, while the negative class is difficult to describe. This one-class classification setting is also referred to as positive unlabelled learning (PUL) in the general field of machine learning. To deal with this challenging setting, when satellite imagetime series data are available, we propose a new framework named positive and unlabelled learning of satellite imagetime series (PUL-SITS). PUL-SITS involves two different stages: In the first one, a recurrent neural network autoencoder is trained to reconstruct only positive samples with the aim to higight reliable negative ones. In the second stage, both labelled and unlabelled samples are exploited in a semi-supervised manner to build the final binary classification model. To assess the quality of our approach, experiments were carried out on a real-world benchmark, namely Haute-Garonne, located in the southwest area of France. From this study site, we considered two different scenarios: a first one in which the process has the objective to map Cereals/Oilseeds cover versus the rest of the land cover classes and a second one in which the class of interest is the Forest land cover. The evaluation was carried out by comparing the proposed approach with recent competitors to deal with the considered positive and unlabelled learning scenarios.
With the rapid development of artificial intelligence, the emergence of intelligent vehicles has brought disruptive changes to traditional survey technology. In response to the limitations of geological environmental ...
详细信息
ISBN:
(纸本)9798400709272
With the rapid development of artificial intelligence, the emergence of intelligent vehicles has brought disruptive changes to traditional survey technology. In response to the limitations of geological environmental factors on the scope and search progress of field exploration, this article utilizes a modular design concept to develop a system structure framework and specific solutions for each functional module of a multi-functional intelligent survey vehicle. An intelligent control platform based on the Nvidia Jetson TX2 and ROS systems has been built, and software programs for eye movement control and 3D dense reconstruction have been written. A system that can assist motion control through line of sight has been designed, A small, low-cost intelligent survey vehicle that can simultaneously perform real-time scene 3D reconstruction. This survey vehicle combines IoT technology, and on the basis of traditional survey vehicle functions, it has functions such as eye tracking, scene 3D reconstruction, remote object detection, etc. The obtained information is transmitted back to the PC through remote communication for users to view the survey results in real-time. At the same time, the operation of the survey vehicle can be controlled according to the direction of the line of sight, providing intelligent interaction ability and multi-dimensional exploration results for exploration work.
暂无评论