Deep learning models can obtain state-of-the-art performance across many speech and imageprocessing tasks, often significantly outperforming earlier methods. In this paper, we attempt to further improve the performan...
详细信息
ISBN:
(纸本)9781479959341
Deep learning models can obtain state-of-the-art performance across many speech and imageprocessing tasks, often significantly outperforming earlier methods. In this paper, we attempt to further improve the performance of these models by introducing multi-task training, in which a combined deep learning model is trained for two inter-related tasks. We show that by introducing a secondary task (such as shape identification in the object classification task) we are able to significantly improve the performance of the main task for which the model is trained. Using public datasets we evaluated our approach on two image understanding tasks, image segmentation and object classification. On the image segmentation task, we observed that the multi-task model almost doubled the accuracy of segmentation at the pixel-level (from 18.7% to 35.6%) compared to the single task model, and improved the performance of face-detection by 10.2% (from 70.1% to 80.3%). For the object classification task, we observed a 2.1% improvement in classification accuracy (from 91.6% to 93.7%) compared to a single-task model. The proposed multi-task models obtained significantly higher accuracies than previously published results on these datasets, obtaining 22.0% and 6.2% higher accuracies on the face-detetction and object classification tasks respectively. These results demonstrate the effectiveness of multi-task training of deep learning models for image understanding tasks.
image registration is an important topic in many fields including industrial image analysis systems, medical and remotesensing. To improve the registration accuracy, an image registration method that combines scale i...
详细信息
ISBN:
(纸本)9783662456439;9783662456422
image registration is an important topic in many fields including industrial image analysis systems, medical and remotesensing. To improve the registration accuracy, an image registration method that combines scale invariant feature transform and individual entropy correlation coefficient (SIFT-IECC) is proposed in this paper. First, scale invariant feature transform algorithm is applied to extract feature points to construct a transformation model. Then, a rough registration image is obtained according to the transformation model. The individual entropy correlation coefficient is used as the similarity measure to refine the rough registration image. Finally, the experimental results show the superior performance of the proposed SIFT-IECC registration method by comparing with the state-of-the-art methods.
In this paper, we propose a graph cuts (GC) based probability propagation approach to automatically extract road network from complex remotesensingimages. First, the support vector machine (SVM) classifier with a si...
详细信息
The paper presents accuracy comparison of subpixel classification based on medium resolution Landsat images, performed using machine learning algorithms built on decision and regression trees method (C.5.0/Cubist and ...
详细信息
ISBN:
(纸本)9781628413076
The paper presents accuracy comparison of subpixel classification based on medium resolution Landsat images, performed using machine learning algorithms built on decision and regression trees method (C.5.0/Cubist and Random Forest) and artificial neural networks. The aim of the study was to obtain the pattern of percentage impervious surface coverage, valid for the period of 2009-2010. Imperviousness index map generation was a two-stage procedure. The first step was classification, which divided the study area into categories: i) completely permeable (imperviousness index less than 1%) and ii) fully or partially impervious areas. For pixels classified as impervious, the percentage of impervious surface coverage in pixel area was estimated. The root mean square errors (RMS) of determination of the percentage of the impervious surfaces within a single pixel were 11.0% for C.5.0/Cubist method, 11.3% for Random Forest method and 12.6% using artificial neural networks. The introduction of the initial hard classification into completely permeable areas (with imperviousness index <1%) and impervious areas, allowed to improve the accuracy of imperviousness index estimation on poorly urbanized areas covering large areas of the Dobczyce Reservoir catchment. The effect is also visible on final imperviousness index maps.
Due to the characteristic of remotesensingimage, we propose a novel method based on K-means algorithm also with the improved multi-phrase level set model. Comparing with the classical multi-phase C-V model, the impr...
详细信息
This paper takes Jiuzhaigou area for example, selecting Landsat remotesensingimages in phase time about years of 1999, 2002, 2006 and 2007, using ENVI and ArcGIS software for image data processing, making spatial an...
详细信息
ISBN:
(纸本)9783038352877
This paper takes Jiuzhaigou area for example, selecting Landsat remotesensingimages in phase time about years of 1999, 2002, 2006 and 2007, using ENVI and ArcGIS software for image data processing, making spatial and temporal characteristics and distribution pattern analysis of vegetation index NDVI of Jiuzhaigou forest. The result shows that NDVI of Jiuzhaigou forest has obvious characteristics of spatial and temporal variation. The paper provides the analysis process and reference to the forest vegetation research of other areas.
Similarity or distance measures play an important role in various patternrecognition applications such as classification, clustering, change detection, information retrieval, energy minimization and optimization prob...
详细信息
Active learning(AL) is an effective method in definition of samples, especially when labeled sample number is small. In this paper, we propose two active learning algorithms, which are Random Sampling (RS) and Margin ...
详细信息
ISBN:
(纸本)9783319093390;9783319093383
Active learning(AL) is an effective method in definition of samples, especially when labeled sample number is small. In this paper, we propose two active learning algorithms, which are Random Sampling (RS) and Margin Sampling(MS) algorithms, the two techniques achieve semiautomatic definition of training samples in remotesensingimage classification, starting with a small and representative data set, then according to query criterion, the experts select informative samples to add training data set, the model builds the optimal set of samples which minimizes the classification error. Compared with traditional sample selection methods, the results denote the effectiveness of the proposed AL methods.
We introduce the new notion of multivalued component-tree, that extends the classical component-tree initially devoted to grey-level images, in the mathematical morphology framework. We prove that multivalued componen...
详细信息
We introduce the new notion of multivalued component-tree, that extends the classical component-tree initially devoted to grey-level images, in the mathematical morphology framework. We prove that multivalued component-trees can model images whose values are hierarchically organized. We also show that they can be efficiently built from standard component-tree construction algorithms, and involved in antiextensive filtering procedures. The relevance and usefulness of multivalued component-trees is illustrated by an applicative example on hierarchically classified remotesensingimages.
In this paper, we propose a graph cuts (GC) based probability propagation approach to automatically extract road network from complex remotesensingimages. First, the support vector machine (SVM) classifier with a si...
详细信息
ISBN:
(纸本)9781479957521
In this paper, we propose a graph cuts (GC) based probability propagation approach to automatically extract road network from complex remotesensingimages. First, the support vector machine (SVM) classifier with a sigmoid model is applied to assign each pixel a posterior probability of being labelled as road class, which avoids the weaknesses of hard labels in general SVM. Then a GC based probability propagation algorithm is employed to keep the extracted road results smooth and coherent, which can reduce the connections between roads and road-like objects. Finally, a road-geometrical prior is considered to refine the extraction result, so that the non-road objects in images can be removed. Experimental results on two remotesensingimage datasets indicate the validity and effectiveness of our method by comparing with two other approaches.
暂无评论