In this paper, we briefly summarize the first challenge on moving object detection and tracking in satellite videos (SatVideoDT). this challenge has three tracks related to satellite video analysis, including moving o...
详细信息
ISBN:
(数字)9781665490627
ISBN:
(纸本)9781665490627
In this paper, we briefly summarize the first challenge on moving object detection and tracking in satellite videos (SatVideoDT). this challenge has three tracks related to satellite video analysis, including moving object detection (Track 1), single object tracking (Track 2), and multiple-object tracking (Track 3). 123, 89, and 70 participants successfully registered, while 37, 42, and 29 teams submitted their final results on the test datasets for Tracks 1-3, respectively. the top-performing methods and their results in each track are described with details. this challenge establishes a new benchmark for satellite video analysis.
Zero-shot learning is a challenging problem in many tasks due to the lack of training samples of the unseen classes. the radical-based zero-shot Chinese character recognition methods treat Chinese characters as a comb...
详细信息
ISBN:
(数字)9781665490627
ISBN:
(纸本)9781665490627
Zero-shot learning is a challenging problem in many tasks due to the lack of training samples of the unseen classes. the radical-based zero-shot Chinese character recognition methods treat Chinese characters as a combination of radicals and structures, and recognize Chinese characters by identifying the radicals and structures contained in them. Current approaches generally treat the contribution of all radicals to Chinese character recognition as the same, and the recognition results rely on the network's ability to recognize radicals and their corresponding position information, ignoring the potential value of radicals themselves in eliminating the uncertainty of Chinese characters. In this paper, we model the problem of radical-based Chinese character recognition as an uncertainty elimination problem and propose a Critical Radical Analysis Network (CRAN) to explore the Ideographic Description Sequence (IDS) information for zero-shot Chinese character recognition. Specifically, we propose a novel method to compute the critical values of radicals based on information theory using the predefined Chinese character IDS dictionary. In recognition, we use an iterative approach to translate the predicted radical sequence to target Chinese characters. that is, the radicals of the predicted sequence are sorted in descending order of the critical value, and then the radicals are continuously selected in this order as the information obtained to eliminate the uncertainty of the Chinese character until the character is recognized. We conduct experiments on the CTW, CASIA-AHCDB, and CASIA-HWDB datasets. the experimental results show that the proposed method improves the ability of recognizing unseen Chinese characters, demonstrating the effectiveness of the proposed method.
the proceedings contain 50 papers. the topics discussed include: urban local climate zone classification with a residual convolutional neural network and multi-seasonal sentinel-2 images;an elegant end-to-end fully co...
ISBN:
(纸本)9781538684795
the proceedings contain 50 papers. the topics discussed include: urban local climate zone classification with a residual convolutional neural network and multi-seasonal sentinel-2 images;an elegant end-to-end fully convolutional network (E3FCN) for green tide detection using MODIS data;preliminary investigation on single remotesensing image inpainting through a modified GAN;collaborative classification of hyperspectral and LIDAR data using unsupervised image-to-image CNN;neighbor consistency based unsupervised manifold alignment for classification of remotesensing image;feature fusion through multitask CNN for large-scale remotesensing image segmentation;inversion of heavy metal content in a copper mining area based on extreme learning machine optimized by particle swarm algorithm;a method of interactively extracting region objects from high-resolution remotesensing image based on full connection CRF;and multi-branch regression network for building classification using remotesensing images.
the proceedings contain 9 papers. the special focus in this conference is on patternrecognition of Social Signals in Human-Computer Interaction. the topics include: A first-person vision dataset of office activities;...
ISBN:
(纸本)9783030209834
the proceedings contain 9 papers. the special focus in this conference is on patternrecognition of Social Signals in Human-Computer Interaction. the topics include: A first-person vision dataset of office activities;perceptual judgments to detect computer generated forged faces in social media;combining deep and hand-crafted features for audio-based pain intensity classification;deep learning algorithms for emotion recognition on low power single board computers;improving audio-visual speech recognition using gabor recurrent neural networks;evolutionary algorithms for the design of neural network classifiers for the classification of pain intensity;visualizing facial expression features of pain and emotion data.
We perform unsupervised domain adaptation for classification of remotesensing images by exploiting unsupervised manifold alignment approach. Manifold alignment method utilized corresponding points between domains to ...
详细信息
ISBN:
(纸本)9781538684795
We perform unsupervised domain adaptation for classification of remotesensing images by exploiting unsupervised manifold alignment approach. Manifold alignment method utilized corresponding points between domains to align data manifolds of source and target domains, where the corresponding points can be constructed by labeled information. Supposing labeled samples are not available in target domain, we proposed neighbor consistency (NC) constraint to select some target points that have reliable predictions. these points and labeled source data are then used to construct corresponding relationships, resulting in unsupervised manifold alignment. the neighbor consistency based unsupervised manifold alignment is denoted as NCUMA in this paper. Both multispectral and hyperspectral remotesensing data have been used to demonstrate the effectiveness of the NCUMA approach.
Automatic matching of multi-modal remotesensing images remains a challenging task in remotesensing image analysis due to significant non-linear radiometric differences between these images. this paper introduces the...
详细信息
ISBN:
(纸本)9781538684795
Automatic matching of multi-modal remotesensing images remains a challenging task in remotesensing image analysis due to significant non-linear radiometric differences between these images. this paper introduces the phase congruency model with illumination and contrast invariance for image matching, and extends the model to a novel image registration method, named as multi-scale phase consistency (MS-PC). the Euclidean distance between MS-PC descriptors is used as similarity metric to achieve correspondences. the proposed method is evaluated with four pairs of multi-model remotesensing images. the experimental results show that MS-PC is more robust to the radiation differences between images, and performs better than two popular method (i.e. SIFT and SAR-SIFT) in both registration accuracy and tie points number.
In recent years, Fully Convolutional Networks (FCN) has been widely used in various semantic segmentation tasks, including multi-modal remotesensing imagery. How to fuse multi-modal data to improve the segmentation p...
详细信息
ISBN:
(纸本)9781538684795
In recent years, Fully Convolutional Networks (FCN) has been widely used in various semantic segmentation tasks, including multi-modal remotesensing imagery. How to fuse multi-modal data to improve the segmentation performance has always been a research hotspot. In this paper, a novel end-to-end fully convolutional neural network is proposed for semantic segmentation of natural color, infrared imagery and Digital Surface Models (DSM). It is based on a modified DeepUNet and perform the segmentation in a multi-task way. the channels are clustered into groups and processed on different task pipelines. After a series of segmentation and fusion, their shared features and private features are successfully merged together. Experiment results show that the feature fusion network is efficient. And our approach achieves good performance in ISPRS Semantic Labeling Contest (2D).
In man-made structures regularities and repetitions prevails. In particular in building facades lattices are common in which windows and other elements are repeated as well in vertical columns as in horizontal rows. I...
详细信息
ISBN:
(纸本)9781538684795
In man-made structures regularities and repetitions prevails. In particular in building facades lattices are common in which windows and other elements are repeated as well in vertical columns as in horizontal rows. In very-high-resolution space-borne radar images such lattices appear saliently. Even untrained arbitrary subjects see the structure instantaneously. However, automatic perceptual grouping is rarely attempted. this contribution applies a new lattice grouping method to such data. Utilization of knowledge about the particular mapping process of such radar data is distinguished from the use of Gestalt laws. the latter are universally applicable to all kinds of pictorial data. An example with so called permanent scatterers in the city of Berlin shows what can be achieved with automatic perceptual grouping alone, and what can be gained using domain knowledge.
As is well known to all, the training of deep learning model is time consuming and complex. therefore, in this paper, a very simple deep learning model called PCANet is used to extract image features from multi-focus ...
详细信息
ISBN:
(数字)9783030209841
ISBN:
(纸本)9783030209841;9783030209834
As is well known to all, the training of deep learning model is time consuming and complex. therefore, in this paper, a very simple deep learning model called PCANet is used to extract image features from multi-focus images. First, we train the two-stage PCANet using ImageNet to get PCA filters which will be used to extract image features. Using the feature maps of the first stage of PCANet, we generate activity level maps of source images by using nuclear norm. then, the decision map is obtained through a series of post-processing operations on the activity level maps. Finally, the fused image is achieved by utilizing a weighted fusion rule. the experimental results demonstrate that the proposed method can achieve state-of-the-art fusion performance in terms of both objective assessment and visual quality.
暂无评论