Building roof type detection from remotely sensed images is a crucial task for many remotesensing applications, including urban planning and disaster management. In recent years, deep learning-based object detection ...
详细信息
ISBN:
(纸本)9798350343557
Building roof type detection from remotely sensed images is a crucial task for many remotesensing applications, including urban planning and disaster management. In recent years, deep learning-based object detection approaches have demonstrated outstanding performance in this field. However, most of these approaches assume that the training and testing data are sampled from the same distribution. When there are differences between the distributions of training and test data, known as domain shift, the performance significantly degrades. In this paper, we proposed a domain generalization method to address domain shift at the instance and image level for roof type detection from remotesensingimages. Furthermore, we evaluated our proposed method with IEEE Data Fusion Contest 2023 dataset. The proposed approach is the first of its kind in terms of domain generalization for remotesensing object detection.
Detecting building change in bitemporal remotesensing (RS) imagery requires a model to highlight the changes in buildings and ignore the irrelevant changes of other objects and sensing conditions. Buildings have comp...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Detecting building change in bitemporal remotesensing (RS) imagery requires a model to highlight the changes in buildings and ignore the irrelevant changes of other objects and sensing conditions. Buildings have comparatively less diverse textures than other objects and appear as repetitive visual patterns on RS images. In this paper, we propose Gabor Feature Network (GFN) to extract the distinctive repetitive texture features of buildings. Furthermore, we also design Feature Fusion Module (FFM) to fuse the extracted multiscale features from GFN with the features from a Transformer-based encoder to pass on the texture features to different parts of the model. Using GFN and FFM, we design a Transformer-based model, called GabFormer for building change detection. Experimental results on the LEVIR-CD and WHU-CD datasets indicate that GabFormer outperforms other SOTA models and in particular show significant improvement in the generalization capability. Our code is available on https://***/Ayana-Inria/GabFormer.
remotesensingimages are widely used in various fields, such as geographic information systems, environmental monitoring, and agricultural management. However, remotesensingimages are often corrupted by various noi...
详细信息
remotesensingimage target detection is one of the key technologies in the field of intelligent interpretation of remotesensingimages, and it has significant application value in various areas, including military d...
详细信息
Feature representations are the fundamental building blocks for various image analysis tasks like retrieval and classification. Traditionally, the Bag-of-Visual-Words (BoVW) framework has been a well-established appro...
详细信息
ISBN:
(纸本)9798350351491;9798350351484
Feature representations are the fundamental building blocks for various image analysis tasks like retrieval and classification. Traditionally, the Bag-of-Visual-Words (BoVW) framework has been a well-established approach for feature extraction. However, BoVW exhibits limitations in capturing the complex spectral-spatial relationships and fine-grained details within remotesensingimages (RSIs). This paper proposes a novel deep learning approach that addresses these shortcomings and fits well with the BoVW framework for RSI analysis. Our method leverages a hybrid architecture that combines a Convolutional Neural Network (CNN) and a Graph Convolutional Network (GCN). The CNN extracts high-level spatial features from RSIs capturing prominent visual patterns and spatial arrangements within the image. These extracted features are then fed into the GCN, which operates on a graph structure constructed from the image data. The GCN excels at learning complex relationships between image elements based on their spatial proximity and spectral information. processing the CNN outputs through the GCN aims to extract and enhance the features that encapsulate both the spatial layout and the intricate spectral-spatial relationships within RSIs. This potentially translates to improved performance in various RSI analysis tasks that demand a nuanced understanding of the image content.
remotesensingimages (RSIs) are usually degraded by haze, losing the spectral fidelity and texture details. Most previous dehazing works adopted a unified dehazing strategy to process the entire image, ignoring the d...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
remotesensingimages (RSIs) are usually degraded by haze, losing the spectral fidelity and texture details. Most previous dehazing works adopted a unified dehazing strategy to process the entire image, ignoring the discrepancies of haze density, spectral information and texture complexity among different regions in one RSI, and thus resulted in the distorted spectral and blurred texture. In this paper, we propose a skip-connected network based on haze density estimation and saliency-driven dual channel fusion, realizing a differentiated defogging method. First, we propose a haze density estimation model, generating the haze density map. Second, we design a saliency-driven dual channel fusion to distinguish the spectral and texture features of different regions in the input, generating the saliency map. The two maps above jointly serve as guidance for the network to perform differentiated dehazing. Finally, an attention-based skip-connected block is introduced, which helps to fuse multi-scale information and obtain more refined dehazing results. A mixed loss function is also constructed to retain more texture details. The efficiency of the proposed method is validated by comparing its performance with other SOTA schemes.
Due to the rich details of residential areas and the characteristics of remotesensingimage sharpness vulnerable to haze, it will not only consume a lot of labor costs but also be very difficult to produce a large-sc...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Due to the rich details of residential areas and the characteristics of remotesensingimage sharpness vulnerable to haze, it will not only consume a lot of labor costs but also be very difficult to produce a large-scale dataset with strong labels. Therefore, the limited-sample dataset has become a hotspot in recent years. To address this issue, we proposed a semantic segmentation method for residential areas by phase learning. The main task of the first stage is to generate a joint saliency map by reducing the interference of haze noise through the feature comparison similarity sorting algorithm and combine them to generate initial pixel-level pseudo labels for the next stage of training. In the second stage, we proposed to construct a group feature interactive perception module to achieve image group semantic co-segmentation. Comprehensive evaluations with 2 datasets and the comparison with 7 methods validate the superiority of the proposed model.
The improvement in imaging resolution and the increase in imaging swath of remotesensing satellites have enabled the acquisition of vast and complex remotesensingimage data. Effectively and comprehensively extracti...
详细信息
Change detection plays a crucial role in remotesensing tasks. However, current deep learning-based change detection methods suffer from issues such as misclassified pixels and unclear segmentation result on edges. To...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Change detection plays a crucial role in remotesensing tasks. However, current deep learning-based change detection methods suffer from issues such as misclassified pixels and unclear segmentation result on edges. To address these challenges, we propose a novel approach called Mixed-feature Attention Siamese Network (MAS-Net). MAS-Net adopts an encoder-decoder structure, where the encoder part effectively fuses image features from different time points, thereby preserving more cross-time information in the feature map. In the decoder part, we introduce the Feature Global Attention Module (FGAM) to leverage the global attention mechanism for extracting deep semantic information from the fused feature map. By incorporating these proposed modules and strategies, MAS-Net achieves fewer misclassified pixels and clearer edges in the resulting change detection maps. Experimental evaluations on the LEVIR-CD and CDD datasets demonstrate that MAS-Net outperforms state-of-the-art models by 0.15% (91.88% vs. 91.73%) and 1.3% (97.5% vs. 96.2%) in terms of F1-Score, respectively, thus establishing a solid baseline for change detection.
The dominant method of processing sonar data is using image-based representations, requiring the preprocessing of image data on autonomous systems. We propose an alternative data processing method for remotesensing a...
详细信息
暂无评论