The proceedings contain 33 papers. The topics discussed include: stereo matching of remotesensingimages using deep stereo matching;object detection with noisy annotations in high-resolution remotesensingimages usi...
ISBN:
(纸本)9781510645684
The proceedings contain 33 papers. The topics discussed include: stereo matching of remotesensingimages using deep stereo matching;object detection with noisy annotations in high-resolution remotesensingimages using robust EfficientDet;few shot object detection in remotesensingimages;fire segmentation using a squeezesegv2;deep-learning-based remotesensing video super-resolution for Jilin-1 satellite;useable machine learning for Sentinel-2 multispectral satellite imagery;self-supervised multi-task learning for semantic segmentation of urban scenes;impact of different compression rates for hyperspectral data compression based on a convolutional autoencoder;and hyperspectral image classification using spectral-spatial hypergraph convolution neural network.
Few-shot learning has been extensively applied in current remotesensingimage classification, enabling rapid identification of new classes by leveraging prior knowledge effectively. However, current methods mainly re...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Few-shot learning has been extensively applied in current remotesensingimage classification, enabling rapid identification of new classes by leveraging prior knowledge effectively. However, current methods mainly rely on image modality to address the issue of low intra-class similarity and high interclass similarity, while the utilization of multimodal methods in remotesensing tasks remains largely unexplored. Therefore, we propose a novel framework for few-shot remotesensingimage classification, named multi-view image-text perception (MVITP). Specifically, it leverages maximum mutual information across multiple views to train an image encoder and generate image features. A text encoder is employed to generate text features. Next, we introduce a multimodal fusion encoder to capture the similarity between image features and text features. Finally, class predictions are further made by computing the similarity between the support set and the query set. We conduct experiments on three remotesensing datasets, demonstrating the outstanding performance of MVITP.
Multi-source remotesensingimage matching is crucial for remotesensing technology applications. However, the variations in factors such as grayscale, perspective, and sensors between multi-source images present cert...
详细信息
ISBN:
(数字)9798350350920
ISBN:
(纸本)9798350350920
Multi-source remotesensingimage matching is crucial for remotesensing technology applications. However, the variations in factors such as grayscale, perspective, and sensors between multi-source images present certain challenges for image matching. In response to the challenges in matching multi-source remotesensingimages, a matching method based on texture-enhanced region features is proposed. Initially, Gabor filters and the gray-level co-occurrence matrix (GLCM) are used to obtain the texture energy maps, followed by the extraction of maximally stable extremal regions (MSER) on the texture energy maps to acquire region features. Subsequently, the contour descriptors of the features are computed using Fourier descriptors. Finally, feature matching and refinement of the matching results are conducted in conjunction with the fast sample consensus (FSC). We conducted experimental region feature matching on multiple pairs of multi-source remotesensingimages, and the results validate the effectiveness of our method.
Haze causes information loss and quality degradation in remotesensingimages. Unsupervised learning-based dehazing methods aim to reduce reliance on paired hazy images and their labels. However, complex mapping relat...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Haze causes information loss and quality degradation in remotesensingimages. Unsupervised learning-based dehazing methods aim to reduce reliance on paired hazy images and their labels. However, complex mapping relationships often increase the difficulty in network convergence, resulting in color distortion and loss of texture details in remotesensingimages. To address these issues, we propose an unsupervised haze removal method based on saliency-guided transmission refinement for remotesensingimages. Firstly, we introduce a saliency-guided transmission refinement method, which decomposes and recombines two transmission maps obtained under different conditions, guided by saliency information. Secondly, we propose a loss function comprising energy loss and texture loss. The energy loss provides an energy reference based on the coarse transmission estimation, while the texture loss enhances the preservation of texture details. Experimental results demonstrate that our method achieves comparable performance to several supervised methods.
As the annotation of remotesensingimages requires domain expertise, it is difficult to construct a large-scale and accurate annotated dataset. image-level annotation data learning has become a research hotspot. In a...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
As the annotation of remotesensingimages requires domain expertise, it is difficult to construct a large-scale and accurate annotated dataset. image-level annotation data learning has become a research hotspot. In addition, due to the difficulty in avoiding mislabeling, label noise cleaning is also a concern. In this paper, a semantic segmentation method for remotesensingimages based on uncertainty perception with noisy labels is proposed. The main contributions are three-fold. First, a label cleaning method based on iterative learning is presented to handle noise labels such as missing or incorrect annotations. Second, a two-stage semantic segmentation model is proposed for image-level annotation, which eliminates the need for post-processing steps during testing. Lastly, a complementary uncertainty perception function is introduced to improve the utilization of dataset features and enhance the accuracy of segmentation. The effectiveness of this method was verified through comprehensive evaluation with 7 models on four datasets.
Haze obscures remotesensingimages, hindering valuable information extraction. To this end, we propose RSHazeNet, an encoder-minimal and decoder-minimal framework for efficient remotesensingimage dehazing. Specific...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Haze obscures remotesensingimages, hindering valuable information extraction. To this end, we propose RSHazeNet, an encoder-minimal and decoder-minimal framework for efficient remotesensingimage dehazing. Specifically, regarding the process of merging features within the same level, we develop an innovative module called intra-level transposed fusion module (ITFM). This module employs adaptive transposed self-attention to capture comprehensive context-aware information, facilitating the robust context-aware feature fusion. Meanwhile, we present a cross-level multi-view interaction module (CMIM) to enable effective interactions between features from various levels, mitigating the loss of information due to the repeated sampling operations. In addition, we propose a multi-view progressive extraction block (MPEB) that partitions the features into four distinct components and employs convolution with varying kernel sizes, groups, and dilation factors to facilitate view-progressive feature learning. Extensive experiments demonstrate the superiority of our proposed RSHazeNet. We release the source code and all pre-trained models at https://***/chdwyb/RSHazeNet.
remotesensing scene classification (RSSC) seeks to allocate correct semantic labels to remotesensingimages. Recently, numerous algorithms have made significant contributions to enhancing the accuracy of RSSC. Howev...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
remotesensing scene classification (RSSC) seeks to allocate correct semantic labels to remotesensingimages. Recently, numerous algorithms have made significant contributions to enhancing the accuracy of RSSC. However, models with high parameters and computational complexity still dominate. To address this issue, we propose a lightweight network architecture, namely Two-Stage TripletNet. In this proposed algorithm, we employ a two-stage optimizing strategy involving label optimization and loss function optimization. First, a KD-tree generated by remotesensingimage features is utilized to produce visual labels. Secondly, we establish the triplet sampling method based on the visual and semantic labels of the images. Finally, the triplet loss and cross-entropy loss are jointly applied to train our model. Experimental results on mainstream datasets demonstrate the effectiveness of our proposed framework. Meanwhile, the two-stage optimizing strategy renders our model more competitive compared to other state-of-the-art algorithms.
remotesensingimage generation is of great value for virtual environment creation and adversarial learning for fake news detection. It could also address the learning sample shortage in the region of interest. Howeve...
详细信息
remotesensingimage dehazing is essential for preprocessing, but most methods overlook the joint occlusion by clouds and haze. Furthermore, generative model-based restoration struggles with haze, weakening edge recov...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
remotesensingimage dehazing is essential for preprocessing, but most methods overlook the joint occlusion by clouds and haze. Furthermore, generative model-based restoration struggles with haze, weakening edge recovery for targets obscured by both elements. In this paper, we propose a new diffusion model based on weight-tuned overlap refinement for clouds and haze co-removal in remotesensingimages. Firstly, we integrate the diffusion model into clouds and haze co-removal task, offering a lightweight solution effective even under limited samples. Secondly, we propose a weight-tuned overlap refinement method (WTOR) to guide noise estimation updating in overlaps in the backward-sampling process. It takes the reciprocal of the variance of each overlap as the weight factor, adjusts the backward-sampling update frequency to improve the model's ability of clouds and haze co-removal. Finally, we introduce the gaussian filter feature extraction block (GFAB) to enhance the learning ability of the model for structural information, which can detect local changes and extract features with precision. The experimental results demonstrate that the proposed method is capable of co-removing clouds and haze with reserving rich color and texture details.
The technique of semantic segmentation (SS) holds significant importance in the domain of remotesensingimage (RSI) processing. The current research primarily encompasses two problems: 1) RSIs are easily affected by ...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
The technique of semantic segmentation (SS) holds significant importance in the domain of remotesensingimage (RSI) processing. The current research primarily encompasses two problems: 1) RSIs are easily affected by clouds and haze;2) SS based on strong annotation requires vast human and time costs. In this paper, we propose a weakly supervised semantic segmentation (WSSS) method for hazy RSIs based on saliency-aware alignment strategy. Firstly, we design alignment network (AN) and target network (TN) with the same structure. After training the AN with clear images, we extract the class activation maps of the two networks and construct a consistency loss to train the TN with hazy images. Secondly, we design a multi-scale channel-spatial attention module in the two classification networks to solve the problems of unclear foreground-background boundary and blurred target texture in hazy images. Finally, the pseudo-labels generated by the TN are utilized to train a feedback saliency analysis network, which is subsequently employed for obtaining segmentation results during the testing phase. The experiment results demonstrate that our approach achieves superior SS performance for hazy RSIs.
暂无评论