To efficiently remove haze in unmanned aerial vehicle (UAV) remotesensingimages, a novel attention-based feedback dehazing network (AFDN) is proposed, which is constructed by feedback connections and attention-based...
详细信息
ISBN:
(纸本)9781665441155
To efficiently remove haze in unmanned aerial vehicle (UAV) remotesensingimages, a novel attention-based feedback dehazing network (AFDN) is proposed, which is constructed by feedback connections and attention-based feedback blocks (AFBs). It has three major advantages compared with other dehazing algorithms: 1) The feedback connections, which allow network to use previous state to improve current performance, can effectively help the proposed AFDN generate clear remotesensing scenes progressively. 2) The AFBs are specially designed to extract global residual features, in which the dual attention block can usefully reduce redundant information and improve the fitting ability of network. 3) To obtain abundant texture information from UAV remotesensingimages and restore real ground surfaces, an energy loss is employed for texture features learning. Experiments on synthetic datasets and real UAV remotesensingimages verify the superiority of AFDN over several state-of-the-art methods in terms of qualitative and quantitative analysis.
Traditional object detection methods suffer from excessively high false alarm rates in scenarios with scarce training samples. To address this issue, this paper proposes a few-shot optical remotesensing object detect...
详细信息
ISBN:
(数字)9798331515669
ISBN:
(纸本)9798331515676
Traditional object detection methods suffer from excessively high false alarm rates in scenarios with scarce training samples. To address this issue, this paper proposes a few-shot optical remotesensing object detection method based on shape matching. This method first obtains the sliced mask image of the object within the predicted bounding box in the pre-detection process. Then, the shape context algorithm is utilized to perform template matching between the template image and the sliced mask image, thereby eliminating false alarm objects and enhancing the object detection performance. To examine the capability of the proposed method, two public datasets, namely FAIR1M datasets and MAR20 datasets, are selected for experimentation. The experimental results show that our method achieves exceptional effectiveness, yielding F1 scores of 0.705 and 0.866 on the FAIR1M and MAR20 datasets, respectively.
With the emergence and advances of the concept 'Digital Earth', more applications require to use remotesensingimages for analyzing or modeling observed scenes. remotesensingimage classification thus become...
详细信息
Ship image target detection has important applications for ship management. In recent years, target detection based on deep learning has been widely studied in visual ship target detection. However, due to the differe...
详细信息
Haze obscures remotesensingimages, hindering valuable information extraction. To this end, we propose RSHazeNet, an encoder-minimal and decoder-minimal framework for efficient remotesensingimage dehazing. Specific...
Haze obscures remotesensingimages, hindering valuable information extraction. To this end, we propose RSHazeNet, an encoder-minimal and decoder-minimal framework for efficient remotesensingimage dehazing. Specifically, regarding the process of merging features within the same level, we develop an innovative module called intra-level transposed fusion module (ITFM). This module employs adaptive transposed self-attention to capture comprehensive context-aware information, facilitating the robust context-aware feature fusion. Meanwhile, we present a cross-level multi-view interaction module (CMIM) to enable effective interactions between features from various levels, mitigating the loss of information due to the repeated sampling operations. In addition, we propose a multi-view progressive extraction block (MPEB) that partitions the features into four distinct components and employs convolution with varying kernel sizes, groups, and dilation factors to facilitate view-progressive feature learning. Extensive experiments demonstrate the superiority of our proposed RSHazeNet. We release the source code and all pre-trained models at https://***/chdwyb/RSHazeNet.
SAR (synthetic aperture radar) uses platform motion to observe stationary objects and synthesizes a larger antenna aperture to improve resolution, among which spotlight SAR further improves resolution through a longer...
详细信息
Superpixel segmentation with deep learning has been proposed in recent years and is widely employed to reduce the input image primitives for subsequent computer vision tasks. In this paper, we propose a Triple Multi-S...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Superpixel segmentation with deep learning has been proposed in recent years and is widely employed to reduce the input image primitives for subsequent computer vision tasks. In this paper, we propose a Triple Multi-Scale Attention based Network (TMANet) for superpixel segmentation. First, aiming to extract more detailed context information, we design a Triple Multi-Scale Attention (TMA) to adapt the varying object scale and reduce inevitable redundant information in the encoder. Moreover, we observe that the dark parts with low probability in the association map generated by the TMANet are closer to the superpixel boundary. Therefore, we devise Boundary Association (BA) loss based on the association map to obtain fine boundaries and contours. Extensive experiments on public datasets show that TMANet outperforms the state-of-the-art methods to a certain extent. The application in saliency object detection of remotesensing also demonstrates the superiority of the proposed method.
Hashing is very popular for remotesensingimage search. This article proposes a multiview hashing with learnable parameters to retrieve the queried images for a large-scale remotesensing dataset. Existing methods al...
详细信息
Hashing is very popular for remotesensingimage search. This article proposes a multiview hashing with learnable parameters to retrieve the queried images for a large-scale remotesensing dataset. Existing methods always neglect that real-world remotesensing data lies on a low- dimensional manifold embedded in high-dimensional ambient space. Unlike previous methods, this article proposes to learn the consensus compact codes in a view-specific low-dimensional subspace. Furthermore, we have added a hyperparameter learnable module to avoid complex parameter tuning. In order to prove the effectiveness of our method, we carried out experiments on three widely used remotesensing data sets and compared them with seven state-of-the-art methods. Extensive experiments show that the proposed method can achieve competitive results compared to the other method.
Due to the diverse geographical environments, intricate landscapes, and high-density settlements, the automatic identification of urban village boundaries using remotesensingimages remains a highly challenging task....
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Due to the diverse geographical environments, intricate landscapes, and high-density settlements, the automatic identification of urban village boundaries using remotesensingimages remains a highly challenging task. This paper proposes a novel and efficient neural network model called UV-Mamba for accurate boundary detection in high-resolution remotesensingimages. UV-Mamba mitigates the memory loss problem in lengthy sequence modeling, which arises in state space models (SSM) with increasing image size, by incorporating deformable convolutions (DCN). Its architecture utilizes an encoder-decoder framework and includes an encoder with four deformable state space augmentation (DSSA) blocks for efficient multi-level semantic extraction and a decoder to integrate the extracted semantic information. We conducted experiments on two large datasets showing that UV-Mamba achieves state-of-the-art performance. Specifically, our model achieves 73.3% and 78.1% IoU on the Beijing and Xi’an datasets, respectively, representing improvements of 1.2% and 3.4% IoU over the previous best model while also being 6× faster in inference speed and 40× smaller in parameter count. Source code and pre-trained models are available at https://***/Devin-Egber/UV-Mamba.
High-quality remotesensingimages are difficult to obtain due to limited conditions and high cost for data acquisition. With the development of machine vision and deep learning, some image generation methods (e.g., G...
详细信息
ISBN:
(纸本)9781728176055
High-quality remotesensingimages are difficult to obtain due to limited conditions and high cost for data acquisition. With the development of machine vision and deep learning, some image generation methods (e.g., GANs) are introduced into this field, but it's still hard to generate images with good texture details and structural dependencies. We establish Skip Attention Mechanism to deal with this problem, which learns dependencies between local points on low-resolution feature maps, and then upsample the attention map and combine it with high-resolution feature maps. With this method, long-range dependencies learned from low-resolution are used for generating remotesensingimages with more structural details. We name this method as Skip Attention GAN, which is the first method applying cross-scale attention mechanism for unsupervised remotesensingimage generation. Experiments show that our method outperforms previous methods under several metrics. Visual and ablation results of attention layers show that Skip Attention has learned long-distance structural dependencies between similar targets.
暂无评论