With the widespread application of remotesensing technology, accurate and rapid detection and identification of targets in remotesensingimages have become an important research area. Traditional target detection me...
详细信息
Fusing low spatial resolution hyperspectral (LR HS) and high spatial resolution multispectral (HR MS) images from different modalities aim to obtain high spatial resolution hyperspectral (HR HS) images. However, most ...
详细信息
Fusing low spatial resolution hyperspectral (LR HS) and high spatial resolution multispectral (HR MS) images from different modalities aim to obtain high spatial resolution hyperspectral (HR HS) images. However, most deep neural network (DNN)-based methods overlook the correlation between the spatial domain and spectral domain, leading to limited fusion performance. To solve this problem, we propose the spatial-spectral unfolding network with mutual guidance (SMGU-Net). Specifically, the information of different modalities in the source images is treated as mutual complementary components to derive the reconstruction model. Then, the model is optimized using half-quadratic splitting and gradient descent algorithms and is unfolded into a network that leverages the powerful learning capabilities of DNNs to explore more potential information in the deep feature space. In this way, the network achieves the interaction and supplementarity of cross-modality information generate fused images. Experiments are conducted on four benchmark datasets to demonstrate the effectiveness of SMGU-Net. The code can be downloaded from https://***/yansql/SMGU-Net.
Wild fish recognition is a fundamental problem of ocean ecology research and contributes to the understanding of biodiversity. Given the huge number of wild fish species and unrecognized category, the essence of the p...
详细信息
Wild fish recognition is a fundamental problem of ocean ecology research and contributes to the understanding of biodiversity. Given the huge number of wild fish species and unrecognized category, the essence of the problem is an open set fine-grained recognition. Moreover, the unrestricted marine environment makes the problem even more challenging. Deep learning has been demonstrated as a powerful paradigm in image classification tasks. In this article, the wild fish recognition deep neural network (termed WildFishNet) is proposed. Specifically, an open set fine-grained recognition neural network with a fused activation pattern is constructed to implement wild fish recognition. First, three different reciprocal inverted residual structural modules are combined by neural structure search to obtain the best feature extraction performance for fine-grained recognition;next, a new fusion activation pattern of softmax and openmax functions is designed to improve the recognition ability of open set. Then, the experiments are implemented on the WildFish dataset that consists of 54 459 unconstrained images, which includes 685 known classes and 1 open set unrecognized category. Finally, the experimental results are analyzed comprehensively to demonstrate the effectiveness of the proposed method. The in-depth study also shows that artificial intelligence can empower marine ecosystem research.
Haze significantly hinders the application of autonomous driving, traffic surveillance, and remotesensing. image dehazing serves as a key technology to enhance the clarity of images captured in hazy conditions. Howev...
详细信息
Haze significantly hinders the application of autonomous driving, traffic surveillance, and remotesensing. image dehazing serves as a key technology to enhance the clarity of images captured in hazy conditions. However, the lack of paired annotated training data significantly limits the performance of deep learning-based dehazing methods in real-world scenarios. In this work, we propose a self-supervised polarization image dehazing framework based on frequency domain generative adversarial networks. By incorporating a polarization calculation module into the generator, the Stokes parameters of airlight are accurately estimated, which are used to reconstruct the synthesized hazy image by combining the dehazed image generated via a densely connected encoder-decoder. Furthermore, we optimize the discriminator with frequency domain features extracted by frequency decomposition module and introduce a pseudo airlight coefficient supervision loss to enhance the selfsupervised training. By discriminating between synthetic hazy images and real hazy images, we achieve adversarial training without the need for paired data. Simultaneously, supervised by the atmospheric scattering model, our network can iteratively generate more realistic dehazed images. Extensive experiments conducted on the constructed multi-view polarization datasets demonstrate that our method achieves state-of-the-art performance without requiring real-world ground truth.
We propose SAM-Road, an adaptation of the Segment Anything Model (SAM) [27] for extracting large-scale, vectorized road network graphs from satellite imagery. To predict graph geometry, we formulate it as a dense sema...
详细信息
ISBN:
(纸本)9798350365474
We propose SAM-Road, an adaptation of the Segment Anything Model (SAM) [27] for extracting large-scale, vectorized road network graphs from satellite imagery. To predict graph geometry, we formulate it as a dense semantic segmentation task, leveraging the inherent strengths of SAM. The image encoder of SAM is fine-tuned to produce probability masks for roads and intersections, from which the graph vertices are extracted via simple non-maximum suppression. To predict graph topology, we designed a lightweight transformer-based graph neural network, which leverages the SAM image embeddings to estimate the edge existence probabilities between vertices. Our approach directly predicts the graph vertices and edges for large regions without expensive and complex post-processing heuristics and is capable of building complete road network graphs spanning multiple square kilometers in a matter of seconds. With its simple, straightforward, and minimalist design, SAM-Road achieves comparable accuracy with the state-of-the-art method RNGDet++[57], while being 40 times faster on the City-scale dataset. We thus demonstrate the power of a foundational vision model when applied to a graph learning task. The code is available at https://***/htcr/sam_road.
Deep learning is increasingly being applied in the field of remotesensingimageprocessing. However, researchers often face limitations in establishing high-quality datasets for target recognition in high-resolution ...
详细信息
remote assistance by users is a common means in current robot applications, which can improve the efficiency of transfer tasks. However, the current mainstream methods are costly, relying on auxiliary equipment, and r...
详细信息
ISBN:
(纸本)9798400716607
remote assistance by users is a common means in current robot applications, which can improve the efficiency of transfer tasks. However, the current mainstream methods are costly, relying on auxiliary equipment, and result in a significant workload for users throughout the task. To reduce the cost and workload of remote assistance and improve user interaction with robots. This paper has undertaken the following work using available visual image information. Firstly, proposing a remote guidance method based on facial information, which is derived from analyzing user facial information for gaze estimation and combined with image detection. Secondly, proposing an autonomous transfer method based on position inference, which combines target information in images with the real-time state of the robot. Finally, by integrating the above methods, proposing a vision-based remote assistance method and applyingit in real-world experiments of object transfer tasks.
With the ongoing advancements in artificial intelligence technology, Convolutional Neural Networks (CNNs) have found widespread application in the realm of remotesensingimageprocessing. They prove instrumental in t...
详细信息
remotesensingimage change detection technology is rapidly advancing under the impetus of deep *** this study, a remotesensingimage change detection method based on hybrid backbone and high and low frequency attent...
详细信息
The proceedings contain 140 papers. The topics discussed include: digital multi-scale visual planning model of spatial-geographical landscape pattern of smart parks;visual question answering model based on fusing glob...
ISBN:
(纸本)9781510667563
The proceedings contain 140 papers. The topics discussed include: digital multi-scale visual planning model of spatial-geographical landscape pattern of smart parks;visual question answering model based on fusing global-local feature;imageprocessing of the special sensor microwave/imager based on passive microwave remotesensing;imageprocessing of the special sensor microwave/imager based on passive microwave remotesensing;graptolite image classification based on feature transfer and mixup data enhancement;an image classification method based on few-shot learning;fine-grained imagerecognition based on multi-branch and multi-scale learning;research on road extraction model of remotesensingimage based on the fused convolutional module and attention mechanism;unsupervised aircraft detection in SAR images with image-level domain adaption from optical images;the role of echocardiography segmentation evaluation metrics in clinical diagnosis;and machine vision-based measurement of air compressor crankshaft journal dimensions.
暂无评论