The transformer-based image captioning models have shown remarkable performance based on the powerful sequence modeling capability. However, most of them focus only on learning deterministic mappings from image space ...
详细信息
In this paper, we wish to explain the contradiction of quality assessments of pansharpening carried out at full and reduced spatial scales. It seems that at full scale, methods based on component substitution (CS) are...
详细信息
ISBN:
(纸本)9781510666955;9781510666962
In this paper, we wish to explain the contradiction of quality assessments of pansharpening carried out at full and reduced spatial scales. It seems that at full scale, methods based on component substitution (CS) are quantitatively poorer than the other methods, but this depends on the intrinsic space varying misregistration between the two datasets. At reduced scale, the local shifts are divided by the MS-to-Pan scale ratio and thus they tend to vanish. The problem of full-scale quality indexes is that they were originally validated on aerial multispectral (MS) data, with synthetic panchromatic (Pan) and thus total absence of misregistration. In the presence of local misregistration due to inaccurate information of the height of the imaged surface, CS methods locally align the lowpass MS components towards the sharpening Pan, thereby preserving the geometry of the scene;all the other methods produce fading contours because of shifts. The favorable property of CS, however, impacts against the (spectral) consistency property of Wald's protocol, developed when the misalignments between MS and Pan was a small fraction of the pixel size, and hence negligible. In this perspective, methods that do not shift the original MS information are better, even though the visual quality of fading contours is worse. After exposing and explaining the contradiction between full- and reduced-scale assessments, we perform an in-depth analysis of the spectral and spatial consistency indexes of three widespread full-scale protocols: QNR, KQNR and HQNR. We investigate the robustness to shifts of all consistency indexes and propose to couple the spectral index and the spatial index that are least sensitive to shifts. In this way, the ranking of methods of reduced-scale assessments is preserved in full-scale assessments.
Currently, existing methods in land cover recognition in open-pit coal mining areas face the issue of insufficient accuracy due to multiscale and blurred boundaries when processingremotesensingimages. This paper in...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Currently, existing methods in land cover recognition in open-pit coal mining areas face the issue of insufficient accuracy due to multiscale and blurred boundaries when processingremotesensingimages. This paper introduces a remotesensingimage semantic segmentation network, DSDN-Net, to tackle the issues. DSDN-Net adopts MobileNetV2 as the backbone, and a spatial pyramid pooling structure, Dynamic Snake Dense-ASPP (DSDN-ASPP), which enhances the model’s receptive field based on cross-layer connections is designed to increase the model’s receptive field by introducing dynamic snake convolution and utilizing depthwise convolutions with different kernel sizes, allowing the model to focus more on spatial features in remotesensingimages. To handle blurred boundaries in remotesensingimage, the decoder of DSDN-Net incorporates a Convolutional Block Attention Module (CBAM) to enhance precision. A dataset containing 5440 remotesensingimages from the open-pit coal mining area is constructed using high-resolution remotesensingimage. Experimental results on the open-pit coal mining area dataset demonstrate that the proposed DSDN-Net outperforms existing methods in multiple performance metrics.
The proceedings contain 26 papers. The special focus in this conference is on Computer, Communication, and signalprocessing. The topics include: Forecast of Movie Sentiment Based on Multi Label Text Classification on...
ISBN:
(纸本)9783031398100
The proceedings contain 26 papers. The special focus in this conference is on Computer, Communication, and signalprocessing. The topics include: Forecast of Movie Sentiment Based on Multi Label Text Classification on Rotten Tomatoes Using Multiple Machine and Deep Learning Technique;an Ensemble Approach to Hostility Detection in Hindi Tweets;an Optimized Framework for Diabetes Mellitus Diagnosis Using Grid Search Based Support Vector Machine;inquisition of Vision Transformer for Content Based Satellite image Retrieval;impact of Spectral Domain Features for Small Object Detection in remotesensing;application of Phonocardiogram and Electrocardiogram signal Features in Cardiovascular Abnormality Recognition;analyzing Cricket Biomechanical Parameters Through Keypoint Detection and Tracking;multi-feature Based Sea–land Segmentation for Multi-spectral and Panchromatic remote-sensingimagery;deoxyribonucleic Acid Cryptography Based Least Significant Byte Steganography;detection and Estimation of Diameter of Retinal Vessels;a Simple Hybrid Local Search Algorithm for Solving Optimization Problems;Performance Study of RIS Assisted NOMA Based Wireless Network with Passive IoT Communication;multi-channel Man-in-the-Middle Attacks Against Protected Wi-Fi Networks and Their Attack Signatures;a Study in Analysing the Critical Determinants of Internet of Things (IoT) Based Smart processing for Sustainable Supply Chain Management;IOT Enabled Rover for remote Survey of Archeological Areas;an Embedded System for Smart Farming Using Machine Learning Approaches;an Intrusion Detection System for Securing IoT Based Sensor Networks from Routing Attacks;automated Summarization of Gastrointestinal Endoscopy Video;interpretation of Feature Contribution Towards Diagnosis of Diabetic Retinopathy from Exudates in Retinal images;false Positive Reduction in Mammographic Mass Detection;Collaborative CNN with Multiple Tuning for Automated Coral Reef Classification.
The proceedings contain 11 papers. The topics discussed include: accurate shadow height measurement technology of the SAR image;millimeter wave radar fall detection algorithm based on improved transformer;an end-to-en...
ISBN:
(纸本)9798400700040
The proceedings contain 11 papers. The topics discussed include: accurate shadow height measurement technology of the SAR image;millimeter wave radar fall detection algorithm based on improved transformer;an end-to-end learning based covolutional neural network for single image defogging algorithm;ornaments and barlines recognition of numbered musical notation using YOLOv5;study on hyperspectral remotesensingimages of GF-5 de-blurring based on sparse representation;design and implementation of target tracking system in low illumination environment based on FPGA;SAR image geometry correction technology based on block parallel signalprocessing;speech recognition method based on deep learning of artificial intelligence: an example of BLSTM-CTC model;and high precision reference measurement technology for mechanical scanning radar.
Because back-projection algorithm (BPA) is independent of the azimuth-invariant hypothesis of the echo signal, it has significant advantages for synthetic aperture radar (SAR) imaging with arbitrary trajectory. Althou...
详细信息
Few-shot learning has been extensively applied in current remotesensingimage classification, enabling rapid identification of new classes by leveraging prior knowledge effectively. However, current methods mainly re...
Few-shot learning has been extensively applied in current remotesensingimage classification, enabling rapid identification of new classes by leveraging prior knowledge effectively. However, current methods mainly rely on image modality to address the issue of low intra-class similarity and high interclass similarity, while the utilization of multimodal methods in remotesensing tasks remains largely unexplored. Therefore, we propose a novel framework for few-shot remotesensingimage classification, named multi-view image-text perception (MVITP). Specifically, it leverages maximum mutual information across multiple views to train an image encoder and generate image features. A text encoder is employed to generate text features. Next, we introduce a multimodal fusion encoder to capture the similarity between image features and text features. Finally, class predictions are further made by computing the similarity between the support set and the query set. We conduct experiments on three remotesensing datasets, demonstrating the outstanding performance of MVITP.
Comparing with the multispectral remotesensingimage, hyperspectral image (HSI) has higher spectral resolution, a near continuous spectral signature, thus can represent fine spectral variations that occurred in the t...
详细信息
ISBN:
(纸本)9781510655386;9781510655379
Comparing with the multispectral remotesensingimage, hyperspectral image (HSI) has higher spectral resolution, a near continuous spectral signature, thus can represent fine spectral variations that occurred in the temporal domain. This allows more spectral changes to be detected, especially major changes that reflected on the overall spectral signature (associating with the abrupt land-cover transitions), as well as subtle changes that reflect only on a portion of the spectral signature (associating with the change of physicochemical properties of the land-cover classes). Currently, there are some available hyperspectral change detection (CD) data sets. However, they have the following drawbacks. First, there is a lack of diversity in the data source;all data sets were created using the Hyperion sensor mounted on the E0-1 satellite. Second, these data sets mainly concentrate on the river and agriculture scenes, which lose their diversity for representing different land-covers. In this paper, we construct three new change detection data sets by using the multitemporal images acquired by the China's new generation of hyperspectral satellites, i.e., OHS, GF-5 and ZY1-02D. These data sets present various event-driven land-cover changes, such as new building construction, crop replacements, and the expansion of energy facilities. Then a novel unsupervised hyperspectral change detection approach is proposed based on the intrinsic image decomposition (IID). Experimental results confirmed the effectiveness of the proposed approach in terms of higher overall accuracy by comparing with the reference techniques.
The proceedings contain 33 papers. The topics discussed include: dangerous goods detection in x-ray security inspection images based on improved YOLOv7;a lightweight anchor-free detector using cross-dimensional intera...
ISBN:
(纸本)9798400709272
The proceedings contain 33 papers. The topics discussed include: dangerous goods detection in x-ray security inspection images based on improved YOLOv7;a lightweight anchor-free detector using cross-dimensional interactive feature balance for SAR ship detection;network intrusion detection based on federated learning with inherited private models;deep learning-based experimental system design for fatigue driving detection;combined classification of hyperspectral and LiDAR data based on dual-channel cross-transformer;a video classification algorithm based on visual-audio cross-attention;research on hyperspectral image classification based on improved deep cross-domain few-shot learning;multi-object detection and classification in construction sites based on YOLOv5;and joint classification of multi-source remotesensing data based on multi-scale features and attention mechanism.
With the rapid development of various satellite sensor techniques, remotesensingimagery has been an important source of data in change detection applications. This paper aims to propose an unsupervised change detect...
详细信息
ISBN:
(纸本)9781665405409
With the rapid development of various satellite sensor techniques, remotesensingimagery has been an important source of data in change detection applications. This paper aims to propose an unsupervised change detection method based on Object-based Markov Random Filed (OMRF) and Inception UNet (IUNet). Our method first utilizes a difference image (DI) obtained from two bi-temporal images as the initial feature, and proposes the OMRF algorithm based on homogeneous region to pre-classify the DI thus derive the coarse change map. The IUNet is then constructed to extract the points with high confidence from the coarse change map for training. Eventually, the trained model is fed to classify the original feature, then the final change map is obtained. Experimental results indicate that our method yields great detection results even without supervision.
暂无评论