Multi-angle Synthetic Aperture Radar (SAR) is a new SAR working mode to acquire multi-dimensional information of target. By observing the target area from different observation angles, the multi-angle SAR has the abil...
详细信息
Terrain image classification is an important research direction in the field of remotesensing and computer vision, aiming to realize automatic recognition and classification of different landform features through the...
详细信息
ISBN:
(纸本)9798331529314
Terrain image classification is an important research direction in the field of remotesensing and computer vision, aiming to realize automatic recognition and classification of different landform features through the analysis and processing of terrain images. In this paper, a deep learning algorithm based on Vision Transformer (ViT) is used to classify terrain images, and the performance of the algorithm in this task is systematically evaluated. In the process of model construction, we first imported the Vision Transformer model and made corresponding parameter Settings to ensure its adaptability and effectiveness. After training, it is observed that the loss function of the training set decreases from the initial value of 2.84 to 0.35, a decrease of 2.49, indicating that the model tends to converge in continuous optimization. At the same time, the accuracy is also significantly improved, from 55.9% to 86.8%, an increase of 30.9%, showing the enhancement of the model's learning ability. For the validation set, loss also decreased from 0.78 to 0.47, a decrease of 0.31, while accuracy increased from 60.1% to 83.5%, an increase of 23.4%. These results further prove the good performance of the model on different data sets and its convergence trend. In addition, through the evaluation of the test set, we get more specific performance indicators: The accuracy of the terrain image classification model based on Vision Transformer on the test set reaches 89.9%, the Precision is 0.9615, the Recall is 0.9494, and the F1-score is 0.9554. These indicators show that the model not only has high classification accuracy, but also performs well in generalization ability. To sum up, this research demonstrates the effectiveness of Vision Transformer deep learning algorithm in terrain image classification, and provides a new solution idea and method for related fields. Through continuous optimization and adjustment, the algorithm is expected to achieve more extensive promotion in pract
In this paper, we present a new framework for view synthesis of novel view based on Neural Radiance Fields(NeRF). We aim to address two main limitations of NeRF. Firstly, we propose to combine multi-view stereo into N...
详细信息
ISBN:
(数字)9781665490627
ISBN:
(纸本)9781665490627
In this paper, we present a new framework for view synthesis of novel view based on Neural Radiance Fields(NeRF). We aim to address two main limitations of NeRF. Firstly, we propose to combine multi-view stereo into NeRF to help construct general neural radiance fields across different scenes. Specifically, We build a MVS-Encoding Feature Volume with average group-wise correlation to aggregate the multi-view appearance and geometry feature for every source view. And then we use an MLP to encode neural radiance fields by using the scene-dependent features interpolated from the MVS-Encoding Feature Volumes. This makes our model can be applied to other unseen scenes without any per-scene fine-tuning, and render realistic images with few images. If more training images are provided, our method can be fine-tuned quickly to render more realistic images. In fine-tuning phase, we propose a depth priors guided sampling method, which can make the model represent more accurate geometry for corresponding scenes and so render high-quality images of novel view. We evaluate our method on three common datasets. The experiment results show that our method performs better than other baselines, neither without or with fine-tuning. And the depth priors guided sampling method can be easily applied on other methods based on Neural Radiance Fields to further improve the quality of rendered images.
Recent advances in automated image analysis have lead to an increased number of proposed datasets in remotesensing applications. This permits the successful employment of data hungry state-of-the-art deep neural netw...
详细信息
ISBN:
(纸本)9781728188089
Recent advances in automated image analysis have lead to an increased number of proposed datasets in remotesensing applications. This permits the successful employment of data hungry state-of-the-art deep neural networks. However, the Earth is not covered equally by semantically meaningful classes. Thus, many land cover datasets suffer from a severe class imbalance. We show that by taking appropriate measures, the performance in the minority classes can be improved by up to 20 percent without affecting the performance in the majority classes strongly. Additionally, we investigate the use of an attribute encoding scheme to represent the inherent class hierarchies commonly observed in land cover analysis.
Precipitation detection using infrared (IR) brightness temperature (BT) temporal flux data is a challenging problem. Other sensors, such as microwave (MW), have reliable and more robust predictive performance, but lac...
详细信息
Waterbody extraction is essential for monitoring surface changes and supporting disaster response. However, differences in morphology, dimensions, and spectral reflectance, make it problematic to segregate waterbodies...
详细信息
Currently, deep neural networks have been widely used in the field of SAR target recognition. Many researchers found that deep neural networks have an ability of denoising. In many cases, there is no need to denoise i...
详细信息
ISBN:
(纸本)9781665403696
Currently, deep neural networks have been widely used in the field of SAR target recognition. Many researchers found that deep neural networks have an ability of denoising. In many cases, there is no need to denoise in pre-process. But the denoising ability of deep neural networks can take place of conventional denoising algorithm or not is doubtful. In this article, we explore the effect of image denoising algorithms to SAR target recognition methods based on deep neural networks. Firstly, seven traditional denoising algorithms are selected to process two SAR datasets. And these data are utilized to train two kinds of deep neural networks. After comparing and analyzing the training processes and results, we find that 1) The effect of denoising algorithms is influenced by architectures of neural networks and quality of datasets. It is difficult to find a SAR image denoising algorithm, which can improve the accuracy of any recognition network. Sometimes they even drag down the performance of recognition networks. 2) The deep networks with more layers will have better denoising ability, so the effect of denoising algorithms will decrease. For ResNet, there is no need to add the denoising processing.
High-resolution remotesensingimages can finely express rich surface information. Using the macroscopic and spatial-temporal full coverage advantages of high-resolution remotesensingimages for urban building object...
详细信息
image registration is a basic problem in image analysis and imageprocessing. image registration has important applications in aerial image fusion, patternrecognition, three-dimensional reconstruction and other field...
ISBN:
(纸本)9781450397148
image registration is a basic problem in image analysis and imageprocessing. image registration has important applications in aerial image fusion, patternrecognition, three-dimensional reconstruction and other fields. Aiming at the problem of low registration accuracy and mismatching in remotesensingimage registration, this paper proposes to use the Involution kernel to improve the ResNext network in the feature extraction stage and combines the SPANet attention mechanism with an improved ResNext network to improve the feature extraction ability of the network. In the feature matching stage, an enhanced matching method is proposed, which uses cross-correlation and nearest neighbor to second nearest neighbor ratio to filter out mismatched points to cope with complex images and background interference. The experimental results show that the proposed algorithm can achieve superior results in a variety of indexes compared with other algorithms, which proves that the proposed algorithm is effective.
The proceedings contain 88 papers. The special focus in this conference is on Soft Computing and patternrecognition. The topics include: A Survey on Ensemble Multi-label Classifiers;semantic Type Detection in Tabular...
ISBN:
(纸本)9783031275234
The proceedings contain 88 papers. The special focus in this conference is on Soft Computing and patternrecognition. The topics include: A Survey on Ensemble Multi-label Classifiers;semantic Type Detection in Tabular Data via Machine Learning Using Semi-synthetic Data;Handwritten Character recognition Using Deep LSTM Approach;two Phase Ensemble Learning Based Extractive Summarization for Short Documents;spider Monkey Based K-Means Dynamic Collaborative Filtering for Movie Recommendation Systems;Fuzzy Bi-GRU Based Hybrid Extractive and Abstractive Text Summarization for Long Multi-documents;a Genomic Signal processing-Based Coronavirus Classification Model Using Deep Learning with Web-Based Console;study on Drowsiness Detection System Using Deep Learning;convolutional Neural Networks for Crack Detection on Flexible Road Pavements;sexism Classification in Social Media Using Machine Learning Algorithms;improving the Performance of Classification via Clustering on the Students’ Academic Performance using Stacking Algorithm;deep Learning and Machine Learning-Based Lung Nodule Detection Systems – An Analysis;breast Cancer Classification Techniques – An Investigation;A Study on Student Performance Prediction and Intervention Mechanisms in MOOC;Bi-stage QWOA-Based Efficient Feature Selection for Enhanced Depression Detection Based on Facial Cues;some Cases of Prediction and Inference with Uncertainty;interpretable Approaches to Predict Evapotranspiration;deep Convolutional Neural Network Model for Classifying Alzheimer’s Disease;comparative Hybrid Deep Convolutional Learning Framework with Transfer Learning for Diagnosis of Lung Cancer;Review of Supply Chain Management Operational Resilience, Risk, and Disruption in COVID-19 Pandemic;adaptable Fog Computing Framework for Healthcare 4.0;ear images Classification Based on Data Augmentation and ResNeXt50;machine Learning Approaches for Crop Identification from remotesensingimagery: A Review;emotion recognition of Speech.
暂无评论