检索结果-内蒙古大学图书馆

Multi-SUAV Collaboration and Low-Altitude remote sensing Technology-Based image Registration and Change Detection Network of Garbage Scattered Areas in Nature Reserves

引用

remote sensing 2022年第24期14卷 6352-6352页

作者： Yan, Kai Dong, Yaxin Yang, Yang Xing, Lin Yunnan Normal Univ Sch Informat Sci & Technol Kunming 650500 Peoples R China Yunnan Normal Univ Lab Pattern Recognit & Artificial Intelligence Kunming 650500 Peoples R China Yunnan Normal Univ Sch Phys & Elect Informat Kunming 650500 Peoples R China

Change detection is an important task in remote sensing image processing and analysis. However, due to position errors and wind interference, bi-temporal low-altitude remote sensing images collected by SUAVs often suffer from different viewing angles. The existing methods need to use an independent registration network for registration before change detection, which greatly reduces the integrity and speed of the task. In this work, we propose an end-to-end network architecture RegCD-Net to address change detection problems in the bi-temporal SUAVs' low-altitude remote sensing images. We utilize global and local correlations to generate an optical flow pyramid and realize image registration through layer-by-layer optical flow fields. Then we use a nested connection to combine the rich semantic information in deep layers of the network and the precise location information in the shallow layers and perform deep supervision through the combined attention module to finally achieve change detection in bi-temporal images. We apply this network to the task of change detection in the garbage-scattered areas of nature reserves and establish a related dataset. Experimental results show that our RegCD-Net outperforms several state-of-the-art CD methods with more precise change edge representation, relatively few parameters, fast speed, and better integration without additional registration networks.

关键词： image registration with optical flow end-to-end change detection multi-SUAV low-altitude remote sensing

来源：评论

学校读者我要写书评

暂无评论

Advancing Generalizable remote Physiological Measurement through the Integration of Explicit and Implicit Prior Knowledge

引用

IEEE Transactions on image processing 2025年 34卷 3764-3778页

作者： Zhang, Yuting Lu, Hao Liu, Xin Chen, Yingcong Wu, Kaishun Hong Kong University of Science and Technology (Guangzhou) Information Hub Guangzhou 511400 China Lappeenranta-Lahti University of Technology Lut Computer Vision and Pattern Recognition Laboratory School of Engineering Science Lappeenranta 53850 Finland

remote photoplethysmography (rPPG) is a promising technology for capturing physiological signals from facial videos, with potential applications in medical health, affective computing, and biometric recognition. The demand for rPPG tasks has evolved from achieving high performance in intra-dataset testing to excelling in cross-dataset testing (i.e., domain generalization). However, most existing methods have overlooked the incorporation of prior knowledge specific to rPPG, leading to limited generalization capabilities. In this paper, we propose a novel framework that effectively integrates both explicit and implicit prior knowledge into the rPPG task. Specifically, we conduct a systematic analysis of noise sources (e.g., variations in cameras, lighting conditions, skin types, and motion) across different domains and embed this prior knowledge into the network design. Furthermore, we employ a two-branch network to disentangle physiological feature distributions from noise through implicit label correlation. Extensive experiments demonstrate that the proposed method not only surpasses state-of-the-art approaches in RGB cross-dataset evaluation but also exhibits strong generalization from RGB datasets to NIR datasets. The code is publicly available at https://***/keke-nice/Greip. © 2025 IEEE.

关键词： domain generalization explicit prior knowledge implicit prior knowledge remote heart rate measurement rPPG

来源：评论

学校读者我要写书评

暂无评论

Fusion Classification of HSI and MSI Using a Spatial-Spectral Vision Transformer for Wetland Biodiversity Estimation

引用

remote sensing 2022年第4期14卷 850-850页

作者： Gao, Yunhao Song, Xiukai Li, Wei Wang, Jianbu He, Jianlong Jiang, Xiangyang Feng, Yinyin Beijing Inst Technol Sch Informat & Elect Beijing 100081 Peoples R China Shandong Marine Resources & Environm Res Inst Shandong Prov Key Lab Restorat Marine Ecol Yantai 264006 Peoples R China Minist Nat Resources Inst Oceanog 1 Lab Marine Phys & Remote Sensing Qingdao 266061 Peoples R China

The rapid development of remote sensing technology provides wealthy data for earth observation. Land-cover mapping indirectly achieves biodiversity estimation at a coarse scale. Therefore, accurate land-cover mapping is the precondition of biodiversity estimation. However, the environment of the wetlands is complex, and the vegetation is mixed and patchy, so the land-cover recognition based on remote sensing is full of challenges. This paper constructs a systematic framework for multisource remote sensing image processing. Firstly, the hyperspectral image (HSI) and multispectral image (MSI) are fused by the CNN-based method to obtain the fused image with high spatial-spectral resolution. Secondly, considering the sequentiality of spatial distribution and spectral response, the spatial-spectral vision transformer (SSViT) is designed to extract sequential relationships from the fused images. After that, an external attention module is utilized for feature integration, and then the pixel-wise prediction is achieved for land-cover mapping. Finally, land-cover mapping and benthos data at the sites are analyzed consistently to reveal the distribution rule of benthos. Experiments on ZiYuan1-02D data of the Yellow River estuary wetland are conducted to demonstrate the effectiveness of the proposed framework compared with several related methods.

关键词： coastal wetlands multisource remote sensing land-cover mapping biodiversity estimation spatial-spectral vision transformer

来源：评论

学校读者我要写书评

暂无评论

MIPPR 2023: Multispectral image Acquisition, processing, and Analysis

MIPPR 2023: Multispectral Image Acquisition, Processing, and...

引用

SPIE 12th International Symposium on Multispectral image processing and pattern recognition, MIPPR 2023

ISBN: (纸本)9781510674912

The proceedings contain 16 papers. The topics discussed include: fully electrically controlled light-field camera via electrowetting liquid lens and liquid-crystal microlens array;study on transmission and nanofocusing characteristics of surface array micronano metasurface;tuning of near-field optical properties based on magneto-tip array super-surfaces;ice area and 3D ice shape measurement method based on polarized light imaging;study on the polarization response of aluminum gratings with graphene;toroidal composite liquid crystal microlens array co-driven by four independent signal voltages;semi-supervised polarimetric SAR images classification based on FixMatch;and overview of remote sensing image fusion based on deep learning.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Coarse-to-Fine Task-Driven Inpainting for Geoscience images

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2023年第12期33卷 7170-7182页

作者： Sun, Huiming Ma, Jin Guo, Qing Zou, Qin Song, Shaoyue Lin, Yuewei Yu, Hongkai Cleveland State Univ Washkewicz Coll Engn EECS Dept Cleveland OH 44115 USA ASTAR Ctr Frontier AI Res CFAR Singapore 138632 Singapore ASTAR Inst High Performance Comp IHPC Singapore 138632 Singapore Wuhan Univ Sch Comp Sci Wuhan 430072 Peoples R China Beijing Univ Technol Fac Informat Technol Beijing 100021 Peoples R China Brookhaven Natl Lab Upton NY 11973 USA

The processing and recognition of geoscience images have wide applications. Most of existing researches focus on understanding the high-quality geoscience images by assuming that all the images are clear. However, in many real-world cases, the geoscience images might contain occlusions during the image acquisition. This problem actually implies the image inpainting problem in computer vision and multimedia. As far as we know, all the existing image inpainting algorithms learn to repair the occluded regions for a better visualization quality, they are excellent for natural images but not good enough for geoscience images, and they never consider the following gescience task when developing inpainting methods. This paper aims to repair the occluded regions for a better geoscience task performance and advanced visualization quality simultaneously, without changing the current deployed deep learning based geoscience models. Because of the complex context of geoscience images, we propose a coarse-to-fine encoder-decoder network with the help of designed coarse-to-fine adversarial context discriminators to reconstruct the occluded image regions. Due to the limited data of geoscience images, we propose a MaskMix based data augmentation method, which augments inpainting masks instead of augmenting original images, to exploit the limited geoscience image data. The experimental results on three public geoscience datasets for remote sensing scene recognition, cross-view geolocation and semantic segmentation tasks respectively show the effectiveness and accuracy of the proposed method. The code is available at: https://***/HMS97/Task-driven-Inpainting.

关键词： image recognition Maintenance engineering image inpainting geoscience images coarse-tofine task-driven

来源：评论

学校读者我要写书评

暂无评论

remote sensing image Scene Classification via Multi-Level Representation Learning 26

Remote Sensing Image Scene Classification via Multi-Level Re...

引用

26th International Conference on pattern recognition / 8th International Workshop on image Mining - Theory and Applications (IMTA)

作者： Fu, Wei Yang, Lishuang Hunan Univ Coll Comp Sci & Elect Engn Changsha Peoples R China

ISBN: (数字)9781665490627

ISBN: (纸本)9781665490627

remote sensing image scene classification (RSSC), which assigns semantic labels to remote sensing images, is very important for remote sensing image interpretation. Thanks to the rapid development of deep learning, RSSC achieves significant breakthroughs by the use of convolutional neural network (CNN). However, CNN relies on local receptive fields and is difficult to capture long-range and global scene information. Moreover, the information of salient objects, which contributes to discriminate the category of scenes (e.g., airplanes indicate the airport scene), should be also exploited. To address this issue, a deep learning method, named multi-level representation learning (MLRL), is proposed to collaboratively extract pixel-level, patch-level, and object-level features, which respectively contain local, global, and object-oriented information. Specifically, pixel-level features are obtained by pixel-wise convolution operations within a CNN. Patch-level features are achieved by a patch-wise self-attention network. Object-level features are acquired by applying a CNN to a cropped sub-image, which conveys important information of salient objects. To this end, a three-branch network structure to respectively extract above features, is built. Finally, a decision fusion method is adopted to integrate multi-level features, and gives rise to refined classification results. Experiments conducted on widely-used datasets demonstrate the effectiveness of the proposed method.

关键词： Deep learning Representation learning Measurement image analysis Fuses Semantics Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Embedding Spatial Relations in Visual Question Answering for remote sensing 26

Embedding Spatial Relations in Visual Question Answering for...

引用

26th International Conference on pattern recognition / 8th International Workshop on image Mining - Theory and Applications (IMTA)

作者： Faure, Maxime Lobry, Sylvain Kurtz, Camille Wendling, Laurent Univ Paris Cite LIPADE F-75006 Paris France

ISBN: (数字)9781665490627

ISBN: (纸本)9781665490627

remote sensing images carry a wealth of information that is not easily accessible to end-users as it requires strong technical skills and knowledge. Visual Question Answering (VQA), a task that aims at answering an open-ended question in natural language from an image, can provide an easier access to this information. Considering the geographical information contained in remote sensing images, questions often embed an important spatial aspect, for instance regarding the relative position of two objects. Our objective is to better model the spatial relations in the construction of a ground-truth database of image/question/answer triplets and to assess the capacity a VQA model has to answer these questions. In this article, we propose to use histograms of forces to model the directional spatial relations between geo-localized objects. This allows a finer modeling of ambiguous relationships between objects and to provide different levels of assessment of a relation (e.g. object A is slightly/strictly to the west of object B). Using this new dataset, we evaluate the performances of a classical VQA model and propose a curriculum learning strategy to better take into account the varying difficulty of questions embedding spatial relations. With this approach, we show an improvement in the performances of our model, highlighting the interest of embedding spatial relations in VQA for remote sensing applications.

关键词： Training Visualization Histograms Feature extraction Question answering (information retrieval) Spatial databases pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Real-time image recognition System based on Ultra-Low-Latency Spiking Neural Networks 2

Real-time Image Recognition System based on Ultra-Low-Latenc...

引用

2nd IEEE International Conference on Signal, Information and Data processing, ICSIDP 2024

作者： Lan, Bin Zhang, Xi Dong, Heng Xu, Ming Beijing Institute of Technology Chongqing Innovation Center Chongqing401120 China Beijing Institute of Technology Beijing100081 China Beijing Institute of Technology Innovation Equipment Research Institute Sichuan TianFu New Area Chengdu610299 China Beijing100081 China

ISBN: (纸本)9798331515669

remote sensing technology plays an important role in many tasks such as natural disaster detection, weather and climate monitoring and military defense. Currently, remote sensing image processing predominantly relies on Convolutional Neural Networks (CNNs) and Transformers, which require a huge amount of multiplication and addition operations, bringing pressure to the deployment of satellite platforms with strict power consumption and computing power constraints. Spiking neural network (SNN), where the signal is encoded as a spike train instead of analog values, significantly reduces energy consumption compared to traditional artificial neural network. This paper proposes an all-input parallel deployment approach that minimizes or eliminates interactions with off-chip memory, thereby accelerating inference speed and reducing power consumption. We deployed LeNet ultra-low latency SNN on the Xilinx VC709 Board. Experimental results demonstrate that the proposed accelerator achieves a processing speed of 1240 frames per second, an accuracy rate of 98.14%, and a power consumption of only 0.745 Watts, making it well-suited for the low power and real-time requirements of remote sensing tasks. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

remote sensing image semantic segmentation network based on ENet

引用

JOURNAL OF ENGINEERING-JOE 2022年第12期2022卷 1219-1227页

作者： Wang, Yiqin Jinzhong Univ Sch Informat Technol & Engn 199 Wenhua St Jinzhong 030619 Shanxi Peoples R China

The current image semantic segmentation methods cannot meet the requirements of high precision and high speed for remote sensing image analysis. The ENet network model builds a semantic segmentation network, which has the characteristics of few network parameters and fast operation speed. The attention mechanism module is integrated with the ENet network model, which can deeply mine image features in remote sensing datasets and ensure the accuracy of semantic segmentation. The author combines the ENet network with the attention mechanism to construct a new semantic segmentation network model. The model first constructed a remote sensing image semantic segmentation network model based on the ENet network, and simplified the model to further improve the speed of image segmentation and recognition. Then, the attention mechanism module is fused with the ENet network model, which can conduct deep and orderly mining of the image features of the remote sensing image data set. It can meet the accuracy requirements of remote sensing image semantic analysis. Simulations are performed based on three general datasets, and the experimental results show high accuracy and high speed.

关键词： image features network parameters remote sensing image analysis image segmentation remote sensing image semantic segmentation network model geophysical image processing attention mechanism module ENet network model remote sensing image semantic analysis remote sensing remote sensing datasets remote sensing image data current image semantic segmentation methods feature extraction

来源：评论

学校读者我要写书评

暂无评论

An Enhanced Semi-Supervised Support Vector Machine Algorithm for Spectral-Spatial Hyperspectral image Classification

引用

pattern recognition AND image ANALYSIS 2024年第1期34卷 199-211页

作者： He, Ziping Xia, Kewen Zhang, Jiangnan Wang, Sijie Yin, Zhixian Changsha Univ Sci & Technol Sch Comp & Commun Engn Changsha 410114 Peoples R China Hebei Univ Technol Sch Elect & Informat Engn Tianjin 300401 Peoples R China

Hyperspectral image classification has become an important issue in remote sensing due to the significant amount of spectral information in HSI. The costly and time-consuming annotation task of HSIs makes the number of labeled samples is limited. To address the above problem, we propose an enhanced semi-supervised support vector machine algorithm for spectral-spatial HSI classification. To fully capture the spectral and spatial information of HSI, we use local binary pattern to obtain spatial feature. The captured spatial features are concatenated with the spectral features to yield the hybrid spectral-spatial features. Self-training mechanism is then adopted to gradually select confident unlabeled samples with their pseudo-labels and add them to the labeled set. To further improve the classification performance of the semi-supervised support vector machine, we choose a cuckoo search algorithm based on the chaotic catfish effect to find its optimal combination of parameters. The experimental results on two publicly available HSI datasets show that the proposed model achieves excellent classification accuracy for each category in hyperspectral images, and also has superior overall accuracy compared with other comparative algorithms. Adequate experiments and analysis illustrate the promising potential and prospect of our proposed model for HSI classification.

关键词： hyperspectral image classification semi-supervised support vector machine self-training cuckoo search algorithm chaotic catfish effect

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：