检索结果-内蒙古大学图书馆

A Target recognition Algorithm of Multi-Source remote sensing image Based on Visual Internet of Things

MOBILE NETWORKS & APPLICATIONS 2022年第2期27卷 784-793页

作者： Sun, Xue-jun Lin, Jerry Chun-Wei Linyi Univ Feixian Campus Linyi 273400 Shandong Peoples R China Western Norway Univ Appl Sci Bergen Norway

Multi-source remote sensing images have the characteristics of large differences in texture and gray level. Mismatch and low recognition accuracy are easy to occur in the process of identifying targets. Thus, in this paper, the target recognition algorithm of multi-source remote sensing image based on IoT vision is investigated. The infrared sensor and SAR radars are set in the visual perception layer of the iVIOT. The visual perception layer transmits the collected remote sensing image information to the application layer through the wireless networks. The data processing module in the application layer uses the normalized central moment idea to extract the features of multi-source remote sensing image. Contourlet two-level decomposition is performed on the image after feature extraction to realize multi-scale and multi-directional feature fusion. A two-step method of primary fineness is used to match the fused features and the random sampling consensus algorithm is used to eliminate false matches for obtaining the correct match pairs. After the image feature matching is completed, the BVM target detection operator is used to complete the target recognition of multi-source remote sensing image. Experimental results show that the use of the IoT to visually recognizing the desired remote sensing image target has low communication overhead, and the recognition reaches 99% accuracy.

关键词： IoT vision Multi-source remote sensing image Target recognition algorithm Infrared sensor SAR radar image registration

来源：评论

学校读者我要写书评

暂无评论

Spatial data mining-focused research on image processing techniques for remotely sensed images

引用

IMAGING SCIENCE JOURNAL 2025年第4期73卷 428-442页

作者： Fan, Yongxia Feng, Shizhou Du, Jing Zhengding Adv Normal Coll Hebei Artificial Intelligence Dept Shijiazhuang Peoples R China Chongqing Coll Int Business & Econ Big Data & Intelligence Engn Sch Chongqing Peoples R China Chongqing Coll Humanities Sci & Technol Sch Comp Sci & Engn Chongqing 401520 Peoples R China

Spatial data mining is an important approach for collecting useful data from big datasets, especially remotely sensed images. This study tackles issues in environmental monitoring and management using sophisticated image processing. The Horse Herd Optimization-based VGG19 (HHO-VGG19) is proposed to improve land cover classification, recognition of objects, detection of changes, and detection of anomalies. The study used the BCDD dataset, which was scaled to 512 x 512 pixels, then applied Z-score normalization and extracted features using Principal Component Analysis (PCA). The VGG19 architecture was enhanced by utilizing Horse Herd Optimization to enhance image classification efficiency. The HHO-VGG19 model surpasses conventional techniques, with F1-score of 92%, a recall of 94%, an accuracy of 98.5%, and a 30-second execution time reduction. The findings indicate the efficiency of integrating sophisticated image processing with spatial data mining, giving an effective tool for remote sensing image processing in environmental uses including tracking ecosystems and handling of natural resources.

关键词： Spatial data mining image processing remotely sensed images environmental uses Horse Herd Optimization (HHO) tracking ecosystems

来源：评论

学校读者我要写书评

暂无评论

Novel algorithm development for event-based sensing deployment 35

Novel algorithm development for event-based sensing deployme...

引用

Conference on pattern recognition and Prediction XXXV

作者： Kjellstrand, C. Bjorn Casias, Lilian K. Hagopian, Kaylin Pattyn, Christian Saltonstall, Christopher Shank, Joshua Sandia Natl Labs 1515 Eubank Dr Albuquerque NM 87123 USA

ISBN: (纸本)9781510673991;9781510673984

Event-based sensors (EBS) consist of a pixelated focal plane array in which each pixel is an independent asynchronous change detector. The analog asynchronous array is read by a synchronous digital readout and written to disk. As a result, EBS pixels consume minimal power and bandwidth unless the scene changes. Furthermore, the change detectors have a very large dynamic range (similar to 120 dB) and rapid response time (similar to 20 us). A framing camera with comparable speed requires similar to 3 orders of magnitude more power and similar to 2 orders of magnitude higher bandwidth. These features make EBS an appealing technology for proliferation detection applications. remote sensing deployed in the field requires low power, low bandwidth, and low complexity algorithms. EBS inherently allows for low power and low bandwidth, but a drawback of event-based sensors is the lack of mature image analysis algorithms. While analysis of conventional imagers draws from decades of image processing algorithms, EBS data is a fundamentally different format;a series of x, y, asynchronous time, and polarization change (increase/decrease) as opposed to x, y, and intensity at a regularly sampled framerate. To leverage the advantages of EBS over conventional imagers, our team has worked to develop and refine image processing algorithms that use EBS data directly. We will discuss these efforts, including frequency and phase detection. We will also discuss the field applications of these algorithms such as degraded visual environments (e.g., fog) and defeating laser dazzling attempts.

关键词： Event-based sensing EBS algorithm remote sensing degraded visual environment laser dazzling

来源：评论

学校读者我要写书评

暂无评论

Face recognition of remote Teaching Video image Based on Improved Frame Difference Method

引用

MOBILE NETWORKS & APPLICATIONS 2023年第3期28卷 995-1006页

作者： Wang, Can Moqurrab, Syed Atif Yoo, Joon Wuhan Coll Arts & Sci Sch Foreign Languages Wuhan Peoples R China Gachon Univ Sch Comp 1342 Seongnam Daero Seongnam 13120 South Korea

remote network teaching has gained significant importance in recent times, with video images serving as a crucial medium for delivering educational content. Ensuring accurate face recognition in these video images is a key challenge. To address this, we present a face recognition algorithm based on an improved frame difference method. The algorithm focuses on enhancing the accuracy of face recognition specifically in remote network teaching video images. By leveraging a generative adversarial network method, we enhance image resolution as a preprocessing step. Subsequently, our proposed image target detection algorithm effectively identifies the face region through foreground and background segmentation. We employ an improved local three-value pattern for face feature extraction, concentrating on the face target region. These features are then input into an integrated neural network face recognition model. Experimental results demonstrate the algorithm's efficacy in enhancing clarity processing, facial object detection, and feature extraction for remote teaching video images. Notably, the proposed method achieves an average gradient of details below 0.1 and attains a facial feature matching degree of 0.98, establishing the high accuracy of facial recognition results in remote teaching video images.

关键词： Improved Frame Difference Method English Teaching Distance Learning Video image Face recognition Object Detection

来源：评论

学校读者我要写书评

暂无评论

PReLim: A Modeling Paradigm for remote sensing image Scene Classification Under Limited Labeled Samples 1

引用

9th International Conference on pattern recognition and Machine Intelligence (PReMI)

作者： Dutta, Suparna Das, Monidipa Indian Stat Inst Machine Intelligence Unit Kolkata India

ISBN: (数字)9783031127007

ISBN: (纸本)9783031126994;9783031127007

With the ongoing development of deep learning techniques in recent years, the convolutional neural networks (CNNs) have shown remarkable performance breakthrough in remote sensing image scene classification. However, the performance of these deep models largely depends on the number of available training samples or labeled images. Although the knowledge transferring and pre-training techniques can handle such situation, these may become ineffective due to domain difference. On the other side, the existing data augmentation approaches often produce training samples with too low diversity to help in performance improvement. In order to address these issues, in this work, we propose PReLim as a novel modeling paradigm for remote sensing scene classification under limited labeled samples scenario. PReLim is based on the notion of local and global filtering of scene fragment mixture, which overcomes both the sample diversity and the domain difference issue. Experimental analyses with the benchmark UCMerced and SIRI-WHU datasets demonstrate the effectiveness of PReLim in achieving the state-of-the-art accuracy using limited number of training samples.

关键词： Scene classification Machine learning remote sensing

来源：评论

学校读者我要写书评

暂无评论

Cultivated land segmentation in RGB remote sensing images: nonuniform regularization with kernel space and graph cut

Cultivated land segmentation in RGB remote sensing images: n...

引用

2024 International Conference on image, Signal processing, and pattern recognition, ISPP 2024

作者： Wu, Wangsheng College of Computer and Information Science Chongqing Normal University Chongqing401331 China

ISBN: (纸本)9781510680425

To improve the application efficiency of RGB remote sensing images in agricultural land resource surveys, a cultivated land segmentation algorithm based on kernel space non-uniform regularization classification and improved graph cut was proposed. Firstly, extracting texture and color features of remote sensing images using Local Binary pattern algorithm (LBP), Gabor filters, and RGB, HSV color space, respectively. Next, introducing a kernel method to map data from low-dimension to high-dimension, and construct a kernel space-based non-uniform regularization sparse representation model to classify and segment images in pixel level. Finally, an innovative graph cut algorithm is enhanced by incorporating a Gaussian distribution to redefine the penalty term for homogeneous regions and introducing a new color gradient measure to define the penalty term for boundaries. This approach effectively removes scatter and restricts the segmentation boundary. The average classification accuracy and average F1 score of the classifier proposed in this paper are about 2% and 3% higher than those of recent regularized subspace classifiers, respectively. Compared with the Graph cut algorithm, the proposed improved graph cut algorithm has an average mIoU improvement of about 9%. The average accuracy of the whole segmentation algorithm is 95.43%, and the average mIoU is 88.56%. Compared with the comparison algorithm, the proposed algorithm has higher segmentation accuracy, which proves that the proposed algorithm can adapt to the cultivated land segmentation scene of remote sensing images and is effective. © 2024 SPIE.

关键词： RGB color model

来源：评论

学校读者我要写书评

暂无评论

Semantic Segmentation of Unmanned Aerial Vehicle remote sensing images Using SegFormer 4th

Semantic Segmentation of Unmanned Aerial Vehicle Remote Sens...

引用

4th International Conference on Intelligent Systems and pattern recognition-ISPR

作者： Spasev, Vlatko Dimitrovski, Ivica Chorbev, Ivan Kitanovski, Ivan Univ Ss Cyril & Methodius Fac Comp Sci & Engn Skopje North Macedonia

ISBN: (纸本)9783031821554;9783031821561

The escalating use of Unmanned Aerial Vehicles (UAVs) as remote sensing platforms has garnered considerable attention, proving invaluable for ground object recognition. While satellite remote sensing images face limitations in resolution and weather susceptibility, UAV remote sensing, employing low-speed unmanned aircraft, offers enhanced object resolution and agility. The advent of advanced machine learning techniques has propelled significant strides in image analysis, particularly in semantic segmentation for UAV remote sensing images. This paper evaluates the effectiveness and efficiency of SegFormer, a semantic segmentation framework, for the semantic segmentation of UAV images. SegFormer variants, ranging from real-time (B0) to high-performance (B5) models, are assessed using the UAVid dataset tailored for semantic segmentation tasks. The research details the architecture and training procedures specific to SegFormer in the context of UAV semantic segmentation. Experimental results showcase the model's performance on benchmark dataset, highlighting its ability to accurately delineate objects and land cover features in diverse UAV scenarios, leading to both high efficiency and performance.

关键词： Semantic segmentation Deep learning SegFormer UAV images

来源：评论

学校读者我要写书评

暂无评论

SPECTRAL-AWARE DEEP NETWORKS FOR OBJECT DETECTION IN HYPERSPECTRAL imageS WITH CLOUD INTERFERENCE 24

SPECTRAL-AWARE DEEP NETWORKS FOR OBJECT DETECTION IN HYPERSP...

引用

1st International Conference on image processing Machine Learning and pattern recognition

作者： Zhang, Ying Zeng, Xia Zhang, Hongtao Xu, Meng Shenzhen Univ Shenzhen Peoples R China Beijing Inst Astronaut Syst Engn Beijing Peoples R China

ISBN: (纸本)9798400707032

Hyperspectral object detection (HOD) aims to identify and locate multiple objects in a scene using hyperspectral images (HSIs). While much research has focused on hyperspectral target detection (HTD) at the pixel level, HOD remains underexplored. Traditional HTD methods rely heavily on prior spectral information of the target and simple pixel neighborhood relationships, leading to accuracy issues when targets are occluded. Inspired by advances in RGB image detection, we propose a compact and efficient cloud-robust hyperspectral object detection network (CR-HODNet) using 3D convolution to extract spatial and spectral features jointly. We further enhance these features with channel and spatial attention mechanisms and address cloud occlusion challenges using transformer-based multi-head attention. Our method is validated on real airborne hyperspectral images with synthetic cloud occlusion, showing robust performance in challenging scenarios.

关键词： Hyperspectral image Object detection Cloud interference remote sensing

来源：评论

学校读者我要写书评

暂无评论

Rethinking Transformers Pre-training for Multi-Spectral Satellite imagery

Rethinking Transformers Pre-training for Multi-Spectral Sate...

引用

IEEE/CVF Conference on Computer Vision and pattern recognition (CVPR)

作者： Noman, Mubashir Naseer, Muzammal Cholakkal, Hisham Anwar, Rao Muhammad Khan, Salman Khan, Fahad Shahbaz Mohamed bin Zayed Univ AI Abu Dhabi U Arab Emirates Australian Natl Univ Canberra ACT Australia Linkoping Univ Linkoping Sweden

ISBN: (纸本)9798350353006

Recent advances in unsupervised learning have demonstrated the ability of large vision models to achieve promising results on downstream tasks by pre-training on large amount of unlabelled data. Such pre-training techniques have also been explored recently in the remote sensing domain due to the availability of large amount of unlabelled data. Different from standard natural image datasets, remote sensing data is acquired from various sensor technologies and exhibit diverse range of scale variations as well as modalities. Existing satellite image pre-training methods either ignore the scale information present in the remote sensing imagery or restrict themselves to use only a single type of data modality. In this paper, we re-visit transformers pre-training and leverage multi-scale information that is effectively utilized with multiple modalities. Our proposed approach, named SatMAE++, performs multiscale pre-training and utilizes convolution based upsampling blocks to reconstruct the image at higher scales making it extensible to include more scales. Compared to existing works, the proposed SatMAE++ with multi-scale pre-training is equally effective for both optical as well as multi-spectral imagery. Extensive experiments on six datasets reveal the merits of proposed contributions, leading to state-of-the-art performance on all datasets. SatMAE++ achieves mean average precision (mAP) gain of 2.5% for multi-label classification task on BigEarthNet dataset. Our code and pre-trained models are available at https://***/techmn/satmae_pp.

关键词： multi-scale mask auto encoder multi-spectral imagery remote sensing scene classification self-supervised learning Vision Transformers

来源：评论

学校读者我要写书评

暂无评论

RST-DeepLabv3+: Multi-Scale Attention for Tailings Pond Identification with DeepLab

引用

remote sensing 2025年第3期17卷 411-411页

作者： Feng, Xiangrui Wei, Caiyong Xue, Xiaojing Zhang, Qian Liu, Xiangnan China Univ Geosci Sch Informat Engn Beijing 100083 Peoples R China Ningxia Inst Remote Sensing Survey High Resolut Satellite Remote Sensing Applicat Dep Yinchuan 750021 Peoples R China

Tailing ponds are used to store tailings or industrial waste discharged after beneficiation. Identifying these ponds in advance can help prevent pollution incidents and reduce their harmful impacts on ecosystems. Tailing ponds are traditionally identified via manual inspection, which is time-consuming and labor-intensive. Therefore, tailing pond identification based on computer vision is of practical significance for environmental protection and safety. In the context of identifying tailings ponds in remote sensing, a significant challenge arises due to high-resolution images, which capture extensive feature details-such as shape, location, and texture-complicated by the mixing of tailings with other waste materials. This results in substantial intra-class variance and limited inter-class variance, making accurate recognition more difficult. Therefore, to monitor tailing ponds, this study utilized an improved version of DeepLabv3+, which is a widely recognized deep learning model for semantic segmentation. We introduced the multi-scale attention modules, ResNeSt and SENet, into the DeepLabv3+ encoder. The split-attention module in ResNeSt captures multi-scale information when processing multiple sets of feature maps, while the SENet module focuses on channel attention, improving the model's ability to distinguish tailings ponds from other materials in images. Additionally, the tailing pond semantic segmentation dataset NX-TPSet was established based on the Gauge-Fractional-6 image. The ablation experiments show that the recognition accuracy (intersection and integration ratio, IOU) of the RST-DeepLabV3+ model was improved by 1.19% to 93.48% over DeepLabV3+.The multi-attention module enables the model to integrate multi-scale features more effectively, which not only improves segmentation accuracy but also directly contributes to more reliable and efficient monitoring of tailings ponds. The proposed approach achieves top performance on two benchmark datasets, NX-TPSe

关键词： tailings pond identification Convolutional Neural Network (CNN) multi-scale attention semantic segmentation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：