检索结果-内蒙古大学图书馆

7th Chinese conference on pattern recognition and Computer Vision

作者： Song, Shitao Liu, Ye Su, Jintao Nanjing Univ Posts & Telecommun Sch Automat & Artificial Intelligence Nanjing 210003 Peoples R China

ISBN: (纸本)9789819785049;9789819785056

Vision Mamba (VMamba) has recently attracted great research attention due to its ability to obtain a global receptive field with linear computational complexity. However, similar to Vision Transformer (ViT), due to its mechanism of dividing patches, it also faces the issue of insufficient description ability of local details. To address this issue, we design in this paper a dual-stream network that combines VMamba and CNN, aiming to enable the network to possess both the global receptive field of VMamba and the local detail description capability of CNN. Both of the two characteristics are crucial for remote sensing image semantic segmentation. The two streams are supervised and trained through independent loss functions. On the other hand, to enable sufficient information exchange between the two branches, we introduce an auto-scaling fusion module aiming at bridging the semantic gap between VMamba and CNN. Experiments demonstrate that the method proposed in this paper outperforms state-of-the-art methods on multiple remote sensing semantic segmentation datasets.

关键词： Vision Mamba CNN Auto-scaling Semantic segmentation remote sensing

来源：评论

学校读者我要写书评

暂无评论

Knowledge Enhancement and Optimization Strategies for remote sensing image Captioning Using Contrastive Language image Pre-training and Large Language Models 24

Knowledge Enhancement and Optimization Strategies for Remote...

引用

International conference on Machine Intelligence and Digital Applications (MIDA)

作者： Wang, Xinren Wan, Tengfei Song, Jianning Huang, Jingmeng Jiangsu Special Equipment Safety Supervis & Inspe Wuxi Jiangsu Peoples R China Hohai Univ Coll Comp & Informat Nanjing Jiangsu Peoples R China

ISBN: (纸本)9798400718144

In this study, we propose an innovative multimodal learning approach that integrates Contrastive Language image Pre-training and large language models to enhance the recognition efficiency of remote sensing images and their capacity to generate related professional information. This method has effectively achieved integration of image processing and text generation at a technical level, exhibiting significant application advantages in fields such as automated Geographic Information Systems construction, environmental monitoring, disaster assessment, and geographic science education. The research underscores the advancements of the Contrastive Language image Pre-training model in visual-textual understanding and the technical strengths of large language models in handling complex text tasks. By designing an integrated fusion layer, we have efficiently combined visual features with textual information and conducted a comprehensive evaluation of the model's recognition accuracy and text generation quality on the dataset. Experimental results show that our model achieved a recognition accuracy of 73.7% and a text quality score of 26.6, validating its efficacy and powerful capability in dealing with the complexity and diversity of remote sensing images. Through the deep integration of Contrastive Language image Pre-training and large language models, this research not only further advances multimodal learning technologies but also opens new perspectives and possibilities for the research and application of remote sensing image recognition and related information generation.

关键词： Multimodal Learning remote sensing image recognition Knowledge Enhancement Large Language Models Caption Generation

来源：评论

学校读者我要写书评

暂无评论

Atmospheric Correction of High Resolution remote sensing images with Automatic Data Acquisition by Network 24

Atmospheric Correction of High Resolution Remote Sensing Ima...

引用

1st International conference on image processing Machine Learning and pattern recognition

作者： Wan, Xingyu Chen, Xingfeng Li, Kaitao Wang, Yanping Xiao, Qian Wan, Wei Liu, Shumin Zhao, Limin Jiangxi Univ Sci & Technol Sch Software Engn Ganzhou Peoples R China Chinese Acad Sci Aerosp Informat Res Inst Beijing Peoples R China Space Engn Univ Sch Space Informat Beijing Peoples R China Inst Disaster Prevent Sanhe Peoples R China Space Star Technol Co Ltd Beijing Peoples R China Peking Univ Inst Remote Sensing & GIS Sch Earth & Space Sci Beijing Peoples R China

ISBN: (纸本)9798400707032

Atmospheric parameters are necessary inputs for atmospheric correction, but obtaining these parameters is difficult. To address this challenge, a solution for atmospheric parameter acquisition based on NNAeroG and networked automatic matching was proposed. This solution, combined with QUAAC, enables the atmospheric correction of GF images, thereby achieving full process automation of atmospheric correction. This scheme effectively simplifies the tedious process of obtaining AOD in existing methods and greatly improves the efficiency of atmospheric correction. The atmospheric parameters provided by this program can support multiple atmospheric correction methods, reduce labor-intensive operations, and offer efficient tools for large-scale atmospheric radiation production and research.

关键词： Atmospheric correction Automated program Aerosol optical depth Geostationary satellite

来源：评论

学校读者我要写书评

暂无评论

Rotated Multi-Scale Interaction Network for Referring remote sensing image Segmentation

Rotated Multi-Scale Interaction Network for Referring Remote...

引用

IEEE/CVF conference on Computer Vision and pattern recognition (CVPR)

作者： Liu, Sihan Ma, Yiwei Zhang, Xiaoqing Wang, Haowei Ji, Jiayi Sun, Xiaoshuai Ji, Rongrong Xiamen Univ Key Lab Multimedia Trusted Percept & Efficient Co Minist Educ China Xiamen 361005 Peoples R China

ISBN: (纸本)9798350353006

Referring remote sensing image Segmentation (RRSIS) is a new challenge that combines computer vision and natural language processing. Traditional Referring image Segmentation (RIS) approaches have been impeded by the complex spatial scales and orientations found in aerial imagery, leading to suboptimal segmentation results. To address these challenges, we introduce the Rotated Multi-Scale Interaction Network (RMSIN), an innovative approach designed for the unique demands of RRSIS. RMSIN incorporates an Intra-scale Interaction Module (IIM) to effectively address the fine-grained detail required at multiple scales and a Cross-scale Interaction Module (CIM) for integrating these details coherently across the network. Furthermore, RMSIN employs an Adaptive Rotated Convolution ARC) to account for the diverse orientations of objects, a novel (contribution that significantly enhances segmentation accuracy. To assess the efficacy of RMSIN, we have curated an expansive dataset comprising 17,402 image-caption-mask triplets, which is unparalleled in terms of scale and variety. This dataset not only presents the model with a wide range of spatial and rotational scenarios but also establishes a stringent benchmark for the RRSIS task, ensuring a rigorous evaluation of performance. Experimental evaluations demonstrate the exceptional performance of RM-SIN, surpassing existing state-of-the-art models by a significant margin. Datasets and code are available at https://***/Lsan2401/RMSIN.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

Scientific School of Academician VA Soifer in the Field of processing, Analysis, and recognition of images and Optical Signals

引用

pattern recognition AND image ANALYSIS 2023年第4期33卷 1080-1103页

作者： Ilyasova, N. Yu. Sergeyev, V. V. Demin, N. S. Russian Acad Sci Image Proc Syst Inst Branch Fed Sci Res Ctr Crystallog & Photon Samara 443001 Russia Samara Univ Samara 443080 Russia

This article is the first in a series of publications dedicated to the leading scientific school of Academician V.A. Soifer in the field of processing, analysis, and recognition of images and optical signals. The article briefly describes the creation and development of the Samara scientific school of computer image processing. Examples of obtained fundamental results and solved applied problems are given. The most significant publications of the scientific school are listed and analyzed.

关键词： computer optics image processing nanophotonics diffraction optics Earth remote sensing biomedical research

来源：评论

学校读者我要写书评

暂无评论

Detection and Classification of Satellite remote sensing images Using Hybrid Segmentation and Feature Extraction with Effective Algorithms 2

Detection and Classification of Satellite Remote Sensing Ima...

引用

2nd International conference on Distributed Computing and Optimization Techniques, ICDCOT 2024

作者： Vinuja, G. Devi, N. Bharatha Saveetha School of Engineering Saveetha Institute of Medical and Technical Sciences Saveetha University Department of Cse Chennai India

ISBN: (纸本)9798350382952

The remote sensing image analysis, classification, and pattern recognition processes all depend on image segmentation. In this research, a search-based convolutional neural network (SBCNN) is used to identification method for remote sensing images. Prior to applying the image data to the SKFCM with PeSOA segmentation step, the image data must first undergo pre-processing. When pre-processing satellite images for road networks, noise is removed using an improved median filtering technique. The image is then segmented using the SKFCM with PeSOA Segmentation technique to have inverse shape determination with lowest energy usage. Using an intensity constraint, it is possible to identify the segments of a building and vegetation, a road, and a barren area of land. Following segmentation, MLBP with DWT feature extraction is performed on the road satellite images, and the SBCNN is then used to categorize the images. After associated with obtainable methods, the findings of the suggested technique display excellent precision of 98.6%. © 2024 IEEE.

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

International conference on image, Signal processing, and pattern recognition, ISPP 2024

International Conference on Image, Signal Processing, and Pa...

引用

2024 International conference on image, Signal processing, and pattern recognition, ISPP 2024

ISBN: (纸本)9781510680425

The proceedings contain 263 papers. The topics discussed include: improved YOLOv5's remote sensing image detection algorithm;wavelet transform based polarized image fusion detection of underwater targets;CLIP-driven hierarchical fusion for referring image segmentation;research on continuous monitoring of subdural hematoma based on optical image mapping feature extraction method;generalized image denoising based on MLP denoiser and diffusion model;CAM: consistency adversarial model for image generation with high-frequency image details;DBE-net: double-level boundary enhanced network for temporomandibular joint CBCT images segmentation;an enhanced feature matching multi-temporal port remote sensing image registration network E-SuperGlue;and application of 3D laser scanning and tilt photography technology in digital landscape surveying and mapping.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Advances in Deep Learning Applications for Plant Disease and Pest Detection: A Review

引用

remote sensing 2025年第4期17卷 698-698页

作者： Wang, Shaohua Xu, Dachuan Liang, Haojian Bai, Yongqing Li, Xiao Zhou, Junyuan Su, Cheng Wei, Wenyu Hainan Aerosp Informat Res Inst Key Lab Earth Observat Hainan Prov Sanya 572029 Peoples R China Chinese Acad Sci Aerosp Informat Res Inst State Key Lab Remote Sensing & Digital Earth Beijing 100101 Peoples R China Lanzhou Jiaotong Univ Fac Geomat Lanzhou 730070 Peoples R China Univ Chinese Acad Sci Coll Resources & Environm Beijing 100049 Peoples R China Lanzhou Jiaotong Univ Sch Architecture & Urban Planning Lanzhou 730070 Peoples R China

Traditional methods for detecting plant diseases and pests are time-consuming, labor-intensive, and require specialized skills and resources, making them insufficient to meet the demands of modern agricultural development. To address these challenges, deep learning technologies have emerged as a promising solution for the accurate and timely identification of plant diseases and pests, thereby reducing crop losses and optimizing agricultural resource allocation. By leveraging its advantages in image processing, deep learning technology has significantly enhanced the accuracy of plant disease and pest detection and identification. This review provides a comprehensive overview of recent advancements in applying deep learning algorithms to plant disease and pest detection. It begins by outlining the limitations of traditional methods in this domain, followed by a systematic discussion of the latest developments in applying various deep learning techniques-including image classification, object detection, semantic segmentation, and change detection-to plant disease and pest identification. Additionally, this study highlights the role of large-scale pre-trained models and transfer learning in improving detection accuracy and scalability across diverse crop types and environmental conditions. Key challenges, such as enhancing model generalization, addressing small lesion detection, and ensuring the availability of high-quality, diverse training datasets, are critically examined. Emerging opportunities for optimizing pest and disease monitoring through advanced algorithms are also emphasized. Deep learning technology, with its powerful capabilities in data processing and pattern recognition, has become a pivotal tool for promoting sustainable agricultural practices, enhancing productivity, and advancing precision agriculture.

关键词： deep learning disease detection plant diseases and pests image classification object detection convolutional neural network

来源：评论

学校读者我要写书评

暂无评论

RSSFormer: Foreground Saliency Enhancement for remote sensing Land-Cover Segmentation

引用

IEEE TRANSACTIONS ON image processing 2023年 32卷 1052-1064页

作者： Xu, Rongtao Wang, Changwei Zhang, Jiguang Xu, Shibiao Meng, Weiliang Zhang, Xiaopeng Chinese Acad Sci Inst Automat Sch Artificial Intelligence Natl Lab Pattern Recognit Beijing 100190 Peoples R China Beijing Univ Posts & Telecommun Sch Artificial Intelligence Beijing 100876 Peoples R China

High spatial resolution (HSR) remote sensing images contain complex foreground-background relationships, which makes the remote sensing land cover segmentation a special semantic segmentation task. The main challenges come from the large-scale variation, complex background samples and imbalanced foreground-background distribution. These issues make recent context modeling methods sub-optimal due to the lack of foreground saliency modeling. To handle these problems, we propose a remote sensing Segmentation framework (RSSFormer), including Adaptive TransFormer Fusion Module, Detail-aware Attention Layer and Foreground Saliency Guided Loss. Specifically, from the perspective of relation-based foreground saliency modeling, our Adaptive Transformer Fusion Module can adaptively suppress background noise and enhance object saliency when fusing multi-scale features. Then our Detail-aware Attention Layer extracts the detail and foreground-related information via the interplay of spatial attention and channel attention, which further enhances the foreground saliency. From the perspective of optimization-based foreground saliency modeling, our Foreground Saliency Guided Loss can guide the network to focus on hard samples with low foreground saliency responses to achieve balanced optimization. Experimental results on LoveDA datasets, Vaihingen datasets, Potsdam datasets and iSAID datasets validate that our method outperforms existing general semantic segmentation methods and remote sensing segmentation methods, and achieves a good compromise between computational overhead and accuracy.

关键词： remote sensing Transformers Semantic segmentation Task analysis Buildings Background noise Convolution remote sensing segmentation foreground saliency enhancement transformer

来源：评论

学校读者我要写书评

暂无评论

Cultivated land segmentation in RGB remote sensing images: nonuniform regularization with kernel space and graph cut

Cultivated land segmentation in RGB remote sensing images: n...

引用

2024 International conference on image, Signal processing, and pattern recognition, ISPP 2024

作者： Wu, Wangsheng College of Computer and Information Science Chongqing Normal University Chongqing401331 China

ISBN: (纸本)9781510680425

To improve the application efficiency of RGB remote sensing images in agricultural land resource surveys, a cultivated land segmentation algorithm based on kernel space non-uniform regularization classification and improved graph cut was proposed. Firstly, extracting texture and color features of remote sensing images using Local Binary pattern algorithm (LBP), Gabor filters, and RGB, HSV color space, respectively. Next, introducing a kernel method to map data from low-dimension to high-dimension, and construct a kernel space-based non-uniform regularization sparse representation model to classify and segment images in pixel level. Finally, an innovative graph cut algorithm is enhanced by incorporating a Gaussian distribution to redefine the penalty term for homogeneous regions and introducing a new color gradient measure to define the penalty term for boundaries. This approach effectively removes scatter and restricts the segmentation boundary. The average classification accuracy and average F1 score of the classifier proposed in this paper are about 2% and 3% higher than those of recent regularized subspace classifiers, respectively. Compared with the Graph cut algorithm, the proposed improved graph cut algorithm has an average mIoU improvement of about 9%. The average accuracy of the whole segmentation algorithm is 95.43%, and the average mIoU is 88.56%. Compared with the comparison algorithm, the proposed algorithm has higher segmentation accuracy, which proves that the proposed algorithm can adapt to the cultivated land segmentation scene of remote sensing images and is effective. © 2024 SPIE.

关键词： RGB color model

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：