检索结果-内蒙古大学图书馆

7th Chinese Conference on pattern recognition and Computer Vision

作者： Wang, Changhe He, Ningyu Wang, Binglu Beijing Informat Sci & Technol Univ Sch Comp Sci Beijing Peoples R China Xian Univ Architecture & Technol Coll Informat & Control Engn Xian 710055 Peoples R China Beijing Inst Technol Sch Informat & Elect Radar Res Lab Beijing 100081 Peoples R China

ISBN: (纸本)9789819784929;9789819784936

remote sensing image Change Captioning (RSICC) faces significant challenges in effectively identifying and articulating changes between bi-temporal images. Traditional approaches often utilize individual text decoders, which may not capture the subtleties of visual changes nor fully exploit advanced language modeling capabilities. To overcome these limitations, we propose Change-Aware Adaption, namely Chareption, a novel framework that effectively leverages pre-trained large language models (LLMs) to enhance both the accuracy and detail of change captions. Central to Chareption is a change-aware module designed to selectively identify and utilize tokens that significantly represent changes, thus avoiding the common issue of redundancy that plagues methods relying solely on class tokens or indiscriminate use of all patch tokens. Additionally, Chareption designs a lightweight change adapter module, seamlessly integrated into both the vision backbone and the LLM, requiring minimal learnable parameters while optimally adjusting representations for the RSICC task. Our experiments on the LEVIR-CC dataset demonstrate that Chareption significantly outperforms existing methods in caption accuracy and contextual relevance, while also reducing training overhead. This establishes Chareption as a pioneering solution that sets a new direction in RSICC by harnessing the rich representational power of LLMs for improved multimodal understanding.

关键词： remote sensing image Change Caption Change-Aware Module Change Adapter Large Language Model

来源：评论

学校读者我要写书评

暂无评论

Adaptive Enhanced Reversible Flow Model for remote sensing image Super Resolution 27th

Adaptive Enhanced Reversible Flow Model for Remote Sensing I...

引用

27th International Conference on pattern recognition, ICPR 2024

作者： Li, Peishan Zhang, Yonghong Wang, Junfei Ma, Guangyi Yuan, Ziwei Nanjing University of Information Science and Technology Jiangsu Nanjing China

ISBN: (纸本)9783031781186

In recent years, convolutional neural networks (CNNs) have excelled in remote sensing image super-resolution reconstruction (RSISR) tasks, becoming the predominant algorithms in this domain. However, these models primarily leverage the dependency of high-resolution (HR) images on low-resolution (LR) counterparts during the super-resolution (SR) forward process, neglecting mutual dependencies. To address the ill-posed nature of one-to-many mappings and enhance reconstruction performance, this paper proposes Adaptive Enhanced Reversible Flow Model (AERNet), an image SR algorithm based on invertible neural networks. AERNet treats image degradation and reconstruction as invertible transformations, where LR and HR images mutually project into each other's spaces. This mutual dependency optimizes distribution mapping across LR and HR images, constraining the solution space effectively in both forward and inverse directions. Integrating a multi-path adaptive feature fusion group and a global interaction enhancement module enhances the network's adaptability, improving its capability to fuse and enhance feature information. This approach enables more accurate processing of key image details and regions. Experimental results demonstrate AERNet's superior performance on two benchmark remote sensing datasets. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Enduring Memory Self-learning Multi-level Transformer Network for remote sensing image Super-Resolution 27th

Enduring Memory Self-learning Multi-level Transformer Networ...

引用

27th International Conference on pattern recognition, ICPR 2024

作者： Li, Peishan Zhang, Yonghong Wang, Junfei Ma, Guangyi Nanjing University of Information Science and Technology Jiangsu Nanjing China

ISBN: (纸本)9783031784972

High-resolution (HR) remote sensing is essential for remote sensing image interpretation, but challenges in super-resolution (SR) stem from scale and texture differences within images, neglecting high-dimensional detail extraction and long-range dependencies among multi-dimensional features. Addressing this, we propose TDSNet, a enduring memory self-learning multi-level Transformer Network for remote sensing image super-resolution (RISSR). The utilization of the similarity balancing module and memory gated group establishes connections between mixed-scale information, while also possessing enduring memory across various receptive fields. Shallow and deep-level data fuse in the transformer, employing a dual learning strategy, enhancing reconstruction through a constrained mapping process with a loss function. This transforms the ill-posed problem into a well-posed one. Attribution analysis with the LAM method reveals TDSNet’s efficacy in capturing content information. Experiments on NWPU-RESISC45 and AID datasets demonstrate TDSNet’s superior performance in remote sensing image super-resolution compared to other methods. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： image texture

来源：评论

学校读者我要写书评

暂无评论

Application of a nonparametric procedure for testing the hypothesis about the independence of random variables given a large amount of statistical data

引用

MEASUREMENT TECHNIQUES 2024年第10期66卷 744-754页

作者： Lapko, A. V. Lapko, V. A. Bakhtina, A. V. Russian Acad Sci Inst Computat Modelling Siberian Branch Krasnoyarsk Russia Reshetnev Siberian State Univ Sci & Technol Krasnoyarsk Russia

The article considers a problem related to testing the hypothesis about the independence of random variables given large amounts of statistical data. The solution to this problem is necessary when estimating probability densities of random variables and synthesizing algorithms for processing information. A nonparametric procedure is proposed for testing the hypothesis about the independence of random variables in a sample containing a large amount of statistical data. The procedure involves the compression of initial statistical data by decomposing the range of values of random variables. The generated data array consists of the centers of sampling intervals and the corresponding frequencies of observations belonging to the original sample. The obtained data was used in the construction of a nonparametric pattern recognition algorithm, which corresponds to the maximum likelihood criterion. The distribution laws in the classes were evaluated assuming the independence and dependence of the compared random variables. When recovering the distribution laws of random variables in the classes, the regression estimates of probability densities were used. For these conditions, the probability of errors in recognizing patterns in the classes was estimated, and decisions about the independence or dependence of random variables were made according to their minimum value. The procedure was used in the analysis of remote sensing data on forest areas;linear and nonlinear relationships between the spectral features of the subject matter of the study were determined.

关键词： Hypothesis Independent random variables Hypothesis testing Regression estimate Probability density pattern recognition remote sensing Forest areas 519.7+004.93

来源：评论

学校读者我要写书评

暂无评论

A high-resolution remote sensing image change detection network based on U-Net++ and attention mechanism 16

A high-resolution remote sensing image change detection netw...

引用

16th International Conference on Digital image processing, ICDIP 2024

作者： Qu, Guangna Xue, Xiaorong Yue, Run Mao, Yue School of Electronics and Information Engineering Liaoning University of Technology Liaoning China

ISBN: (数字)9781510682917

ISBN: (纸本)9781510682900

remote sensing image change detection is an important task in the field of remote sensing image analysis, and it is widely used in urban planning, disaster detection, environmental protection and other fields. A U-Net++ based remote sensing image change detection network is proposed to address the issues of complex backgrounds, diverse types of changes, missed detections, and rough boundary recognition in high-resolution remote sensing images in change detection tasks. This algorithm uses U-Net++ as the backbone extraction network, and applies a Siamese neural network structure in its encoder to extract features from two different time images. In the convolutional part, the CBAM attention module and Mish activation function are fused to improve the network's feature extraction ability. In addition, the MSOF strategy is used to fuse the results of different levels of the U-Net++ network to output the final result map to improve the accuracy of the network. © 2024 SPIE.

关键词： Urban planning

来源：评论

学校读者我要写书评

暂无评论

Learned representation-guided diffusion models for large-image generation

Learned representation-guided diffusion models for large-ima...

引用

IEEE/CVF Conference on Computer Vision and pattern recognition (CVPR)

作者： Graikos, Alexandros Yellapragada, Srikar Le, Minh-Quan Kapse, Saarthak Prasanna, Prateek Saltz, Joel Samaras, Dimitris SUNY Stony Brook Stony Brook NY 11794 USA

ISBN: (纸本)9798350353013;9798350353006

To synthesize high-fidelity samples, diffusion models typically require auxiliary data to guide the generation process. However, it is impractical to procure the painstaking patch-level annotation effort required in specialized domains like histopathology and satellite imagery;it is often performed by domain experts and involves hundreds of millions of patches. Modern-day self-supervised learning (SSL) representations encode rich semantic and visual information. In this paper, we posit that such representations are expressive enough to act as proxies to fine-grained human labels. We introduce a novel approach that trains diffusion models conditioned on embeddings from SSL. Our diffusion models successfully project these features back to high-quality histopathology and remote sensing images. In addition, we construct larger images by assembling spatially consistent patches inferred from SSL embeddings, preserving long-range dependencies. Augmenting real data by generating variations of real images improves downstream classifier accuracy for patch-level and larger, image-scale classification tasks. Our models are effective even on datasets not encountered during training, demonstrating their robustness and generalizability. Generating images from learned embeddings is agnostic to the source of the embeddings. The SSL embeddings used to generate a large image can either be extracted from a reference image, or sampled from an auxiliary model conditioned on any related modality (e.g. class labels, text, genomic data). As proof of concept, we introduce the text-to-large image synthesis paradigm where we successfully synthesize large pathology and satellite images out of text descriptions. (1)

关键词： diffusion models generative models histopathology remote sensing

来源：评论

学校读者我要写书评

暂无评论

POTENTIAL OF UAV-BASED pattern CLASSIFICATION WITH CONVOLUTIONAL NEURAL NETWORK ON MODERATE/LOW QUALITY UAV DATA

POTENTIAL OF UAV-BASED PATTERN CLASSIFICATION WITH CONVOLUTI...

引用

IEEE International Geoscience and remote sensing Symposium (IGARSS)

作者： Arslanova, Linara Hese, Soeren Metz, Friederike Schmullius, Christiane Thau, Christian Scheibler, Friedemann Heckel, Kai Foelsch, Marcel Urban, Marcel Schultz, Michael Friedrich Schiller Univ Jena Jena Germany Jena City Adm Jena Germany Planet Labs Germany GmbH Berlin Germany 365FarmNet GmbH Berlin Germany ESN EnergieSystemeNord GmbH Schwentinental Germany Univ Tubingen Tubingen Germany

ISBN: (纸本)9798350320107

This work serves as demonstrator, how low/medium quality UAV data can be integrated for agricultural pattern classification with convolutional neural network (CNN). The study also illustrates the potential sources of error in spectral and texture information that arise during image acquisition and processing, which can be improved during image processing and correct choice of mosaicking parameters. CNN classification of six agricultural patterns of interest (weed infested area, dry and vital crop area, dry and vital lodged crop area, bare soil area) of corn, rapeseed, winter wheat and spring barley fields. The performance of the classification is assessed on images with different units (reflectance and DN) and images with different sun lightening conditions, shadows and 'blur' effects (moderate/low quality data).

关键词： CNN classification UAV imagery agricultural pattern mapping image processing data quality

来源：评论

学校读者我要写书评

暂无评论

Guided Depth Super-Resolution by Deep Anisotropic Diffusion

Guided Depth Super-Resolution by Deep Anisotropic Diffusion

引用

IEEE/CVF Conference on Computer Vision and pattern recognition (CVPR)

作者： Metzger, Nando Daudt, Rodrigo Caye Schindler, Konrad Swiss Fed Inst Technol Photogrammetry & Remote Sensing Zurich Switzerland

ISBN: (纸本)9798350301298

Performing super-resolution of a depth image using the guidance from an RGB image is a problem that concerns several fields, such as robotics, medical imaging, and remote sensing. While deep learning methods have achieved good results in this problem, recent work highlighted the value of combining modern methods with more formal frameworks. In this work, we propose a novel approach which combines guided anisotropic diffusion with a deep convolutional network and advances the state of the art for guided depth super-resolution. The edge transferring/enhancing properties of the diffusion are boosted by the contextual reasoning capabilities of modern networks, and a strict adjustment step guarantees perfect adherence to the source image. We achieve unprecedented results in three commonly used benchmarks for guided depth super-resolution. The performance gain compared to other methods is the largest at larger scales, such as x32 scaling. Code(1) for the proposed method is available to promote reproducibility of our results.

关键词： Computational imaging

来源：评论

学校读者我要写书评

暂无评论

Spatial-spectral unfolding network with mutual guidance for multispectral and hyperspectral image fusion

引用

pattern recognition 2025年 161卷

作者： Yan, Jun Zhang, Kai Sun, Qinzhu Ge, Chiru Wan, Wenbo Sun, Jiande Zhang, Huaxiang Shandong Normal Univ Sch Informat Sci & Engn Jinan Peoples R China Xidian Univ Sch MicroElect Xian Peoples R China

Fusing low spatial resolution hyperspectral (LR HS) and high spatial resolution multispectral (HR MS) images from different modalities aim to obtain high spatial resolution hyperspectral (HR HS) images. However, most deep neural network (DNN)-based methods overlook the correlation between the spatial domain and spectral domain, leading to limited fusion performance. To solve this problem, we propose the spatial-spectral unfolding network with mutual guidance (SMGU-Net). Specifically, the information of different modalities in the source images is treated as mutual complementary components to derive the reconstruction model. Then, the model is optimized using half-quadratic splitting and gradient descent algorithms and is unfolded into a network that leverages the powerful learning capabilities of DNNs to explore more potential information in the deep feature space. In this way, the network achieves the interaction and supplementarity of cross-modality information generate fused images. Experiments are conducted on four benchmark datasets to demonstrate the effectiveness of SMGU-Net. The code can be downloaded from https://***/yansql/SMGU-Net.

关键词： remote sensing Unfolding network image fusion Multispectral image Hyperspectral image

来源：评论

学校读者我要写书评

暂无评论

WildFishNet: Open Set Wild Fish recognition Deep Neural Network With Fusion Activation pattern

引用

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND remote sensing 2023年 16卷 7303-7314页

作者： Zhang, Xiaoya Huang, Baoxiang Chen, Ge Radenkovic, Milena Hou, Guojia Qingdao Univ Dept Comp Sci & Technol Qingdao 266071 Peoples R China Qingdao Natl Lab Marine Sci & Technol Lab Reg Oceanog & Numer Modeling Qingdao 266228 Peoples R China Qingdao Natl Lab Marine Sci & Technol Lab Reg Oceanog & Numer Modeling Qingdao 266228 Peoples R China Ocean Univ China Sch Marine Technol Inst Adv Ocean Study Qingdao 266075 Peoples R China Univ Nottingham Sch Comp Sci & Informat Technol Nottingham NG8 1BB England

Wild fish recognition is a fundamental problem of ocean ecology research and contributes to the understanding of biodiversity. Given the huge number of wild fish species and unrecognized category, the essence of the problem is an open set fine-grained recognition. Moreover, the unrestricted marine environment makes the problem even more challenging. Deep learning has been demonstrated as a powerful paradigm in image classification tasks. In this article, the wild fish recognition deep neural network (termed WildFishNet) is proposed. Specifically, an open set fine-grained recognition neural network with a fused activation pattern is constructed to implement wild fish recognition. First, three different reciprocal inverted residual structural modules are combined by neural structure search to obtain the best feature extraction performance for fine-grained recognition;next, a new fusion activation pattern of softmax and openmax functions is designed to improve the recognition ability of open set. Then, the experiments are implemented on the WildFish dataset that consists of 54 459 unconstrained images, which includes 685 known classes and 1 open set unrecognized category. Finally, the experimental results are analyzed comprehensively to demonstrate the effectiveness of the proposed method. The in-depth study also shows that artificial intelligence can empower marine ecosystem research.

关键词： Deep neural network fusion activation pattern neural structure search open set fine-grained recognition wild fish recognition

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：