检索结果-内蒙古大学图书馆

Fine-Grained Species recognition With Privileged Pooling: Better Sample Efficiency Through Supervised Attention

IEEE TRANSACTIONS ON pattern ANALYSIS AND MACHINE INTELLIGENCE 2023年第12期45卷 14575-14589页

作者： Rodriguez, Andres C. D'Aronco, Stefano Schindler, Konrad Wegner, Jan Dirk Swiss Fed Inst Technol EcoVis Lab Photogrammetry & Remote Sensing CH-8092 Zurich Switzerland Univ Zurich Inst Computat Sci CH-8006 Zurich Switzerland

We propose a scheme for supervised image classification that uses privileged information, in the form of keypoint annotations for the training data, to learn strong models from small and/or biased training sets. Our main motivation is the recognition of animal species for ecological applications such as biodiversity modelling, which is challenging because of long-tailed species distributions due to rare species, and strong dataset biases such as repetitive scene background in camera traps. To counteract these challenges, we propose a visual attention mechanism that is supervised via keypoint annotations that highlight important object parts. This privileged information, implemented as a novel privileged pooling operation, is only required during training and helps the model to focus on regions that are discriminative. In experiments with three different animal species datasets, we show that deep networks with privileged pooling can use small training sets more efficiently and generalize better.

关键词： Camera trap images fine-grained species recognition privileged pooling supervised attention training set bias

来源：评论

学校读者我要写书评

暂无评论

CSVD: a cross-scenario vehicle dataset for multi-object tracking

CSVD: a cross-scenario vehicle dataset for multi-object trac...

引用

2024 International Conference on image, Signal processing, and pattern recognition, ISPP 2024

作者： Li, Xiaolei Zhou, Juefan Xiao, Xingjie Lin, Jiayu Yang, Siyuan Sha, Zongyao Tu, Jianguang School of Remote Sensing and Information Engineering Wuhan University Hubei Province 430079 China Wuhan ALLPRS Remote Sensing Data Technology Co. Ltd. Hubei Province 430079 China

ISBN: (纸本)9781510680425

Leveraging visual sensing technologies for the detection and tracking of vehicles represents a critical application domain for unmanned aerial vehicles (UAVs), notably in challenging operational *** study focuses on enhancing UAV functionalities in intricate environments through the development of a specialized dataset, derived from battlefield scenarios, to facilitate advanced research on vehicle detection and multi-target tracking under complex conditions.A comprehensive collection of vehicular movement videos spanning diverse scenarios was amassed and manually annotated, culminating in the creation of the " Cross-Scenario Vehicle Detection" (CSVD) *** dataset encompasses a wide array of environmental settings, featuring urban landscapes, plains, and forests, across the four seasons, resulting in a total of 13,025 meticulously annotated *** several state-of-the-art deep learning models, we established robust benchmarks for object ***, an extensive evaluation and performance validation were conducted using cutting-edge multi-object tracking algorithms on the CSVD dataset, incorporating diverse assessment *** conducted experiments demonstrate the dataset's robust applicability and versatility, endorsing its effectiveness for the development and evaluation of UAV-based vehicle detection and multi-target tracking systems in complex settings. © 2024 SPIE.

关键词： Aircraft detection

来源：评论

学校读者我要写书评

暂无评论

remote sensing image Denoising Based on Deep and Shallow Feature Fusion and Attention Mechanism

引用

remote sensing 2022年第5期14卷 1243页

作者： Han, Lintao Zhao, Yuchen Lv, Hengyi Zhang, Yisa Liu, Hailong Bi, Guoling Chinese Acad Sci Changchun Inst Opt Fine Mech & Phys Changchun 130033 Peoples R China Univ Chinese Acad Sci Coll Mat Sci & Optoelect Technol Beijing 100049 Peoples R China

Optical remote sensing images are widely used in the fields of feature recognition, scene semantic segmentation, and others. However, the quality of remote sensing images is degraded due to the influence of various noises, which seriously affects the practical use of remote sensing images. As remote sensing images have more complex texture features than ordinary images, this will lead to the previous denoising algorithm failing to achieve the desired result. Therefore, we propose a novel remote sensing image denoising network (RSIDNet) based on a deep learning approach, which mainly consists of a multi-scale feature extraction module (MFE), multiple local skip-connected enhanced attention blocks (ECA), a global feature fusion block (GFF), and a noisy image reconstruction block (NR). The combination of these modules greatly improves the model's use of the extracted features and increases the model's denoising capability. Extensive experiments on synthetic Gaussian noise datasets and real noise datasets have shown that RSIDNet achieves satisfactory results. RSIDNet can improve the loss of detail information in denoised images in traditional denoising methods, retaining more of the higher-frequency components, which can have performance improvements for subsequent image processing.

关键词： image denoising neural network feature fusion attention mechanism remote sensing

来源：评论

学校读者我要写书评

暂无评论

Attribute-Guided Generative Adversarial Network With Improved Episode Training Strategy for Few-Shot SAR image Generation

引用

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND remote sensing 2023年 16卷 1785-1801页

作者： Sun, Yuanshuang Wang, Yinghua Hu, Liping Huang, Yuanyuan Liu, Hongwei Wang, Siyuan Zhang, Chen Xidian Univ Natl Lab Radar Signal Proc Xian 710071 Peoples R China Beijing Inst Environm Features Sci & Technol Electromagnet Scattering Lab Beijing 100854 Peoples R China

Deep-learning-based models usually require a large amount of data for training, which guarantees the effectiveness of the trained model. Generative models are no exception, and sufficient training data are necessary for the diversity of generated images. However, for synthetic aperture radar (SAR) images, data acquisition is expensive. Therefore, SAR image generation under a few training samples is still a challenging problem to be solved. In this article, we propose an attribute-guided generative adversarial network (AGGAN) with an improved episode training strategy for few-shot SAR image generation. First, we design the AGGAN structure, and spectral normalization is used to stabilize the training in the few-shot situation. The attribute labels of AGGAN are designed to be the category and aspect angle labels, which are essential information for SAR images. Second, an improved episode training strategy is proposed according to the characteristics of the few-shot generative task, and it can improve the quality of generated images in the few-shot situation. In addition, we explore the effectiveness of the proposed method when using different auxiliary data for training and use the Moving and Stationary Target Acquisition and recognition benchmark dataset and a simulated SAR dataset for verification. The experimental results show that AGGAN and the proposed improved episode training strategy can generate images of better quality when compared with some existing methods, which have been verified through visual observation, image similarity measures, and recognition experiments. When applying the generated images to the 5-shot SAR image recognition problem, the average recognition accuracy can be improved by at least 4$\%$.

关键词： Few-shot image generation generative adversarial network (GAN) meta-learning synthetic aperture radar (SAR) transfer learning

来源：评论

学校读者我要写书评

暂无评论

Application of remote sensing image processing for Classification and recognition

Application of Remote Sensing Image Processing for Classific...

引用

International Conference on Advanced Infocomm Technology (ICAIT)

作者： Xiaolong Shi Dan Huang Wenjie Li Xianjie Wang College of Computer Science and Engineering Chongqing University of Technology Chongqing China China Research and Development Academy of Machinery Equipment Beijing China Chongqing high-tech Zone Pegasus Innovation Institute Chongqing University of Technology Chongqing China

Aiming at the difficulties in object detection and recognition in remote sensing images caused by high background complexity, large scale variations of targets, and the presence of numerous small objects, an improved method for remote sensing image object detection based on YOLOv7-tiny is proposed. This method combines the loss function based on normalized Gaussian Wasserstein distance (NWD) with the CIoU loss function to address the problem of sensitivity to positional deviation of small objects by IoU-Loss. The addition of a global attention mechanism (GAM) in the backbone network reduces information diffusion and enhances the interaction at the global dimension to mitigate the interference of complex backgrounds in remote sensing images on the model, enabling the model to focus on the feature extraction of the desired targets. Finally, the coupled detection head (Coupled Head) of the model is replaced with a decoupled detection head (Decoupled Head), allowing the classification and regression tasks to output from different branches to achieve decoupling and avoid the decrease in detection accuracy caused by conflicts between classification and regression. The experimental results of this method on the public dataset DIOR achieved 88.73% accuracy, which is an improvement of 1.78% compared to the unimproved method's accuracy of 86.95%. Furthermore, compared to other researchers' methods tested on DIOR, the proposed method also shows improvement, thus validating its effectiveness.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Spatial planning method of urban landscape architecture distribution pattern based on evolutionary algorithm

引用

INTERNATIONAL JOURNAL OF ENVIRONMENTAL TECHNOLOGY AND MANAGEMENT 2024年第1-2期27卷 66-78页

作者： Feng, Jingjing Chen, Yuxiong Guangdong Nanfang Inst Technol Sch Informat Jiangmen 529000 Peoples R China

Aiming at the problems of low planning accuracy and long planning time in the traditional spatial planning method of urban landscape architecture distribution pattern, a spatial planning method of urban landscape architecture distribution pattern based on evolutionary algorithm was proposed. First, we acquire urban landscape remote sensing images through ETM+ and Landsat TM/OLI images, and use ENVI software to conduct geometric correction, image enhancement and other image processing. Then, we acquire spatial data of landscape distribution pattern from urban landscape green space types, patch area size, number and other aspects. We then use differential evolution algorithm to calculate the fitness value corresponding to the initialised population, extract landscape features, and use mutation operators. The optimal solution is obtained through the three steps of crossover operator and selection operation, which is the optimal spatial planning strategy. The simulation results show that the proposed method has higher precision and shorter planning time in spatial planning of urban landscape architecture distribution pattern.

关键词： evolutionary algorithm urban landscape architecture distribution pattern spatial planning landscape characteristics

来源：评论

学校读者我要写书评

暂无评论

A Novel Real-Time Text-to-Speech System Using Raspberry Pi for Assisting the Visually Impaired

引用

TRAITEMENT DU SIGNAL 2024年第6期41卷 3183-3192页

作者： Ben Atitallah, Ahmed Kammoun, Manel Atitallah, Mohamed Amin Ben Albekairi, Mohammed Said, Yahia Boudabous, Anis Kaaniche, Khaled Atri, Mohamed Jouf Univ Coll Engn Dept Elect Engn Sakaka 72388 Saudi Arabia Univ Sfax LETI ENIS Sfax 3029 Tunisia Gustave Eiffel Univ Lab Informat Gaspard Monge CNRS A3SIESIEE Paris BP 99 F-93162 Noisy Le Grand France Northern Border Univ Coll Engn Remote Sensing Unit Ar Ar 91431 Saudi Arabia Univ Monastir Lab Elect & Microelect LR99ES30 Monastir 5019 Tunisia Jouf Univ Coll Comp & Informat Sci Dept Comp Engn & Networks Sakaka 72388 Saudi Arabia King Khalid Univ Coll Comp Sci Abha 62529 Saudi Arabia

Visual impairment is one of the most significant challenges facing humanity, Aespecially in an era where information is frequently conveyed through text rather than voice. To address this, the proposed system is designed to assist individuals with visual impairments. This paper presents the development of a real-time Text-to-Speech (TTS) Aembedded system based on the Raspberry Pi 4. AOur system incorporates a novel approach to enhance the accuracy of text recognition using Optical Character recognition (OCR) from images. Specifically, a series of preprocessing steps are employed, selected dynamically by a decision-making process based on the content of the image. The image processing is handled using OpenCV2, while the conversion of text to speech is achieved through the pyttsx3 Python library. The entire system is implemented and tested on a Raspberry Pi 4, connected to a USB Full HD camera for high-resolution image acquisition, and controlled via the Traffic HAT-LED module. Experimental results demonstrate that our system achieves a minimum accuracy of 88.33% in text recognition from images.

关键词： image preprocessing visual impairment Raspberry Pi 4 text-to-speech optical character recognition real-time processing

来源：评论

学校读者我要写书评

暂无评论

Research on target recognition based on optical neural networks

Research on target recognition based on optical neural netwo...

引用

2024 International Conference on remote sensing, Mapping, and image processing, RSMIP 2024

作者： Zhang, Yixuan Feng, Yuxiang Chang, Hong Beijing Aerospace Institute for Metrology and Measurement Technology Beijing China

ISBN: (纸本)9781510680012

Due to the advantages of high throughput, low latency, and low power consumption, optical neural networks hold great promise in addressing the challenges of energy consumption and computational efficiency faced by current artificial intelligence technologies. Consequently, they have become a research hotspot in both academia and industry in recent years. The goal of optical neural networks is to use photons as the physical carrier to construct the basic computational units of artificial neural network algorithms, thus achieving high-performance novel computing architectures and applying them to solve practical problems. This paper introduces the working principles and characteristics of optical neural networks and discusses relevant research on target recognition based on optical neural network architectures. © 2024 SPIE.

关键词： Optical data processing

来源：评论

学校读者我要写书评

暂无评论

Research on Object-Oriented Classification Technology for remote sensing imagery of Coastal Zone 11th

Research on Object-Oriented Classification Technology for Re...

引用

11th International Conference on Signal and Information processing, Network and Computers, ICSINC 2023

作者： Yize, Dong Rui, Zhang Haitao, Wang Chao, Wang Xianglei, Kong Lele, Yao Institute of Spacecraft Application System Engineering CAST No. 104 Youyi Road Haidian Beijing China

ISBN: (纸本)9789819721191

Terrain identification of coastal is of great significance for coastal development activities and coastal terrain survey in overseas areas. However, due to the complex characteristics of coastal features, the use of remote sensing images for automatic feature classification and recognition has become a current research hotspot. Utilizing the homogeneous and homogeneous spectral characteristics of hyperspectral remote sensing images and the characteristics of coastal zone elements, an object-oriented shoreland classification technique is proposed after performing operations such as atmospheric correction and image enhancement preprocessing on hyperspectral remote sensing data, which enables automatic identification and extraction of feature types and geomorphological information in the coastal zone. This method overcomes the limitation of the traditional hyperspectral remote sensing classification method, which takes image element as the processing unit, and synthesizes the spatial characteristics and spectral characteristics of the features, which greatly reduces the classification spots and significantly improves the classification effect. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

关键词： Coastal zones

来源：评论

学校读者我要写书评

暂无评论

Hyperspectral and LiDAR Data Classification Based on Structural Optimization Transmission

引用

IEEE TRANSACTIONS ON CYBERNETICS 2023年第5期53卷 3153-3164页

作者： Zhang, Mengmeng Li, Wei Zhang, Yuxiang Tao, Ran Du, Qian Beijing Inst Technol Sch Informat & Elect Beijing 100081 Peoples R China Beijing Inst Technol Beijing Key Lab Fract Signals & Syst Beijing 100081 Peoples R China Mississippi State Univ Dept Elect & Comp Engn Starkville MS 39762 USA

With the development of the sensor technology, complementary data of different sources can be easily obtained for various applications. Despite the availability of adequate multisource observation data, for example, hyperspectral image (HSI) and light detection and ranging (LiDAR) data, existing methods may lack effective processing on structural information transmission and physical properties alignment, weakening the complementary ability of multiple sources in the collaborative classification task. The complementary information collaboration manner and the redundancy exclusion operator need to be redesigned for strengthening the semantic relatedness of multisources. As a remedy, we propose a structural optimization transmission framework, namely, structural optimization transmission network (SOT-Net), for collaborative land-cover classification of HSI and LiDAR data. Specifically, the SOT-Net is developed with three key modules: 1) cross-attention module;2) dual-modes propagation module;and 3) dynamic structure optimization module. Based on above designs, SOT-Net can take full advantage of the reflectance-specific information of HSI and the detailed edge (structure) representations of multisource data. The inferred transmission plan, which integrates a self-alignment regularizer into the classification task, enhances the robustness of the feature extraction and classification process. Experiments show consistent outperformance of SOT-Net over baselines across three benchmark remote sensing datasets, and the results also demonstrate that the proposed framework can yield satisfying classification result even with small-size training samples.

关键词： Laser radar Feature extraction Optimization Indexes Hyperspectral imaging Collaboration Task analysis Collaborative classification convolutional neural network (CNN) deep learning hyperspectral image (HSI) light detection and ranging (LiDAR) data pattern recognition remote sensing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：