Ultra-high-definition (UHD) image restoration is becoming a critical research area due to the increasing demand for high-quality visual content in various applications, including autonomous driving, remotesensing, di...
详细信息
Currently, oriented object detection, as an emerging subfield within object detection, has garnered significant attention. Besides encompassing directional information, datasets of oriented objects exhibit notable cha...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Currently, oriented object detection, as an emerging subfield within object detection, has garnered significant attention. Besides encompassing directional information, datasets of oriented objects exhibit notable characteristics, including significant variations in object scales and a wide range of aspect ratios for ground-truth bounding boxes. Nevertheless, the current state-of-the-art two-stage rotating object detection models have not sufficiently addressed these characteristics, leading to inherent limitations in accuracy. In response to these challenges, we introduce the Rotated RCNN. Our model is the first to introduce trainable anchors in the field of oriented object detection to achieve anchor distributions similar to the ground truth boxes in the oriented object dataset. Furthermore, considering the distinctive traits of oriented ground truth boxes, we have devised a novel strategy for assigning labels to more effectively choose positive and negative samples specifically designed for oriented objects. In the regression phase of the RPN, we introduce shape constraints to alleviate accuracy losses stemming from mismatches between the encoding method and oriented objects. We comprehensively evaluate our model on the DOTAv1.0 and HRSC2016 datasets, demonstrating the effectiveness of our meticulously designed model.
The proceedings contain 52 papers. The topics discussed include: improvement of remotesensingimage target detection algorithm based on YOLO V5;A Study of Chan-Vese model with the introduction of edge information;rea...
The proceedings contain 52 papers. The topics discussed include: improvement of remotesensingimage target detection algorithm based on YOLO V5;A Study of Chan-Vese model with the introduction of edge information;real-time monitoring algorithm of muscle state based on sEMG signal;lane detection network with direction context;anomaly pixel detection via dual-branch uncertainty metrics;high precision license plate recognition algorithm in open scene;implementation and design of metro process quality inspection system based on imageprocessing technology;the research on remotesensingimage change detection based on deep learning;research on aircraft wheel hub pose detection method based on machine vision;lunar dome detection method based on few-shot object detection;and image enhancement algorithm of foggy sky with sky based on sky segmentation.
U-Net is widely lightened to achieve fast semantic segmentation of medical images and remotesensingimages. The cheap operation of the GhostNet series provides new lightening ideas. Currently, the GhostNet series has...
详细信息
We are studying in-orbit real-time object detection for remotesensing satellites. Due to the small object size of remotesensingimages, it is hard to achieve high detection accuracy, especially for resource-constrai...
详细信息
ISBN:
(纸本)9781510655386;9781510655379
We are studying in-orbit real-time object detection for remotesensing satellites. Due to the small object size of remotesensingimages, it is hard to achieve high detection accuracy, especially for resource-constrained spacecraft computers. Lightweight object detection models such as YOLO and SSD are feasible choices to achieve acceptable detection speed on board. This study proposes an accuracy-improvement method for the lightweight neural networks with an upscaling ratio estimator without retraining the model. The estimator exploits a scaling ratio that determines how much the image should be resized. With our scaling estimator, we have achieved 10.09% higher accuracy than the original YOLOv4-Tiny models with a 40% detection speed overhead.
In order to limit the interference of cloud noise on ground scene information, cloud detection has been a hot issue in research on remotesensingimageprocessing. Cloud detection labels the clouds in remotesensing i...
详细信息
Satellites are equipped with diverse sensors, capable of capturing detailed information across a multitude of wavelengths. The fusion of multispectral data is pivotal to amplify the visual representation of the area o...
详细信息
ISBN:
(纸本)9781510673816;9781510673809
Satellites are equipped with diverse sensors, capable of capturing detailed information across a multitude of wavelengths. The fusion of multispectral data is pivotal to amplify the visual representation of the area of interest. The improvement of information representation allows for enhanced processing, analysis, and other crucial tasks for numerous fields of study, including remotesensing, defense, and material characterization. Previous solutions often utilize traditional signalprocessing techniques, including principal component analysis (PCA), to accomplish data fusion. By performing fusion on a feature level, extracted information about the area of interest texture and boundaries are combined. The introduction of neural network techniques improved the reconstruction of data similar to the results obtained by conventional inference of humans. For example, the use of deep learning algorithms in conjunction with PCA allowed for refined reduction of redundancy and distortion of spectral data, in comparison to traditional methods alone. The introduction of the Vision Transformer (ViT) architecture, originally developed for two-dimensional image data, has revolutionized imageprocessing tasks, vastly improving performance at the cost of a large quantity of trainable parameters. Recent experimentation has proven that optimizing ViT for efficiency allows for comparable or even superior performance while lessening the computational cost. The transition from 2D to 3D information via utilization of additional depth and spatial data has also led to superior results as the added information allows for better representation of terrain features, making it invaluable for satellite imagery analysis. Combining the principles of ViT and 3D information to process complex satellite data can result in more effective data fusion to achieve a superior level of data visualization of multispectral satellite imagery in an efficient manner.
To address the multi-scale problem and interference of differences between data in remotesensingimage segmentation, a multi-scale Siamese dual decoding network is proposed. The twin network is used as the backbone n...
详细信息
This paper proposes a remotesensingimage-based method for the extraction of fire trails. Firstly, by acquiring multispectral images of the forest in the study area and pre-processing the multispectral images. A vege...
详细信息
Robust sea–land segmentation in optical remote-sensingimages is difficult task because of complex sea-land environment. Low contrast difference between sea and land in the case of panchromatic images and water color...
详细信息
暂无评论