Object detection is a fundamental component of computer vision, playing a pivotal role in various applications. However, the accurate detection of small and densely distributed objects remains a challenging problem in...
详细信息
ISBN:
(纸本)9798350359329;9798350359312
Object detection is a fundamental component of computer vision, playing a pivotal role in various applications. However, the accurate detection of small and densely distributed objects remains a challenging problem in this field. This challenge is particularly exacerbated in the context of aerial imagery, characterized by its distinctive bird's-eye view, intricate backgrounds, and the variability in object appearances. This paper addresses these persistent challenges in object detection, with a focus on the specific difficulties posed by aerial images. We propose a deformable end-to-end object detection with transformers (DETR)-based framework to enhance small object detection accuracy, ultimately contributing to improved computer vision capabilities in domains like remote sensing, surveillance, and autonomous aerial systems. Firstly, in order to aggregate the entire input sequence information in the backbone network and improve the detection accuracy of small objects, we propose DMCA based on deformable features and attention mechanisms. Secondly, in order to capture and model the relationships between samples for dense pixel-level representations in small objects and improve the detection accuracy of small objects, we try to introduce batchencoder by implementing an encoder in the batch dimension. Experimental results show that, compared to the baseline, our method significantly improves the accuracy of small object detection in aerial images. The processing and analysis of a large amount of remote sensing data require powerful computing power. The computing power Internet can integrate the scattered computing resources to provide flexible and scalable computing power to meet the computing needs of different sizes and types.
暂无评论