检索结果-内蒙古大学图书馆

An Efficient detection Framework for aerial Imagery Based on Uniform Slicing Window

REMOTE SENSING 2023年第17期15卷

作者： Yang, Xin Song, Yong Zhou, Ya Liao, Yizhao Yang, Jinqi Huang, Jinxiang Huang, Yiqian Bai, Yashuo Beijing Inst Technol Sch Opt & Photon Beijing 100081 Peoples R China Beijing Inst Technol Beijing Key Lab Precis Optoelect Measurement Instr Beijing 100081 Peoples R China

Drone object detection faces numerous challenges such as dense clusters with overlapping, scale diversity, and long-tail distributions. Utilizing tiling inference through uniform sliding window is an effective way of enlarging tiny objects and meanwhile efficient for real-world applications. However, merely partitioning input images may result in heavy truncation and an unexpected performance drop in large objects. Therefore, in this work, we strive to develop an improved tiling detection framework with both competitive performance and high efficiency. First, we formulate the tiling inference and training pipeline with a mixed data strategy. To avoid truncation and handle objects at all scales, we simultaneously perform global detection on the original image and local detection on corresponding sub-patches, employing appropriate patch settings. Correspondingly, the training data includes both original images and the patches generated by random online anchor-cropping, which can ensure the effectiveness of patches and enrich the image scenarios. Furthermore, a scale filtering mechanism is applied to assign objects at diverse scales to global and local detection tasks to keep the scale invariance of a detector and obtain optimal fused predictions. As most of the additional operations are performed in parallel, the tiling inference remains highly efficient. Additionally, we devise two augmentations customized for tiling detection to effectively increase valid annotations, which can generate more challenging drone scenarios and simulate the practical cluster with overlapping, especially for rare categories. Comprehensive experiments on both public drone benchmarks and our customized real-world images demonstrate that, in comparison to other drone detection frameworks, the proposed tiling framework can significantly improve the performance of general detectors in drone scenarios with lower additional computational costs.

关键词： aerial object detection sliding window augmentation unmanned aerial vehicles

来源：评论

学校读者我要写书评

暂无评论

Gaussian Synthesis for High-Precision Location in Oriented object detection

引用

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 2023年 61卷 1-1页

作者： Li, Zhonghua Hou, Biao Wu, Zitong Ren, Bo Ren, Zhongle Jiao, Licheng Xidian Univ Key Lab Intelligent Percept & Image Understanding Minist Educ China Xian 710071 Peoples R China Xidian Univ Joint Int Res Lab Intelligent Percept & Computat Xian 710071 Peoples R China

In aerial image scenes, the objects have properties of arbitrary orientation, large-scale range, and dense distribution. Thus, the object detector uses an oriented bounding box (OBB) to locate objects, which is more complex and challenging than a horizontal bounding box (HBB) detector. Mainstream OBB detectors mostly use a one-to-many label assignment strategy to predict multiple bounding boxes for the same object and filter out repeat predictions by nonmaximum suppression (NMS). NMS ranks with confidence and drops the detection box with intersection over union (IoU) higher than the threshold, which makes it easy to get the local optimum result. The clustered synthesis method gets more accurate results than the original NMS, but applying it to the OBB detector leads to border shift, which arises from the angular discontinuity problem. Therefore, we use Gaussian OBB (G-OBB) to deal with the angular discontinuity and thus eliminate the offset generated by direct synthesis. G-OBB is not easy to understand and describe representation. For this reason, we analyze the properties of G-OBB and design a decoding method to convert a G-OBB to a rotated rectangular box, further discussing its conditions. Based on the decoding method, we propose a Gaussian synthesis (GauS) algorithm, which transforms the OBB into Gaussian space, followed by synthesis, and finally transforms the synthesis result back into a new OBB. We have derived the synthesis and decoding methods and further verified their effectiveness. The extensive experiments on several existing models show that GauS takes very little computation and improves the detector's high-precision performance. Extensive experiments verify the effectiveness, stability, and universality of the proposed algorithm. In addition, the RTMDet using GauS achieves a performance of 81.61 AP50 and gains a 0.39% improvement in mean average precision (mAP), which achieves the state-of-the-art (SOTA) performance. Our implementation is available a

关键词： Detectors object detection Prediction algorithms Decoding Shape Task analysis Filtering algorithms aerial object detection Gaussian synthesis (GauS) nonmaximum suppression (NMS) oriented object detection postprocess

来源：评论

学校读者我要写书评

暂无评论

Towards Feature Decoupling for Lightweight Oriented object detection in Remote Sensing Images

引用

REMOTE SENSING 2023年第15期15卷

作者： Deng, Chenwei Jing, Donglin Han, Yuqi Deng, Zhiyuan Zhang, Hong Beijing Inst Technol Chongqing lnnovat Ctr Chongqing 401135 Peoples R China Beijing Inst Technol Sch Informat & Elect Beijing 100081 Peoples R China Beihang Univ Sch Astronaut Beijing 100191 Peoples R China

Recently, the improvement of detection performance always relies on deeper convolutional layers and complex convolutional structures in remote sensing images, which significantly increases the storage space and computational complexity of the detector. Although previous work has designed various novel lightweight convolutions, when these convolutional structures are applied to remote sensing detection tasks, the inconsistency between features and targets as well as between features and tasks in the detection architecture is often ignored: (1) The features extracted by convolution sliding in a fixed direction make it difficult to effectively model targets with arbitrary direction distribution, which leads to the detector needing more parameters to encode direction information and the network parameters being highly redundant;(2) The detector shares features from the backbone, but the classification task requires rotation-invariant features while the regression task requires rotation-sensitive features. This inconsistency in the task can lead to inefficient convolutional structures. Therefore, this paper proposed a detector that uses the Feature Decoupling for Lightweight Oriented object detection (FDLO-Det). Specifically, we constructed a rotational separable convolution that extracts rotational equivariant features while significantly compressing network parameters and computational complexity through highly shared parameters. Next, we introduced an orthogonal polarization transformation module that decomposes rotational equivariant features in both horizontal and vertical orthogonal directions, and used polarization functions to filter out the required features for classification and regression tasks, effectively improving detector performance. Extensive experiments on DOTA, HRSC2016, and UCAS-AOD show that the proposed detector can achieve the best performance and achieve an effective balance between computational complexity and detection accuracy.

关键词： aerial object detection convolutional neural network deep compression lightweight network

来源：评论

学校读者我要写书评

暂无评论

Complete Rotated Localization Loss Based on Super-Gaussian Distribution for Remote Sensing Images

引用

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 2023年 61卷 1-1页

作者： Li, Zhonghua Hou, Biao Wu, Zitong Guo, Zhengxi Ren, Bo Guo, Xianpeng Jiao, Licheng Xidian Univ Key Lab Intelligent Percept & Image Understanding Minist Educ China Xian 710071 Peoples R China Xidian Univ Joint Int Res Lab Intelligent Percept & Computat Xian 710071 Peoples R China

Localization regression in oriented object detection tasks has long faced boundary discontinuity and angular discontinuity problems induced by periodic angles. These problems were successfully resolved by using a 2-D Gaussian distribution to modeling the oriented bounding box (OBB). However, the angular information of square-like objects will be lost when they are converted to 2-D Gaussian distribution, forming a systematic problem. Its fundamental reason is that when the aspect ratio of the object tends to 1, the equiprobability curve of 2-D Gaussian distribution degenerates from an ellipse to a circle, thus losing the orientation information of the rotated object. This results in the bounding boxes of such square-like objects not being learned effectively. To resolve this problem, we used the Lame curve (or superellipse) to modify the existing 2-D Gaussian function and designed a super-Gaussian distribution. This distribution can maintain anisotropy at arbitrary aspect ratios, thus preserving the angular information of the oriented object. We used the Kullback-Leibler (KL) divergence to measure the distance between two super-Gaussian distributions and convert it into a localization loss (SGKLD) by a function. SGKLD is an improved version of KLD loss. By modifying the form of the probability distribution, we elegantly fix the angle missing problem of the traditional Gaussian distribution. We validated the effectiveness of the proposed algorithm on several datasets and obtained the performance of state-of-the-art (SOTA). Our algorithm achieves a mean average precision (mAP) of 80.07, 76.59, 62.27, and 90.55/98.13 on the DOTA-v1.0, DOTA-v1.5, DOTA-v2.0, and HRSC2016 datasets, respectively.

关键词： Gaussian distribution Task analysis Location awareness Shape Probability distribution object detection Systematics aerial object detection Kullback-Leibler (KL) divergence oriented object detection super-Gaussian distribution superellipse

来源：评论

学校读者我要写书评

暂无评论

SAFDet: A Semi-Anchor-Free Detector for Effective detection of Oriented objects in aerial Images

引用

REMOTE SENSING 2020年第19期12卷 1-16页

作者： Fang, Zhenyu Ren, Jinchang Sun, He Marshall, Stephen Han, Junwei Zhao, Huimin Guangdong Polytech Normal Univ Sch Comp Sci Guangzhou 510665 Peoples R China Univ Strathclyde Dept Elect & Elect Engn Glasgow G1 1XQ Lanark Scotland Beijing Inst Technol Sch Automat Beijing 100081 Peoples R China Northwestern Polytech Univ Sch Automat Xian 710109 Peoples R China

An oriented bounding box (OBB) is preferable over a horizontal bounding box (HBB) in accurate object detection. Most of existing works utilize a two-stage detector for locating the HBB and OBB, respectively, which have suffered from the misaligned horizontal proposals and the interference from complex backgrounds. To tackle these issues, region of interest transformer and attention models were proposed, yet they are extremely computationally intensive. To this end, we propose a semi-anchor-free detector (SAFDet) for object detection in aerial images, where a rotation-anchor-free-branch (RAFB) is used to enhance the foreground features via precisely regressing the OBB. Meanwhile, a center-prediction-module (CPM) is introduced for enhancing object localization and suppressing the background noise. Both RAFB and CPM are deployed during training, avoiding increased computational cost of inference. By evaluating on DOTA and HRSC2016 datasets, the efficacy of our approach has been fully validated for a good balance between the accuracy and computational cost.

关键词： rotate region convolutional neural network anchor free aerial object detection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：