版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Guangzhou Univ Sch Elect & Commun Engn Guangzhou 510006 Peoples R China
出 版 物:《IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS》 (IEEE Trans. Intell. Transp. Syst.)
年 卷 期:2023年第24卷第12期
页 面:14914-14926页
核心收录:
学科分类:0808[工学-电气工程] 08[工学] 0814[工学-土木工程] 0823[工学-交通运输工程]
基 金:Guangzhou Science and Technology Project [2023A03J0110] Guangdong Basic and Applied Basic Research Foundation [2023A1515030118] Key Discipline Project of Guangzhou Education Bureau
主 题:Stereo matching autonomous driving cost construction re-parameterization edge AI
摘 要:3D perception is an essential capability of autonomous vehicles. Most state-of-the-art stereo matching networks pursue higher prediction accuracy at the cost of the inference speed. However, high demand on computational resource pushes the cost of hardware, hindering practical applications of stereo matching. In this paper, we propose RMCNet, a novel re-parameterized coarse-to-fine network for stereo matching. RMCNet achieves real time on edge devices while outputs accurate disparity maps. To reduce the computing complexity of 3D convolution in cost aggregation, we propose Mixed Cost Volumes that consist of 4D and 3D cost volumes, enabling efficient optimization of the initial disparity map for real-time inference. An Efficient Feature Matching module is proposed to provide absolute disparity candidates for 3D cost volume. To increase the reliability of 4D and 3D cost volumes, we propose a simple but efficient 3D cost aggregation module and a pyramid 2D cost aggregation module for the aggregation of Mixed Cost Volumes. Moreover, we propose a Re-parameterized Channel Sensing module and a Re-parameterized Disparity-Aware module to replace 2D and 3D convolutional blocks with residual connections. Being evaluated on KITTI 2012 and KITTI 2015, the comprehensive performance of the proposed network is improved by the re-parameterization approach. Outstanding trade-off between accuracy and speed on edge devices over existing SOTA works is achieved. With fast average inference speed on NVIDIA Jetson Nano of 33.74 FPS, the accuracy in terms of 3px-all on KITTI 2012 test set and D1-all on KITTI 2015 test set are 3.01% and 3.41%, respectively.