咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >ES6D: A Computation Efficient ... 收藏
arXiv

ES6D: A Computation Efficient and Symmetry-Aware 6D Pose Regression Framework

作     者:Mo, Ningkai Gan, Wanshui Yokoya, Naoto Chen, Shifeng 

作者机构:ShenZhen Key Lab of Computer Vision and Pattern Recognition Shenzhen Institute of Advanced Technology Chinese Academy of Sciences China The University of Tokyo Japan RIKEN Japan 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2022年

核心收录:

主  题:Computational efficiency 

摘      要:In this paper, a computation efficient regression framework is presented for estimating the 6D pose of rigid objects from a single RGB-D image, which is applicable to handling symmetric objects. This framework is designed in a simple architecture that efficiently extracts point-wise features from RGB-D data using a fully convolutional network, called XYZNet, and directly regresses the 6D pose without any post refinement. In the case of symmetric object, one object has multiple ground-truth poses, and this one-to-many relationship may lead to estimation ambiguity. In order to solve this ambiguity problem, we design a symmetry-invariant pose distance metric, called average (maximum) grouped primitives distance or A(M)GPD. The proposed A(M)GPD loss can make the regression network converge to the correct state, i.e., all minima in the A(M)GPD loss surface are mapped to the correct poses. Extensive experiments on YCB-Video and TLESS datasets demonstrate the proposed framework s substantially superior performance in top accuracy and low computational cost. The relevant code is available in https://***/GANWANSHUI/***. Copyright © 2022, The Authors. All rights reserved.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分