检索结果-内蒙古大学图书馆

Augmented Autoencoders: Implicit 3d Orientation Learning for 6d object detection

INTERNATIONAL JOURNAL OF COMPUTER VISION 2020年第3期128卷 714-729页

作者： Sundermeyer, Martin Marton, Zoltan-Csaba durner, Maximilian Triebel, Rudolph German Aerosp Ctr DLR D-82234 Wessling Germany Tech Univ Munich D-80333 Munich Germany

We propose a real-time RGB-based pipeline for object detection and 6d pose estimation. Our novel 3d orientation estimation is based on a variant of the denoising Autoencoder that is trained on simulated views of a 3d model using domain Randomization. This so-called Augmented Autoencoder has several advantages over existing methods: It does not require real, pose-annotated training data, generalizes to various test sensors and inherently handles object and view symmetries. Instead of learning an explicit mapping from input images to object poses, it provides an implicit representation of object orientations defined by samples in a latent space. Our pipeline achieves state-of-the-art performance on the T-LESS dataset both in the RGB and RGB-d domain. We also evaluate on the LineMOd dataset where we can compete with other synthetically trained approaches. We further increase performance by correcting 3d orientation estimates to account for perspective errors when the object deviates from the image center and show extended results. Our code is available here https://***/dLR-RM/AugmentedAutoencoder.

关键词： 6d object detection Pose estimation domain randomization Autoencoder Synthetic data Symmetries

来源：评论

学校读者我要写书评

暂无评论

Implicit 3d Orientation Learning for 6d object detection from RGB Images 15th

Implicit 3D Orientation Learning for 6D Object Detection fro...

引用

15th European Conference on Computer Vision (ECCV)

作者： Sundermeyer, Martin Marton, Zoltan-Csaba durner, Maximilian Brucker, Manuel Triebel, Rudolph German Aerosp Ctr DLR D-82234 Wessling Germany Tech Univ Munich D-80333 Munich Germany

ISBN: (纸本)9783030012311;9783030012304

We propose a real-time RGB-based pipeline for object detection and 6d pose estimation. Our novel 3d orientation estimation is based on a variant of the denoising Autoencoder that is trained on simulated views of a 3d model using domain Randomization. This so-called Augmented Autoencoder has several advantages over existing methods: It does not require real, pose-annotated training data, generalizes to various test sensors and inherently handles object and view symmetries. Instead of learning an explicit mapping from input images to object poses, it provides an implicit representation of object orientations defined by samples in a latent space. Experiments on the T-LESS and LineMOd datasets show that our method outperforms similar model-based approaches and competes with state-of-the art approaches that require real pose-annotated images.

关键词： 6d object detection Pose estimation domain Randomization Autoencoder Synthetic data Pose ambiguity Symmetries

来源：评论

学校读者我要写书评

暂无评论

object Pose Estimation Based on Multi-precision Vectors and Seg-driven PnP

引用

INTERNATIONAL JOURNAL OF COMPUTER VISION 2025年第5期133卷 2620-2634页

作者： Wang, Yulin Li, Hongli Luo, Chen Southeast Univ Sch Mech Engn Nanjing 211189 Jiangsu Peoples R China

object pose estimation based on a single RGB image has wide application potential but is difficult to achieve. Existing pose estimation involves various inference pipelines. One popular pipeline is to first use Convolutional Neural Networks (CNN) to predict 2d projections of 3d keypoints in a single RGB image and then calculate the 6d pose via a Perspective-n-Point (PnP) solver. due to the gap between synthetic data and real data, the model trained on synthetic data has difficulty predicting the 6d pose accurately when applied to real data. To address the acute problem, we propose a two-stage pipeline of object pose estimation based upon multi-precision vectors and segmentation-driven (Seg-driven) PnP. In keypoint localization stage, we first develop a CNN-based three-branch network to predict multi-precision 2d vectors pointing to 2d keypoints. Then we introduce an accurate and fast Keypoint Voting scheme of Multi-precision vectors (KVM), which computes low-precision 2d keypoints using low-precision vectors and refines 2d keypoints on mid- and high-precision vectors. In the pose calculation stage, we propose Seg-driven PnP to refine the 3d Translation of poses and get the optimal pose by minimizing the non-overlapping area between segmented and rendered masks. The Seg-driven PnP leverages 2d segmentation trained on real images to improve the accuracy of pose estimation trained on synthetic data, thereby reducing the synthetic-to-real gap. Extensive experiments show our approach materially outperforms state-of-the-art methods on LM and HB datasets. Importantly, our proposed method works reasonably well for weakly textured and occluded objects in diverse scenes.

关键词： 6d object detection Pose estimation Monocular RGB Synthetic data Keypoint PnP

来源：评论

学校读者我要写书评

暂无评论

Evaluation of the use of box size priors for 6d plane segment tracking from point clouds with applications in cargo packing

引用

EURASIP JOURNAL ON IMAGE ANd VIdEO PROCESSING 2024年第1期2024卷 17页

作者： Camacho-Munoz, Guillermo A. Rodriguez, Sandra Esperanza Nope Loaiza-Correa, Humberto Lima, Joao Paulo Silva do Monte Roberto, Rafael Alves Univ Valle Elect Engn Dept Calle 13 100-00 Cali 760042 Valle Del Cauca Colombia Univ Fed Rural Pernambuco Dept Computacao Visual Comp Lab Rua Dom Manoel Medeiros S-N BR-52171900 Recife PE Brazil Univ Fed Pernambuco Ctr Informat Voxar Labs Ave Jornalista Anibal Fernandes S-N BR-50740560 Recife PE Brazil

This paper addresses the problem of 6d pose tracking of plane segments from point clouds acquired from a mobile camera. This is motivated by manual packing operations, where an opportunity exists to enhance performance, aiding operators with instructions based on augmented reality. The approach uses as input point clouds, by its advantages for extracting geometric information relevant to estimating the 6d pose of rigid objects. The proposed algorithm begins with a RANSAC fitting stage on the raw point cloud. It then implements strategies to compute the 2d size and 6d pose of plane segments from geometric analysis of the fitted point cloud. Redundant detections are combined using a new quality factor that predicts point cloud mapping density and allows the selection of the most accurate detection. The algorithm is designed for dynamic scenes, employing a novel particle concept in the point cloud space to track detections' validity over time. A variant of the algorithm uses box size priors (available in most packing operations) to filter out irrelevant detections. The impact of this prior knowledge is evaluated through an experimental design that compares the performance of a plane segment tracking system, considering variations in the tracking algorithm and camera speed (onboard the packing operator). The tracking algorithm varies at two levels: algorithm (Awpk\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$A_{wpk}$$\end{document}), which integrates prior knowledge of box sizes, and algorithm (Awoutpk\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$A_{woutpk}$$\end{document}), which assumes ignorance of box pro

关键词： Visual 6d tracking Plane tracking Manual packing of cargo 6d object detection Visual tracking on dynamic environment Multi-object tracking Integration of size priors Spatial mapping

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：