We introduce a lightweight and accurate localization method that only utilizes the geometry of 2D-3D lines. Given a pre-captured 3D map, our approach localizes a panorama image, taking advantage of the holistic 360...
详细信息
ISBN:
(数字)9798350353006
ISBN:
(纸本)9798350353013
We introduce a lightweight and accurate localization method that only utilizes the geometry of 2D-3D lines. Given a pre-captured 3D map, our approach localizes a panorama image, taking advantage of the holistic 360° view. The system mitigates potential privacy breaches or domain discrepancies by avoiding trained or hand-crafted visual descriptors. However, as lines alone can be ambiguous, we express distinctive yet compact spatial contexts from relationships between lines, namely the dominant directions of parallel lines and the intersection between non-parallel lines. The resulting representations are efficient in processing time and memory compared to conventional visual descriptor-based methods. Given the groups of dominant line directions and their intersections, we accelerate the search process to test thousands of pose candidates in less than a millisecond without sacrificing accuracy. We empirically show that the proposed 2D-3D matching can localize panoramas for challenging scenes with similar structures, dramatic domain shifts or illumination changes. Our fully geometric approach does not involve extensive parameter tuning or neural network training, making it a practical algorithm that can be readily deployed in the real world. Project page including the code is available through this link: https://***/fgpl/.
One-class classification aims to learn one-class models from only in-class training samples. Because of lacking out-of-class samples during training, most conventional deep learning based methods suffer from the featu...
详细信息
A computational framework, which relies on the discontinuous Galerkin time-domain scheme, is proposed to simulate transient lasing generated by interactions of light with an active medium. The proposed scheme solves a...
A computational framework, which relies on the discontinuous Galerkin time-domain scheme, is proposed to simulate transient lasing generated by interactions of light with an active medium. The proposed scheme solves a coupled system of the Maxwell and the rate equations. The active medium inside the laser region is described quantum-mechanically by the rate equations to account for the atomic transitions of a multi-level system, while electromagnetic field interactions are described classically by the Maxwell equations. Numerical examples are provided to demonstrate the accuracy and the applicability of the proposed framework.
Pupillometry measures pupil size, and several open-source algorithms are available to analyse pupillometry data. However, only a few studies compared these algorithms’ accuracy and computational resources. This study...
详细信息
Ensemble learning has gained attention due to its ability to improve predictive performance in classification and regression problems. However, there are several issues related to optimizing accuracy, diversity, and e...
详细信息
We present Reusable Motion prior (ReMP), an effective motion prior that can accurately track the temporal evolution of motion in various downstream tasks. Inspired by the success of foundation models, we argue that a ...
详细信息
ISBN:
(数字)9798331510831
ISBN:
(纸本)9798331510848
We present Reusable Motion prior (ReMP), an effective motion prior that can accurately track the temporal evolution of motion in various downstream tasks. Inspired by the success of foundation models, we argue that a robust spatio-temporal motion prior can encapsulate underlying 3D dynamics applicable to various sensor modalities. We learn the rich motion prior from a sequence of complete parametric models of posed human body shape. Our prior can easily estimate poses in missing frames or noisy measurements despite significant occlusion by employing a temporal attention mechanism. More interestingly, our prior can guide the system with incomplete and challenging input measurements to quickly extract critical information to estimate the sequence of poses, significantly improving the training efficiency for mesh sequence recovery. ReMP consistently outperforms the baseline method on diverse and practical 3D motion data, including depth point clouds, LiDAR scans, and IMU sensor data. Project page is available in https://***/ReMP.
This paper introduces a novel approach for power system inspection using Mask-RCNN, an advanced model for instance segmentation. The accuracy of our model in identifying and diagnosing damaged low- to medium-voltage i...
详细信息
In offshore aquaculture operations, personnel equipped with diving gear are often necessary to inspect the underwater net cages for damage, particularly on the sea floor. This manual inspection process is time-consumi...
详细信息
ISBN:
(数字)9798331530839
ISBN:
(纸本)9798331530846
In offshore aquaculture operations, personnel equipped with diving gear are often necessary to inspect the underwater net cages for damage, particularly on the sea floor. This manual inspection process is time-consuming and complex. To overcome this problem, this paper proposes a computer vision solution for identifying damage in underwater net cages to address the inefficiencies and challenges of traditional manual inspections. The proposed scheme utilizes a high-performance multi-branch computational architecture designed based on ShuffleNet architecture to detect net cage damage more efficiently. Experimental results demonstrate that this work performs well on the ImageNet ILSVRC-2010 dataset and achieves an accuracy of 88.54% in underwater net damage detection.
暂无评论