检索结果-内蒙古大学图书馆

P1AC: Revisiting Absolute Pose From a Single Affine Correspondence

学校读者我要写书评

暂无评论

P1AC: Revisiting Absolute Pose From a Single Affine Correspo...

International Conference on computer vision (ICCV)

作者： Jonathan Ventura Zuzana Kukelova Torsten Sattler Dániel Baráth Department of Computer Science & Software Engineering Cal Poly San Luis Obispo Visual Recognition Group Faculty of Electrical Engineering Czech Technical University in Prague Czech Institute of Informatics Robotics and Cybernetics Czech Technical University in Prague Computer Vision and Geometry Group ETH Zürich

Affine correspondences have traditionally been used to improve feature matching over wide baselines. While recent work has successfully used affine correspondences to solve various relative camera pose estimation problems, less attention has been given to their use in absolute pose estimation. We introduce the first general solution to the problem of estimating the pose of a calibrated camera given a single observation of an oriented point and an affine correspondence. The advantage of our approach (P1AC) is that it requires only a single correspondence, in comparison to the traditional point-based approach (P3P), significantly reducing the combinatorics in robust estimation. P1AC provides a general solution that removes restrictive assumptions made in prior work and is applicable to large-scale image-based localization. We propose a minimal solution to the P1AC problem and evaluate our novel solver on synthetic data, showing its numerical stability and performance under various types of noise. On standard image-based localization benchmarks we show that P1AC achieves more accurate results than the widely used P3P algorithm. Code for our method is available at https://***/jonathanventura/P1AC/.

关键词：

Relative pose of three calibrated and partially calibrated cameras from four points using virtual correspondences

学校读者我要写书评

暂无评论

arXiv 2023年

作者： Tzamos, Charalambos Kocur, Viktor Barath, Daniel Haladová, Zuzana Berger Sattler, Torsten Kukelova, Zuzana Visual Recognition Group Faculty of Electrical Engineering Czech Technical University in Prague Czech Republic Faculty of Mathematics Physics and Informatics Comenius University in Bratislava Slovakia ETH Zürich Computer Vision and Geometry Group Switzerland Czech Institute of Informatics Robotics and Cybernetics Czech Technical University in Prague Czech Republic

We study challenging problems of estimating the relative pose of three cameras and propose novel efficient solutions to the configurations (1) of four points in three calibrated cameras (the 4p3v problem), and (2) of four points in three cameras with unknown shared focal length (the 4p3vf problem). Our solutions are based on the simple idea of generating one or two additional virtual point correspondences in two views by using the information from the locations of the input correspondences. We generate such correspondences using a very simple and efficient strategy, where the new points are the mean points of three corresponding input points. The new solvers are efficient and easy to implement, since they are based on existing efficient minimal solvers, i.e., the well-known 5-point and 6-point relative pose solvers and the P3P solver. Extensive experiments on real data show that our solvers achieve state-of-the-art results. We also present a simple network that can improve the precision of the mean-point correspondences, showing the potential to learn better virtual point correspondences. Copyright © 2023, The Authors. All rights reserved.

关键词： Cameras

Partially calibrated semi-generalized pose from hybrid point correspondences

学校读者我要写书评

暂无评论

arXiv 2022年

作者： Bhayani, Snehal Larsson, Viktor Sattler, Torsten Heikkilä, Janne Kukelova, Zuzana Center for Machine Vision and Signal Analysis University of Oulu Finland Czech Institute of Informatics Robotics and Cybernetics Czech Technical University Prague Czech Republic Computer Vision and Geometry Group Department of Computer Science ETH Zürich Switzerland Visual Recognition Group Faculty of Electrical Engineering Czech Technical University Prague Czech Republic

In this paper we study the problem of estimating the semi-generalized pose of a partially calibrated camera, i.e., the pose of a perspective camera with unknown focal length w.r.t. a generalized camera, from a hybrid set of 2D-2D and 2D-3D point correspondences. We study all possible camera configurations within the generalized camera system. To derive practical solvers to previously unsolved challenging configurations, we test different parameterizations as well as different solving strategies based on state-of-the-art methods for generating efficient polynomial solvers. We evaluate the three most promising solvers, i.e., the H51f solver with five 2D-2D correspondences and one 2D-3D correspondence viewed by the same camera inside generalized camera, the H32f solver with three 2D-2D and two 2D-3D correspondences, and the H13f solver with one 2D-2D and three 2D-3D correspondences, on synthetic and real data. We show that in the presence of noise in the 3D points these solvers provide better estimates than the corresponding absolute pose solvers. Copyright © 2022, The Authors. All rights reserved.

关键词： Cameras

Few-Shot Medical Image Segmentation with High-Fidelity Prototypes

学校读者我要写书评

暂无评论

arXiv 2024年

作者： Tang, Song Yan, Shaxu Qi, Xiaozhi Gao, Jianxin Ye, Mao Zhang, Jianwei Zhu, Xiatian IMI Group School of Health Sciences and Engineering University of Shanghai for Science and Technology Shanghai China TAMS Group Department of Informatics Universität Hamburg Hamburg Germany School of Computer Science and Engineering University of Electronic Science and Technology of China Chengdu China Surrey Institute for People-Centred Artificial Intelligence Centre for Vision Speech and Signal Processing University of Surrey Guildford United Kingdom Shenzhen Key Laboratory of Minimally Invasive Surgical Robotics and System Shenzhen Institute of Advanced Technology Chinese Academy of Sciences China

Few-shot Semantic Segmentation (FSS) aims to adapt a pretrained model to new classes with as few as a single labelled training sample per class. Despite the prototype based approaches have achieved substantial success, existing models are limited to the imaging scenarios with considerably distinct objects and not highly complex background, e.g., natural images. This makes such models suboptimal for medical imaging with both conditions invalid. To address this problem, we propose a novel Detail Self-refined Prototype Network (DSPNet) to constructing high-fidelity prototypes representing the object foreground and the background more comprehensively. Specifically, to construct global semantics while maintaining the captured detail semantics, we learn the foreground prototypes by modelling the multi-modal structures with clustering and then fusing each in a channel-wise manner. Considering that the background often has no apparent semantic relation in the spatial dimensions, we integrate channel-specific structural information under sparse channel-aware regulation. Extensive experiments on three challenging medical image benchmarks show the superiority of DSPNet over previous state-of-the-art methods. The code and data are available at https://***/tntek/DSPNet. Copyright © 2024, The Authors. All rights reserved.

关键词： Semantic Segmentation

Towards robust monocular visual odometry for flying robots on planetary missions

学校读者我要写书评

暂无评论

arXiv 2021年

作者： Wudenka, Martin Müller, Marcus G. Demmel, Nikolaus Wedler, Armin Triebel, Rudolph Cremers, Daniel Stürzl, Wolfgang Institute of Robotics and Mechatronics German Aerospace Center DLR Computer Vision Group Department of Informatics Technical University of Munich Germany Autonomous Systems Lab ETH Zurich Switzerland

In the future, extraterrestrial expeditions will not only be conducted by rovers but also by flying robots. The technical demonstration drone Ingenuity, that just landed on Mars, will mark the beginning of a new era of exploration unhindered by terrain traversability. Robust self-localization is crucial for that. Cameras that are lightweight, cheap and information-rich sensors are already used to estimate the ego-motion of vehicles. However, methods proven to work in man-made environments cannot simply be deployed on other planets. The highly repetitive textures present in the wastelands of Mars pose a huge challenge to descriptor matching based approaches. In this paper, we present an advanced robust monocular odometry algorithm that uses efficient optical flow tracking to obtain feature correspondences between images and a refined keyframe selection criterion. In contrast to most other approaches, our framework can also handle rotation-only motions that are particularly challenging for monocular odometry systems. Furthermore, we present a novel approach to estimate the current risk of scale drift based on a principal component analysis of the relative translation information matrix. This way we obtain an implicit measure of uncertainty. We evaluate the validity of our approach on all sequences of a challenging real-world dataset captured in a Mars-like environment and show that it outperforms state-of-the-art approaches. The source code is publicly available at: https://***/DLR-RM/granite. Copyright © 2021, The Authors. All rights reserved.

关键词： Principal component analysis

Calibrated and partially calibrated semi-generalized homographies

学校读者我要写书评

暂无评论

arXiv 2021年

作者： Bhayani, Snehal Sattler, Torsten Barath, Daniel Beliansky, Patrik Heikkilä, Janne Kukelova, Zuzana Center for Machine Vision and Signal Analysis University of Oulu Finland Czech Institute of Informatics Robotics and Cybernetics Czech Technical University in Prague Computer Vision and Geometry Group Department of Computer Science ETH Zürich Faculty of Mathematics and Physics Charles University Prague Czech Republic Visual Recognition Group Faculty of Electrical Engineering Czech Technical University in Prague

In this paper, we propose the first minimal solutions for estimating the semi-generalized homography given a perspective and a generalized camera. The proposed solvers use five 2D-2D image point correspondences induced by a scene plane. One group of solvers assumes the perspective camera to be fully calibrated, while the other estimates the unknown focal length together with the absolute pose parameters. This setup is particularly important in structurefrom-motion and visual localization pipelines, where a new camera is localized in each step with respect to a set of known cameras and 2D-3D correspondences might not be available. Thanks to a clever parametrization and the elimination ideal method, our solvers only need to solve a univariate polynomial of degree five or three, respectively a system of polynomial equations in two variables. All proposed solvers are stable and efficient as demonstrated by a number of synthetic and real-world experiments. Copyright © 2021, The Authors. All rights reserved.

关键词： Cameras

Calibrated and Partially Calibrated Semi-Generalized Homographies

学校读者我要写书评

暂无评论

Calibrated and Partially Calibrated Semi-Generalized Homogra...

International Conference on computer vision (ICCV)

作者： Snehal Bhayani Torsten Sattler Daniel Barath Patrik Beliansky Janne Heikkilä Zuzana Kukelova Center for Machine Vision and Signal Analysis University of Oulu Finland Czech Institute of Informatics Robotics and Cybernetics Czech Technical University in Prague Computer Vision and Geometry Group ETH Zürich Faculty of Mathematics and Physics Charles University Prague Visual Recognition Group Faculty of Electrical Engineering Czech Technical University in Prague

ISBN: (纸本)9781665428132

In this paper, we propose the first minimal solutions for estimating the semi-generalized homography given a perspective and a generalized camera. The proposed solvers use five 2D-2D image point correspondences induced by a scene plane. One group of solvers assumes the perspective camera to be fully calibrated, while the other estimates the unknown focal length together with the absolute pose parameters. This setup is particularly important in structure-from-motion and visual localization pipelines, where a new camera is localized in each step with respect to a set of known cameras and 2D-3D correspondences might not be available. Thanks to a clever parametrization and the elimination ideal method, our solvers only need to solve a univariate polynomial of degree five or three, respectively a system of polynomial equations in two variables. All proposed solvers are stable and efficient as demonstrated by a number of synthetic and real-world experiments.

关键词： Location awareness Visualization computer vision Pipelines Focusing Cameras

OTE: Optimal Trustworthy EdgeAI solutions for smart cities

学校读者我要写书评

暂无评论

OTE: Optimal Trustworthy EdgeAI solutions for smart cities

IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID)

作者： Vasileios Mygdalis Lorenzo Carnevale Jose Ramiro Martí nez-De-Dios Dmitriy Shutin Giovanni Aiello Massimo Villari Ioannis Pitas Department of Informatics Aristotle University of Thessaloniki Thessaloniki Greece Department of Mathematical and Computer Science Physics and Hearth Sciences University of Messina Messina Italy Gruppo Nazionale per il Calcolo Scientifico (GNCS) Istituto Nazionale di Alta Matematica (INdAM) &#x201C F. Severi&#x201D Rome Italy Robotics Vision and Control Group University of Seville Seville Spain Institute of Communications and Navigation German Aerospace Center (DLR) Wessling Germany Research and Development Lab. Engineering Ingegneria Informatica S.p.A Rome Italy

This work studies and defines the problem of providing extensive and opportunistic Edge AI-based area coverage in smart city application scenarios, by researching and determining the optimal configuration of sensing and computational resources for minimizing the environmental/technology footprint of the solution. A typical smart city computing continuum consists of statically installed multimodal sensing Internet-of-Things (IoT) nodes at various city locations, accompanied by interconnected computational Cloud/Edge/IoT nodes. This paper presents Optimal Trustworthy EdgeAI (OTE), an entirely novel research pipeline, that complements existing smart city infrastructure with intelligent drone Edge/IoT nodes (in the form of modularly equipped unmanned aerial vehicles), capable of autonomous repositioning according to individual/collective sensing and coverage criteria. Thereby, we envisage the emerging cutting-edge technologies of trustworthy sensing, perceiving, modelling technologies for predicting the behavior of moving targets (e.g., citizens/vehicles/objects), understanding natural phenomena (e.g., sea wave motion, urban flora/fauna, biodiversity) in order to anticipate events (people's bad habits, environmental changes), by exploiting novel continuous data processing services across the whole span of the enhanced Cloud-Edge-IoT computing continuum.

关键词： Cloud computing Privacy Smart cities Pipelines Semantics Robot sensing systems Software

Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings

学校读者我要写书评

暂无评论

arXiv 2022年

作者： Bano, Sophia Casella, Alessandro Vasconcelos, Francisco Qayyum, Abdul Benzinou, Abdesslam Mazher, Moona Meriaudeau, Fabrice Lena, Chiara Cintorrino, Ilaria Anita De Paolis, Gaia Romana Biagioli, Jessica Grechishnikova, Daria Jiao, Jing Bai, Bizhe Qiao, Yanyan Bhattarai, Binod Gaire, Rebati Raman Subedi, Ronast Vazquez, Eduard Plotka, Szymon Lisowska, Aneta Sitek, Arkadiusz Attilakos, George Wimalasundera, Ruwan David, Anna L. Paladini, Dario Deprest, Jan De Momi, Elena Mattos, Leonardo S. Moccia, Sara Stoyanov, Danail Department of Computer Science University College London United Kingdom Department of Advanced Robotics Istituto Italiano di Tecnologia Italy Department of Electronics Information and Bioengineering Politecnico di Milano Italy The BioRobotics Institute Department of Excellence in Robotics and AI Scuola Superiore Sant'Anna Italy Fetal Medicine Unit Elizabeth Garrett Anderson Wing University College London Hospital United Kingdom EGA Institute for Women's Health Faculty of Population Health Sciences University College London United Kingdom Department of Development and Regeneration University Hospital Leuven Belgium Department of Fetal and Perinatal Medicine Istituto Giannina Gaslini Italy ENIB UMR CNRS 6285 LabSTICC 29238 France Department of Computer Engineering and Mathematics University Rovira i Virgili Spain ImViA Laboratory University of Bourgogne Franche-Comté France Physics Department Lomonosov Moscow State University Russia Fudan University China Medical Computer Vision and Robotics Group Department of Mathematical and Computational Sciences University of Toronto Canada Co. Ltd China NepAL Applied Mathematics and Informatics Institute for Research Nepal Redev Technology United Kingdom Sano Center for Computational Medicine Poland Quantitative Healthcare Analysis Group Informatics Institute University of Amsterdam Amsterdam Netherlands Center for Advanced Medical Computing and Simulation Massachusetts General Hospital Harvard Medical School BostonMA United States

Fetoscopy laser photocoagulation is a widely adopted procedure for treating Twin-to-Twin Transfusion Syndrome (TTTS). The procedure involves photocoagulation pathological anastomoses to restore a physiological blood exchange among twins. The procedure is particularly challenging, from the surgeon's side, due to the limited field of view, poor manoeuvrability of the fetoscope, poor visibility due to amniotic fluid turbidity, and variability in illumination. These challenges may lead to increased surgery time and incomplete ablation of pathological anastomoses, resulting in persistent TTTS. computer-assisted intervention (CAI) can provide TTTS surgeons with decision support and context awareness by identifying key structures in the scene and expanding the fetoscopic field of view through video mosaicking. Research in this domain has been hampered by the lack of high-quality data to design, develop and test CAI algorithms. Through the Fetoscopic Placental Vessel Segmentation and Registration (FetReg2021) challenge, which was organized as part of the MICCAI2021 Endoscopic vision (EndoVis) challenge, we released the first large-scale multi-center TTTS dataset for the development of generalized and robust semantic segmentation and video mosaicking algorithms with a focus on creating drift-free mosaics from long duration fetoscopy videos. For this challenge, we released a dataset of 2060 images, pixel-annotated for vessels, tool, fetus and background classes, from 18 in-vivo TTTS fetoscopy procedures and 18 short video clips of an average length of 411 frames for developing placental scene segmentation and frame registration for mosaicking techniques. Seven teams participated in this challenge and their model performance was assessed on an unseen test dataset of 658 pixel-annotated images from 6 fetoscopic procedures and 6 short clips. The challenge provided an opportunity for creating generalized solutions for fetoscopic scene understanding and mosaicking. In this paper,

关键词： Pixels