this paper presents an integrated region-based and gradient-based supervised method for segmentation of a patient magnetic resonance images (MRI) of the shoulder joint. the method is noninvasive, anatomy-based and req...
详细信息
As an extension to classical structured lighting techniques, the use of bi-dimensional pseudo-random color codes is explored to perform range sensing with variable density from a stereo calibrated rig and a projector....
详细信息
ISBN:
(纸本)9780769527864
As an extension to classical structured lighting techniques, the use of bi-dimensional pseudo-random color codes is explored to perform range sensing with variable density from a stereo calibrated rig and a projector. Pseudo-random codes are used to create artificial textures on a scene which are extracted and grouped in a confidence map to ensure reliable feature matching between pairs of images taken from two cameras. Depth estimation is performed on corresponding points with progressive refinement as the pseudo-random pattern projection is marched over the scene to increase the density of matched features, and achieve dense 3D reconstruction. the potential of hi-dimensional pseudo-random color patterns for structured lighting is demonstrated in terms of patterns computation, ease of extraction, matching confidence level, as well as density of depth estimation for 3D reconstruction.
We show how a greedy approach to visual search - i.e., directly moving to the most likely location of the target - can be suboptimal, if the target object is hard to detect. Instead it is more efficient and leads to h...
详细信息
ISBN:
(纸本)9780769527864
We show how a greedy approach to visual search - i.e., directly moving to the most likely location of the target - can be suboptimal, if the target object is hard to detect. Instead it is more efficient and leads to higher detection accuracy to first look for other related objects, that are easier to detect. these provide contextual priors for the target that make it easier to find. We demonstrate this in simulation using POMDP models, focussing on two special cases: where the target object is contained within the related object, and where the target object is spatially adjacent to the related object.
In this paper we introduce the Fourier tag, a synthetic fiducial marker used to visually encode information and provide controllable positioning. the Fourier tag is a synthetic target akin to a bar-code that specifies...
详细信息
ISBN:
(纸本)9780769527864
In this paper we introduce the Fourier tag, a synthetic fiducial marker used to visually encode information and provide controllable positioning. the Fourier tag is a synthetic target akin to a bar-code that specifies multi-bit information which can be efficiently and robustly detected in an image. Moreover the Fourier tag has the beneficial property that the bit string it encodes has variable length as a function of the distance between the camera and the target. this follows from the fact that the effective resolution decreases as an effect of perspective. this paper introduces the Fourier tag, describes its design, and illustrates its properties experimentally
the proceedings contain 63 papers. the topics discussed include: extrinsic recalibration in camera networks;screen camera calibration using a spherical mirror;a simple operator for very precise estimation of ellipses;...
详细信息
ISBN:
(纸本)0769527868
the proceedings contain 63 papers. the topics discussed include: extrinsic recalibration in camera networks;screen camera calibration using a spherical mirror;a simple operator for very precise estimation of ellipses;training database adequacy analysis for learning-based super-resolution;extracting salient objects from operator-framed images;learning saccadic gaze control via motion prediction;efficient camera motion and 3D recovery using an inertial sensor;can Lucas-Kanade be used to estimate motion parallax in 3D cluttered scenes;local graph matching for object category recognition;efficient registration of 3D SPHARM surfaces;computing view-normalized body parts trajectories;automated detection of mitosis in embryonic tissues;computer assisted detection of polycystic ovary morphology in ultrasound images;automatic detection and clustering of actor faces based on spectral clustering techniques;and petri net-based cooperation in multi-agent systems.
We propose a new and flexible hierarchical multi-baseline stereo algorithm that features a non-uniform spatial decomposition of the disparity map. the visibility computation and refinement of the disparity map are int...
详细信息
ISBN:
(纸本)9780769527864
We propose a new and flexible hierarchical multi-baseline stereo algorithm that features a non-uniform spatial decomposition of the disparity map. the visibility computation and refinement of the disparity map are integrated into a single iterative framework that does not add extra constraints to the cost function. this makes it possible to use a standard efficient stereo matcher during each iteration. the level of refinement is increased automatically where it is needed in order to preserve a good localization of boundaries. While two graph-theoretic stereo matchers are used in our experiments, our framework is general enough to be applied to many others. the validity of our framework is demonstrated using real imagery with ground truth.
the success of NASA's Mars Exploration Rovers has demonstrated the important benefits that mobility adds to planetary exploration. Very soon, mission requirements will impose that planetary exploration rovers driv...
详细信息
ISBN:
(纸本)9780769527864
the success of NASA's Mars Exploration Rovers has demonstrated the important benefits that mobility adds to planetary exploration. Very soon, mission requirements will impose that planetary exploration rovers drive autonomously in unknown terrain. this will require an evolution of the methods and technologies currently used. this paper presents our approach to 3D terrain reconstruction from large sparse range data sets, and the data reduction achieved through decimation. the outdoor experimental results demonstrate the effectiveness of the reconstructed terrain model for different types of terrain. We also present a first attempt to classify the terrain based on the scans properties.
In this paper we present a system for automatic annotation of humans passing a surveillance camera. Each human has 4 associated annotations: the primary color of the clothing, the height, and focus of attention. the a...
详细信息
To perform as desired in a dynamic environment a vision system must adapt to a variety of operating conditions by selecting vision modules, tuning their parameters, and controlling image acquisition. Knowleage-based (...
详细信息
ISBN:
(纸本)9780769527864
To perform as desired in a dynamic environment a vision system must adapt to a variety of operating conditions by selecting vision modules, tuning their parameters, and controlling image acquisition. Knowleage-based (KB) controller-agents that reason over explicitly represented knowledge and interact withtheir environment can be used for this task;however the lack of a unifying methodology and development tools makes KB controllers difficult to create, maintain, and reuse. this paper presents the INVICON toolkit, based on the IndiGolog agent programming language with elements from control theory. It provides a basic methodology, a vision module declaration template, a suite of control components, and support tools for KB controller development. We have evaluated INVICON in two case studies that involved controlling vision-based pose estimation systems. the case studies show that INVICON reduces the effort needed to build. KB controllers for challenging domains and improves their flexibility and robustness.
this paper introduces a number of innovative no-reference algorithms to assess the perceived quality of real-time analog and digital television and video streams. A prototype system is developed to locate and measure ...
详细信息
ISBN:
(纸本)9780769527864
this paper introduces a number of innovative no-reference algorithms to assess the perceived quality of real-time analog and digital television and video streams. A prototype system is developed to locate and measure the impact of three types of impairments that commonly impair television and video signals. Analog sequences are tested for the presence of random noise. In the case of digital signals, two fundamental types of errors are of interest. the first is the blocking artifact that is so pervasive among DCT-based compression schemes such as MPEG. the second category includes errors caused by random changes to the bit stream of a signal. Of the various forms that these distortions may take on, only those that appear as "colored blocks " are detected by this system. Ideas to address the remaining issues are discussed.
暂无评论