This paper presents a robust and accurate vision-based augmented reality system for surgical navigation. The key point of our system is a robust and real-time monocular vision algorithm to estimate the 3D pose of surg...
详细信息
This paper presents a robust and accurate vision-based augmented reality system for surgical navigation. The key point of our system is a robust and real-time monocular vision algorithm to estimate the 3D pose of surgical tools, utilizing specially designed code markers and Kalman filter-based position updating. The vision system is not impaired by occlusion and rapid change of illumination. The augmented reality system superimposes the 3D object wireframe onto the live viewing image taken from the surgical microscope as well as displaying other useful navigation information, while allowing the surgeons to freely change its room and focus for viewing. The experimental results verified the robustness and usefulness of the system, and acquired the image registration error less than 2 mm.
This demonstration presents the methods for 3D scene geometry recovery/refinement and pose estimation from motion imagery in two representative scenarios. Ar-st, we present the method for pose estimation and scene geo...
详细信息
This demonstration presents the methods for 3D scene geometry recovery/refinement and pose estimation from motion imagery in two representative scenarios. Ar-st, we present the method for pose estimation and scene geometry recovery from extended sequences without a prior knowledge of the scene. Second, we discuss how to recover camera poses when a rough scene model is provided. We show how to extend and refine the scene model using the recovered poses. Finally, we present applications of above techniques for 3D imagery manipulation such as enhanced visualization for the video and 3D insertion of synthetic objects in the imagery.
A novel approach for real-time skirt segmentation in video sequences is described. The approach enables reliable skin segmentation despite wide variation in illumination during tracking. An explicit second order Marko...
详细信息
A novel approach for real-time skirt segmentation in video sequences is described. The approach enables reliable skin segmentation despite wide variation in illumination during tracking. An explicit second order Markov model is used to predict evolution of the skin color (HSV) histogram over time. Histograms are dynamically updated based on feedback from the current segmentation and based on predictions of the Markov model. The evolution of the skin color distribution at each frame is parameterized by translation, scaling and rotation in color space. Consequent changes in geometric parameterization of the distribution ni-e propagated by warping and re-sampling the histogram. The parameters of the discrete-time dynamic Markov model are Estimated using Maximum Likelihood Estimation, and also evolve over time. Quantitative evaluation of the method rc as conducted on labeled ground-truth video sequences taken from papular movies.
image segmentation has traditionally been thought of as a low/mid-level vision process incorporating no high level constraints. However, in complex and uncontrolled environments, such bottom-up strategies have drawbac...
详细信息
image segmentation has traditionally been thought of as a low/mid-level vision process incorporating no high level constraints. However, in complex and uncontrolled environments, such bottom-up strategies have drawbacks that lead to large misclassification rates. Remedies to this situation include taking into account (1) contextual and application constraints, (2) user input and feedback to incrementally improve the performance of the system. Here we attempt to incorporate these in the context of pipeline segmentation in industrial images. This problem is of practical importance for the 3D reconstruction of factory environments. However it poses several fundamental challenges mainly due to shading, highlights and textural variations, etc. Our system performs pipe segmentation by fusing methods from physics-based vision, edge and texture analysis, probabilistic learning and the use of the graph-cut formalism.
In this paper, we developed a color model to cancel the dependency between color channels, which enables us to separate spectral processing.from spatial processing. We introduced Independent Component Analysis (ICA) t...
详细信息
In this paper, we developed a color model to cancel the dependency between color channels, which enables us to separate spectral processing.from spatial processing. We introduced Independent Component Analysis (ICA) transformation in the wavelet domain to decorrelate the subband color joint statistics. The decorrelated joint color conditional histograms display scaling of variance. Gaussian Scale Mixture (GSM) was used to model the subband color statistics and a normalization scheme was adapted to cancel the pair-wise color subband statistical dependency. This color model was combined with the Portilla/Simoncelli texture model to construct the color texture model. Based on this model, features were extracted and the corresponding color texture synthesis scheme was developed.
image-based-interpolation creates smooth and photorealistic views between two view points. The concept of joint view triangulation (JVT) has been proven to be an efficient multi-view representation to handle visibilit...
详细信息
image-based-interpolation creates smooth and photorealistic views between two view points. The concept of joint view triangulation (JVT) has been proven to be an efficient multi-view representation to handle visibility issue. However;the existing JVT built only on a regular sampling grid, often produces undesirable artifacts for artificial objects. To tackle these problems, a new edge-constrained joint view triangulation is developed in this paper to integrate contour points and artificial rectilinear objects as triangulation constraints. Also a super-sampling technique is introduced to refine visible boundaries. The new algorithm is successfully demonstrated on many real image pairs.
We present contour based techniques, for automatic object recognition, that avoid the difficulties that arise as a consequence of translation, rotation and scaling, Our techniques do not require the use of any represe...
详细信息
ISBN:
(纸本)078036466X
We present contour based techniques, for automatic object recognition, that avoid the difficulties that arise as a consequence of translation, rotation and scaling, Our techniques do not require the use of any representation of shape. The first technique uses the scale-space filtered coordinate functions of the contours of the objects to be recognized as well as what we define as the "largest diameter" of the contour. The second technique uses the Hotelling transform of the vector representations of the points of contours. We describe and use to advantage some interesting properties this transform that make it an important tool for imageprocessing.
The analysis of human action captured in video sequences has been a topic of considerable interest in computer vision. Much of the previous work has focused on the problem of action or activity recognition, but ignore...
详细信息
ISBN:
(纸本)0769506623
The analysis of human action captured in video sequences has been a topic of considerable interest in computer vision. Much of the previous work has focused on the problem of action or activity recognition, but ignored the problem of detecting action boundaries in a video sequence containing unfamiliar and arbitrary visual actions. This paper presents an approach to this problem based on detecting temporal discontinuities of the spatial pattern of image motion that captures the action. We represent frame to frame optical-flow in terms of the coefficients of the most significant principal components computed from all the flow-fields within a given video sequence. We then detect the discontinuities in the temporal trajectories of these coefficients based on three different measures. We compare our segment boundaries against those detected by human observers on the same sequences in a recent independent psychological study of human perception of visual events. We show experimental results on the two sequences that were used in this study. Our experimental results are promising both from visual evaluation and when compared against the results of the psychological study.
作者:
Tieu, KViola, PMIT
Artificial Intelligence Lab Cambridge MA 02139 USA
We present an approach for image retrieval using a very large number of highly selective features and efficient online learning. Our approach is predicated on the assumption that each image is generated by a sparse se...
详细信息
ISBN:
(纸本)0769506623
We present an approach for image retrieval using a very large number of highly selective features and efficient online learning. Our approach is predicated on the assumption that each image is generated by a sparse set of visual "causes" and that images which are visually similar share causes. We propose a mechanism for computing a very large number of highly selective features which capture some aspects of this causal structure (in our implementation there are or er 45,000 highly selective features). At query time a user selects a few example images, and a technique known as "boosting" is used to learn a classification function in this feature space. By construction, the boosting procedure learns a simple classifier which only relies on 20 of the features. As a result a very large database of images can be scanned rapidly perhaps a million images per second. Finally we will describe a set of experiments performed using our retrieval system on a database of 3000 images.
A novel approach for estimating articulated body posture and motion from monocular video sequences is proposed. Human pose is defined as the instantaneous two dimensional configuration (i.e.,the projection onto the im...
详细信息
A novel approach for estimating articulated body posture and motion from monocular video sequences is proposed. Human pose is defined as the instantaneous two dimensional configuration (i.e.,the projection onto the image plane) of a single articulated body in terms of the position of a predetermined set of joints. First, statistical segmentation of the human bodies from the background is performed and low-level visual features are found given the segmented body shape. The goal is to be able to map these, generally low level, visual features to body configurations. The system estimates different mappings, each one with a specific cluster in the visual feature space. Given a set of body motion sequences for training, unsupervised clustering is obtained via the Expectation Maximization algorithm. For each of the clusters, a function is estimated to build the mapping between low-level features to 2D pose. Given new visual features, a mapping from each cluster is performed to yield a set of possible poses. From this set, the system selects the most likely pose given the learned probability distribution and the visual feature similarity, between hypothesis and input. Performance of the proposed approach is characterized using real and artificially generated body postures, showing promising results.
暂无评论