Range sensing technology allows the photorealistic modeling of large-scale scenes, such as urban structures. The generated3d representations, after automated registration, are useful for urban planning, historical pr...
详细信息
ISBN:
(纸本)9780769528250
Range sensing technology allows the photorealistic modeling of large-scale scenes, such as urban structures. The generated3d representations, after automated registration, are useful for urban planning, historical preservation, or virtual reality applications. One major issue in 3d modeling of complex large-scale scenes is that the final result is a dense complicated mesh. Significant, in some cases manual, post-processing (mesh simplification, hole filling) is required to make this representation usable by graphics or CAd applications. This paper presents a 3d modeling approach that models large planar scene areas of the scene with planar primitives (extracted via a segmentation pre-process), and non-planar areas with mesh primitives. In that respect, the final model is significantly compressed. Also, lines of intersection between neighboring planes are modeled as such. These steps bring the model closer to graphics/CAd applications. We present results from experiments with complex range scans from urban structures and from the interior of a large-scale landmark urban building (Grand Central Terminal, NYC).
Self-calibration is one of the most active issues concerning vision-based3d measurements. However, in the case of the light sectioning method, there has been little research conducted on self-calibration techniques. ...
详细信息
ISBN:
(纸本)9780769528250
Self-calibration is one of the most active issues concerning vision-based3d measurements. However, in the case of the light sectioning method, there has been little research conducted on self-calibration techniques. In this paper we study the problem of self-calibration for an active vision system which uses line lasers and a single camera. The problem can be defined as the estimation of multiple laser planes from the curves of laser reflections observed from a sequence of images captured by a single camera. The constraints of the problem can be obtained from observed intersection points between the curves. In this condition, the problem is formulated as simultaneous polynomial equations, in which the number of equations is larger than the number of variables. Approximated solutions of the equations can be computed by using Grobner bases. By refining them using nonlinear optimization, the final result can be obtained. We developed an actual 3d measurement system using the proposed method, which consists of only a laser projector with two line lasers and a single camera. Users are just required to move the projector freely so that the projected lines sweep across the surface of the scene to get the 3d shape.
When an observer moves through a rigid3d scene, points that are near to the observer move with a different image velocity than points that are far away. The difference between image velocity vectors is the direction ...
详细信息
ISBN:
(纸本)9780769528250
When an observer moves through a rigid3d scene, points that are near to the observer move with a different image velocity than points that are far away. The difference between image velocity vectors is the direction of motion parallax. This direction vector points towards the observer's translation direction. Hence estimates of the direction of motion parallax are useful for estimating the observer's translation direction. Standard ways to compute the direction of motion parallax either rely on precomputed optical flow, or rely on motion compensation to remove the local image shift caused by observer rotation. Here we present a simple Fourier-based method for estimating the direction of motion parallax directly, that does not require optical flow and motion compensation. The method is real-time and performs accurately for image regions in which multiple motions are present.
A common approach to model-based tracking is to use a model of the object to predict what will be observed, and then to compare that with real observations. For methods that make use of the object39;s photometric pr...
详细信息
ISBN:
(纸本)9780769528250
A common approach to model-based tracking is to use a model of the object to predict what will be observed, and then to compare that with real observations. For methods that make use of the object's photometric properties (appearance) in their measurements, illumination inconsistencies between the modeled and actual scene can cause tracking problems. In this paper we address one case: model-based tracking of Lambertian objects under directional light sources. We present an iterative optimization method that uses a Kalman filter to simultaneously refine estimates of the object motion, the illumination, and the model texture. We model the illumination variance between the real and predicted observation using the intensity ratios of corresponding surface points, which we then use to make model-based image predictions consistent with the real lighting. To demonstrate the effectiveness of our method we present experimental results using both synthetic (controlled) and real image sequences.
Motivated by the requirements of modern archaeologists, we are developing a documentation system based on structured light for acquisition of ceramics. Fragments of ceramics are daily finds at excavations and importan...
详细信息
ISBN:
(纸本)9780769528250
Motivated by the requirements of modern archaeologists, we are developing a documentation system based on structured light for acquisition of ceramics. Fragments of ceramics are daily finds at excavations and important for archaeological research, because their shape leads to information about ancient cultures. The shapes used for documentation are called profile lines and estimated by a vertical cross-section of orientated fragments. As ceramics have been produced using rotational plates for several thousands of years, the rotational axis can be used for orientation. Therefore we conducted experiments using existing methods for estimation of the rotational axis. The drawbacks of these methods are the requirement of either complete objects or industrialized quality of symmetry. Therefore we show a new method using circle templates, which has been inspired by the manual method of archaeologists. In this work we present results using previous and related work in comparison with the estimation of the rotational axis using circle templates. The results of the presented methods are shown for synthetic data, well-known fragments and real data acquired at an archaeological excavation. Finally a conclusion and an outlook is given.
In this paper, we address the problem of finding depth from defocus in a fundamentally new way. Most previous methods have used an approximate model in which blurring is shift invariant and pixel area is negligible. O...
详细信息
ISBN:
(纸本)9780769528250
In this paper, we address the problem of finding depth from defocus in a fundamentally new way. Most previous methods have used an approximate model in which blurring is shift invariant and pixel area is negligible. Our model avoids these assumptions. We consider the area in the scene whose radiance is recorded by a pixel on the sensor and relate the size and shape of that area to the scene's position with respect to the plane of focus. This is the notion of reverse projection, which allows us to illustrate that, when out of focus, neighboring pixels will record light from overlapping regions in the scene. This overlap results in a measurable change in the correlation between the pixels' intensity values. We demonstrate that this relationship can be characterized in such a way as to recover depth from defocused images. Experimental results show the ability of this relationship to accurately predict depth from correlation measurements.
In this paper we present a prototype system for image based localization in urban environments. Given a database of views of city street scenes tagged by GPS locations, the system computes the GPS location of a novel ...
详细信息
ISBN:
(纸本)9780769528250
In this paper we present a prototype system for image based localization in urban environments. Given a database of views of city street scenes tagged by GPS locations, the system computes the GPS location of a novel query view. We first use a wide-baseline matching technique based on SIFT features to select the closest views in the database. Often due to a large change of viewpoint and presence of repetitive structures, a large percentage of matches (> 50%) are not correct correspondences. The subsequent motion estimation between the query view and the reference view, is then handled by a novel and efficient robust estimation technique capable of dealing with large percentage of outliers. This stage is also accompanied by a model selection step among the fundamental matrix and the homography. Once the motion between the closest reference views is estimated, the location of the query view is then obtained by triangulation of translation directions. Approximate solutions for cases when triangulation cannot be obtained reliably are also described. The presented system is tested on the dataset used in ICCV 2005 Computer Vision Contest and is shown to have higher accuracy than previous reported results.
Extended Kalman filters (EKF) have been proposed to estimate ego-motion and to recursively update scene structure in the form of 3-d positions of selected prominent features from motion and stereo sequences. Previous ...
详细信息
ISBN:
(纸本)9780769528250
Extended Kalman filters (EKF) have been proposed to estimate ego-motion and to recursively update scene structure in the form of 3-d positions of selected prominent features from motion and stereo sequences. Previous methods typically accommodate no more than a few dozen features for real-time processing. To maintain motion estimation accuracy, this calls for high contrast images to compute image feature locations with precision. Within manmade environments, various prominent corner points exist that can be extracted and tracked with required accuracy. However, prominent features are more difficult to localize precisely in natural scenes. Statistically, more feature points become necessary to maintain the same level of motion estimation accuracy and robustness. However, this imposes a computational burden beyond the capability of EKF-based techniques for real-time processing. A sequential dual EKF estimator utilizing stereo data is proposed for improved computation efficiency. Two important issues, unbiased estimation and stochastic stability are addressed. Furthermore, the dynamic feature set is handled in a more effective, efficient and robust way. Experimental results to demonstrate the merits of the new theoretical and algorithmic developments are presented.
3d city modeling using computer vision is very challenging. A typical city contains objects which are a nightmare for some vision algorithms, while other algorithms have been designed to identify exactly these parts b...
详细信息
ISBN:
(纸本)9780769528250
3d city modeling using computer vision is very challenging. A typical city contains objects which are a nightmare for some vision algorithms, while other algorithms have been designed to identify exactly these parts but, in their turn, suffer from other weaknesses which limit their application. For instance, moving cars with metallic surfaces can degrade the results of a 3d city reconstruction algorithm which is primarily based on the assumption of a static scene with diffuse reflection properties. On the other hand, a specialized object recognition algorithm could be able to detect cars, but also yields too many false positives without the availability of additional scene knowledge. In this paper, the design of a cognitive loop which intertwines both aforementioned algorithms is demonstrated for 3d city modeling, proving that the whole can be much more than the simple sum of its parts. A cognitive loop is the mutual transfer of higher knowledge between algorithms, which enables the combination of algorithms to overcome the weaknesses of any single algorithm. We demonstrate the promise of this approach on a real-world city modeling task using video data recorded by a survey vehicle. Our results show that the cognitive combination of algorithms delivers convincing city models which improve upon the degree of realism that is possible from a purely reconstruction-based approach.
Many applications in computer graphics need high level shape descriptions, in order to benefit from a global understanding of shapes. Topological approaches enable pertinent surface decompositions, providing structura...
详细信息
ISBN:
(纸本)9780769528250
Many applications in computer graphics need high level shape descriptions, in order to benefit from a global understanding of shapes. Topological approaches enable pertinent surface decompositions, providing structural descriptions of 3d polygonal Meshes;but in practice, their use raises several difficulties. In this paper we present a novel method for the construction of invariant high level Reeb graphs, topological entities that give a good overview of the shape structure. With this aim, we propose an accurate and straightforward feature point extraction algorithm for the computation of an invariant and meaningful quotient function. Moreover, we propose a new graph construction algorithm, based on an analysis of the connectivity evolutions of discrete level lines. This algorithm brings a practical solution for the suppression of non-significant critical points over piecewise continuous functions, providing meaningful Reeb graphs. Presented method gives accurate results, with satisfactory execution times and without input parameter The geometrical invariance of resulting graphs and their robustness to variation in model pose and mesh sampling make them good candidates for several applications, like shape deformation (experimented in this paper), recognition, compression, indexing, etc.
暂无评论