This paper proposes a new technique for modeling and matching three-dimensional rigid objects by encoding the fluctuation of the surface and the variation of its normal around an oriented point on the surface as the s...
详细信息
ISBN:
(纸本)9780769528250
This paper proposes a new technique for modeling and matching three-dimensional rigid objects by encoding the fluctuation of the surface and the variation of its normal around an oriented point on the surface as the surface expands. The surface of the object is encoded into two two-dimensional curves as the surface signature on each point, and then the collection of the signatures are used to model and match the object. The signatures implicitly encode the curvature and symmetry of the surface around an oriented point. This modeling technique is robust to scale, orientation, noise, patch resolution, occlusion, and cluttering.
For AR applications the 3d position anddirection of the users view have to be determined in real time. At the same time, the augmentation, one or more 3d objects, have to be rendered. This article describes a distrib...
详细信息
ISBN:
(纸本)9780769528250
For AR applications the 3d position anddirection of the users view have to be determined in real time. At the same time, the augmentation, one or more 3d objects, have to be rendered. This article describes a distributed mobile AR-System designed for industrial service applications. By distributing the different tasks involved in AR, the computational load of the portable system can be minimized such that a small and lightweight computer can be used. The portable computer is connected to a backend server via WLAN. Minimisation of the bandwidth usage of the system ensures that 20 or more systems can operate on one standard WLAN access point. The main focus of this article is to minimise the computational load of the mobile system and the network load simultaneously.
This paper presents a new technique for estimating a multiview reconstruction given pair-wise Euclidean reconstructions up to rotations, translations and scales. The partial reconstructions are glued by the following ...
详细信息
ISBN:
(纸本)9780769528250
This paper presents a new technique for estimating a multiview reconstruction given pair-wise Euclidean reconstructions up to rotations, translations and scales. The partial reconstructions are glued by the following three step procedure: (i) Camera rotations consistent with all reconstructions are estimated linearly. (ii) All the pair-wise reconstructions are modified according to the new rotations and refined by bundle adjustment while keeping the corresponding rotations same. (iii) The refined rotations are used to estimate both camera translations and3d points using Second Order Cone Programming by minimizing the L-infinity-norm. We introduce a new criterion for evaluating importance of an epipolar geometry in influence on the overall 3d geometry. The estimated importance is used to reweight data in the above algorithm to better handle unequiponderantly captured images. The performance of the proposed method is demonstrated on difficult wide base-line image sets.
We present a 3d outdoor scene scanner for the acquisition of kilometers-deep scenes in night conditions. Its imaging system is based on a compact and low-cost pulsed laser illuminator and a light-intensifier equipped ...
详细信息
ISBN:
(纸本)9780769528250
We present a 3d outdoor scene scanner for the acquisition of kilometers-deep scenes in night conditions. Its imaging system is based on a compact and low-cost pulsed laser illuminator and a light-intensifier equipped CCd camera. By precisely synchronizing both the illuminator and the camera shutter it is possible to acquire "slices" of the scene at specific known distances. We show that even with large laser pulses and without megahertz-capable electronics, the thirddimension can be recovered for the whole range of the scene by processing only two images acquired in specific conditions. As the pixel intensities of the images produced by active imaging systems vary with the square of the range, anddue to the limiteddynamics of image sensors, scanning long-range scenes with shorter "slices" allows the camera gain to be adjusted with respect to the range and the accuracy to be enhanced. The imaging system as well as the different image processing steps are detailed in this paper and an example of typical results is given.
As the next generation of media, 3d video is attracting increased attention. 3d video is a sequence of three dimensional mesh models, captured and generated for a real dynamic object. In this paper, we present a simpl...
详细信息
ISBN:
(纸本)9780769528250
As the next generation of media, 3d video is attracting increased attention. 3d video is a sequence of three dimensional mesh models, captured and generated for a real dynamic object. In this paper, we present a simple framework of motion editing in 3d video database to re-use 3d video data. Our system is composed of two modules. In the first module, a motion database is automatically set up from original 3d video sequences off-line by analyzing the feature vectors of each frame. It is observed that our original 3d video sequences have a two-level temporal structure. A fine-to-coarse method is proposed to extract such a structure. 3d video is segmented into the fine-level structure by a three reference frame strategy and then is clustered into the coarse-level structure. In the second module, users can synthesize the motions to edit a new 3d video sequence online. A cost function is optimized to transit between two motions with the users' requirements. All the algorithms in the system are based on the analysis in feature vector space and the edited3d video sequence is played using OpenGL.
3d shape features are inherently scale-dependent. For instance, on a 3d model of a human body, the top of the head and a fingertip can both be detected as comer points, however, at entirely different scales. In this p...
详细信息
ISBN:
(纸本)9780769528250
3d shape features are inherently scale-dependent. For instance, on a 3d model of a human body, the top of the head and a fingertip can both be detected as comer points, however, at entirely different scales. In this paper, we present a method for extracting and integrating 3d shape features in the discrete scale-space of a triangular mesh model. We first parameterize the surface of the mesh model on a 2d plane and then construct a dense surface normal map. In general, the parametrization is not isometric. To account for this, we compute the relative stretch of the original edge lengths. Next, we compute a dense distortion map which is used to approximate the geodesic distances on the normal map. Then, we construct a discrete scale-space of the original 3d shape by successively convolving the normal map with distortion-adapted Gaussian kernels of increasing standarddeviation. We derive comer and edge detectors to extract 3d features at each scale in the discrete scale-space. Furthermore, we show how to combine the detector responses from different scales to form a unified representation of the 3d features.
This paper presents a novel object recognition approach based on range images. due to its insensitivity to illumination, range data is well suited for reliable silhouette extraction. Silhouette or contour descriptions...
详细信息
ISBN:
(纸本)9780769528250
This paper presents a novel object recognition approach based on range images. due to its insensitivity to illumination, range data is well suited for reliable silhouette extraction. Silhouette or contour descriptions are good sources of information for object recognition. We propose a complete object recognition system, based on a 3d laser scanner reliable contour extraction with floor interpretation, feature extraction using a new, fast Eigen-CSS method, and a supervised learning algorithm. The recognition system was successfully tested on range images acquired with a mobile robot, and the results are compared to standard techniques, i.e., Geometric features, Hu and Zernike moments, the Border Signature method and the Angular Radial Transformation. An evaluation using the receiver operating characteristic analysis completes this paper The Eigen-CSS method has proved to be comparable in detection performance to the top competitors, yet faster than the best one by an order of magnitude in feature extraction time.
A fully automatic 3d face recognition algorithm is presented. Several novelties are introduced to make the recognition robust to facial expressions and efficient. These novelties include: (1) Automatic 3d face detecti...
详细信息
ISBN:
(纸本)9780769528250
A fully automatic 3d face recognition algorithm is presented. Several novelties are introduced to make the recognition robust to facial expressions and efficient. These novelties include: (1) Automatic 3d face detection by detecting the nose;(2) Automatic pose correction and normalization of the 3d face as well as its corresponding 2d face using the Hotelling Transform;(3) A Spherical Face Representation and its use as a rejection classifier to quickly reject a large number of candidate faces for efficient recognition;and (4) Robustness to facial expressions by automatically segmenting the face into expression sensitive and insensitive regions. Experiments performed on the FRGC Ver 2.0 dataset (9,500 2d/3d faces) show that our algorithm outperforms existing 3d recognition algorithms. We achieved verification rates of 99.47% and 94.09% at 0.001 FAR and identification rates of 98.03% and 89.25% for probes with neutral and non-neutral expression respectively.
This paper demonstrates that, for axial non-central optical systems, the equation of a 3d line can be estimated using only four points extracted from a single image of the line. This result, which is a direct conseque...
详细信息
ISBN:
(纸本)9780769528250
This paper demonstrates that, for axial non-central optical systems, the equation of a 3d line can be estimated using only four points extracted from a single image of the line. This result, which is a direct consequence of the lack of vantage point, follows from a classic result in enumerative geometry: there are exactly two lines in 3-space which intersect four given lines in general position. We present a simple algorithm to reconstruct the equation of a 3d line from four image points. This algorithm is based on computing the Singular Value decomposition (SVd) of the matrix of Plucker coordinates of the four corresponding rays. We evaluate the conditions for which the reconstruction fails, such as when the four rays are nearly coplanar Preliminary experimental results using a spherical catadioptric camera are presented. We conclude by discussing the limitations imposed by poor calibration and numerical errors on the proposed reconstruction algorithm.
This paper addresses the problem of camera self-calibration, bundle adjustment and3d reconstruction from line segments in two images of poorly-textured indoor scenes. First, we generate line segment correspondences, ...
详细信息
ISBN:
(纸本)9780769528250
This paper addresses the problem of camera self-calibration, bundle adjustment and3d reconstruction from line segments in two images of poorly-textured indoor scenes. First, we generate line segment correspondences, using an extended version of our previously proposed matching scheme. The first main contribution is a new method to identify polyhedral junctions resulting from the intersections of the line segments. At the same time, the images are segmented into planar polygons. This is done using an algorithm based on a Binary Space Partitioning (BSP) tree. The junctions are matched end points of the detected line segments and hence can be used to obtain the epipolar geometry. The essential matrix is considered for metric camera calibration. For better stability, the second main contribution consists in a bundle adjustment on the line segments and the camera parameters that reduces the number of unknowns by a maximum flow algorithm. Finally, a piecewise-planar 3d reconstruction is computed based on the segmentation of the BSP tree. The system's performance is tested on some challenging examples.
暂无评论