2d or 3d shapes are the most important visual information that we use to recognize an object. We propose a unified framework "ShapeLab" to search similar 2d or 3d shapes from an existing database. Users can ...
详细信息
ISBN:
(纸本)9780769528250
2d or 3d shapes are the most important visual information that we use to recognize an object. We propose a unified framework "ShapeLab" to search similar 2d or 3d shapes from an existing database. Users can search 3d shapes with a 2d input, and vice versa. ShapeLab is composed of four key components: (1) pose determination for 3d models;(2) 2d orthogonal view generation based on multiple levels of detail;(3) similarity measurement between 2d shapes;and (4) freehand sketch-based user interface. Key algorithms supporting the above components are briefly described. Experiments show ShapeLab can provide a better performance such as high accuracy, flexibility and scalability compared to the available methods.
We propose a technique for 3d face registration and matching using a novel representation called "isoradius contours". An isoradius contour is the contour on the 3d facial surface that is a known fixeddista...
详细信息
ISBN:
(纸本)9780769528250
We propose a technique for 3d face registration and matching using a novel representation called "isoradius contours". An isoradius contour is the contour on the 3d facial surface that is a known fixeddistance relative to some pre-defined reference point (the tip of the nose). A 3d face representation contains many isoradius contours with different radii and the first major benefit of the technique is that the shape of the contours is independent of the facial pose, due to the infinite rotational symmetry of a sphere. The second major benefit of the technique is that registration, alignment and matching can be implemented using a simple process of Id correlation. Our results have shown that registration and alignment is of comparable accuracy to ICP (iterative closest points), but is fast, non iterative, and is robust to the presence of outliers.
In this paper, we propose a method for 3d-model retrieval from one or more photos. This method provides an "optimal" selection of 2d views to represent a 3d-model, and a probabilistic Bayesian method for 3d-...
详细信息
ISBN:
(纸本)9780769528250
In this paper, we propose a method for 3d-model retrieval from one or more photos. This method provides an "optimal" selection of 2d views to represent a 3d-model, and a probabilistic Bayesian method for 3d-model retrieval from realistic photos and sketches using these views. The characteristic view selection algorithm is based on an adaptive clustering algorithm and uses statistical model distribution scores to select the optimal number of views. We also introduce a Bayesian approach to score the probability of correspondence between the queries and the 3d-models. We present our results on the Princeton 3d Shape Benchmark database (1814 3d-models) and 50 photos (real photographs, sketches, synthesised images). A practical on-line 3d-model retrieval system based on our approach is available on the web to asset our results [1].
A method for recovery of a 3d model of a planet-sized cloud-like structure that is in motion anddeforming but approximately governed by magnetic field properties is described. The method allows recovery of the model ...
详细信息
ISBN:
(纸本)9780769528250
A method for recovery of a 3d model of a planet-sized cloud-like structure that is in motion anddeforming but approximately governed by magnetic field properties is described. The method allows recovery of the model from a single intensity image in which the structure's silhouette can be observed. The method exploits envelope theory and a magnetic field model. Given one intensity image and the segmented silhouette in the image, the method proceeds without human intervention to produce the 3d model. In addition to allowing 3d model synthesis, the method's capability to yield a very compact description offers further utility. Application of the method to real-worlddata is also demonstrated.
We propose a mathematical approach for quantifying shape complexity of 3d surfaces based on perceptual principles of visual saliency. Our curvature variation measure (CVM), as a 3d feature, combines surface curvature ...
详细信息
ISBN:
(纸本)9780769528250
We propose a mathematical approach for quantifying shape complexity of 3d surfaces based on perceptual principles of visual saliency. Our curvature variation measure (CVM), as a 3d feature, combines surface curvature and information theory by leveraging bandwidth-optimized kernel density estimators. Using a part decomposition algorithm for digitized3d objects, represented as triangle meshes, we apply our shape measure to transform the low level mesh representation into a perceptually informative form. Further, we analyze the effects of noise, sensitivity to digitization, occlusions, anddescriptiveness to demonstrate our shape measure on laser-scanned real world3d objects.
3dtransmission over unreliable networks needs to take into account the possibility of packet loss. In this work we describe a perceptually motivated strategy for joint transmission of texture and mesh over unreliable...
详细信息
ISBN:
(纸本)9780769528250
3dtransmission over unreliable networks needs to take into account the possibility of packet loss. In this work we describe a perceptually motivated strategy for joint transmission of texture and mesh over unreliable networks. The approach is described initially considering regular mesh structure, to show the utility of optimizing the texture-mesh tradeoff. In order to generalize our approach to arbitrary meshes we consider stripification of the mesh, combined with a strategy that does not need texture or vertex packets to be re-transmitted. Only the valence (connectivity) packets need to be re-transimitted;however, storage of valence information requires only 10% space compared to vertices and even less compared to photo-realistic texture. Thus, only less than 5% of the packets may need to be re-transmitted in the worst case to allow our algorithm to successfully reconstruct an acceptable object under severe packet loss. Results showing the implementation of the proposed approach are described.
detection of articulated objects such as humans is an important task in computer vision. We present a system that incorporates a variety of constraints in a unified multiview framework to automatically detect humans i...
详细信息
ISBN:
(纸本)9780769528250
detection of articulated objects such as humans is an important task in computer vision. We present a system that incorporates a variety of constraints in a unified multiview framework to automatically detect humans in possibly crowded scenes. These constraints include the kinematic constraints, the occlusion of one part by another and the high correlation between the appearance of parts such as the two arms. The graphical structure (non-tree) obtained is optimized in a nonparametric belief propagation framework using prior based search.
Interactive network-based navigation over large urban environments raises difficult problems due to the size and complexity of these scenes. In this paper, we present a client-server system allowing navigation over 3d...
详细信息
ISBN:
(纸本)9780769528250
Interactive network-based navigation over large urban environments raises difficult problems due to the size and complexity of these scenes. In this paper, we present a client-server system allowing navigation over 3d cities in real time. due to a novel progressive and hierarchical representation of 3d models of densely built urban areas, only perceptible details for all the regions visible from a given viewpoint are progressively streamed to visualisation clients. Furthermore, efficient coding methods are used to compress the representation data allowing quick start-up of the interactive visualisation with a highly-detailed model. This is achieved through a set of dedicated algorithms allowing a very large city model to be structured into a multi-resolution representation. The method efficiently exploits the fact that most automated modelling techniques of urban scenes provides 2d1/2 models (building footprint, height, altitude, So as to efficiently and faithfully model complex buildings, a procedural representation for roofs and facades is proposed. Finally, we present an MPEG4 compatible implementation based on the introduction of new node types with the associated bitstream.
In this paper we report our progress in building a system for the acquisition, analysis, andvisualization of a collection of Native Californian baskets from the Phoebe A. Hearst Museum of Anthropology. Our project di...
详细信息
ISBN:
(纸本)9780769528250
In this paper we report our progress in building a system for the acquisition, analysis, andvisualization of a collection of Native Californian baskets from the Phoebe A. Hearst Museum of Anthropology. Our project differs from existing cultural heritage applications in terms of its focus: to build tools and techniques for visualizing and studying a large number of related objects - in this case, baskets. We present our progress in the following system components: (i) laser-scanning of baskets, (ii) construction andprocessing of 3d models, and (iii) building virtual exhibits. We conclude the paper with our experiences and a summary of challenges we anticipate in building a completely automated system for processing and analyzing a large set of models - such as might be encountered when digitizing a large museum collection. Efficient retrieval andvisualization of artifact collections are important to a number of communities, including anthropology researchers, Native American tribes, and the general public.
As more and more human motion data are widely used to animate computer graphics figures in many applications, there is an imperative need to compress motion data for compact storage and fast transmission. We propose a...
详细信息
ISBN:
(纸本)9780769528250
As more and more human motion data are widely used to animate computer graphics figures in many applications, there is an imperative need to compress motion data for compact storage and fast transmission. We propose a data-driven method for efficient compression of human motion sequences by exploiting both spatial and temporal coherences of the data. We first segment a motion sequence into subsequences such that the poses within a subsequence lie near a low dimensional linear space. We then compress each segment using the principal component analysis. Further compression is achieved by storing only the key frames' projections to the principal component space and interpolating the other frames in-between the keyframes via spline functions. The experimental results show that our method can achieve significant compression rate with low reconstruction errors.
暂无评论