Natural Human-computer Interface requires integration of realistic audio and visual information for perception and display. An example of such an interface is an animated talking head displayed on the computer screen ...
Natural Human-computer Interface requires integration of realistic audio and visual information for perception and display. An example of such an interface is an animated talking head displayed on the computer screen in the form of a human-like computer agent. This system converts text to acoustic speech with synchronized animation of mouth movements. The talking head is based on a generic 3D human head model, but to improve realism, natural looking personalized models are necessary. In this paper, we report a semi-automatic method for adapting a generic head model to 3D range data of a human head obtained from a 3D-laser range scanner. This personalized model is incorporated into the talking head system. With texture mapping, the personalized model offers a more natural and realistic look than the generic model. The model created with the proposed method compares favorable to generic models.
A nonlinear ranked-order filter based on a content model of similarity is proposed for colour imageprocessing. Simulation results indicate that the new filter suppresses impulsive as well as Gaussian noise and preser...
详细信息
A nonlinear ranked-order filter based on a content model of similarity is proposed for colour imageprocessing. Simulation results indicate that the new filter suppresses impulsive as well as Gaussian noise and preserves edges and details.
We consider a distributed system where sensors make location parameter estimates using their observations. A central processor collects the local estimates and declares a final estimate based on them. We present a sim...
详细信息
We consider a distributed system where sensors make location parameter estimates using their observations. A central processor collects the local estimates and declares a final estimate based on them. We present a simple study of the convergence properties of three structures where empirical mean and M-estimates are used in various combinations. It is shown that when occasional outliers exist, decentralized estimators that provide robustness at stages where data corruption occurs perform superiorly. (C) 1997 Elsevier Science B.V.
This paper focuses on the representation and view generation of three-dimensional (3-D) scenes. In contrast to existing methods that construct a full 3-D model or those that exploit geometric invariants, our represent...
详细信息
This paper focuses on the representation and view generation of three-dimensional (3-D) scenes. In contrast to existing methods that construct a full 3-D model or those that exploit geometric invariants, our representation consists of dense depth maps at several preselected viewpoints from an image sequence, Furthermore, instead of using multiple calibrated stationary cameras or range scanners, we derive our depth maps from image sequences captured by an uncalibrated camera with only approximately known motion, We propose an adaptive matching algorithm that assigns various confidence levels to different regions in the depth maps, Nonuniform bicubic spline interpolation is then used to fill in low confidence regions in the depth maps. Once the depth maps are computed at preselected viewpoints, the intensity and depth at these locations are used to reconstruct arbitrary views of the 3-D scene, Specifically, the depth maps are regarded as vertices of a deformable 2-D mesh, which are transformed in 3-D, projected to 2-D, and rendered to generate the desired view. Experimental results are presented to verify our approach.
A 1.5V resistive fuse for image smoothing and segmentation using bulk-driven MOSFETs is presented. The circuit switches on only if the differential voltage applied across its input terminals is less than a set voltage...
详细信息
A 1.5V resistive fuse for image smoothing and segmentation using bulk-driven MOSFETs is presented. The circuit switches on only if the differential voltage applied across its input terminals is less than a set voltage;it switches off if the differential voltage is higher than the set value. The useful operation range of the circuit is 0.4V with a supply voltage of 1.5V and threshold voltages of V-Tn = 0.828V and V-Tp = -0.56V for n and g channel MOSFETs, respectively.
This work concentrates on comparing the performance of the minimum distance classifier and maximum-likelihood classifier for texture analysis. A tree-structured wavelet transform has been used for extracting the featu...
详细信息
This work concentrates on comparing the performance of the minimum distance classifier and maximum-likelihood classifier for texture analysis. A tree-structured wavelet transform has been used for extracting the features and the comparison is based on the correct classification percentage. The results indicate that the maximum-likelihood classifier performs marginally better than the mahalanobis distance for some feature sets. The Euclidean distance did not prove to be powerful in distinguishing the textures. The performance of various orthogonal wavelet transforms have also been compared in order to find out the best wavelet for each of the classifiers considered. (C) 1997 Elsevier Science Ltd.
In this paper, we present a new system to segment and label CT/MRI Brain slices using feature extraction and unsupervised clustering. In this technique, each voxel is assigned a feature pattern consisting of a scaled ...
详细信息
ISBN:
(纸本)0780341236
In this paper, we present a new system to segment and label CT/MRI Brain slices using feature extraction and unsupervised clustering. In this technique, each voxel is assigned a feature pattern consisting of a scaled family of differential geometrical invariant features. The invariant feature pattern is then assigned to a specific region using a two-stage neural network system. The first stage is a self-organizing principal components analysis (SOPCA) network that is used to project the feature vector onto its leading principal axes found by using principal components analysis. This step provides an effective basis for feature extraction. The second stage consists of a self-organizing feature map (SOFM) which will automatically cluster the input vector into different regions. The optimum number of regions (clusters) is obtained by a model fitting approach. Finally, a 3D connected component labeling algorithm is applied to ensure region connectivity. Implementation and performance of this technique are presented. Compared to other approaches, the new system is more accurate in extracting 3D anatomical structures of the brain, and can be apdated to real-time imaging scenarios.
An image warping technique based on segmented regions is introduced for the temporal prediction of videophone-type sequences. At the encoder, a set of control points are determined from the previous frame and their co...
详细信息
The use of higher-order statistics in an autoregressive modeling of lung sounds is presented resulting in a characterization of their source and transmission. The lung sound source in the airway is estimated using the...
详细信息
The use of higher-order statistics in an autoregressive modeling of lung sounds is presented resulting in a characterization of their source and transmission. The lung sound source in the airway is estimated using the prediction error of an all-pole filter based on higher-order statistics (AR-HOS), while the acoustic transmission through the lung parenchyma and chest wall is modeled by the transfer function of the same AR-HOS filter. The parametric bispectrum, using the estimated a/sub i/ coefficients of the AR-HOS model, is also calculated to elucidate the frequency characteristics of the modeled system. The implementation of this approach on pre-classified lung sound segments in known disease conditions, selected from teaching tapes, was examined. Experiments have shown that a reliable and consistent with current knowledge estimation of lung sound characteristics can be achieved using this method, even in the presence of additive Gaussian noise.
A block-matching method is generally used for the disparity estimation as well as motion estimation applications. However, this method suffers from limitations such as blocking artifacts on the reconstructed images an...
详细信息
A block-matching method is generally used for the disparity estimation as well as motion estimation applications. However, this method suffers from limitations such as blocking artifacts on the reconstructed images and a lack of compensation ability for the mismatching areas. A new disparity estimation scheme using 2-D filtering is proposed to provide more accurate estimates of the disparity vector and better compensation ability. This is accomplished by applying the left image to the reference input of the filter while using the right image as the desired output. The goal is to provide the best matching for the right image using the filter output instead of the left image directly as used in the block-matching method. A reduced order filtering scheme is also introduced to minimize the number of filter coefficients for the reconstruction. The reconstructed images generated based upon the estimated disparity vectors and some principal filter coefficients exhibited excellent quality.
暂无评论