It is generally considered that human behavior includes both regularities and habits. In this paper, the regularities and habits of behavior are called the behavioral pattern, and we wish to learn and recognize them. ...
详细信息
Karhunen-Loeve transformation (KLT) is a popular method for dimensional reduction and feature extraction in image analysis, signal processing, automatic control systems, and so on, while the drawback of the KLT is exp...
详细信息
The merging ratio image (MRI) for realistic object class re-rendering is presented and applied to human face image. We focus on the Lambertian object class and utilize a uniform ratio image scheme - merging ratio imag...
详细信息
ISBN:
(纸本)0780385543
The merging ratio image (MRI) for realistic object class re-rendering is presented and applied to human face image. We focus on the Lambertian object class and utilize a uniform ratio image scheme - merging ratio image - to merge the expression ratio image, ageing ratio image, and illumination ratio image (quotient image), which exhibits a new facial model quality. Given a single face image and some other photorealistic face examples with distinct attributes, such as aged wrinkles, expressive feature motions, and certain lighting conditions, we generate expressive expressions, natural ageing, rejuvenating and varying illuminations with MRI rendering technique. Experimental results demonstrate the attractive properties of our method with some face examples from MPI Caucasian face database and AI&R Asian face database.
In this article, we present the results of a pilot study that examined the performance of people training on a Virtual Reality based BEST-IRIS Laparoscopic Surgery Training Simulator. The performance of experienced su...
详细信息
ISBN:
(纸本)1586034049
In this article, we present the results of a pilot study that examined the performance of people training on a Virtual Reality based BEST-IRIS Laparoscopic Surgery Training Simulator. The performance of experienced surgeons was examined and compared to the performance of residents. The purpose of this study is to validate the BEST-IRIS training simulator. It appeared to be a useful training and assessment tool.
The vision system mounted in intelligent vehicle could require high resolution in certain azimuth direction and wide field of view to meet safety for autonomous or assistant driving, but few vision systems with these ...
详细信息
The vision system mounted in intelligent vehicle could require high resolution in certain azimuth direction and wide field of view to meet safety for autonomous or assistant driving, but few vision systems with these characteristics exists. A true single viewpoint multi-resolution elliptical cone catadioptric system with 180 degree horizontal field of view is suggested in order to provide higher resolution in lookahead direction and less in lateral direction. An elliptical cone is used as catoptric element and conventional perspective camera is used as dioptric element. The single viewpoint of the catadioptric system is proved and its vertical and horizontal resolution, and vertical field of view are calculated by pinhole imaging model and geometric optics image formation model. Experimental results verify the ability of multi-resolution of the system.
Machine vision system is one of the most important components of intelligent vehicles, and the software and hardware co-realization is critical for the vision navigation of intelligent vehicles. This work presents the...
详细信息
ISBN:
(纸本)0780382730
Machine vision system is one of the most important components of intelligent vehicles, and the software and hardware co-realization is critical for the vision navigation of intelligent vehicles. This work presents the multi-DSP realization for intelligent vehicle vision system, analyzes and illustrates the software and hardware co-realization. Based on the software framework and the data flow of the machine vision system, we propose the software implementation for the host computer and DSP terminals. In particular, a road detection algorithm and its multi-DSP realization are presented. Simulated in real environments, the system is stable and its real-time performance is satisfactory.
Focusing on the problem of path waiting or path circulation that existed in updating of the context table and the renorme and byteout procedure in the realization of the conventional arithmetic encoder in JPEG2000, a ...
详细信息
ISBN:
(纸本)078038511X
Focusing on the problem of path waiting or path circulation that existed in updating of the context table and the renorme and byteout procedure in the realization of the conventional arithmetic encoder in JPEG2000, a 3-step pipeline architecture is used on an FPGA to get high speed encoding. A method of updating the CX table is proposed, and a circuit with short delay is also implemented to detect the left zeros of the A-register. Multiplexers are adopted to accelerate the random left shift operation, and parallel processing based on data dependency is used to optimize the RTL code to shorten the main critical path. Finally, the updating of the logic of the context table is fully discussed. Experimental result show the encoder can work up to 107 MHz on Altera's EP1S25B672C7 and the critical path is 4.6 ns when synthesized in synopsys DC by the TSMC 0.25 /spl mu/m library.
To detect lane boundaries robustly, R channel and B channel of color road image were used to form a gray level image. Size of the gray image was reduced and Sobel operator with very low threshold was used to produce g...
详细信息
ISBN:
(纸本)0780382730
To detect lane boundaries robustly, R channel and B channel of color road image were used to form a gray level image. Size of the gray image was reduced and Sobel operator with very low threshold was used to produce gray edge image. In adaptive randomized Hough transform, pixels of gray edge image were sampled randomly according to their weights corresponding to their gradient magnitude. 3D parametric space of parabolic curve was reduced to 2D and two parameters were estimated by use of gradient direction, then another parameter was used to verify the estimated parameters by adaptive threshold value. Such lane markings can be detected accurately and robustly. Experimental results in different condition prove the validity of the method.
Speaker recognition systems perform better when clean speech signals are used for the task. In the presence of high levels of background noise, speech recorded from a close speaking microphone will be degraded and hen...
详细信息
Speaker recognition systems perform better when clean speech signals are used for the task. In the presence of high levels of background noise, speech recorded from a close speaking microphone will be degraded and hence the performance of the speaker recognition system. Use of a transducer held at the throat results in a signal that is clean even in a noisy environment. This paper discusses the prospect of using such signals for speaker recognition. A study of a text-independent speaker recognition system based on features extracted from speech simultaneously recorded using a throat microphone and a close-speaking microphone in clean and simulated noisy conditions is conducted. Autoassociative neural networks are used to model the speaker characteristics based on the vocal tract system and excitation source features represented by weighted linear prediction cepstral coefficients and linear prediction residual, respectively. The results of experimental studies show that the speech collected from the throat microphone can be used for tasks like speaker recognition, especially in noisy conditions.
Karhunen-Loeve transform (KLT) is a popular method for dimensional reduction and feature extraction in image analysis, signal processing and automatic control systems and so on. The drawback of the KLT is expensive co...
详细信息
Karhunen-Loeve transform (KLT) is a popular method for dimensional reduction and feature extraction in image analysis, signal processing and automatic control systems and so on. The drawback of the KLT is expensive computation. So the efficient updating algorithms were proposed in signal processing and numerical linear algebra. The updating algorithms make the active learning and recognition possible. But they mainly deal with zero mean data. In this paper, we propose an updating algorithm for KLT for nonzero mean data. And we also show its application in face analysis. The experimental results demonstrate the efficiency of our algorithm.
暂无评论