A new method for the reduction of the number of colors in a digital image is proposed. The new method is based on the developed of a new neural network classifier that combines the advantages of the Growing Neural Gas...
详细信息
Motivated by the success of free-parts based representations in face recognition, we have attempted to address some of the problems associated with applying such a philosophy to the task of speaker-independent visual ...
详细信息
作者:
Lucey, SimonLucey, PatrickAdvanced Multimedia Processing Laboratory
Department of Electrical and Computer Engineering Carnegie Mellon University PittsburghPA15213 United States Speech
Audio Image and Video Research Laboratory Queensland University of Technology GPO Box 2424 Brisbane4001 Australia
Motivated by the success of free-parts based representations in face recognition [1] we have attempted to address some of the problems associated with applying such a philosophy to the task of speaker-independent auto...
详细信息
This paper presents a feature point tracking algorithm using optical flow under the non-prior training active feature model (NPT-AFM) framework. The proposed algorithm mainly focuses on analysis of deformable objects,...
详细信息
Data clustering is intensively used in signal processing in tasks such as multimedia compression, segmentation and pattern matching. In this work we extend the use of principal curves in clustering to complex multidim...
详细信息
Data clustering is intensively used in signal processing in tasks such as multimedia compression, segmentation and pattern matching. In this work we extend the use of principal curves in clustering to complex multidimensional datasets. The use of principal curve in clustering is limited for high complexity data. Automatic parameterization of the principal curve to assure good results for different datasets is a difficult task. We propose to use the tree structure to capture the general settlement of the data. Using this topology, regions of the dataset can be extracted, individually clustered using the principal curve and then optimally recombined. The experiments show the improvement of the new method over the principal curve based clustering and the good performance compared to other clustering methods.
Motivated by the success of free-parts based representations in face recognition, we have attempted to address some of the problems associated with applying such a philosophy to the task of speaker-independent visual ...
详细信息
Motivated by the success of free-parts based representations in face recognition, we have attempted to address some of the problems associated with applying such a philosophy to the task of speaker-independent visual speech recognition. A major problem with canonical area-based approaches in automatic visual speech recognition is the dependence these approaches have on locating and tracking the speaker’s region of interest (ROI) correctly. By employing a free-parts representation,we assume that the position/structure of patches within the mouth image can be relaxed so they can "freely" move to varying extents, hence reducing the influence of the front-end effect. In this paper, we show that by using a free-parts representation we gain some robustness against the problem of ROI localisation and tracking compared to current area-based feature extraction techniques such as the discrete cosine transform (DCT). Also in this paper, we expose the importance of representation for the task of visual speech recognition highlighted by the poor results current representations yield.
We present a combined real-time face region tracking and highly accurate face recognition technique for an intelligent surveillance system. High-resolution face images are very important to achieving accurate identifi...
详细信息
We present a combined real-time face region tracking and highly accurate face recognition technique for an intelligent surveillance system. High-resolution face images are very important to achieving accurate identification of a human face. Conventional surveillance or security systems, however, usually provide poor image quality because they use only fixed cameras to record scenes passively. We have implemented a real-time surveillance system that tracks a moving face using four pan-tilt-zoom (PTZ) cameras. While tracking, the region-of-interest (ROI) can be obtained by using a low-pass filter and background subtraction with the PTZ. Color information in the ROI is updated to extract features for optimal tracking and zooming. FaceIt/sup /spl reg//, which is one of the most popular face recognition software packages, is evaluated and then used to recognize the faces from the video signal. Experimentation with real human faces showed highly acceptable results in the sense of both accuracy and computational efficiency.
We study the problem of representing images within a multimedia Database Management System (DBMS), in order to support fast retrieval operations without compromising storage efficiency. To achieve this goal, we propos...
We study the problem of representing images within a multimedia Database Management System (DBMS), in order to support fast retrieval operations without compromising storage efficiency. To achieve this goal, we propose new image coding techniques which combine a wavelet representation, embedded coding of the wavelet coefficients, and segmentation of image-domain regions in the wavelet domain. A bitstream is generated in which each image region is encoded independently of other regions, without having to explicitly store information describing the regions. Simulation results show that our proposed algorithms achieve coding performance which compares favorably, both perceptually and objectively, to that achieved using state-of-the-art image/video coding techniques while additionally providing region-based support.
暂无评论