3D data has many advantages over image data. It is robust to illumination change and does not have a scaling problem caused by distance of an object. Also it can be viewed at various angles. Nowadays with advance of 3...
详细信息
The region-based approach has become a popular research trend in the field of multimedia database retrieval. We present the Region Frequency and Inverse Picture Frequency (RF/sup*/IPF) weighting, a measure developed t...
详细信息
The proceedings contain 157 papers. The special focus in this conference is on Wearable Computing, Retrieval Techniques, Coding Techniques and Systems. The topics include: Hardware platform for wearable computing rese...
ISBN:
(纸本)3540426809
The proceedings contain 157 papers. The special focus in this conference is on Wearable Computing, Retrieval Techniques, Coding Techniques and Systems. The topics include: Hardware platform for wearable computing research;automatic summarization of wearable video indexing subjective interest;a novel video retrieval method to support a user's recollection of past events aiming for wearable information playing;experience of immersive virtual world using cellular phone interface;toward human-centered interaction through wearable vision and visualization;face indexing and retrieval in personal digital album;an image retrieval system based on local and global color descriptors;a new shot boundary detection algorithm;an adaptive index structure for high-dimensional similarity search;combining hierarchical classifiers with video semantic indexing systems;dynamic multi-reference prediction in video coding for improved error resilience over internet;fast and robust sprite generation for mpeg-4 video coding;improved mpeg-4 visual texture coding using perceptual dithering for transparent image coding;a real-time large vocabulary continuous recognition system for Chinese sign language;object modeling, coding, and transmission for multimedia communications;live events accessing for multi-users with free viewpoints using stereo Omni-directional system;user modeling for efficient use of multimedia files;a feature-based vehicle tracking system in congested traffic video sequences;movie event detection by using audiovisual information;automatic segmentation and tracking of moving objects;interacting with 3d graphic objects in an image-based environment and robust head pose estimation using textured polygonal model with local correlation measure.
In this paper, we present a segmented linear subspace model for face recognition that is robust under varying illumination conditions. The algorithm generalizes the 3D illumination subspace model by segmenting the ima...
详细信息
ISBN:
(纸本)0769512720
In this paper, we present a segmented linear subspace model for face recognition that is robust under varying illumination conditions. The algorithm generalizes the 3D illumination subspace model by segmenting the image into regions that have surface normals whose directions are close to each other. This segmentation is performed using a K-means clustering algorithm and requires only a few training images under different illuminations. When the linear subspace model is applied to the segmented image, recognition is robust to attached and cast shadows, and the recognition rate is equal to that of computationally more complex systems that require constructing the 3D surface of the face.
A software is developed which enables reconstruction of the three-dimensional (3-D) shape of fracture surfaces without human assistance. It is based upon computerimageprocessing.and patternrecognition techniques by...
详细信息
ISBN:
(纸本)0819439983
A software is developed which enables reconstruction of the three-dimensional (3-D) shape of fracture surfaces without human assistance. It is based upon computerimageprocessing.and patternrecognition techniques by using a stereo-pair of scanning electron micrographs. The processing.consists of two subprocesses: searching the matching points between two images and computation of heights using the relative shift of the matching points. By using the previously developed system, some mismatches were inevitable in the search process, in particular, for low-contrast SEM images such as striations, intergranular facets, and so on. In order to improve the accuracy of the search, a Genetic Algorithm (GA) was implemented into the developed system. By using the GA method, the 3-D shapes of a wide variety of fracture surfaces including cleavage failures, intergranular cracking dimples and fatigue striations, were successfully reconstructed with sufficient accuracy. The searching processes by the GA method and the previously developed two-step algorithm of coarse and close searching were compared. These proved that the GA method has the advantage of accuracy in the searching process and of short-run-time. A detailed 3-D shape, or more than a 120 x 120 reconstructed point-sized shape, was thus obtained with sufficient accuracy and with a relatively short-run-time.
In this paper we will revise the application of twisted nematic liquid crystal displays (TN-LCD) as spatial fight modulators (SLM) for imageprocessing.and diffractive optics. In general two kind of responses are desi...
详细信息
ISBN:
(纸本)0819441236
In this paper we will revise the application of twisted nematic liquid crystal displays (TN-LCD) as spatial fight modulators (SLM) for imageprocessing.and diffractive optics. In general two kind of responses are desired for the mentioned applications: amplitude-only and phase-only modulation. In general the users of commercially available LCDs do not know the optical properties of the used material. Thus, a reverse-engineering approach is needed to optimize the LCD response. First, we show a simplified model, that we recently proposed, for the orientation of the LC molecules. The model allows the determination of the physical parameters of the LCD by means of simple intensity measurements. Second, we demonstrate the capability of the model to provide very accurate predictions of the optical transmission. Therefore, we can perform computer searches for the optimum orientation of the added polarizing elements to obtain the required optical transmission. We demonstrate the need to insert wave plates in front and behind the LCD to obtain either amplitude-only or phase-only regimes with the LCD. Finally, we show the application of the optimized LCD to display images and filters in optical imageprocessing. as well as we show the design of diffractive optical elements and apodizers.
Biometrical systems have been the focus of concentrated research efforts in recent years. These systems can be used to identify a person or to grant a person access to something, e.g., a room. Face recognition technol...
详细信息
Biometrical systems have been the focus of concentrated research efforts in recent years. These systems can be used to identify a person or to grant a person access to something, e.g., a room. Face recognition technology has reached a level of performance at which frontal-view recognition of faces with slightly different facial expressions, view angles or head poses can be considered nearly solved. We present a novel hybrid ANN/HMM approach to recognize a person from that person's profile view (90) although the recognition system is trained with only one single frontal view of the person. Such a system can be useful for mugshot identification where a victim or witness has seen the criminal from the side only. Our approach uses neural methods in order to synthesize a profile out of the frontal view using no additional knowledge about the 3D shape and structure of a human head. The classification of the generated images is accomplished using a statistical HMM-approach.
The modal correspondence method of L.S. Shapiro and J.M. Brady (see image and Vision Computing, vol.10, p.283-8, 1992) aims to match point-sets by comparing the eigenvectors of a pairwise point proximity matrix. Altho...
详细信息
We have developed a Chinese OCR engine for machine printed documents. Currently, our OCR engine can support a vocabulary of 6921 characters which include 6707 simplified Chinese characters in GB2312-80, 12 frequently ...
详细信息
We have developed a Chinese OCR engine for machine printed documents. Currently, our OCR engine can support a vocabulary of 6921 characters which include 6707 simplified Chinese characters in GB2312-80, 12 frequently used GBK Chinese characters, 62 alphanumeric characters, 140 punctuation marks and symbols. The supported font styles include Song, Fang Song, Kat, He, Yuan, LiShu, WeiBei, XingKai, etc. The averaged character recognition accuracy is above 99% for newspaper quality documents with a recognition speed of about 250 characters per second on a Pentium III-450 MHz PC yet only consuming less than 2 MB memory. We describe the key technologies we used to construct the above recognizer. Among them, we highlight three key techniques contributing to the high recognition accuracy, namely the use of Gabor features, the use of discriminative feature extraction, and the use of minimum classification error as a criterion for model training.
This paper presents a progress report (after 1 year of a 3 year project) on the overall design for a flexible archive conversion system, intended eventually for widespread use as a tool to convert legacy typescript an...
详细信息
暂无评论