The proceedings contain 590 papers. The topics discussed include: path diversity and bandwidth allocation for multimedia streaming;video streaming with diversity;quality monitoring for compressed video subjected to pa...
ISBN:
(纸本)0780379659
The proceedings contain 590 papers. The topics discussed include: path diversity and bandwidth allocation for multimedia streaming;video streaming with diversity;quality monitoring for compressed video subjected to packet loss;a framework for aligning and indexing movies with their script;anchor space for classification and similarity measurement of music;automatic singer identification;sunset scene classification using simulated image recomposition;eye gaze and speech for data entry: a comparison of different data entry methods;wearable imaging system for summarizing personal experiences;flowfield and beyond: applying pressure-sensitive multi-point touchpad interaction;visual attention based image browsing on mobile devices;a simulation based approach for incorporating virtual components IP cores into multimedia systems design;optimization of decision-timing for early termination of SSDA-based block matching;and structural risk minimization using nearest neighbor rule.
In this paper we present a blind low frequency watermarking scheme. We obtain a robustness improvement to most common imageprocessing operations by embedding the watermark in the approximation image of the original i...
详细信息
In this paper we present a blind low frequency watermarking scheme. We obtain a robustness improvement to most common imageprocessing operations by embedding the watermark in the approximation image of the original image. In order to embed the watermark with minimal loss in image fidelity, the watermark strength is modulated according to the local image characteristics. We generate a visual mask based on the texture, edge and luminance masking effects of the human visual system. Watermark detection is done without use of the original image. Experimental results show that the proposed technique is robust against most common imageprocessing operations.
WWW image retrieval has been a very challenging research topic in the last decade. Interesting, WWW images are described typically by both high-level (text) and low-level (visual) high-dimensional features. While most...
详细信息
ISBN:
(纸本)0780381858
WWW image retrieval has been a very challenging research topic in the last decade. Interesting, WWW images are described typically by both high-level (text) and low-level (visual) high-dimensional features. While most research effort has focused on effectiveness issues, the problems related to efficient retrieval from WWW image databases using an approach based on multiple features have not been seriously addressed. In this paper, we discuss the difficulties in indexing multi-feature image databases, and outline various indexing approaches for such databases, as well as consider their respective strengths and weaknesses. Corresponding query processing algorithms are also briefly discussed.
image indexing using content analysis is known as a difficult task, involving the vision research domain. Using these tools in the context of a retrieval system is generally frustrating for users, due to a lack of int...
详细信息
image indexing using content analysis is known as a difficult task, involving the vision research domain. Using these tools in the context of a retrieval system is generally frustrating for users, due to a lack of interfaces development and to the difficulty for users to understand the low-level features managed by the system. We propose in this paper a general point of view for introducing a link between such systems and potential users. This includes image features based on visual perception models, a relevance feedback model, and a graphical interface to express the information need through user-system interactions.
The ability to change illumination is a crucial factor in image-based modeling and rendering. image-based relighting offers such capability. However, the trade-off is the enormous increase of storage requirement. In t...
详细信息
CNES has launched in May 2002 a new high resolution (2.5m) and large swath (2 x 60km) optical remote sensing satellite : SPOT5. To achieve a high image acquisition capacity with this system, a large on-board mass memo...
详细信息
ISBN:
(纸本)0819450243
CNES has launched in May 2002 a new high resolution (2.5m) and large swath (2 x 60km) optical remote sensing satellite : SPOT5. To achieve a high image acquisition capacity with this system, a large on-board mass memory (100 Gbits) together with a 3:1 real-time compression are being used. The quasi-lossless and fixed output rate requirements put on the on-board image compression resulted in the development of a custom algorithm. This algorithm is based on : a DCT decorrelator, a scalar quantizer, an entropy coder and a rate regulator. It has been extensively tested before launch both in terms of quantitative performances and in terms of visual performances. The objectives of the on-orbit validation of the SPOT5 image compression function were the followings: Perform an image quality assessment in worst case conditions for the compression. In particular, the THR mode (2.5 in resolution) is potentially sensitive to compression noise and was therefore thoroughly checked for any compression artefacts. Compression noise characteristics were taken into account in the denoising stage of the ground processing for improved performances: Verify the adequacy of the compression parameters with regard to the in-flight characteristics of the instruments (MTF, radiometric spreading,...);Technological checkout of the compression unit on board the satellite. This paper will present an overview of SPOT5 mission.. the methods used for on-orbit validation of the compression and, finally, all the validation results together with the lessons learned throughout this development. On-board image compression for future CNES remote sensing missions will be addressed as a conclusion.
作者:
Hao, Wang
Visual Communications Laboratory No. 11 He Ping Li Dong Jie Beijing 100013 China
This paper describes algorithms for automatic facial feature extraction by using Active Shape Models (ASM). Based on these extracted facial features, face drawing is created, which embodies the individual features tha...
详细信息
A "Navigation Aid for visually Impaired" (NAVI) system has been developed in UMS in 2001 and improved in 2002. The NAVI system comprises of 3-part functioning units namely a digital video camera, a single bo...
详细信息
A "Navigation Aid for visually Impaired" (NAVI) system has been developed in UMS in 2001 and improved in 2002. The NAVI system comprises of 3-part functioning units namely a digital video camera, a single board computer and a headphone. The single camera limitation in providing the depth information, which is critical for navigation purposes, has prompted the extension into a stereovision system by using two cameras. This stereovision system also uses the fuzzy-based segmentation procedure as an image preprocessing package that was developed for the single camera NAVI but with an additional adaptive loop. The segmented images undergo a rule-based stereo matching procedure. From the matching features, the disparity is computed. The disparity in combination with information of focal length as well as the space between two cameras provides the information on the distance between cameras and object. This distance information is incorporated into the final processed image as four gray levels such as white, light gray, dark gray and black. The size and location of object in the visual plane is then conveyed to the blind individual by means of a structured coded sound. The distance information is represented by means of verbal sound. Preliminary experimental analysis reveals a promising new approach for developing a navigational aid for blinds through the transformation of stereo image to stereo sound.
The operation and performance of a six degree-of-freedom (DOF) shared-aperture tracking system with image overlay is described. This unique tracking technology shares the same aperture or scanned optical beam with the...
详细信息
The operation and performance of a six degree-of-freedom (DOF) shared-aperture tracking system with image overlay is described. This unique tracking technology shares the same aperture or scanned optical beam with the visual display, virtual retinal display (VRD). This display technology provides high brightness in an AR helmet-mounted display, especially in the extreme environment of a military cockpit. The VRD generates an image by optically scanning visible light directly to the viewer's eye. By scanning both visible and infrared light, the head-worn display can be directly coupled to a head-tracking system. As a result, the proposed tracking system requires minimal calibration between the user's viewpoint and the tracker's viewpoint. This paper demonstrates that the proposed shared-aperture tracking system produces high accuracy and computational efficiency. The current proof-of-concept system has a precision of +/- 0.05 and +/- 0.01 deg. in the horizontal and vertical axes, respectively. The static registration error was measured to be 0.08 +/- 0.04 and 0.03 +/- 0.02 deg. for the horizontal and vertical axes, respectively. The dynamic registration error or the system latency was measured to be within 16.67 ms, equivalent to our display refresh rate of 60 Hz. In all testing, the VRD was fixed and the calibrated motion of a robot arm was tracked. By moving the robot arm within a restricted volume, this real-time shared-aperture method of tracking was extended to six-DOF measurements. Future AR applications of our shared-aperture tracking and display system will be highly accurate head tracking when the VRD is helmet mounted and worn within an enclosed space, such as an aircraft cockpit.
In this article a new optimization of genetic snakes (GS): multiobjectives genetics snakes (MGS) faster and simpler to implement is proposed. They enable us to make converge two snakes in parallel while minimizing ene...
详细信息
In this article a new optimization of genetic snakes (GS): multiobjectives genetics snakes (MGS) faster and simpler to implement is proposed. They enable us to make converge two snakes in parallel while minimizing energies of different nature. We apply them to the modeling of mouth contours within the framework of the audio-visual speech recognition (AVSR). The proposed AVSR system implements a classifier based on the STM (sparse template matching) which simplicity makes it possible to consider a real time implementation. We evaluate the classifier performances on European database M2VTS, and compare the performances of the GS and MGS.
暂无评论