We present a reversible watermarking scheme which achieves perfect restoration of both the embedded watermark and the original image during extraction. The proposed scheme embeds data by modifying those integer DCT co...
详细信息
ISBN:
(纸本)0780385780
We present a reversible watermarking scheme which achieves perfect restoration of both the embedded watermark and the original image during extraction. The proposed scheme embeds data by modifying those integer DCT coefficients with peak amplitudes in each coefficient histogram. The integer DCT performed over the original image is a lossless 8/spl times/8 block transform with high energy concentrating ability, which guarantees reversibility and high capacity/distortion ratio for the proposed watermarking scheme. In addition, this scheme provides a wide quality (PSNR) range from around 40 dB to 60 dB for the watermarked image, and an inherent fine adjustment capability for the quality (PSNR). Some experimental results are presented to demonstrate the high performance of our scheme in terms of capacity and the quality of the watermarked image.
The paper focuses on the development of a human-computer communication method which utilizes the user's typing skills to control the facial expression of a computer-generated 3D face. The method is based on the re...
详细信息
The paper focuses on the development of a human-computer communication method which utilizes the user's typing skills to control the facial expression of a computer-generated 3D face. The method is based on the realization that the human face is a movable and deformable system with 26 degrees of freedom (the same number as the letters of the English alphabet). Therefore it is possible to create a parameterized graphical facial model confined to a set of 26 parameters each one controlled by a letter key. The method is an extension of the KUI technique (Adamo-Villani and Beni, 2004; Adamo-Villani and Beni, 2003) recently developed to encode hand gestures.
Collagen fiber bundles are important constituent parts of biological soft tissues, such as tendons, blood vessels and skin. The single collagen fibers within a bundle have a certain crimp that determines the bundle...
详细信息
Collagen fiber bundles are important constituent parts of biological soft tissues, such as tendons, blood vessels and skin. The single collagen fibers within a bundle have a certain crimp that determines the bundle's stretching characteristic, which is of particular interest in mechanobiology. A thorough understanding of the complex interrelations between mechanical factors and the associated biological responses may help to improve diagnostics, which allow disease and injury to be treated earlier. We present a method to obtain the relative stretching characteristics (RSC) of collagen (fibrous) bundles based on the analysis of microscopic tissue images. We show, how the orientation statistics of local orientations in a bundle can be directly related to the RSC. The von Mises distribution (VMD) - a circular probability distribution is used to describe the 2D orientation statistics. The k parameter of the VMD is used to obtain the mean stretching and the probability density function (PDF) of the bundles' RSC is identified.
Ubiquitous, high-resolution, large public displays offer an attractive complement to wearable displays. Unfortunately, the inherently public nature of these public displays makes them unsuitable for displaying sensiti...
详细信息
Ubiquitous, high-resolution, large public displays offer an attractive complement to wearable displays. Unfortunately, the inherently public nature of these public displays makes them unsuitable for displaying sensitive information. We present EyeGuide, a wearable system that allows the user to obtain information quickly from a public display without sacrificing privacy. To this end, EyeGuide employs a lightweight head-worn eye-tracker for hands-free object selection and an earphone for private communication. Our system supports public displays that are dynamic (e.g., a large plasma screen) and static (e.g., a large printed map). In our printed map scenario, EyeGuide whispers verbal directions via earphone to a user, based on where they are looking on the map. Using a technique we call "gaze steering," the system guides the user's eye position to specific locations. In our dynamic public display scenarios, EyeGuide presents documents (e.g., maps) that contain sensitive data in a way that preserves privacy.
Object identification from local information has recently been investigated with respect to its potential for robust recognition, e.g., in case of partial object occlusions, scale variation, noise, and background clut...
详细信息
Object identification from local information has recently been investigated with respect to its potential for robust recognition, e.g., in case of partial object occlusions, scale variation, noise, and background clutter in detection tasks. This work contributes to this research by a thorough analysis of the discriminative power of local appearance patterns and by proposing to exploit local information content for object representation and recognition. In a first processing stage, we localize discriminative regions in the object views from a posterior entropy measure, and then derive object models from selected discriminative local patterns. Object recognition is then applied to test patterns with associated low entropy using an efficient voting process. The method is evaluated by various degrees of partial occlusion and Gaussian image noise, resulting in highly robust recognition even in the presence of severe occlusion effects.
In this paper, we present a coding framework addressing image-space compression for free-viewpoint video. Our framework is based on time-varying 3D point samples which represent real-world objects. The 3D point sample...
详细信息
In this paper, we present a coding framework addressing image-space compression for free-viewpoint video. Our framework is based on time-varying 3D point samples which represent real-world objects. The 3D point samples are obtained after a geometrical reconstruction from multiple pre-recorded video sequences and thus allow for arbitrary viewpoints during playback. The encoding of the data is performed as an off-line process and is not time-critical. The decoding however, must support for real-time rendering of the dynamic 3D data. We introduce a compression framework which encodes multiple point attributes like depth and color into progressive streams. The reference data structure is aligned on the original camera input images and thus enables for easy view-dependent decoding. A novel differential coding approach permits random access in constant time throughout the entire data set and thus enables arbitrary viewpoint trajectories in both time and space.
This work introduces a methodology for evaluating the operational range of a video surveillance system in terms of robustness and reliability. We propose the generation of semi and full-synthetic video sequences under...
详细信息
This work introduces a methodology for evaluating the operational range of a video surveillance system in terms of robustness and reliability. We propose the generation of semi and full-synthetic video sequences under controlled variation of selected parameters. This data provides the necessary ground truth information for evaluating the motion detection and tracking systems. In addition, we propose several error metrics for quantitative evaluation.
The object of the paper is the development of a new human computer communication (HCC) interface for configuration and animation of a signing semantroid. The work presented in this paper builds on two recent research ...
详细信息
The object of the paper is the development of a new human computer communication (HCC) interface for configuration and animation of a signing semantroid. The work presented in this paper builds on two recent research results: (1) the concepts of semantroid (=semantic android - an avatar limited to head and hands), and semantic intensity (=semantic content); and (2) the KUI (keyboard user interface) method, a keyboard-based method of configuration and animation of 3D hand gestures and facial expressions. In particular, the paper focuses on: (1) development of a new set of keyboard-controlled facial parameters of the semantroid for animation of American Sign Language (ASL) non-manuals, and (2) design of an efficient HCC interface for input and encoding recording of ASL handshapes and facial expressions.
Fingerprint matching is a common technique for biometric authentication. Solid state sensors allow the use of fingerprint recognition in small sized embedded systems. The size of these sensors makes it necessary to st...
详细信息
Fingerprint matching is a common technique for biometric authentication. Solid state sensors allow the use of fingerprint recognition in small sized embedded systems. The size of these sensors makes it necessary to store several impressions of the same finger to provide good coverage of the entire fingertip. In order to reduce memory requirements and matching time all these impressions can be fused into a single larger image. Memory constraints imposed by embedded computers prohibit the use of images. A fingerprint is therefore represented as a set of minutiae coordinates and minutiae angles. We present a two stage approach to combine two fingerprints. First, a RANSAC based method is used to determine a rigid transformation which roughly aligns the two fingerprints. Second, the transformation is optimised using a robust least median of squares solution. The reliability of the method is demonstrated on a large synthetic dataset and real fingerprint images. The computational complexity and memory requirements allow implementation of the algorithm on embedded hardware.
Vehicle occupants that are out-of-position can be deadly injured by the deployment of the air bag in a crash situation. In recent years many different sensors and systems have been proposed to detect the type of occup...
详细信息
Vehicle occupants that are out-of-position can be deadly injured by the deployment of the air bag in a crash situation. In recent years many different sensors and systems have been proposed to detect the type of occupant and the position of the occupant's head. This paper presents a method for classification and occupant's head detection based on passive stereo vision. The proposed system uses depth surface analysis and scene statistics together with support vector machines for classification and selection of head candidates. Evaluation of the method shows 99% correct for classification and 98% correct for head detection, using large sets of image data, and image sequences recorded in a driving vehicle.
暂无评论