Digital watermarking is a new technique for digital multimedia copyright protection. The robustness and the imperceptibility are the basic requirements of the digital watermark. The key factor that affects both the ro...
详细信息
ISBN:
(纸本)9810475241
Digital watermarking is a new technique for digital multimedia copyright protection. The robustness and the imperceptibility are the basic requirements of the digital watermark. The key factor that affects both the robustness and the imperceptibility of the digital watermark is the watermarking strength. In this paper, artificial neural network (ANN) is used to model human visual system (HVS) and an ANN-based image-adaptive method for deciding watermarking strength for image DCT coefficients is presented. The experimental results show that the method can increase the watermarking strength so that the robustness of digital watermark is enhanced and that the method has very good adaptability.
In this paper, we describe methods of detecting a human face and localizing the feature area in our man-machine interaction project. We use a simplified human skin color model for pre-segmenting the input image. From ...
详细信息
ISBN:
(纸本)0780374886
In this paper, we describe methods of detecting a human face and localizing the feature area in our man-machine interaction project. We use a simplified human skin color model for pre-segmenting the input image. From candidate areas we use linear Fisher discriminate analysis (LFDA) to choose the most like face area, then in the detected face area, we localize the feature area with LFDA and linear principal component analysis (LPCA). We use the method of the image pyramid to improve the searching speed. The results are presented on images from the Chinese audio visual speech recognition database (CAVSRD 1.0).
Digital Watermarking is an effective and popular technique to discourage illegal copying and distribution of copyrighted digital image information. The important attributes are the picture quality of the watermarked i...
详细信息
Digital Watermarking is an effective and popular technique to discourage illegal copying and distribution of copyrighted digital image information. The important attributes are the picture quality of the watermarked image (similarity to the original) and robustness to attacks such as cropping. We propose a transform-domain robust digital watermarking technique which uses a pattern-based compression of the watermark image, an intelligent dynamic embedding of the signature bits and a post-watermarking content-based visual masking technique to deliver high image quality and robustness in retaining watermark content against attacks (cropping).
This paper describes a high capacity blind video watermarking system invariant to geometrical attacks such as shift, rotation, scaling and cropping. A spatial domain reference watermark is used to obtain invariance to...
详细信息
This paper describes a high capacity blind video watermarking system invariant to geometrical attacks such as shift, rotation, scaling and cropping. A spatial domain reference watermark is used to obtain invariance to geometric attacks by employing image registration techniques to determine and invert the attacks. A second, high capacity watermark, which carries the data payload, is embedded in the wavelet domain according to a human visual system (HVS) model. This is protected by a state-of-the-art error correction code (turbo code). The proposed system is invariant to scaling up to 180%, rotation up to 70/spl deg/ and arbitrary aspect ratio changes up to 200% on both axes. Furthermore, the system is virtually invariant to any shifting, cropping, or combined shifting and cropping.
We present a novel watermarking scheme to ensure the credibility of digital images. The proposed technique is able to detect malicious tampering of images even if they have been incidentally distorted by basic image p...
详细信息
We present a novel watermarking scheme to ensure the credibility of digital images. The proposed technique is able to detect malicious tampering of images even if they have been incidentally distorted by basic imageprocessing operations. Our system is based on the quantization of wavelet packet coefficients and uses characteristics of the human visual system to maximize the embedding weights while keeping good perceptual transparency. We develop an image-dependant method to evaluate, in the discrete wavelet domain, the optimal quantization steps allowing the tamper proofing of still images. The nature of multiresolution discrete wavelet decomposition allows the spatial and frequency localization of image tampering. Experimental results are presented to demonstrate the capacity of our system to detect unauthorized modification of images while staying robust to image compression.
We have designed a system with an intuitive user interface for remote camera control and image-based queries over the Internet. While searching for present solutions we realized the importance of a well-designed user ...
详细信息
We have designed a system with an intuitive user interface for remote camera control and image-based queries over the Internet. While searching for present solutions we realized the importance of a well-designed user interface. We developed a system, which enables remote observation and remote control of the JVC network camera over the Internet. The user interface is based on the combination of live video and a static panoramic view of a remote location. It provides a complete overview of a remote location and significantly simplifies the control over the Internet. By interactively moving a rectangular frame in the panoramic picture, the user locally selects the new direction of the camera. visual summaries of activities at the observed location can be generated as well as custom queries with a simple user interface over the Internet.
Mainstream automatic speech recognition has focused almost exclusively on the acoustic signal. The performance of these systems degrades considerably in the real world in the presence of noise. On the other hand, most...
详细信息
Mainstream automatic speech recognition has focused almost exclusively on the acoustic signal. The performance of these systems degrades considerably in the real world in the presence of noise. On the other hand, most human listeners, both hearing-impaired and normal hearing, make use of visual information to improve speech perception in acoustically hostile environments. Motivated by humans' ability to lipread, the visual component is considered to yield information that is not always present in the acoustic signal and enables improved accuracy over totally acoustic systems, especially in noisy environments. In this paper, we investigate the usefulness of visual information in speech recognition. We first present a method for automatically locating and extracting visual speech features from a talking person in color video sequences. We then develop a recognition engine to train and recognize sequences of visual parameters for the purpose of speech recognition. We particularly explore the impact of various combinations of visual features on the recognition accuracy. We conclude that the inner lip contour features together with the information about the visibility of the tongue and teeth significantly improve the performance over using outer contour only features in both speaker dependent and speaker independent recognition tasks.
In this paper, we propose a color segmentation algorithm based on contrast information and adaptive thresholds. Given a color image, instead of the commonly used achromatic difference and chromatic difference, we use ...
详细信息
In this paper, we propose a color segmentation algorithm based on contrast information and adaptive thresholds. Given a color image, instead of the commonly used achromatic difference and chromatic difference, we use achromatic contrast and chromatic contrast to represent the significance of boundary. To fit for human visual perception, adaptive thresholds are applied to suppress perceptually faint boundaries. A complete segmentation scheme is proposed and the simulation results demonstrate the superiority of this approach in providing reasonable and reliable color segmentation.
One of the key issues in video manipulation is video abstraction in the form of skimmed video. For this purpose, an important task is to determine the content significance of each chunk of frames in a video sequence. ...
详细信息
One of the key issues in video manipulation is video abstraction in the form of skimmed video. For this purpose, an important task is to determine the content significance of each chunk of frames in a video sequence. In this paper, we present a new computational model of motion attention and the approach to applying this model in video skimming. The effectiveness of our architecture and model is demonstrated by user studies of visual skimming experiments. The results indicate that the precision of motion attention detection is over 80%, and the user satisfaction of visual skimming is beyond 70%.
In order to transmit pre-encoded digital video over heterogeneous networks, it is necessary to employ transcoding techniques that convert pre-encoded video streams into streams having different bit rates and quality. ...
详细信息
ISBN:
(纸本)9781581136203
In order to transmit pre-encoded digital video over heterogeneous networks, it is necessary to employ transcoding techniques that convert pre-encoded video streams into streams having different bit rates and quality. The specified problem is referred to as rate shaping or rate adaptation. In this work, we propose a new rate control scheme for H.263+ based video transcoding. The proposed rate control scheme is comprised of Frame-Layer bit allocation and Macroblock-Layer rate control. At the frame layer, scene context statistics from the incoming video stream are utilized to detect scene changes and determine frame type. The bit budget is allocated to frames according to their energy and frame types. At the macroblock layer, a novel linear Rate-Quantization model is used for selecting quantization parameters for macroblocks. Implementation and experimental results show that the proposed algorithm can provide accurate bit allocation, and can effectively alleviate visual quality degradation after scene changes. This rate adaptation scheme can be used to provide flexible video bit rate adaptation for transmission of pre-encoded video over heterogeneous networks.
暂无评论