Transform domain methods have dominated the watermarking field from its early stages. In these methods some coefficients are selected and modified according to certain rules. The two most important numbers in this pro...
详细信息
ISBN:
(纸本)0819450235
Transform domain methods have dominated the watermarking field from its early stages. In these methods some coefficients are selected and modified according to certain rules. The two most important numbers in this process are the length and the position of the watermark. These are usually heuristically chosen. In order to handle this problem, an adaptive scheme for the selection of the proper coefficients is analysed in the present communication.
This demo paper gives a real-time learned image codec on FPGA. By using Xilinx VCU128, the proposed system reaches 720P@30fps codec, which is 7.76x faster than prior work.
ISBN:
(纸本)9781665475921
This demo paper gives a real-time learned image codec on FPGA. By using Xilinx VCU128, the proposed system reaches 720P@30fps codec, which is 7.76x faster than prior work.
Video transmission over Internet or wireless channels usually suffers from network congestion. However, for transmission of video coded by Motion JPEG 2000 (MJP2), the effect of network congestion can be greatly moder...
详细信息
ISBN:
(纸本)0819450235
Video transmission over Internet or wireless channels usually suffers from network congestion. However, for transmission of video coded by Motion JPEG 2000 (MJP2), the effect of network congestion can be greatly moderated by exploiting the progressive transmission. When network congestion occurs, data packets representing detailed information will be dropped selectively, and this will lead to degradation of image quality in the corresponding frames. In this paper, we present an algorithm to enhance the image quality of the degraded frames by utilizing inter-frame correlation. For a degraded frame, its detailed information is recovered from the previously decoded frame. Simulation results show that both the objective and visual quality of these frames are greatly improved.
The anisotropic wavelet packet transform is an extension of the conventional wavelet (packet) transform where the basis can have different scales in different dimensions. As there are certain kinds of images with diff...
详细信息
ISBN:
(纸本)0819450235
The anisotropic wavelet packet transform is an extension of the conventional wavelet (packet) transform where the basis can have different scales in different dimensions. As there are certain kinds of images with different behaviour in horizontal and vertical direction, anisotropic wavelet packet bases can be adapted more precisely to these images. Zero-tree image compression has already proved its efficiency on conventional wavelet transformed data as well as for wavelet packets. In this work, zero-tree methods are extended to work with anisotropic wavelet packets and coding results are shown for several types of images.
Inexpensive computer hardware and optical devices has made image/video applications available even for private individuals. This has created a huge demand for image and multimedia databases and other systems, which wo...
详细信息
ISBN:
(纸本)0819450235
Inexpensive computer hardware and optical devices has made image/video applications available even for private individuals. This has created a huge demand for image and multimedia databases and other systems, which work with visual information. Analysis of visual information has not been completely formalized and automated yet. The reason for that is a long tradition of separation of vision and knowledge subsystems. However, brain researches show that vision is a part of a larger information system that converts visual information into knowledge structures. These structures drive vision process, resolve ambiguity and uncertainty in real images via feedback projections, and provide image understanding that is an interpretation of visual information in terms of such knowledge models. It is hard to split such system apart. Vision mechanisms can never be completely understood separately from the informational processes related to knowledge and intelligence. MPEG-7 is an industry-wide effort to incorporate knowledge into image/video code. This article describes basic principles of integration low-level imageprocessing with high-level knowledge reasoning, and shows how image Understanding systems can utilize MPEG-7 standard. Such applications can add to the standard the power of image understanding.
A new video method which performs well in the segmentation of images containing multiple independently moving foreground objects is presented. It combines the strong points of both color and motion segmentation in the...
详细信息
ISBN:
(纸本)0819450235
A new video method which performs well in the segmentation of images containing multiple independently moving foreground objects is presented. It combines the strong points of both color and motion segmentation in the way expected, while the algorithm is still of such a low complexity that it could be implemented in consumer electronics hardware. Performing the motion based block resolution segmentation before refining it with pixel resolution color segmentation does provide several advantages.
This paper describes an attack on semi-fragile image authentication schemes proposed in papers In this attack, the adversary manipulates an authentic image and queries a verifier with the corrupted image. According to...
详细信息
ISBN:
(纸本)0819450235
This paper describes an attack on semi-fragile image authentication schemes proposed in papers In this attack, the adversary manipulates an authentic image and queries a verifier with the corrupted image. According to the answers from the verifier, the adversary can disclose the secret relationship graphs used to produce a signature. With the disclosed relationship graphs, the adversary can impersonate an innocent person to forge authentic images easily. A countermeasure to this attack is to change scheme parameters with the relationship edges so that the relationship graphs reconstructed by the attacker are different from the original one. Sequentially, the attacker is hard to forge an authentic image without correct relationship graphs.
Perceptual organization is the process of assigning each part of a scene to a specified association of features to be a part of the same organization. In the twenty century, Gestalt psychologists formalized how image ...
详细信息
ISBN:
(纸本)9781728180687
Perceptual organization is the process of assigning each part of a scene to a specified association of features to be a part of the same organization. In the twenty century, Gestalt psychologists formalized how image features tend to be grouped by giving a set of organizing principles. In this paper, we propose an approach for the detection of perceptual groups in an image. We are mainly interested in features grouped by the proximity law of Gestalt. We conceive an object-based model within a stochastic framework using a marked point process (MPP). We use a Bayesian learning method to extract perceptual groups in a scene. The proposed model tested on synthetic images proves the efficient detection of perceptual groups in noisy images.
This paper demonstrates a model-based reinforcement learning framework for training a self-flying drone. We implement the Dreamer proposed in a prior work as an environment model that responds to the action taken by t...
详细信息
ISBN:
(纸本)9781728185514
This paper demonstrates a model-based reinforcement learning framework for training a self-flying drone. We implement the Dreamer proposed in a prior work as an environment model that responds to the action taken by the drone by predicting the next video frame as a new state signal. The Dreamer is a conditional video sequence generator. This model-based environment avoids the time-consuming interactions between the agent and the environment, speeding up largely the training process. This demonstration showcases for the first time the application of the Dreamer to train an agent that can finish the racing task in the Airsim simulator.
In this paper, we present a novel approach to image-based rendering (IBR) that for generates an arbitrary view image with arbitrary focus for a scene consisting two approximately constant depths. The presented method ...
详细信息
ISBN:
(纸本)0819450235
In this paper, we present a novel approach to image-based rendering (IBR) that for generates an arbitrary view image with arbitrary focus for a scene consisting two approximately constant depths. The presented method differs from the conventional IBRs using multiple view images in that we acquire two differently focused images at each camera position and render parallax and focus effects on each object simply by linear filtering of the acquired images without segmentation. Experimental results on the real images acquired with 4 cameras located in parallel are presented.
暂无评论