In this paper, we propose a combined source and channel coding scheme for image transmission over noisy channels. The key component is extracting and preserving the scene information and incorporating it with unequal ...
详细信息
ISBN:
(纸本)0780365968
In this paper, we propose a combined source and channel coding scheme for image transmission over noisy channels. The key component is extracting and preserving the scene information and incorporating it with unequal error protection to combat the channel errors. After hierarchical wavelet decomposition of the image, wavelet coefficients with parent-child relationship are grouped into wavelet blocks and classified according to corresponding scenes in the original image. For each class, spatial neighborhood coefficients in the high frequency subbands are constrained so that the spatially isolated coefficients are removed and clustered coefficients are retained at the same time. All the wavelet blocks in the same class are grouped together and coded using SPIHT algorithm. High source coding efficiency is preserved even though multiple source coded bitstreams are generated since the wavelet tree structure is untact. In order to combat the channel errors, unequal error protection strategy implemented by RCPC/CRC channel coding is designed based on the bit contribution to both PSNR and human visual sensitivity. Finally, a post-processing method is developed at the receiving end to restore the degradation due to the residual error after channel decoding. Experimental results show that the proposed scheme is indeed able to provide protection for more important bits and more important visual content under noisy transmission environment. In particular, the reconstructed images illustrate consistently better visual quality.
In this work, we investigate a new class of scalable image coders. We target at the same time multiresolution (for spatial scalability), critical (for compression efficiency) and (hierarchical) segmentation based deco...
详细信息
ISBN:
(纸本)0819437034
In this work, we investigate a new class of scalable image coders. We target at the same time multiresolution (for spatial scalability), critical (for compression efficiency) and (hierarchical) segmentation based decompositions (for object based scalability). Hierarchical segmentation allows to access the description of a scene in terms of regions or objects at several resolution levels, and thus encode and transmit the objects selectively. From a coding viewpoint, it is obviously interesting to couple the multi-level segmentation with a critically decimated decomposition of the image (to avoid redundancy of representation). However, the association of object representation combined with critically sampled multiresolution decomposition has not been studied to our knowledge. In this paper, we propose new methods to perform hierarchical segmentation of an image using critically decimated non linear filter banks;the resulting decomposition embeds a hierarchical segmentation map and is therefore particularly well suited for region based coding and progressive transmission. As the segmentation map is embedded by reconstruction inside the decomposition, we do not really need to transmit it separately thus attempting to reduce the bitrate. Simulations show that a prototype coder of this type has a degradation in terms of rate/distortion tradeoff compared to a conventional wavelet based image coder, but offers in addition new perspectives for object based manipulations, coding and transmission.
Rate Control is an important component in a video encoder for date storage or real-time visual communication. In this paper, we will discuss the rate control in MPEG encoder for real-time video communication over Vari...
详细信息
ISBN:
(纸本)0819437034
Rate Control is an important component in a video encoder for date storage or real-time visual communication. In this paper, we will discuss the rate control in MPEG encoder for real-time video communication over Variable Bit Rate (VBR) channel. In interactive video communication, the video transmission is subject to both channel rate constraints and end-to-end delay constraints. Our goal in this paper is to modify the rate control in MPEG-2 encoder and satisfy the rate constraints, and study how to improve the video quality in the scenario of VBR transmission. Here, we employ Leaky-Bucket to describe the traffic parameters and monitor the encoder's output. Depending on the Rate-distortion models developed by us, we present a rate control algorithm to achieve almost uniform distortion both within a frame and between frames in a scene. With adaptive rate-distortion models and additional function of scene detecting, our method can robustly deal with scenes of different statistical characteristics. Comparing to MPEG2 TM5, in real time video communication, we could keep the constant buffer delay while maintain the decoded image quality stable. Furthermore, the bit allocation in our algorithm is more reasonable and controllable. Therefore, our method realized the advantages that advocated by VER video communication, such as small end-to-end delay, consistent image quality and high channel efficiency.
Many optical range sensors use an Equal Angle Increment (EAI) sampling. This type of sensors uses rotating mirrors with a constant angular velocity using radar and triangulation techniques, where the sensor sends and ...
详细信息
ISBN:
(纸本)0819437034
Many optical range sensors use an Equal Angle Increment (EAI) sampling. This type of sensors uses rotating mirrors with a constant angular velocity using radar and triangulation techniques, where the sensor sends and receives the modulated coherent light through the mirror. Such an EAI model generates data for surface geometrical description that has to be converted, in many applications, into data which meet the desired Equal Distance Increment (EDI) orthographic projection model. For an accurate analysis in three-dimensional (3-D) images, a 3-D interpolation scheme is needed to resample the range data into spatially equally-distance sampling data that emulate the Cartesian orthographic projection model. In this paper, a resampling approach using a B-Spline surface fitting is proposed. The first step is to select a new scale for all X, Y, Z directions based on the 3-D Cartesian coordinates of range data obtained from the sensor parameters. The size of the new range image and the new coordinates of each point are then computed according to the actual references of (X, Y, Z) coordinates and the new scale. The new range data are interpolated using a B-Spline surface fitting based on the new Cartesian coordinates. The experiments show that this 3-D interpolation approach provides a geometrically accurate solution for many industrial applications which deploy the EAI sampling sensors.
Model-based image coding has been given extensive attention due to its high subject image quality and low bit-rates. But the estimation of object motion parameter is still a difficult problem, and there is not a prope...
详细信息
ISBN:
(纸本)0819435929
Model-based image coding has been given extensive attention due to its high subject image quality and low bit-rates. But the estimation of object motion parameter is still a difficult problem, and there is not a proper error criteria for the quality assessment that are consistent with visual properties. This paper presents an algorithm of the facial motion parameter estimation based on feature point correspondence and gives the motion parameter error criteria. The facial motion model comprises of three parts. The first part is the global 3-D rigid motion of the head, the second part is non-rigid translation motion in jaw area, and the third part consists of local non-rigid expression motion in eyes and mouth areas. The feature points are automatically selected by a function of edges, brightness and end-node outside the blocks of eyes and mouth. The numbers of feature point are adjusted adaptively. The jaw translation motion is tracked by the changes of the feature point position of jaw. The areas of non-rigid expression motion can be rebuilt by using block-pasting method. The estimation approach of motion parameter error based on the quality of reconstructed image is suggested. and area error function and the error function of contour transition-turn rate are used to be quality criteria. The criteria reflect the image geometric distortion caused by the error of estimated motion parameters properly.
New markets are emerging for digital electronic image device, especially in visualcommunications, PC camera, mobile/cell phone, security system, toys, vehicle image system and computer peripherals for document captur...
详细信息
ISBN:
(纸本)0819437603
New markets are emerging for digital electronic image device, especially in visualcommunications, PC camera, mobile/cell phone, security system, toys, vehicle image system and computer peripherals for document capture. To enable one-chip image system that image sensor is with a full digital interface, can make image capture devices in our daily lives. Adding a color filter to such image sensor in a pattern of mosaics pixel or wide stripes can make image more real and colorful. We can say "color filter makes the life more colorful"! What color filter is? Color filter means can filter image light source except the color with specific wavelength and transmittance that is same as color filter itself. Color filter process is coating and patterning green, red and blue (or cyan, magenta and yellow) mosaic resists onto matched pixel in image sensing array pixels. According to the signal caught from each pixel, we can figure out the environment image picture. Widely use of digital electronic camera and multimedia applications today makes the feature of color filter becoming bright. Although it has challenge but it is very worthy to develop the process of color filter. We provide the best service on shorter cycle time, excellent color quality, high and stable yield. The key issues of advanced color process have to be solved and implemented are planarization and micro-lens technology [2][3][4]. Lots of key points of color filter process technology have to consider will also be described in this paper.
The proceedings contain 85 papers. The topics discussed include: searching and filtering of audio-visual information: technologies, standards, and applications;dequantization bias for JPEG decompression;a probabilisti...
ISBN:
(纸本)0769505406
The proceedings contain 85 papers. The topics discussed include: searching and filtering of audio-visual information: technologies, standards, and applications;dequantization bias for JPEG decompression;a probabilistic image model for smoothing and compression;application specific image compression forvirtual conferencing;media-independent watermarking classification and the need for combining digital video and audio watermarking for media authentication;a spatial digital video watermark that survives MPEG;overview of interference cancellation for CDMA wireless systems;wireless multimedia communication through multichannel multipoint distribution service;advanced signal processing for wireless multimedia communications;simple collosion-secure fingerprinting schemes for images;and digital watermarking using inter-block correlation: extension to JPEG coded domain.
Low bit rate video coding has triggered intensive research in both academia and industry. Again, to address the problem of heterogeneity, and meeting a very small bandwidth of mobile wireless networks, low bit rate sc...
详细信息
Low bit rate video coding has triggered intensive research in both academia and industry. Again, to address the problem of heterogeneity, and meeting a very small bandwidth of mobile wireless networks, low bit rate scalable video compression is becoming more important. This paper proposes to discuss the current approach to wireless visualcommunications for the need of low bit rate scalable video compression, using discrete wavelet transformation technique. The discrete wavelet transform (DWT) has provided efficient framework of multiresolution space-frequency representation with promising applications in imageprocessing. This codec is based on 3D wavelet coding and adaptive quantization of the subbands coefficients. A common problem with some existing quantization methods is that the inherent image structures are severely distorted with course quantization. The Moving Picture Expert Group-4 (MPEG-4), an important component of the ITU standard is referred as the bench-mark of video standard for low bit rate video transmission.
In this paper, we introduce an unequal error protection technique for foveation-based error resilience over highly error-prone mobile networks. For point-to-point visualcommunications, visual quality can be significa...
详细信息
In this paper, we introduce an unequal error protection technique for foveation-based error resilience over highly error-prone mobile networks. For point-to-point visualcommunications, visual quality can be significantly increased by using foveation-based error resilience where each frame is divided into foveated and background layers according to the gaze direction of the human eye, and two bitstreams are generated. In an effort to increase the source throughput of the foveated layer, we employ unequal delay-constrained ARQ and RCPC (rate compatible punctured convolutional) codes in H.223 Annex C. In the simulation, the visual quality is increased in the range of 0.3 dB to 1 dB over channel SNR 5 dB to 15 dB.
Digital watermarking is a very active research area for copyright protection of electronic documents and media. A visual cryptographic approach is used to generate two random shares of a watermark: one is embedded int...
详细信息
Digital watermarking is a very active research area for copyright protection of electronic documents and media. A visual cryptographic approach is used to generate two random shares of a watermark: one is embedded into the cover-image and another is kept as a secret key for the later watermark extraction. The watermark can be extracted by simply superimposing the key share over the stego-image. This asymmetric digital watermark is specially designed and is not easily changed or removed. But, it is very convenient to be extracted. The embedded digital watermark by this approach seems robust after several attacks are performed on the stego-image.
暂无评论