The aim of image restoration is to find an estimate of the ideal image using a priori information about blur and/or noise and/or the ideal image. Classical criterion are minimum least square, minimum mean square error...
详细信息
The aim of image restoration is to find an estimate of the ideal image using a priori information about blur and/or noise and/or the ideal image. Classical criterion are minimum least square, minimum mean square error or maximum a posteriori probability. The choice of the criterion used to measure the estimation quality is crucial for the design of the restoration algorithm. The authors propose a new criterion based on a visual model: it is based on perceptual masking. Thereafter, they propose a new restoration algorithm dealing only with additive noise. The perceptual components of the image to restore which are corrupted by an additive noise above a visibility threshold are simply set to zero. Some results obtained for the post-processing of JPEG images are presented.< >
Proposes a pel adaptive image restoration algorithm for coded video signals, where ringing effects near sharp edges are smoothed out and blocking effects are removed without blurring edges at the decoder. The basic id...
详细信息
Proposes a pel adaptive image restoration algorithm for coded video signals, where ringing effects near sharp edges are smoothed out and blocking effects are removed without blurring edges at the decoder. The basic idea in this algorithm is that noise in block coded images is estimated from local statistics of image and quantization information and that each pixel data is estimated adaptively using linear least mean square error method. Since local mean and local variance for each pixel are the main calculation in the proposed algorithm, it is readily adaptable to real-time imageprocessing. The simulation examples of processed images coded by MPEG are presented to demonstrate that a significant improvement in subjective quality has been achieved.< >
We utilize speech information to improve the quality of audio/visualcommunications, such as videotelephony, videoconferencing, and multimedia. In particular, marriage of speech processing and imageprocessing can sol...
详细信息
We utilize speech information to improve the quality of audio/visualcommunications, such as videotelephony, videoconferencing, and multimedia. In particular, marriage of speech processing and imageprocessing can solve problems related to lip synchronization. Two main techniques proposed in this paper are: speech-assisted interpolation and speech-assisted coding of talking head video. Audio/video sequences are presented to demonstrate our techniques.< >
We present a technique for improving the applicability of complete, nonorthogonal, multiresolution transforms to image coding. As is well known, the L/sup 2/ norm of the quantization error is not preserved by nonortho...
详细信息
We present a technique for improving the applicability of complete, nonorthogonal, multiresolution transforms to image coding. As is well known, the L/sup 2/ norm of the quantization error is not preserved by nonorthogonal transforms, so the L/sup 2/ reconstruction error may be unacceptably large. However, given the quantizers and synthesis filters, we show that this artifact can be eliminated by formulating the coding problem as that of minimizing the L/sup 2/ reconstruction error over the set of possible encoded images. This high-dimensional, discrete optimization problem is solved using a multiscale relaxation algorithm. Bounds on the coding gain over the standard coding technique are derived. Experiments using biorthogonal spline filters demonstrate appreciable SNR gains over the standard coding technique, and comparable visual improvements.< >
In second generation image compression techniques the image to be compressed is first segmented. The pixels are divided into mutually exclusive spatial regions based on some criteria. After segmentation, the image con...
详细信息
In second generation image compression techniques the image to be compressed is first segmented. The pixels are divided into mutually exclusive spatial regions based on some criteria. After segmentation, the image consists of regions separated by contours. Then, the information is coded describing the shapes and interiors of the regions. The interiors of the regions are usually encoded using polynomials. The objective of this paper is to encode the interior of the regions by stochastic vector quantization techniques. If the segmentation process has been well defined and the obtained regions are homogeneous, then it is possible to design a specific codebook suited to the statistics of each region. The approach is to design the codebook according to some previously defined model for the regions of the image found in the segmentation process. If the approach is combined with efficient contour coding techniques, good visual results for high compression rates are obtained.< >
This paper proposes a 3-D filtering scheme which reduces temporal aliasing effectively and requires much less memory than conventional schemes in hardware implementation. Our 3-D filter is implemented in the form of a...
详细信息
ISBN:
(纸本)0780317750
This paper proposes a 3-D filtering scheme which reduces temporal aliasing effectively and requires much less memory than conventional schemes in hardware implementation. Our 3-D filter is implemented in the form of a space-variant 2-D finite-duration impulse response (FIR) filter in the spatial domain with motion information. The filter requires only one frame and its motion vectors. The simulation result for the image sequence with constant motion informs explicitly that our filter has temporal de-aliasing property and 3-D band-limitation.< >
Understanding what the problems are in robot-vision architecture, requires a better comparison of major features. But a zoology of vision machines is questioned when, aiming for a well informed architectural feature c...
详细信息
Understanding what the problems are in robot-vision architecture, requires a better comparison of major features. But a zoology of vision machines is questioned when, aiming for a well informed architectural feature choice, a rapid presentation of technological trends in the field is proposed. Then an approach closer to physics prompts to a classification from a control point of view: it reveals some duality between operations and communications. A few visual operations are distinguished provided technology is not trailing behind. But emphasis should be put on communication networks, although they do not show any imageprocessing specificity currently, and the conclusion addresses some limits in that respect.< >
We propose codec designs for image browsing. image/video browsing is an important requirement in many multimedia applications. In image browsing, a lower resolution version of the image is first viewed. Upon request, ...
详细信息
We propose codec designs for image browsing. image/video browsing is an important requirement in many multimedia applications. In image browsing, a lower resolution version of the image is first viewed. Upon request, the resolution is further enhanced until an acceptable fidelity is obtained. Recently, many new codecs have been designed for image/video compression. However these codecs are not directly applicable for image browsing as they cannot interact with the user. In this paper, we propose a codec design based on Gabor decomposition which is well adapted for image browsing applications. However we note that the generic design can be implemented with any compression algorithm. Gabor decomposition is a promising approach for image browsing. The Gabor basis functions match the human visual profiles and hence result in better subjective quality compared to JPEG. For example, Gabor decomposition does not suffer from the blocking artifacts prevalent in JPEG-DCT. The proposed codec can achieve real-time performance, is simple and modular and hence can be implemented in VLSI.< >
The quality of visualcommunications has greatly improved with HDTV. Due to the advantages of digital communications, and the trend towards it, it is desirable to develop a cost effective digital codec for HDTV signal...
详细信息
The quality of visualcommunications has greatly improved with HDTV. Due to the advantages of digital communications, and the trend towards it, it is desirable to develop a cost effective digital codec for HDTV signals. One of the criteria in the transmission of digital signals via satellites is the transmission bandwidth which dictates the transmission costs. Apart from development of good compression schemes, due consideration should also be given to the ease of hardware implementation, which is crucial in deciding factors like processing delay and the level of encoder/decoder complexity. With the above mentioned factors in mind a cost-effective HDTV codec is proposed. Few US organizations have addressed the problem of HDTV distribution via communication satellites. The majority of efforts in the development of an HDTV codec have been aimed at the 100 to 140 Mbits/sec data rate. To facilitate cost effective satellite transmission, the DS3 rate of 45 Mbits/sec, is desirable. The purpose of the present research is to compress the HDTV signals to around 20 Mbits/sec so that 2 HDTV channels can be transmitted through the DS3 rate channel. To achieve this degree of compression a new scheme is proposed. First subband coding is used to decompose the image into different frequency bands. Next motion compensation is applied to the low band, and the resulting interframe difference is discrete cosine transform (DCT) coded. The high bands are also DCT coded. The DCT coefficients of the difference signal of the low band and the DCT coefficients of the high band are then vector quantized.< >
We present a computational model of contour processing that was suggested by neurophysiological recordings from the monkey visual cortex. The model employs convolutions and nonlinear operations, but no feedback loops....
详细信息
We present a computational model of contour processing that was suggested by neurophysiological recordings from the monkey visual cortex. The model employs convolutions and nonlinear operations, but no feedback loops. Contours are defined by the local maxima of the responses of a contour operator that sums a representation of contrast borders and a "grouping signal". The grouping consists in convolving a representation of "key points", such as T-junctions, corners, and line ends, with a set of orientation selective kernels, and a nonlinear pairing operation. The grouping scheme is selective according to whether the configuration of key points is consistent with the interpretation of occlusion. The resulting contour representation includes an indicator of figure ground direction. We show: that the model reproduces illusory contours in accurate agreement with perception; and generates representations of occluding contours on images of natural scenes that are more complete and less cluttered by spurious connections of foreground and background than those obtained by conventional edge detection operators.
暂无评论