作者:
ALGAZI, VRKELLY, PLESTES, RRGraphics
Image and Vision Engineering Research Laboratory CIPIC Center for Image Processing Integrated Computing University of California Davis CA USA
Algorithms for preprocessing, global modeling and segmentation, local modeling and representation, and binary encoding of facsimile images are proposed, and their performance is examined. Some binary morphological ope...
详细信息
Algorithms for preprocessing, global modeling and segmentation, local modeling and representation, and binary encoding of facsimile images are proposed, and their performance is examined. Some binary morphological operations that can be used to improve the quality and increase the compressibility of binary images and the modeling of binary images are discussed. The techniques are incorporated into a set of comprehensive encoding schemes, the PCSE codes. Their effectiveness in compressing the International Telegraphy and Telephony Consultative Committee (CCITT) binary images is demonstrated. The PCSE code using preprocessing and color shrinking prior to encoding is shown to outperform the READ code by from 7 to 42% for the standard CCITT test images with a very small change in image quality.< >
Some preliminary results are presented on a simple progressive code for scanned 300 dpi documents. The effectiveness of the standard CCITT codes as a function of document resolution is determined. Extensions to the te...
详细信息
Some preliminary results are presented on a simple progressive code for scanned 300 dpi documents. The effectiveness of the standard CCITT codes as a function of document resolution is determined. Extensions to the technique of document resolution are examined. For higher-resolution scanning, e.g. 300 dpi, a pyramid of the information source which allows for a tradeoff between the compression achieved and the quality of the representation is proposed. A progressive code that first generates a color block image that may be useful for identification of the document is proposed. Progression in quality corresponds to 100 dpi binary images, 100 dpi gray scale images, 300 dpi binary images, and finally 300 dpi gray-scale images. Each stage in the progressive code is based on the data available at the previous stage. The performance of the progressive code is examined.< >
The use of the wideband maximum likelihood estimator (MLE) is shown to improve the color flow mapping of the blood velocity profile. Using a transmitted signal with a significant fractional bandwidth, it is shown that...
详细信息
The use of the wideband maximum likelihood estimator (MLE) is shown to improve the color flow mapping of the blood velocity profile. Using a transmitted signal with a significant fractional bandwidth, it is shown that the wideband MLE improves the quality of the color flow estimate and simultaneously allows an increase in the frame rate. It is the improved global accuracy of the estimate, with the use of a new signaling scheme, that allows an increase in the frame rate of the display. The signaling scheme interleaves transmission of pulses in two or three directories. This interleaving, with nonperiodic transmission, reduces the total number of pulses required to achieve a fixed level of local and global accuracy. To study the overall estimator performance, the authors evaluate the expected estimator output and demonstrate that the width of the mainlobe and height of subsidiary peaks are reduced through the use of this estimator and signaling scheme.< >
We present a framework and a set of techniques for the analysis and display of three-dimensional experimental data or images. We assume that the data are available in the form of two-dimensional cross sections of the ...
详细信息
The application of novel anisotropic filter design techniques based on properties of human vision to the processing of luminance and chrominance components of color images is considered. Applied independently, these a...
详细信息
The application of novel anisotropic filter design techniques based on properties of human vision to the processing of luminance and chrominance components of color images is considered. Applied independently, these anisotropic filters can be used for the sequential digital representation of images by subsampling. By using them with two-dimensional quadrature modulation of chrominance signals, they led to a novel scheme for color composite images in which the skewing of energy due to the anisotropy of the filters improves the juxtaposition of luminance and chrominance in the two-dimensional frequency domain. It is found that the image quality is substantially better than that of NTSC images for either sequential or composite techniques.< >
A mathematical model has been developed for tracking spectral transitions within the spectral envelope of a speech signal. This technique incorporates linguistic knowledge into a mathematical framework to determine ti...
详细信息
A mathematical model has been developed for tracking spectral transitions within the spectral envelope of a speech signal. This technique incorporates linguistic knowledge into a mathematical framework to determine time-varying acoustic-phonetic features and describe formant transitions. The proposed model is quite robust and is capable of extracting not only rapid spectral movement, but also smoother spectral transitions that occur in vowel and sonorant sequences. This basic approach has been previously used to extract steady-state acoustic-phonetic features across spectrally homogeneous regions and to perform speaker dependent recognition in which quite successful results were attained in clean as well as noisy speech. It has now been augmented to capture the dynamics of spectral acoustic-phonetic features.< >
The authors present a computationally efficient encoding scheme for vector quantization. Efficiency is achieved by combining techniques: homes are bucketed into the subset of codewords in the same region as the input ...
详细信息
The authors present a computationally efficient encoding scheme for vector quantization. Efficiency is achieved by combining techniques: homes are bucketed into the subset of codewords in the same region as the input point; the energy of the input point eliminates codewords not in the same energy range; the smallest hyperrectangle parallel to the coordinate axes that bounds the Voronoi region associated with the codeword acts as a discriminant; and approximations to the actual distortion are used to avoid multiplications. Simulations on Gaussian sources with ranges of codebook sizes and block sizes indicate that the encoding time, measured in multiplications, actually falls with increasing codebook size. It is shown that with no increase in signal/noise ratio the algorithm substantially outperforms tree search and binary hyperplane testing search.< >
The authors propose a novel approach to the modeling and estimation of the speech spectral envelope over acoustic subwords that exhibits robust performance in noise. The technique exploits the underlying signal struct...
详细信息
The authors propose a novel approach to the modeling and estimation of the speech spectral envelope over acoustic subwords that exhibits robust performance in noise. The technique exploits the underlying signal structure of speech to improve parameter estimates, and it uses the perceptual properties of hearing to decrease the computational requirements in a perceptually meaningful way. The approach provides a considerable speech quality improvement over other methods.< >
Advances in computer networking have opened opportunities for remote imageprocessing, allowing processing tasks to be distributed among specialized nodes. The integrated programming environment, Davis interactive sys...
详细信息
Advances in computer networking have opened opportunities for remote imageprocessing, allowing processing tasks to be distributed among specialized nodes. The integrated programming environment, Davis interactive system (DAISY) provides a control panel interface on a workstation to a library of imageprocessing applications that run on specialized remote hosts. The workstation provides an interface to a remote image display, image previews, and an image display.< >
暂无评论