Automatic generation of textured object models from a sequence of range and color images requires two major tasks: measurement registration and measurement integration. Measurement registration is the estimation of th...
详细信息
ISBN:
(纸本)0819424358
Automatic generation of textured object models from a sequence of range and color images requires two major tasks: measurement registration and measurement integration. Measurement registration is the estimation of the current position and orientation of the object in 3D space with respect to an arbitrary fixed reference, given the current measurement and the 3D object model under construction. Measurement integration is the updating of the 3D object model using the current registered measurement. In this paper we present an iterative 3D-3D registration technique that uses both texture(1) and shape information available in the 3D object models and the 3D measurements. The proposed technique handles probabilistic models that are potentially incomplete before the measurement integration step. Measurements are acquired via a sensor characterized by a probabilistic sensor model. The object models are constructed automatically without user interaction. Each model is a compact uniform tessellation of 3D space, where each cell of the tessellation represents shape and texture in a probabilistic fashion. Free formed objects are supported and no prior knowledge about the object shape, texture or pose is assumed. Traditional registration methods consider only shape and geometric information. We consider texture information as an additional evidence by defining a generalized intercell distance measure that considers both the relative positioning of cells in space and the texture discrepancy between cells. Experimental results demonstrate the efficiency and robustness of the proposed method. The usefulness of texture in registration is highlighted in a comparison with results obtained considering only geometric information.
The organization of video data-bases according to semantic content of data, is a key point in multimedia technologies. In fact, this would allow algorithms such as indexing and retrieval to work more efficiently. As a...
详细信息
This paper presents a wavelet based image compression algorithm using subjective thresholding and quantization of the wavelet coefficients. subjective compression techniques use the properties of the human visual syst...
详细信息
This paper presents a wavelet based image compression algorithm using subjective thresholding and quantization of the wavelet coefficients. subjective compression techniques use the properties of the human visual system (HVS) to eliminate redundant information in an image. By combining the wavelet transform coefficients with psychovisual thresholding and quantization schemes, a coding result of 0.31 bits per pixels has been obtained with weighted PSNR of 39.07 dB.
Some of the most significant image quality indexes are reviewed and compared with a new method for blockness distortion evaluation. The paper begins with a brief survey on classical measures based on numerical differe...
详细信息
Some of the most significant image quality indexes are reviewed and compared with a new method for blockness distortion evaluation. The paper begins with a brief survey on classical measures based on numerical difference between the original and reconstructed image data (e.g., MSE, SNR and PSNR) and advanced methods aiming at considering the perceptive aspects of image degradation (e.g., Hosaka (1986) plots and other methods based on human visual system properties like information content or perceptual image distortion). After, four innovative methods for blockness distortion measurement are proposed: two based on DCT analysis, and two on differential Sobel operator. Results on standard pictures confirm the efficiency of the proposed measures.
In this paper, we incorporate the Weber's law into the framework of subband absolute moment block truncation coding (SAMBTC) together with an effective bit-rate control scheme to greatly improve image quality at o...
详细信息
ISBN:
(纸本)0780336763
In this paper, we incorporate the Weber's law into the framework of subband absolute moment block truncation coding (SAMBTC) together with an effective bit-rate control scheme to greatly improve image quality at or around the same bit rates. Or, alternatively it reduces the bit rates further without incurring more noticeable degradation beyond what the SAMBTC has yielded. Experimental results show that the proposed perceptually based SAMBTC distinctly outperforms SAMBTC coded at very low bit rates.
It has been shown that about 80% of the DC coefficients of a transformed image can be discarded at the encoder and then restored at the decoder using a DC coefficient restoration (DCCR) algorithm. This approach can im...
详细信息
It has been shown that about 80% of the DC coefficients of a transformed image can be discarded at the encoder and then restored at the decoder using a DC coefficient restoration (DCCR) algorithm. This approach can improve the compression rate of JPEG by over 10% at low bit rates. We extend the DCCR algorithm not only to restore the DC coefficients but also to predict some of the (0, 1)th and (1, 0)th AC coefficients. The prediction thus allows these AC coefficients to be represented using less bits. Experimental results show that, almost in all cases, images coded using the proposed algorithm have higher PSNR and better visual quality than those coded using only the DCCR.
This paper presents a wavelet based image compression algorithm specifically tailored for facial photographs. The proposed method dramatically reduces memory requirements for facial image databases. The algorithm make...
详细信息
This paper presents a wavelet based image compression algorithm specifically tailored for facial photographs. The proposed method dramatically reduces memory requirements for facial image databases. The algorithm makes it possible to reach compression rates of 0.25 to 0.1 bpp without compromising the visual quality of the facial features used for identification purposes. This is made possible by utilizing a novel strategy for quantizing the wavelet coefficients, where the spatial content and the frequency distribution of each input image are combined to produce a quantization scheme which is spatially and frequency dependent and different for each image being processed. The main steps of the algorithm are as follows: detection of more and less important spacial areas in the photograph; discrete wavelet transform; space and frequency dependent quantization customized for a specific frequency distribution of each image; and entropy encoding.
Summary form only given. In order to maintain the safety of steel tubes which have been used as structural members in large scale structures for a long period, the inner wall of their tubes must be investigated period...
详细信息
Summary form only given. In order to maintain the safety of steel tubes which have been used as structural members in large scale structures for a long period, the inner wall of their tubes must be investigated periodically. There are several inspection methods including visual inspection which use an industrial endoscope, and visual inspection is practical to use. However, the methods often involve investigation of a lot of video tapes recorded in the field. The aim of the present study is to automate this visual inspection. We have paid attention to the color of the rust which appears in the corroded wall, and have tried to detect corroded regions by using color imageprocessing. Moreover, we have developed a method to be able to inspect the observing video tape in real time by using a commercial high speed imageprocessing board.
In late 1996, the Federal communications Commission ruled the United States would migrate to a digital television broadcasting system. The Commission chose the Grand Alliance transmission scheme and left the picture s...
详细信息
ISBN:
(纸本)9781614829287
In late 1996, the Federal communications Commission ruled the United States would migrate to a digital television broadcasting system. The Commission chose the Grand Alliance transmission scheme and left the picture scanning format to the marketplace. High Definition video, at 1920 times 1080 pixels, interlaced scanning, at a nominal 60 Hertz field rate is one format in the infamous Table 3 of the ATSC document and FCC ruling. This picture format is also well on the way to becoming a universal format. - A new series of digital systems, conforming to the interlaced scanning format of SMPTE 274M- 1995 were recently introduced by Sony. This paper deals with the architecture, enabling technologies, and capabilities of a live and post production vision mixer from this family of equipment. - A modern, digital, high definition vision mixer, and associated digital effects and editing equipment, must provide producers, directors, and the artistic and technical staff employing them with similar capabilities as existing standard definition equipment - the visual effects expected by a sophisticated audience. These requirements are detailed in this paper and the vision mixer described is compared to similar standard definition devices. - The capabilities of the switcher resulted in an architecture for crosspoints, mix effects systems, key and video processing, control and external interfaces. Operational and maintenance aspects are also key factors in an architecture and include considerations for size, mass, power consumption and heat loads. Picture size and shape may change but control room and remote broadcast vehicle designs dictate limitations on these specifications. Trade-offs in capability and architecture are enumerated. - The data rates employed in HDTV (more than 5 times the rate of SMPTE 125M and SMPTE 259M) required additional new technologies, including the development of a series of Application Specific Integrated Circuits which perform the same functions employed
In this paper a scheme for bank cheque images compression based on layout knowledge is proposed. The layout structure of the cheques is analyzed and the nonessential parts are located. These parts, viz., the backgroun...
详细信息
In this paper a scheme for bank cheque images compression based on layout knowledge is proposed. The layout structure of the cheques is analyzed and the nonessential parts are located. These parts, viz., the background and the printed information, are eliminated from the original image. The resulting image contains some noise that are eliminated by a filtering operation. The image is enclosed to eliminate some uninformative parts. The final image has only the filled information. The digitized image can be easily reconstructed by restoring the filled information and summing it with background and printed information. The proposed compression scheme is tested by Brazilian bank cheques. Comparisons with other compression schemes, shows that the proposed scheme performs significantly better in terms of the compression efficiency, maintaining the visual quality.
暂无评论