A Cabin Car Communication System (CCCS) has the goal of improving the communication among passengers inside vehicles [1]. The lack of visual contact between speakers, the high level of noise and many other factors deg...
详细信息
A Cabin Car Communication System (CCCS) has the goal of improving the communication among passengers inside vehicles [1]. The lack of visual contact between speakers, the high level of noise and many other factors degrade the communications inside vehicles. The CCCS makes use of a set of microphones placed on the overhead to pick up the speech of each passenger, those signals are amplified and played back into the cabin through the car audio loudspeaker system. This system has to deal with two main problems, electro-acoustic coupling and noise amplification. To overcome these problems, CCCS makes use of echo cancellation and noise reduction techniques. In this work a discussion about the echo cancellation implementation with simulation results about the performance of the proposed echo canceller and noise reduction stage are shown.
A new image coding method utilizing the wavelet transform, JPEG2000, has been developed. In this report, we consider several types of visual distortion observed in moving HDTV pictures and ultra-high definition pictur...
详细信息
A new image coding method utilizing the wavelet transform, JPEG2000, has been developed. In this report, we consider several types of visual distortion observed in moving HDTV pictures and ultra-high definition pictures compressed by wavelet transform coding, and describe measures to reduce them. So-called "flicker artifacts" are visually decreased by visual weighting and the mechanism is interpreted by Weber's law. The characteristic distortion named "comb-tooth artifacts" caused by the combination of subband decomposition and the interlaced TV signal structure is discussed, and a preprocessing method is proposed to counter it. The relation between the resolution levels of the subband decomposition and the distortion is investigated by coding experiments on ultra-high definition pictures.
With the rapid development of the Internet, more and more attention is focused on IP video streaming. We introduce an RD optimal macro-block mode decision scheme for the new H.26L video stream. Based on the statistica...
详细信息
With the rapid development of the Internet, more and more attention is focused on IP video streaming. We introduce an RD optimal macro-block mode decision scheme for the new H.26L video stream. Based on the statistical error propagation model of the Internet and the unequal NAL packet of H.26L, our new scheme can be more error robust than those currently adopted in the H.26L test mode.
visual servo control is needed for realizing automatic bio-micromanipulation and increasing the accuracy of micromanipulator. A dual-hand micromanipulation system with micro-visual-servo was developed for automating c...
详细信息
visual servo control is needed for realizing automatic bio-micromanipulation and increasing the accuracy of micromanipulator. A dual-hand micromanipulation system with micro-visual-servo was developed for automating cell manipulation in biotechnology. This paper presents a complete solution for micro-visual-servo of the system, involving micro-visual-servo architecture, control law, modeling of image Jacobian and recognition algorithms of micro objects. The experimental results of micro-circle trajectory tracking and automatic transgenic operation verify the effectiveness of the micro-visual-servo solution.
Presents a contribution in the field of computer vision in dermatology for the follow up of skin lesions. Several known skin lesions in the country are identified. The images of these lesions are captured and stored i...
详细信息
Presents a contribution in the field of computer vision in dermatology for the follow up of skin lesions. Several known skin lesions in the country are identified. The images of these lesions are captured and stored in a computer for further imageprocessing. Several filtering and imageprocessing techniques available in the MATLAB tools are applied to these images to produce their histograms and color distribution particularly in the region concerned. These proposed visual records of medical skin imaging can be analyzed further and served as a visual front end for developing a knowledge-based pre-diagnostics system to aid dermatologist in their work.
This paper aims to device an architecture which uses the capability of asynchronous concurrency of the data flow architecture as well as spatial parallelism of SIMD machines for a class of imageprocessing application...
详细信息
This paper aims to device an architecture which uses the capability of asynchronous concurrency of the data flow architecture as well as spatial parallelism of SIMD machines for a class of imageprocessing applications using reconfigurable processing elements (RPE). Overall processing speed is enhanced by: (a) concurrent functioning of the RPE; and (b) replacing software execution of signal processing functions by hardware approach using FPGA as RPE. Thus, a hybrid architecture, which functions as a data flow machine at a functional level and exploits the capability of spatial parallelism by incorporating modified SIMD concepts is presented.
In this paper, a wavelet-based perceptual watermarking considering human visual system (HVS) that is proposed by Barni et al. (see IEEE Trans. imageprocessing., vol.10, no.5, p.783-791, 2001) for monotone images with...
详细信息
In this paper, a wavelet-based perceptual watermarking considering human visual system (HVS) that is proposed by Barni et al. (see IEEE Trans. imageprocessing., vol.10, no.5, p.783-791, 2001) for monotone images without embedding bit data is extended. In our method, an arbitrary length of bit data (binary pattern) can be embedded in brightness component of color images still keeping the perceptual quality of image. Pseudo random number (PN) sequence is introduced as a key to produce a watermarking code. The availability increases compared with the conventional methods. The robustness of our computationally efficient watermark extraction method to the attacks such as JPEG compression, cropping and color tone modification is shown in simulations.
3D object perception is one of the important issues in the study of human visual functions. It is a seemingly effortless process that requires no conscious thought for the human beings, but a difficult computational p...
详细信息
3D object perception is one of the important issues in the study of human visual functions. It is a seemingly effortless process that requires no conscious thought for the human beings, but a difficult computational problem for machines. The object perception with binocular viewing utilizes the disparities between the two eyes to recover the 3D information of an object, and takes advantage of the fact that human beings have two eyes. Recently, a new visual effect named the mime effect was found, in which an illusory 3D volumetric object is perceived due to some stereoscopically displayed inducing objects (Zhang et al., 1998). Here we propose a processing model with both top-down and bottom-up processes, and discuss the involvement of the early and higher-level visual cortical areas to this 3D volumetric object perception. It is hoped to provide new clues to the understanding of the human 3D visual system.
Blind image quality assessment refers to the problem of evaluating the visual quality of an image without any reference. It addresses a fundamental distinction between fidelity and quality, i.e. human vision system us...
详细信息
Blind image quality assessment refers to the problem of evaluating the visual quality of an image without any reference. It addresses a fundamental distinction between fidelity and quality, i.e. human vision system usually does not need any reference to determine the subjective quality of a target image. In this paper, we propose to appraise the image quality by three objective measures: edge sharpness level, random noise level and structural noise level. They jointly provide a heuristic approach of characterizing the most important aspects of visual quality. We investigate various mathematical tools (analytical, statistical and PDE-based) for accurately and robustly estimating those three levels. Extensive experiment results are used to justify the validity of our approach.
In this paper, we present a method of watershed-based region merging using 'conflicting regions' for segmentation of gray level images. It is obvious that both regions and edges in an image give important clue...
详细信息
In this paper, we present a method of watershed-based region merging using 'conflicting regions' for segmentation of gray level images. It is obvious that both regions and edges in an image give important clues to segmentation in our visual system. So our method uses information from both regions and edges properly. We first obtain initial segments by applying watershed transformation to the image gradient magnitude and then find 'conflicting regions' by using the edges. After that, 'conflicting regions' and region homogeneity guide the iterative merging process. 'Conflicting regions' give a good starting point of merging and make it unnecessary to define termination criterion in advance. Since they serve as seeds, processing time is also inexpensive. The experimental results show that segmentation is visually reasonable.
暂无评论