Man-made environments possess a lot of regularities which simplify otherwise difficult pose estimation and visual reconstruction tasks. The constraints arising front parallel and orthogonal lines and planes can be eff...
详细信息
Man-made environments possess a lot of regularities which simplify otherwise difficult pose estimation and visual reconstruction tasks. The constraints arising front parallel and orthogonal lines and planes can be efficiently exploited at various stages of vision processing pipeline. In this paper we propose an approach for estimation of vanishing points by exploiting the constraints of structured man-made environments, where the majority of lines is aligned with the principal orthogonal directions of the world coordinate frame. We combine efficient imageprocessing techniques used in the line detection and initialization stage with simultaneous grouping and estimation of vanishing directions using expectation maximization (EM) algorithm. Since we assume an uncalibrated camera the estimated vanishing points can be used towards partial camera calibration and estimation of the relative orientation of the camera with respect to the scene. The presented approach is computationally efficient and has been verified extensively by experiments.
Objective image/video quality measures play important roles in various image/video processing applications, such as compression, communication, printing, analysis, registration, restoration and enhancement. Most propo...
详细信息
Objective image/video quality measures play important roles in various image/video processing applications, such as compression, communication, printing, analysis, registration, restoration and enhancement. Most proposed quality assessment approaches in the literature are error sensitivity-based methods. We follow a new philosophy in designing image/video quality metrics, which uses structural distortion as an estimation of perceived visual distortion. We develop a new approach for video quality assessment. Experiments on the video quality experts group (VQEG) test data set shows that the new quality measure has higher correlation with subjective quality measurement than the proposed methods in VQEG's Phase I tests for full-reference video quality assessment.
This paper addresses issues relating to software-defined receivers. Namely radio frequency (RF) to baseband architectures and the signal processing algorithms involved. Several direct conversion receiver architectures...
详细信息
This paper addresses issues relating to software-defined receivers. Namely radio frequency (RF) to baseband architectures and the signal processing algorithms involved. Several direct conversion receiver architectures are introduced and analyzed for their performance. Issues relating to the quadrature imbalances and DC offset impairments are analyzed and detailed. To estimate and compensate for these impairments, several DSP algorithms are proposed and simulated for use with these receivers.
Region-of-interest coding and unequal error protection are two important tools in video communication systems to improve the received visual quality. One common property of the two techniques is that unequal coding or...
详细信息
Region-of-interest coding and unequal error protection are two important tools in video communication systems to improve the received visual quality. One common property of the two techniques is that unequal coding or transmission is applied to improve the quality of the most important parts of images. The proposed sub-picture coding technique facilitates both region-of-interest coding and unequal error protection by partitioning images to regions of interest and separating the corresponding coded data units from each other. Simulation results show that the overall subjective quality is considerably improved compared to the conventional coding schemes.
In this paper, we present an adaptive two-pass median filter to remove impulsive noise. In two-pass median filtering, an image contaminated by impulsive noise is processed by a median filter twice. Median filtering is...
详细信息
In this paper, we present an adaptive two-pass median filter to remove impulsive noise. In two-pass median filtering, an image contaminated by impulsive noise is processed by a median filter twice. Median filtering is a non-reversible process, i.e., useful information discarded by the filter cannot be recovered. This behavior becomes more apparent in two-pass median filtering. To correct this problem, between the two filtering processes we introduce an adaptive process to selectively replace some pixels by their original values based on the spatial distribution of estimated impulsive noise. Compared with standard median filtering and two-pass median filtering, better results are obtained in terms of visual appreciation and mean squared error. We use examples to demonstrate the performance of the method.
Many events are captured using multiple cameras today. Frames of each video stream have to be synchronized and aligned to a common time axis before processing them. Synchronization of the video streams necessarily nee...
详细信息
Many events are captured using multiple cameras today. Frames of each video stream have to be synchronized and aligned to a common time axis before processing them. Synchronization of the video streams necessarily needs a hardware based solution that is applied while capturing. The alignment problem between the frames of multiple videos can be posed as a search using traditional measures for image similarity. Multiview relations and constraints developed in Computer Vision recently can provide more elegant solutions to this problem. In this paper, we provide two solutions for the video frame alignment problem using two view and three view constraints. We present solutions to this problem for the case when the videos are taken using affine cameras and for general projective cameras. Excellent experimental results are achieved by our algorithms.
This paper presents a new human skin color model in YCbCr color space and its application to human face detection. Skin colors are modeled by a set of three Gaussian clusters, each of which is characterized by a centr...
详细信息
This paper presents a new human skin color model in YCbCr color space and its application to human face detection. Skin colors are modeled by a set of three Gaussian clusters, each of which is characterized by a centroid and a covariance matrix. The centroids and covariance matrices are estimated from large set of training samples after a k-means clustering process. Pixels in a color input image can be classified into skin or non-skin based on the Mahalanobis distances to the three clusters. Efficient post-processing techniques namely noise removal, shape criteria, elliptic curve fitting and face/non-face classification are proposed in order to further refine skin segmentation results for the purpose of face detection.
Address-event-representation (AER) is a communications protocol for transferring images between chips, originally developed for bio-inspired imageprocessing systems. Such systems may consist of a complicated hierarch...
详细信息
Address-event-representation (AER) is a communications protocol for transferring images between chips, originally developed for bio-inspired imageprocessing systems. Such systems may consist of a complicated hierarchical structure with many chips that transmit images among them in real time, while performing some processing (for example, convolutions). In developing AER based systems it is very convenient to have available some kind of means of generating AER streams from on-computer stored images. In this paper we present a method for generating AER streams in real time from images stored in a computer's memory. The method exploits the concept of linear feedback shift register random number generators. This method has been tested by software and compared to other possible algorithms for generating AER streams. It has been found that the proposed method yields a minimum error with respect to the ideal situation. A hardware platform that exploits this technique is currently under development.
The technique of lossless image compression plays an important role in image transmission and storage for high quality. At present, both the compression ratio and processing speed should be considered in a real-time m...
详细信息
ISBN:
(纸本)0780374908
The technique of lossless image compression plays an important role in image transmission and storage for high quality. At present, both the compression ratio and processing speed should be considered in a real-time multimedia system. A novel lossless compression algorithm is researched. A low complexity predictive model is proposed using the correlation of pixels and color components. In the meantime, perception in neural network is used to rectify the prediction values adaptively. It makes the prediction residuals smaller and in a small dynamic scope. Also, a color space transform is used and good decorrelation is obtained in our algorithm. Compared to the new standard JPFG-LS, this predictive model reduces its computational complexity. The compared experimental results have shown that our algorithm has noticeable better performance than the traditional algorithms. Moreover, its speed is faster than the JPEG-LS with negligible performance sacrifice.
This paper presents a steganography method based on a lossy wavelet compression scheme and bit-plane complexity segmentation (BPCS) steganography. This method utilizes the embedded zerotree wavelet (EZW) compression s...
详细信息
This paper presents a steganography method based on a lossy wavelet compression scheme and bit-plane complexity segmentation (BPCS) steganography. This method utilizes the embedded zerotree wavelet (EZW) compression scheme, where wavelet coefficients of an image are quantized into a bit-plane structure. The proposed steganography enables us to use lossy compressed images as dummy files in bit-plane-based steganographic algorithms. Large embedding rates of around 25% of the compressed image size were achieved with little noticeable degradation in image quality. The proposed method can be applied to other wavelet-based lossy compression schemes like SPIHT and JPEG2000, because in these compression schemes the wavelet coefficients are also quantized into a bit-plane structure.
暂无评论