Content based retrieval and recognition of objects represented in images is a challenging problem making it an active research topic. Shape analysis is one of the main approaches to the problem. In this paper we propo...
详细信息
Content based retrieval and recognition of objects represented in images is a challenging problem making it an active research topic. Shape analysis is one of the main approaches to the problem. In this paper we propose the use of a reduced set of features to describe 2D shapes in images. The design of the proposed technique aims to result in a short and simple to extract shape description. We conducted several experiments for both retrieval and recognition tasks and the results obtained demonstrate usefulness and competiveness against existing descriptors. For the retrieval experiment the achieved bull's eye performance is about 60%. Recognition was tested with three different classifiers: decision trees (DT), k-nearest neighbor (kNN) and support vector machines (SVM). Estimated mean accuracies range from 69% to 86% (using 10-fold cross validation). The SVM classifier presents the best performance, followed by the simple kNN classifier.
General-purpose no-reference image quality assessment approaches still lag the advances in full-reference methods. Most no-reference methods are either distortion specific (i.e. they quantify one or more distortions s...
详细信息
General-purpose no-reference image quality assessment approaches still lag the advances in full-reference methods. Most no-reference methods are either distortion specific (i.e. they quantify one or more distortions such as blur, blockiness, or ringing), or they train a learning machine based on a large number of features. In this approach, we propose a discrete cosine transform (DCT) statistics-based support vector machine (SVM) approach based on only 3 features in the DCT domain. The approach extracts a very small number of features and is entirely in the DCT domain, making it computationally convenient. The results are shown to correlate highly with human visual perception of quality.
Digital CE development has been fueled by furiously rapid advances in semiconductor technology. In video systems centered around television (TV), digital image processing technology underpins all of content creation, ...
详细信息
Digital CE development has been fueled by furiously rapid advances in semiconductor technology. In video systems centered around television (TV), digital image processing technology underpins all of content creation, storage, transmission, and display. Today, these systems have been able to achieve high performance economically because of the realization of leading edge semiconductor technology. Digital video processing technology and system LSI technology have been the driving forces behind the creation of 3D imaging systems, which until recently have been difficult to realize and now are offering a whole new passion for people throughout the world. 2010 has seen an overwhelming presence of 3D in TV and Blu-ray Disc™* players and recorders now available to the consumer market, making this decidedly the inaugural year for 3D. Technological innovations in the field of 3D imaging will soon bring about new applications hard to imagine today.
This paper will describe a method for providing a useful intelligence down a low-bandwidth output stream from a high-bandwidth input stream. Many intelligence systems take advantage of streaming video and utilize high...
详细信息
This paper will describe a method for providing a useful intelligence down a low-bandwidth output stream from a high-bandwidth input stream. Many intelligence systems take advantage of streaming video and utilize high-bandwidth data links to supply information. The issue is that high-bandwidth systems typically require larger equipment, better line-of-sight, and more complex gear to keep a reliable link. The intended platform for the system described in this paper is an unmanned aerial vehicle, so the limitations of high-bandwidth equipment become more significant in a tactical scenario. By providing snapshots from the high-definition (HD) video stream, the system is able to send data to users that may not have the capability, due to a myriad of reasons, to acquire a full HD video stream. The system was developed using a Gumstix Verdex 600Mhz computer and the frame acquisition algorithm is written in C. Testing was done using HD video files streamed by VideoLan Client (VLC). The system is capable of outputting JPEG images at a maximum interval of 30 seconds. The capture system provides a method of reaching out to more users and providing functional data.
JPEG XR is the newest image coding standard from the JPEG committee. It primarily targets the representation of continuous-tone still images such as photographic images and achieves high image quality, on par with JPE...
详细信息
JPEG XR is the newest image coding standard from the JPEG committee. It primarily targets the representation of continuous-tone still images such as photographic images and achieves high image quality, on par with JPEG 2000, while requiring low computational resources and storage capacity. Moreover, it effectively addresses the needs of emerging high dynamic range imagery applications by including support for a wide range of image representation formats.
This paper describes a design method of cosine-modulated filter banks (CMFB's) for in efficient coding of images. Whereas the CMFB has advantages of low design and implementation cost, subband filters of the CMFB ...
详细信息
This paper describes a design method of cosine-modulated filter banks (CMFB's) for in efficient coding of images. Whereas the CMFB has advantages of low design and implementation cost, subband filters of the CMFB do not have linear phase property. This prevents from employing a symmetric extension in transformation process, and leads to a degradation of the image compression performance. However, a recently proposed smooth extension alleviates the problem with CMFB's. As a result, well-designed CMFB's can be expected to be good candidates for a transform block in image compression applications. In this paper, we present a novel design approach of regular CMFB's. After introducing a regularity constraint on lattice parameters of a prototype filter in paraunitary (PU) CMFB's, we also derive a regularity condition for perfect reconstruction (PR) CMFB's. Finally, we design regular 8-channel PUCMFB and PRCMFB by an unconstrained optimization of residual lattice parameters, and several simulation results for test images are compared with various transforms for evaluating the proposed image coder based on the CMFB's with one degree of regularity. In addition, we show a computational complexity of the designed CMFB's.
Let X,Y,Z be zero-mean, jointly Gaussian random vectors of dimensions n(x), n(y), and n(z), respectively. Let P be the set of random variables W such that W Y (X,Z) is a Markov string. We consider the following optim...
详细信息
Let X,Y,Z be zero-mean, jointly Gaussian random vectors of dimensions n(x), n(y), and n(z), respectively. Let P be the set of random variables W such that W <-> Y <->(X,Z) is a Markov string. We consider the following optimization problem: min(W is an element of P) I(Y;W vertical bar Z) subject to one of the following two possible constraints: 1) I(X;W vertical bar Z) >= R-1, and 2) the mean squared error between and (X) over cap = E(X vertical bar W, Z) is less than d. The problem under the first kind of constraint is motivated by multiple-input multiple-output ( MIMO) relay channels with an oblivious transmitter and a relay connected to the receiver through a dedicated link, while for the second case, it is motivated by source coding with decoder side information where the sensor observation is noisy. In both cases, we show that jointly Gaussian solutions are optimal. Moreover, explicit water filling interpretations are given for both cases, which suggest transform coding approaches performed in different transform domains, and that the optimal solution for one problem is, in general, suboptimal for the other.
In this paper, a new image coding scheme based on orthogonal polynomials has been presented. From a set of orthogonal polynomial functions, we first obtain the polynomial operators and polynomial basis operators of di...
详细信息
In this paper, a new image coding scheme based on orthogonal polynomials has been presented. From a set of orthogonal polynomial functions, we first obtain the polynomial operators and polynomial basis operators of different width to propose the polynomial transform coding. After applying the proposed transformation, the transform coefficients are threshold coded using quantization and bit allocation as in JPEG baseline system. The performance of the proposed transform coding is reported by computing peak signal-to-noise ratio (PSNR). The proposed coding scheme is also compared with other transform coding schemes such as Discrete Cosine transform and Discrete Wavelet transforms. (C) 2008 Elsevier B.V. All rights reserved.
In this paper, we propose an image coding scheme based on a wavelet-like transform derived from orthogonal polynomial basis. First, 2D non-separable wavelet functions are derived from a set of bivariate orthogonal pol...
详细信息
ISBN:
(纸本)9781617388767
In this paper, we propose an image coding scheme based on a wavelet-like transform derived from orthogonal polynomial basis. First, 2D non-separable wavelet functions are derived from a set of bivariate orthogonal polynomials. Then, a wavelet-like transform coding scheme using the proposed wavelet functions is proposed. The motivation behind using orthogonal polynomials is that they exhibit some properties related to the human visual system (HVS) [1]. After applying the proposed transform, the obtained coefficients are threshold coded using quantization and bit allocation as in JPEG baseline system. The performance of the proposed transform coding method is reported. The proposed coding scheme is also compared with other transform coding methods such as JPEG, JPEG2000 and JPEG-XR/HDPHOTO.
In this paper, a new image coding scheme based on a wavelet-like transform derived from orthogonal polynomial basis is presented. From a set of bivariate orthogonal polynomial functions, we first obtain the 2D non-sep...
详细信息
In this paper, a new image coding scheme based on a wavelet-like transform derived from orthogonal polynomial basis is presented. From a set of bivariate orthogonal polynomial functions, we first obtain the 2D non-separable wavelet functions to propose a wavelet-like transform coding. The motivation behind using orthogonal polynomials is that they exhibit some properties related to the human visual system (HVS). After applying the proposed transformation, the transform coefficients are threshold coded using quantization and bit allocation as in JPEG baseline system. The performance of the proposed transform coding is reported. The proposed coding scheme is also compared with other transform coding methods such as JPEG, JPEG 2000 and JPEG-XR/HDPHOTO.
暂无评论