In this paper, we address the problem of the efficient encoding of object boundaries. This problem is becoming increasingly important in applications such as content-based storage and retrieval, studies and television...
详细信息
In this paper, we address the problem of the efficient encoding of object boundaries. This problem is becoming increasingly important in applications such as content-based storage and retrieval, studies and television postproduction, and mobile multimedia applications. The MPEG-4 visual standard will allow the transmission of arbitrarily shaped video objects. The techniques developed for shape coding within the MPEG-4 standardization effort are described and compared first. A framework for the representation of shapes using their contours is presented next. Such representations are achieved using curves of various orders, and they are optimal in the rate-distortion sense. Last, conclusions are drawn.
In this paper we present an encoding algorithm for boundary maps containing only closed contours. The method relies on the location and encoding of 'transition points', i.e., points where rows or columns of ed...
详细信息
In this paper we present an encoding algorithm for boundary maps containing only closed contours. The method relies on the location and encoding of 'transition points', i.e., points where rows or columns of edge elements change state. Experimental results show that this technique offers better compression ratios than differential chain-coding for boundary maps having medium or large complexity. (C) 1998 Elsevier Science B.V.
A major problem in. object-oriented video coding and, MPEG-4 is the encoding of object boundaries., Traditionally this problem, is treated separately from the texture encoding problem. In. this paper, we present,a ver...
详细信息
A major problem in. object-oriented video coding and, MPEG-4 is the encoding of object boundaries., Traditionally this problem, is treated separately from the texture encoding problem. In. this paper, we present,a vertex-based shape coding method which is optimal in the operational rate-distortion sense and takes into account the texture information of the video frames. This is accomplished by utilizing a variable-width tolerance band whose Width,is a function of the texture, profile. As an. example, this width is inversely proportional to the magnitude, of the image gradient. Thus, in areas where the confidence in the estimation of the boundary is low and/or coding errors in the boundary will not affect the application (e.g., object-oriented coding and MPEG-4) significantly,. a larger boundary approximation error, is allowed. We present experimental. results which demonstrate the effectiveness of the proposed algorithm.
In this paper, we present a new shape-coding approach, which decouples the shape information into two independent signal data sets;the skeleton and the boundary distance from the skeleton. The major benefit of this ap...
详细信息
In this paper, we present a new shape-coding approach, which decouples the shape information into two independent signal data sets;the skeleton and the boundary distance from the skeleton. The major benefit of this approach is that it allows for a more flexible tradeoff between approximation error and bit budget. Curves of arbitrary order can be utilized for approximating both the skeleton and distance signals. For a given bit budget for a video frame, we solve the problem of choosing the number and location of the control points for all skeleton and distance signals of all boundaries within a frame, so that the overall distortion is minimized. An operational rate-distortion (ORD) optimal approach using Lagrangian relaxation and a four-dimensional Direct Acyclic Graph (DAG) shortest path algorithm is developed for solving the problem. To reduce the computational complexity from O(N-5) to O(N-3), where N is the number of admissible control points for a skeleton, a suboptimal greedy-trellis search algorithm is proposed and compared with the optimal algorithm. In addition, an even more efficient algorithm with computational complexity O(N-2) that finds an ORD optimal solution using a relaxed distortion criterion is also proposed and compared with the optimal solution. Experimental results demonstrate that our proposed approaches outperform existing ORD optimal approaches, which do not follow the same decomposition of the source data.
In this paper, we review a general framework for the optimal bit allocation among dependent quantizers based on the minimum maximum (MINMAX) distortion criterion. Pros and cons of this optimization criterion are discu...
详细信息
In this paper, we review a general framework for the optimal bit allocation among dependent quantizers based on the minimum maximum (MINMAX) distortion criterion. Pros and cons of this optimization criterion are discussed and compared to the well-known Lagrange multiplier method for the minimum average (MINAVE) distortion criterion. We argue that, in many applications, the MINMAX criterion is more appropriate than the more popular MINAVE criterion. We discuss the algorithms for solving the optimal bit allocation problem among dependent quantizers for both criteria and highlight the similarities and differences. We point out that any problem which can be solved with the MINAVE criterion can also be solved with the MINMAX criterion, since both approaches are based on the same assumptions. We discuss uniqueness of the MINMAX solution and the way both criteria can be applied simultaneously within the same optimization framework. Furthermore, we show how the discussed MINMAX approach can be directly extended to result in the lexicographically optimal solution. Finally, we apply the discussed MINMAX solution methods to still image compression, intermode frame compression of H.263, and shape coding applications.
A transform coding scheme for closed image boundaries on a plane is described. The given boundary is approximated by a series of straight line segments. Depending on the shape, the boundary is represented by the (x-y)...
详细信息
A transform coding scheme for closed image boundaries on a plane is described. The given boundary is approximated by a series of straight line segments. Depending on the shape, the boundary is represented by the (x-y) coordinates of the endpoints of the line segments or by the magnitude of the successive radii vectors that are equispaced in angle around the given boundary. Due to the circularity present in the data, the discrete Fourier transform is used to exactly decorrelate the finite boundary data. By fitting a Gaussian circular autoregressive model to represent the boundary data, estimates of the variances of the Fourier coefficients are obtained. Using the variances of the Fourier coefficients and the MAX quantizer, the coding scheme is implemented. The scheme is illustrated by an example.","doi":"10.1109/TPAMI.1984.4767482","publicationTitle":"IEEE Transactions on Pattern Analysis and Machine Intelligence","startPage":"102","endPage":"105","rightsLink":"http://***/AppDispatchServlet?publisherName=ieee&publication=0162-8828&title=Fourier+coding+of+Image+Boundaries&isbn=&publicationDate=Jan.+1984&author=R.+Chellappa&ContentID=10.1109/TPAMI.1984.4767482&orderBeanReset=true&startPage=102&endPage=105&volumeNum=PAMI-6&issueNum=1","displayPublicationTitle":"IEEE Transactions on Pattern Analysis and Machine Intelligence","pdfPath":"/iel5/34/4767466/***","keywords":[{"type":"IEEE Keywords","kwd":["Image coding","Image edge detection","Image segmentation","Cyclones","Area measurement","Satellites","Pixel","Delay","Clouds","Pattern recognition"]},{"type":"Author Keywords ","kwd":["transform coding","boundary coding","circular autoregressive models","image coding"]}],"allowComments":false,"pubLink":"/xpl/***?punumber=34","issueLink":"/xpl/***?isnumber=4767466","standardTitle":"Fourier coding of Image Boundaries
暂无评论