Existence of the optimal prefix codes is shown in this paper. Relationship between the optimal prefix code and the Huffman code is also discussed. We prove that all Huffman codes are optimal prefix codes and conversel...
详细信息
Existence of the optimal prefix codes is shown in this paper. Relationship between the optimal prefix code and the Huffman code is also discussed. We prove that all Huffman codes are optimal prefix codes and conversely optimal prefix codes need not be Huffman codes. Especially, the problem of whether the optimal prefix code has to be maximal is presented. Although for information source alphabets of being not greater than four letters we show that an optimal prefix code must be maximal, it remains to be an open problem in general. As seen from Huffman codes, optimal prefix codes are used not only for statistical modeling but also for dictionary methods. Moreover, it is obtained that the complexity of breaking an optimal prefix code is NP-complete from the viewpoint of computational difficulty.
A framework with two scalar parameters is introduced for various problems of finding a prefixcode minimizing a coding penalty function. The framework encompasses problems previously proposed by Huffman, Campbell, Nat...
详细信息
A framework with two scalar parameters is introduced for various problems of finding a prefixcode minimizing a coding penalty function. The framework encompasses problems previously proposed by Huffman, Campbell, Nath, and Drmota and Szpankowski, shedding light on the relationships among these problems. In particular, Nath's range of problems can be seen as bridging the minimum average redundancy problem of Huffman with the minimum maximum pointwise redundancy problem of Drmota and Szpankowski. Using this framework, two linear-time Huffman-like algorithms are devised for the minimum maximum pointwise redundancy problem, the only one in the framework not previously solved with a Huffman-like algorithm. Both algorithms provide solutions common to this problem and a subrange of Nath's problems, the second algorithm being distinguished by its ability to find the minimum variance solution among all solutions common to the minimum maximum pointwise redundancy and Nath problems. Simple redundancy bounds are also presented.
This paper presents new lower and upper bounds for the compression rate of binary prefixcodes optimized over memoryless sources according to various nonlinear codeword length objectives. Like the most well-known redu...
详细信息
This paper presents new lower and upper bounds for the compression rate of binary prefixcodes optimized over memoryless sources according to various nonlinear codeword length objectives. Like the most well-known redundancy bounds for minimum average redundancy coding-Huffman coding-these are in terms of a form of entropy and/or the probability of an input symbol, often the most probable one. The bounds here, some of which are tight, improve on known bounds of the form L is an element of [H, H + 1), where H is some form of entropy in bits (or, in the case of redundancy objectives, 0) and L is the length objective, also in bits. The objectives explored here include exponential-average length, maximum pointwise redundancy, and exponential-average pointwise redundancy (also called d(th) exponential redundancy). The first of these relates to various problems involving queueing, uncertainty, and lossless communications;the second relates to problems involving Shannon coding and universal modeling. Also explored here for these two objectives is the related matter of individual codeword length.
Whereas Huffman coding finds a prefixcode minimizing mean codeword length for a given finite-item probability distribution, quasiarithmetic or quasilinear coding problems have the goal of minimizing a generalized mea...
详细信息
Whereas Huffman coding finds a prefixcode minimizing mean codeword length for a given finite-item probability distribution, quasiarithmetic or quasilinear coding problems have the goal of minimizing a generalized mean of the form phi(-1)(Sigma(i)p(i)phi(l(i))), where l(i) denotes the length of the ith codeword, P-i denotes the corresponding probability, and phi is a monotonically increasing cost function. Such problems, proposed by Campbell, have a number of diverse applications. Several cost functions are shown here to yield quasiarithmetic problems with simple redundancy bounds in terms of a generalized entropy. A related property, also shown here, involves the existence of optimalcodes: For "well-behaved" cost functions, optimalcodes always exist for (possibly infinite-alphabet) sources having finite generalized entropy. An algorithm is introduced for finding binary codes optimal for convex cost functions. This algorithm, which can be extended to other minimization utilities, can be performed using quadratic time and linear space. This reduces the computational complexity of a problem involving minimum delay in a queue, allows combinations of previously considered problems to be optimized, and greatly expands the set of problems solvable in quadratic time and linear space.
A new type of sufficient condition is provided for a probability distribution on the nonnegative integers to be given an optimal D-ary prefixcode by a Huffman-type algorithm, In the justification of our algorithm, we...
详细信息
A new type of sufficient condition is provided for a probability distribution on the nonnegative integers to be given an optimal D-ary prefixcode by a Huffman-type algorithm, In the justification of our algorithm, we introduce two new (essentially one) concepts as the definition of the ''optimality'' of a prefix D-ary code, which are shown to be equivalent to that defined in the traditional way, These new concepts of the optimality are meaningful even for the case where the Shannon entropy H(P) diverges.
暂无评论