Region-of-interest (ROI) techniques are often utilized to improve coding for detailed regions in natural still-imagecoding standards such as JPEG2000 [1], but no specific method is stated for determining the ROI map....
详细信息
Region-of-interest (ROI) techniques are often utilized to improve coding for detailed regions in natural still-imagecoding standards such as JPEG2000 [1], but no specific method is stated for determining the ROI map. In this paper, an ROI-based method, in which rectangular regions are extracted using documentimage analysis (DIA), is proposed specifically for document image coding. These rectangular regions can be efficiently coded using wavelets, and DIA may also be used to distinguish between important and unwanted foreground regions, allowing further coding gains (as illustrated in one of the example documents in the paper). Compared to multilayer methods currently used for document image coding [2], the method is simpler and scalable, while improving visual quality and the peak-signal-to-noise ratio (PSNR).
This paper discusses image decomposition problem of the 3-layer MRC model based coding of scanned (noisy) documentimages. A widely-used approach for document decomposition is to divide the documentimage into blocks ...
详细信息
ISBN:
(纸本)9781424442195
This paper discusses image decomposition problem of the 3-layer MRC model based coding of scanned (noisy) documentimages. A widely-used approach for document decomposition is to divide the documentimage into blocks and split the pixel histogram of each block into two halves by minimizing the sum of variance of its pixels with the mean of the halms. We propose to split a block by minimizing the variance of one half with its minimum pixel and the variance of the other half with its maximum pixel. Our goal is to increase the gap between the two halves by avoiding splitting of any cluster of pixels into both halves. It should help reduce complexity of the generated mask. Moreover, we do not decompose a block if it has no edge points, again to reduce the mask complexity. We also implement a noise reduction heuristic in the mask layer to correct placement of transition pixels. We provide simple analysis and evaluate block energy in terms of the DCT coefficients of the resulting FG/BG layer blocks. Experimental results show that code size of the mask layer of our test images, obtained using proposed processing is reduced to nearly half of the mask obtained by a straightforward 3-MRC implementation.
暂无评论