High Efficiency Video Coding (HEVC), not only provides a much better coding efficiency than previous video coding standards, but also shows significantly superior performance than other image coding schemes when appli...
详细信息
ISBN:
(纸本)9781479902880
High Efficiency Video Coding (HEVC), not only provides a much better coding efficiency than previous video coding standards, but also shows significantly superior performance than other image coding schemes when applied to image coding. However, the improvement is at the cost of significant increase of encoding complexity. In this paper, we focus on retaining the high coding efficiency provided by HEVC while largely reducing its encoding complexity for image coding. By applying various techniques including optimized coding structure parameters, coding unit early termination, fast intra prediction and transform skip mode decision, we significantly reduce the complexity of HEVC intra coding while keeping most of its coding efficiency. Experimental results show that our light-weight HEVC encoder can save about 82% coding time compared with original HEVC encoder. With a slight loss to the HEVC reference software, the proposed scheme still gains about 19% in BD-BR compared with H.264/AVC.
In this paper we present a novel framework for compressing images using saliency maps and KAZE features. The method involves adapting the quality factor in JPEG compression scheme for each block instead of using the s...
详细信息
ISBN:
(纸本)9781509017461
In this paper we present a novel framework for compressing images using saliency maps and KAZE features. The method involves adapting the quality factor in JPEG compression scheme for each block instead of using the same quality factor for the entire image. This is achieved by adapting JPEG quality parameter based on visual saliency and KAZE keypoints. Subsequently, a piecewise function is used to compress the least important image blocks with higher compression ratio while maintaining the overall perceptual quality and avoiding blocking artifacts. This work introduces use of KAZE keypoints for image compression for the first time in literature. We show that the proposed method outperforms the JPEG compression using PSNR and FSIMc evaluation measures especially at high compression ratios.
In this paper we propose an image coding scheme based on the polynomial transform and multiresolution analysis. The polynomial transform is an image representation model that mimics some properties of the human visual...
详细信息
ISBN:
(纸本)0819424358
In this paper we propose an image coding scheme based on the polynomial transform and multiresolution analysis. The polynomial transform is an image representation model that mimics some properties of the human visual system, and which we use in order to model edges in terms of their characteristic parameters. Based on the polynomial transform, we build a pyramidal hierarchical predictive scheme for image coding. The feature parameters that we encode are: local average, edge orientation, edge position and edge magnitude.
Hyperspectral imaging captures a high number of spectrally narrow bands and provides advantages for image analysis applications such as identification and classification in particular. Hyperspectral images contain a l...
详细信息
ISBN:
(纸本)9781467373869
Hyperspectral imaging captures a high number of spectrally narrow bands and provides advantages for image analysis applications such as identification and classification in particular. Hyperspectral images contain a large amount of bands. processing these images causes the operation load substantially. Improved methods for the classification of hyperspectral image, can not succeed due to the multidimensionality. To overcome this disadvantage made size reduction and to reduce the number of bands. In this study, to hyperspectral image to be consistent with the human visual system, band gaps are selected which red (R), green (G) and blue (B) corresponding to the wave length. In this paper, superpixel approach is proposed to improve the classification performance.
image quality assessment (IQA) is useful in many visualprocessing systems but challenging to perform in line with the human perception. A great deal of recent research effort has been directed towards IQA. In order t...
详细信息
ISBN:
(纸本)9780819482341
image quality assessment (IQA) is useful in many visualprocessing systems but challenging to perform in line with the human perception. A great deal of recent research effort has been directed towards IQA. In order to overcome the difficulty and infeasibility of subjective tests in many situations, the aim of such effort is to assess visual quality objectively towards better alignment with the perception of the Human visual system (HVS). In this work, we review and analyze the recent progress in the areas related to IQA, as well as giving our views whenever possible. Following the recent trends, we discuss the engineering approach in more details, explore the related aspects for feature pooling, and present a case study with machine learning.
Multi-modal medical image registration is an important processing step for extracting the maximum amount of information from multi-modal medical images. In this paper, to perform image registration of CT and MRI data ...
详细信息
ISBN:
(纸本)9781479961399
Multi-modal medical image registration is an important processing step for extracting the maximum amount of information from multi-modal medical images. In this paper, to perform image registration of CT and MRI data volumes, we use the sum-of-conditional variance (SCV) similarity measure which utilizes the joint probability distribution of two images and allows Gauss-Newton optimization to be used. We compare the results from experiments on clinical CT and MRI datasets obtained using the SCV similarity measure, the entropy images on sum-of-squared-difference (eSSD) method and the mutual information (MI) approach. Our results indicate that the proposed SCV approach outperforms the eSSD and MI similarity measure approaches.
Place recognition, also called visual localization, facilitates the autonomous navigation capabilities of the future of driverless cars. This paper proposes a new place recognition algorithm that considers the appeara...
详细信息
ISBN:
(纸本)9781728119045
Place recognition, also called visual localization, facilitates the autonomous navigation capabilities of the future of driverless cars. This paper proposes a new place recognition algorithm that considers the appearancebased methodology to localize the vehicle by utilizing visual route map, i.e. a sequence of images, or sets of features extracted from these images, that were recorded over different times and dates for the route environments. These reference sequences are accurately labeled and annotated using GPS tags or manually using odometry information. The dynamic time warping (DTW) algorithm is used to achieve image sequence alignment and find the best match for each frame from the test sequence. The proposed algorithm considered hand-crafted features like SIFT, HOG, and LDB. Experiments, using common challenging and benchmark datasets, i.e. "UQ St Lucia" and "Nordland", have been conducted, and it has been observed that the proposed technique has significantly improved the performance of well-known appearance-based descriptors SIFT, HOG, and LDB as compared to its individual performance and to some of the state-of-the-art localization and mapping methods such as ABLE (Binary-appearance Loop-closure).
The goal of video stabilization is to remove the unwanted camera motion and obtain stable versions. Theoretically, a good stabilization algorithm should remove the unwanted motion without the loss of image qualities. ...
详细信息
ISBN:
(纸本)9781479902880
The goal of video stabilization is to remove the unwanted camera motion and obtain stable versions. Theoretically, a good stabilization algorithm should remove the unwanted motion without the loss of image qualities. However, due to the lack of ground-truth video frames, the accurate performance evaluation of different algorithms is hard. Most existing evaluation techniques usually synthesize stable videos from shaking ones, but they are not effective enough. Different from previous methods, in this paper we propose a novel method which synthesize shaking videos from stable frames. Based on the synthetic shaking videos, we perform preliminary video stabilization performance assessment on three stabilization algorithms. Our shaking video synthesis method can not only give a benchmark for full-reference video stabilization performance assessment, but also provide a basis for exploring the theoretical bound of video stabilization which may help to improve existing stabilization algorithms.
High Dynamic Range (HDR) images capture the full range of luminance present in real world scenes, and unlike Low Dynamic Range (LDR) images, can simultaneously contain detailed information in the deepest of shadows an...
详细信息
ISBN:
(纸本)9780819482341
High Dynamic Range (HDR) images capture the full range of luminance present in real world scenes, and unlike Low Dynamic Range (LDR) images, can simultaneously contain detailed information in the deepest of shadows and the brightest of light sources. In order to render HDR image on LDR displayers, it is often necessary to create LDR depictions of HDR images at the cost of contrast information loss. To reduce the loss, this paper enables to render HDRI (High Dynamic Range image) with multiple low-bit images periodically. From the viewpoint of a human, the pixel value is fractural. It does not adjust the tones but can reconstruct HDR images.
The objective in developing compact descriptors for visualimage search is building an image retrieval system that works efficiently and effectively under bandwidth and memory constraints. Selecting local descriptors ...
详细信息
ISBN:
(纸本)9781479902880
The objective in developing compact descriptors for visualimage search is building an image retrieval system that works efficiently and effectively under bandwidth and memory constraints. Selecting local descriptors to be processed, and sending them to the server for matching is an integral part of such a system. One such image search and retrieval system is the Compact Descriptors for visual Search (CDVS) standardization test model being developed by MPEG which has an efficient local descriptor selection criteria. However, all the existing selection parameters in CDVS are based on low-level features. In this paper, we propose two "mid-level" local descriptor selection criteria: visual Meaning Score (VMS), and visual Vocabulary Score (VVS) which can be seamlessly integrated into the existing CDVS framework. A mid-level criteria explicitly allows selection of local descriptors closer to a given set of images. Both VMS and VVS are based on visual words (patches) of images, and provide significant gains over the current CDVS standard in terms of matching accuracy, and have very low implementation cost.
暂无评论