Most existing objective quality evaluation metrics for panoramic images are typically derived from peak signal to noise ratio (PSNR) or structural similarity (SSIM). However, they need pristine panoramic images and ar...
详细信息
Most existing objective quality evaluation metrics for panoramic images are typically derived from peak signal to noise ratio (PSNR) or structural similarity (SSIM). However, they need pristine panoramic images and are not highly consistent with human perception. To address the problem, we propose a novel blind panoramic image quality assessment (PIQA) method to predict the visual quality of panoramic images based on asymmetric mechanism of human brain. In the proposed method, the high-frequency feature is extracted by panoramic-weighted local binary pattern and relative total variation is used to clip the high-frequency information in panoramic images, then panoramic-weighted statistic feature can represent the low-frequency feature. Finally, support vector regression (SVR) is adopted to build a quality predictor from feature space to quality score space. The experimental results on the public subjective dataset demonstrate the superiority of our proposed metric compared with state-of-the-art objective PIQA methods.
Bag-of-features (BoF) representation is one of the most popular image representations that is used in visual object classification, owing to its simplicity and good performance. However, the BoF representation always ...
详细信息
image deblurring is an important task for practical applications. However, it remains a challenge due to the serious camera shake. In this paper, we propose a prior named Dual-Exposure Prior (DEP) according to the obs...
详细信息
image deblurring is an important task for practical applications. However, it remains a challenge due to the serious camera shake. In this paper, we propose a prior named Dual-Exposure Prior (DEP) according to the observation that image captured with a relatively short exposure time may achieve sharp edges. Based on the DEP, the blur kernels of the blurry images can be accurately estimated. An effective optimization between blur kernel estimation and intermediate image restoration is established by using the L_2-regularized DEP in the maximum a posterior framework. Each sub-problem of the optimization is convex, it can be solved in close-form with fast Fourier transforms acceleration. The experiments based on real and synthesized blurry images show that the proposed image deblurring algorithm can outperform the state-of-the-art methods in terms of subjective and objective quality. Moreover, the proposed algorithm can be implemented with significantly low computational complexity.
The pixel variation signal extracted from the nasal region of RGB-thermal images can be used to achieve breathing rate (BR) measurement. However, this method fails when the nasal region is not detected in complicated ...
The pixel variation signal extracted from the nasal region of RGB-thermal images can be used to achieve breathing rate (BR) measurement. However, this method fails when the nasal region is not detected in complicated motion scenarios. In this paper, we develop an RGB-thermal imaging system collaborated with marker sticker to achieve unobtrusive and accurate BR measurement. Pixel variation signal of Regions of interest (ROI) is extracted from the thermal video and chest movement signal is extracted from the RGB video with the assistance of marker stickers. Subsequently, a custom-made time-domain signal processing approach is developed for determining BR. We further propose a method of splicing computation to measure the BR after separate processing of signal segments. We construct an RGB-thermal video dataset with different head and body movements to evaluate the effectiveness of the proposed algorithm. After linear regression analysis, the determination coefficient (R~2) of 0.905 has been observed for the estimated and reference BRs, indicating the feasibility of our proposed method in complex motion scenarios.
There are limitations in existing multi-exposure fused image quality assessment (MEF-IQA) approaches. The first is that the existing approaches are hard to estimate the consistency of luminance distribution in fused i...
详细信息
There are limitations in existing multi-exposure fused image quality assessment (MEF-IQA) approaches. The first is that the existing approaches are hard to estimate the consistency of luminance distribution in fused images, which can be easily destroyed in fusion process. The second is that the aesthetics estimation of fused images are neglected in existing approaches, which should be considered for the images produced by fusion algorithms to give people an aesthetic experience. In order to solve these problems, a MEF-IQA approach based on the deep network was proposed. First, we adapted a quad path network as luminance error quality evaluator (LE) to extract the features, which can capture the luminance distribution patterns, from fused images. Then, we trained an aesthetic evaluator (AE) to obtain the feature which can describe the aesthetic attributes of fused image. Finally, the two extracted features are combined for predicting the perceptual quality scores. Experimental result clearly shows that the proposed approach can quantify the image quality reliably, and it is highly correlated with subjective quality scores.
General-purpose forensics on small image patches appears to be feasible and important, but in fact poses a challenge due to insufficient statistics. Furthermore, there is a need to develop a forensic approach that can...
General-purpose forensics on small image patches appears to be feasible and important, but in fact poses a challenge due to insufficient statistics. Furthermore, there is a need to develop a forensic approach that can automatically learn effective and robust features related to image forensics with high parameter efficiency. In this paper, we propose a depthwise separable convolutional neural network (CNN) for the simultaneous detection of eleven types of image manipulations in image patches. Different from the previous CNNs based on standard convolution, depthwise separable convolution is introduced in the proposed CNN to adaptively extract forensics-related features from image patches with better parameter efficiency. When compared with four state-of-the-art methods, experiments demonstrate that the proposed CNN architecture can achieve better performance, e.g., the improvement in terms of accuracy in the detection of 32 × 32 images is up to 7.33%. It also achieves significantly better overall performance for different databases and better robustness against JPEG compression.
In this paper, we build up a new block-based color image compression scheme based on our proposed transform domain down-sampling method and deep convolutional reconstruction algorithm. Specifically, our proposed down-...
In this paper, we build up a new block-based color image compression scheme based on our proposed transform domain down-sampling method and deep convolutional reconstruction algorithm. Specifically, our proposed down-sampling scheme aims to down-sample each N × N transform block into the N/2 × N/2 block for the saving of bit-cost. On the other hand, the proposed deep convolutional reconstruction algorithm is employed to reconstruct the down-sampled block for a full-resolution reconstruction. We apply our proposed methods to both the chrominance components to compress color images. Experimental results show that our proposed method achieves excellent results when used in practice.
A new retinal vessel detection method is introduced using an antagonistic model, consisting of the excitatory and inhibitory regions. It utilizes the model profile containing important image features such as lines and...
详细信息
A new retinal vessel detection method is introduced using an antagonistic model, consisting of the excitatory and inhibitory regions. It utilizes the model profile containing important image features such as lines and curves to detect retinal vessels. The excitatory area is analyzed into several sub-regions with multiple directions to decompose the local vessel structure, resulting in several profiles that include various vessel trees. Also, the strength of the inhibitory region is adaptively varied to differentiate vessel pixels from background while decreasing false detection. Then, each profile is linked to a multi-bound relaxed median operator for noise removal, and a logistic function for providing a soft transition between vessel and non-vessel regions. The final vessel tree is produced by fusing all profiles. The proposed method is evaluated on high-resolution and low-resolution fundus imaging databases (HRF and DRIVE), on which it outperforms state-of-the-art methods.
To replicate human visual perception, we analyze processingimages with optical illusion using edge preserving filters and smoothed local histogram equalization (LHE). images with the optical illusions are good models...
详细信息
ISBN:
(纸本)9781538644584
To replicate human visual perception, we analyze processingimages with optical illusion using edge preserving filters and smoothed local histogram equalization (LHE). images with the optical illusions are good models for gradual/rapid changes in contrast and strong edges, which are good cases for assessing the robustness of image filters. Here, we study and analyze the performance of smoothed LHE filters while processing perceptual illusion. Our studies conclude that, smoothed LHEs are useful in retaining actual edge forms in these images as they can operate using large kernel sizes. These large kernel size filters can construct sawtooth like edge and it corresponds to adequately wide halos. We also demonstrate the usefulness of smoothed LHE like tone mapping techniques in preserving naturalness, and we confirmed it by performing subjective visual test.
High dynamic range (HDR) images capture the luminance information of the real world and have more detailed information than low dynamic range (LDR) images. In this paper, we propose a dual-streams global guided end-to...
High dynamic range (HDR) images capture the luminance information of the real world and have more detailed information than low dynamic range (LDR) images. In this paper, we propose a dual-streams global guided end-to-end learning method to reconstruct HDR image from a single LDR input that combines both global information and local image features. In our framework, global features and local features are separately learned in dual-streams branches. In the reconstructed phase, we use a fusion layer to fuse them so that the global features can guide the local features to better reconstruct the HDR image. Furthermore, we design mixed loss function including multi-scale pixel-wise loss, color similarity loss and gradient loss to jointly train our network. Comparative experiments are carried out with other state-of-the-art methods and our method achieves superior performance.
暂无评论