Advances in image quality assessment have shown the potential added value of including visual attention aspects in objective quality metrics. Numerous models of visual saliency are implemented and integrated in differ...
详细信息
ISBN:
(纸本)9781479961399
Advances in image quality assessment have shown the potential added value of including visual attention aspects in objective quality metrics. Numerous models of visual saliency are implemented and integrated in different quality metrics;however, their ability of improving a metric's performance in predicting perceived image quality is not fully investigated. In this paper, we conduct an exhaustive comparison of 20 state-of-the-art saliency models in the context of image quality assessment. Experimental results show that adding computational saliency is beneficial to quality prediction in general terms. However, the amount of performance gain that can be obtained by adding saliency in quality metrics highly depends on the saliency model and on the metric.
The visual navigation system for a mobile patrol robot using imageprocessing by FPGA and real-time Linux is presented. The CMOS image sensor and the stepper motors driver ICs are connected to external I/O ports of th...
详细信息
ISBN:
(纸本)9780769546001
The visual navigation system for a mobile patrol robot using imageprocessing by FPGA and real-time Linux is presented. The CMOS image sensor and the stepper motors driver ICs are connected to external I/O ports of the FPGA. The imageprocessing and motor drive circuits are implemented into the reconfigurable device as original logic. The image capture circuit applies state machine and FIFO memory buffer to adjust timing for pixel data transmission. The motor drive circuit generates clock signals for steps according to the value from processor in the FPGA. The realtime device driver has been developed for the linkage between flexible hardware circuits and real-time software applications for robot vision purpose.
In this study, a product suggestion engine has been developed for an e-commerce site portfolio focusing on garment items. Traditional collaborative filtering methods usually lack applicability due to high turnover rat...
详细信息
ISBN:
(纸本)9781728172064
In this study, a product suggestion engine has been developed for an e-commerce site portfolio focusing on garment items. Traditional collaborative filtering methods usually lack applicability due to high turnover rates in product lists. Thus, by focusing on visual similarity using deep leraning technique successful results were obtained and it has been concluded that application of this technique to real live e-commerce garment site will be suitable.
This paper outlines a generalized image reconstruction approach to improve the resolution of an Electro-Optic (EO) imaging sensor using multiple frames of an image sequence. This method only assumes the constituent vi...
详细信息
ISBN:
(纸本)0819444111
This paper outlines a generalized image reconstruction approach to improve the resolution of an Electro-Optic (EO) imaging sensor using multiple frames of an image sequence. This method only assumes the constituent video has some ambient motion between the sensor and stationary background, and the optical image is physically captured by a staring focal plane array.
In this paper, we propose a high-frequency guided CNN for video compression artifacts reduction. In the proposed method, high frequency component in Y channel is extracted and used to guide the quality enhancement of ...
详细信息
ISBN:
(纸本)9781665475921
In this paper, we propose a high-frequency guided CNN for video compression artifacts reduction. In the proposed method, high frequency component in Y channel is extracted and used to guide the quality enhancement of all Y, U, V channels. As high frequency component contains the edge and contour information of the objects in the image, which is of vital importance to both subjective and objective quality. In general, the proposed method consists of two modules: the high frequency guidance module and the quality enhancement module. The high-frequency guidance module uses multiple octave convolutions to extract the high-frequency component in Y channel and then fuse it into the features of Y, U, and V channels. While in the quality enhancement module, multiple CNN residual blocks are used for the quality enhancement of Y, U, and V channels. The proposed method was integrated into both HM-16.22 and VTM-16.0. The results on the JVET test sequence under All Intra configuration shows the effectiveness of the proposed method. Compared with HEVC, the proposed method achieves the average BD-rate reductions of -12.3%, -22.7% and -23.5% for Y, U and V channels respectively. Compared with VVC, the average BD-rate reductions are -6.7%, -12.3% and -13.2% correspondingly.
Fractal image compression is computationally expensive. Therefore speedup techniques are required to achieve time demands comparable to other compression techniques. In this paper we combine sequential and parallel te...
详细信息
ISBN:
(纸本)0819424358
Fractal image compression is computationally expensive. Therefore speedup techniques are required to achieve time demands comparable to other compression techniques. In this paper we combine sequential and parallel techniques suitable for MIMD architectures which moves this compression scheme closer to real-time processing. The algorithms introduced are especially designed for memory-critical environments.
This paper presents the development of a fast Free-Viewpoint Video (FVV) rendering algorithm that exploits the parallelism offered by General Purpose Graphics processing Units (GPGPUs). The system generates virtual vi...
详细信息
ISBN:
(纸本)9781479961399
This paper presents the development of a fast Free-Viewpoint Video (FVV) rendering algorithm that exploits the parallelism offered by General Purpose Graphics processing Units (GPGPUs). The system generates virtual views through the use of Depth image-Based Rendering (DIBR) algorithms, implemented using NVidia r Compute Unified Device Architecture (CUDA). A novel reference image brightness adjustment algorithm that exploits the correspondences between matching pixels in the reference images to avoid drastic brightness switching while navigating in between views is also discussed. The developed solution ensures that data transfers are kept at a minimum, thus improving the overall rendering speed. Objective and subjective test results show that, for typical free-view scenarios, the proposed algorithm can be successfully deployed in real-time FVV systems, providing a good Quality of Experience (QoE).
This paper presents a new approach to color image denoising under consideration of human visual system (HVS) model. The denosing process takes place in the wavelet transform domain. A contrast sensitivity function (CS...
详细信息
ISBN:
(纸本)0819450235
This paper presents a new approach to color image denoising under consideration of human visual system (HVS) model. The denosing process takes place in the wavelet transform domain. A contrast sensitivity function (CSF) implementation is employed into wavelet-based algorithm based on an invariant single factor weighting per subband and noise masking in succession. Experimental results show that the new approach is good in terms of perceptual error metrics and visual effect.
This paper presents a motion-based depth estimation algorithm for automatic 2D-to-3D video conversion algorithm by employing the co-occurrence matrix of motion vectors (MVCM). Video scenes possess distinct signatures ...
详细信息
ISBN:
(纸本)9781479902880
This paper presents a motion-based depth estimation algorithm for automatic 2D-to-3D video conversion algorithm by employing the co-occurrence matrix of motion vectors (MVCM). Video scenes possess distinct signatures of MVCM, which enables exploiting the corresponding motion-depth relation for depth generation. The subsequent motion-compensated depth updating scheme provides stable and comfort 3D visual quality as synthesized by depth-image-based rendering. The simulation results of several high-definition image sequences indicate that the proposed algorithm produces better and more reasonable depth than two motion-based depth estimation algorithms. With the adaptive depth estimation scheme using MVCM, the proposed 2D-to-3D video conversion algorithm can accommodate a great variety of visual contents. It thus provides an efficient and reliable solution towards the problem of automatic 3D video content creation.
Increasing the spatial resolution is an ongoing research topic in imageprocessing. A recently presented approach applies a non-regular sampling mask on a low resolution sensor and subsequently reconstructs the masked...
详细信息
ISBN:
(纸本)9781479961399
Increasing the spatial resolution is an ongoing research topic in imageprocessing. A recently presented approach applies a non-regular sampling mask on a low resolution sensor and subsequently reconstructs the masked area via an extrapolation algorithm to obtain a high resolution image. This paper introduces an acceleration of this approach for use with full color sensors. Instead of employing the effective, yet computationally expensive extrapolation algorithm on each of the three RGB channels, a color space conversion is performed and only the luminance channel is then reconstructed using this algorithm. As natural images contain much less information in the chrominance channels, a fast linear interpolation technique can here be used to accelerate the whole reconstruction procedure. Simulation results show that an average speed up factor of 2.9 is thus achieved, while the loss in visual quality stays imperceptible. Comparisons of PSNR results confirm this.
暂无评论