Judder is a motion artifact describing a perceptual mismatch between human visual system and discrete movements on a display. Judder is correlated with video frame rates, motion speeds and brightness. It significantly...
详细信息
ISBN:
(纸本)9798331529543;9798331529550
Judder is a motion artifact describing a perceptual mismatch between human visual system and discrete movements on a display. Judder is correlated with video frame rates, motion speeds and brightness. It significantly degrades the perceived video quality. We present a framework for modeling judder, where the input frames are processed to generate optical flow and a sensitivity map. Concurrently, a specifically designed attention map is integrated into the process. These components collectively contribute to the creation of a feature map, which is utilized to compute judder scores. The proposed approach explicitly takes into account the video frame rate, resulting in an accurate judder score to predict the subjective mean opinion scores ultimately. Our assessment of the proposed framework on an HDR video dataset shows that judderness is highly influenced by the frame rate and, to some extent, by motion speed and brightness. The prediction of perceived quality scores shows improvement compared to the baseline framework.
Space-time video super-resolution targets to increase a given video's frame rate and resolution simultaneously. Although existing approaches have made great progress, most of them still suffer from the inaccurate ...
详细信息
ISBN:
(纸本)9781665475921
Space-time video super-resolution targets to increase a given video's frame rate and resolution simultaneously. Although existing approaches have made great progress, most of them still suffer from the inaccurate approximation of large motions or fail to generate temporal consistent motion trajectory. To alleviate these problems, we carefully review the characteristics of different optical flow warping strategies, integrating and enhancing them to achieve more robust capabilities for handling extreme motions and time-modulated interpolation. Specifically, we utilize enhanced backward warping to perform alignment, mine space-time information across low resolution input frames, and propose an enhanced forward warping strategy to interpolate arbitrary intermediate frames. Furthermore, the proposed model can be trained end-to-end and produce intermediate results at any time by merely supervising the center moment. Experimental results show that the proposed algorithm performs favorably against the state-of-the-art methods in objective metrics and subjective visual effects.
In this paper, a new procedure for generating a mesh structure of image representation has been proposed. The proposed algorithm employs a feature map extraction with a specific threshold level to place initial nodes ...
详细信息
ISBN:
(纸本)0780389387
In this paper, a new procedure for generating a mesh structure of image representation has been proposed. The proposed algorithm employs a feature map extraction with a specific threshold level to place initial nodes densely in regions that contain high frequency features and nodes are placed coarsely in smooth regions. After that, many insignificant nodes are removed using node elimination scheme. The proposed algorithm is well suited for usage at very low bit-rate image and video coding as processing results have demonstrated that it provides a good subjective and objective image quality at a lower number of required nodes. Moreover, comparison shows that the proposed method provides comparable image quality while requiring fewer mesh nodes than the several existing methods.
Recursive wavelet filters and an alternative algorithm for implementing wavelet transform are presented in this paper. The recursive filters use previously calculated (past) wavelet coefficients as inputs to calculate...
详细信息
ISBN:
(纸本)0819459763
Recursive wavelet filters and an alternative algorithm for implementing wavelet transform are presented in this paper. The recursive filters use previously calculated (past) wavelet coefficients as inputs to calculate the current wavelet coefficient, and provide the same transform results as convolutional FIR and lifting wavelet filters. The coefficients of the recursive filters are derived from those of conventional FIR wavelet filters. The wavelet transform with recursive filters requires a smaller amount of memory and is easy to implement in hardware. Another important advantage of the recursive filters is that perfect reconstruction can be easily achieved using recursive wavelet filters if a sequence of pixels to be transformed is extended by boundary pixel repetition. Boundary pixel repetition can be more efficient than the widely used method of symmetric extension for image and video coding.
In this work, we explore the efficacy of graphs for representing video sequences in the context of no-reference video quality assessment (NR-VQA). Specifically, we address the problem of predicting video quality from ...
详细信息
ISBN:
(纸本)9798350350463;9798350350456
In this work, we explore the efficacy of graphs for representing video sequences in the context of no-reference video quality assessment (NR-VQA). Specifically, we address the problem of predicting video quality from frame-level spatial quality scores by using them as node features. We implicitly capture temporal dependency among the nodes through an appropriate choice of the graph adjacency matrix. A graph convolutional network (GCN) that accepts the graph video representation as its input is trained to predict mean opinion scores (MOS). We experiment with several NR image quality assessment (NR-IQA) algorithms that include Natural image Quality Evaluator (NIQE), Multi-scale image Quality Transformer (MUSIQ), Codebook Representation for No-Reference image Assessment (CORNIA), and Neural image Assessment (NIMA) for predicting spatial quality. The proposed NR-VQA algorithm is evaluated on four user-generated VQA datasets on which it showed a consistent improvement over directly using spatial quality scores for regression. These improvements suggest that graph-based video representation is promising for the NR-VQA task.
Comments on extended versions of selected research papers from the 1st IEE European conference on Visual Media Production in London, England. Presentation of novel methods for image and videoprocessing fundamental to...
详细信息
Comments on extended versions of selected research papers from the 1st IEE European conference on Visual Media Production in London, England. Presentation of novel methods for image and videoprocessing fundamental to many activities in production and post-production; Research on the reconstruction of people for animation production; Discussion of topics contributing to visual content production.
In [1], image adaptive linear minimum mean squared error (LMMSE) filtering was proposed as an enhancement layer color image coding technique that exploited the statistical dependencies among the luminance/chrominance ...
详细信息
ISBN:
(纸本)0819456586
In [1], image adaptive linear minimum mean squared error (LMMSE) filtering was proposed as an enhancement layer color image coding technique that exploited the statistical dependencies among the luminance/chrominance or Karhunen Loeve Transform (KLT) coordinate planes of a lossy compressed color image to enhance the red, blue, green (RGB) color coordinate planes of that image. In the current work, we propose the independent design and application of LMMSE filters on the subbands of a color image as a low complexity solution. Towards this end, only the coordinates of the neighbors of the filtered subband coefficient, that are sufficiently correlated with the corresponding coordinate of the filtered subband coefficient, are included in the support of the filter for each subband. Additionally, each subband LMMSE filter is selectively applied only on the high variance regions of the subband. Simulation results show that, at the expense of an insignificant increase in the overhead rate for the transmission of the coefficients of the filters and with about the same enhancement gain advantage, subband LMMSE filtering offers a substantial complexity advantage over fullband LMMSE filtering.
Two-pass rate control (RC) schemes have proven useful for generating low-bitrate video-on-demand or streaming catalogs. Visually optimized encoding particularly using latest-generation coding standards like Versatile ...
详细信息
ISBN:
(纸本)9781728185514
Two-pass rate control (RC) schemes have proven useful for generating low-bitrate video-on-demand or streaming catalogs. Visually optimized encoding particularly using latest-generation coding standards like Versatile video Coding (VVC), however, is still a subject of intensive study. This paper describes the two-pass RC method integrated into version 1 of VVenC, an open VVC encoding software. The RC design is based on a novel two-step rate-quantization parameter (R-QP) model to derive the second-pass coding parameters, and it uses the low-complexity XPSNR visual distortion measure to provide numerically as well as visually stable, perceptually R-D optimized encoding results. Random-access evaluation experiments confirm the improved objective as well as subjective performance of our RC solution.
video chat becomes more and more popular in our daily life. However, how to provide a high-quality video chat with the limited bandwidth is a key challenging task. In this paper, beyond the state-of-the-art video comp...
详细信息
ISBN:
(纸本)9781728185514
video chat becomes more and more popular in our daily life. However, how to provide a high-quality video chat with the limited bandwidth is a key challenging task. In this paper, beyond the state-of-the-art video compression system, we propose an encoder-decoder joint enhancement algorithm for the video chat. In particular, the sparse map of the original frame is extracted at the encoder side and signaled to the decoder, which is utilized together with the sparse map of the decoded frame to obtain the boundary transformation map. In this manner, the boundary transformation map represents the key difference between the original frame and the decoded frame and hence can be used to enhance the decoded frame. Experimental results show that the proposed algorithm brings clear subjective and objective quality improvements. At the same quality, the proposed algorithm can achieve 35% bitrate savings compared to the VVC.
Media processing such as real-time compression and decompression of video signal is now expected to be the driving force in the evolution of media processor. in this paper, a hardware and software co-design approach i...
详细信息
ISBN:
(纸本)0819456586
Media processing such as real-time compression and decompression of video signal is now expected to be the driving force in the evolution of media processor. in this paper, a hardware and software co-design approach is introduced for a 32-bit media processor: MediaDsp3201 (MD32), which is realized in 0.18 mu m TSMC, 200MHz and can achieve 200 million multiply-accumulate (MAC) operations per second. In our design, we have emerged RISC and DSP into one processor (RISC/DSP). Based on the analysis of inherent characteristics of videoprocessing algorithms, media enhancement instructions are adopted into MD32'instruction set. The media extension instructions are physically realized in the processor core, and improve videoprocessing performance effectively with negligible additional hardware cost (2.7%). Considering the high complexity of the operation for media instructions, technology named scalable super pipeline is used to resolve problem of the time delay of pipeline stage (mainly EX stage). Simulation results show that our method can reduce more than 31% and 23% instructions for IDCT compared to MMX and SSE's implementation [5] and 40% for MC compared to MMX's implementation.
暂无评论