Real-time multimedia applications often require efficient bitrate reduction. This is mainly done by requantization, usually in the DCT domain. This work introduces theoretical rate-distortion analysis that allows for ...
详细信息
Real-time multimedia applications often require efficient bitrate reduction. This is mainly done by requantization, usually in the DCT domain. This work introduces theoretical rate-distortion analysis that allows for straightforward selection of the quantization step needed to achieve a given bit-rate. The analysis is based on the Laplace-like distribution of DCT coefficients in the transform domain and on the structure of the quantizers commonly used in video and image coding. We show that the proposed transcoding design achieves significant compression at relatively low distortion, while keeping the computational complexity very low, allowing for real-time implementation.
Video is expected to be the dominant application by traffic volume over mobile networks in the near future. Mobile network operators are deploying video optimization techniques to enhance the user experience and netwo...
详细信息
Video is expected to be the dominant application by traffic volume over mobile networks in the near future. Mobile network operators are deploying video optimization techniques to enhance the user experience and network utilization for video delivery. Video optimization is typically out-of-network and includes techniques such as transcoding, transrating, time shifting and pacing that are implemented outside of mobile radio access and core networks. However, such techniques cannot easily exploit information about real time cell congestion and radio conditions for the video terminals because it is difficult to obtain such information from the radio access network. We propose a novel in-network video optimization technique, named Adaptive Guaranteed Bit Rate (AGBR), for HTTP-based Adaptive Streaming (HAS) video. This optimization technique is implemented at the base station and can thus exploit knowledge of the radio and congestion conditions. With only limited knowledge of the video stream properties or content, AGBR works by adjusting the throughput delivered to the different HAS clients that in turn adjust the video quality they request. The optimization algorithm maximizes aggregate quality across multiple video flows served by the base station without starving data clients, thereby improving the overall quality of experience. We demonstrate through extensive analytical modeling and simulations that AGBR can adapt to changing network conditions to support more video sessions at an acceptable quality than alternative algorithms, while enforcing fairness among all users competing for resources within a sector. (c) 2013 Alcatel-Lucent.
Video transcoding is an important step to enable interoperability between different networks, terminals, applications, and services for video communication. This paper studies the influence of typical video transcodin...
详细信息
Video transcoding is an important step to enable interoperability between different networks, terminals, applications, and services for video communication. This paper studies the influence of typical video transcoding artifacts due to frame rate reduction and drift error on the subjective quality. Given a realistic dataset for a DVB-T to DVB-H transcoding scenario, the subjective quality before and after the transcoding is compared against each other. In order to quantify the influence of both artifacts, a pixel domain and an open loop transcoding solution have been considered. Since the strength of both artifacts depends largely on the initial encoding parameters, additional experiments have been conducted to quantify the influence of the distance between I frames and the number of consecutive B frames on the subjective quality.
We present an efficient H.264/AVC block-size partitioning prediction method, which is based on our proposed empirical rate and distortion models. Compared to other state-of-the-art transcoding methods, and for the sam...
详细信息
We present an efficient H.264/AVC block-size partitioning prediction method, which is based on our proposed empirical rate and distortion models. Compared to other state-of-the-art transcoding methods, and for the same rate-distortion performance, our proposed algorithm requires the least computational complexity, reaching a 73% reduction in variable block-size motion estimation for SDTV sequences, and 71% reduction for CIF sequences.
In this work, a novel implementation of a video transcoder that converts a video sequence encoded with the H.264/AVC standard to a temporally scalable H.264/SVC stream is achieved with the use of a pixel-domain hetero...
详细信息
In this work, a novel implementation of a video transcoder that converts a video sequence encoded with the H.264/AVC standard to a temporally scalable H.264/SVC stream is achieved with the use of a pixel-domain heterogeneous architecture. The input H.264/AVC stream is fully decoded by the transcoder. Macroblock coding modes are extracted from the input stream and are reused to encode the output stream. A set of new motion vectors is computed from the input stream coded motion vectors, and are mapped to either the hierarchical B-frame or zero-delay referencing structures employed by H.264/SVC. These new motion vectors are further subjected to a 3 pixel refinement. As a result, a significant decrease in computational complexity is achieved, while maintaining a close to optimum compression efficiency.
While deployment of wireless channels has become widespread and fast-growing for mobile applications, transmitting data over these existing error-prone networks can be very unreliable and challenging due to time-varyi...
详细信息
While deployment of wireless channels has become widespread and fast-growing for mobile applications, transmitting data over these existing error-prone networks can be very unreliable and challenging due to time-varying interference and channel errors. Many error-resilient algorithms have been proposed to provide adequate resilient features in order to protect video data from channel errors. However, these algorithms often aim to achieve the optimal decoded video quality in terms of mean square error without any consideration for the visual quality. In this paper, we present a perceptually error-resilient method for video transcoding based on the attention-based intra refresh technique and the characteristics of the human visual system to enhance the perceptual performance of the transcoded video. Specifically, the foveated just noticeable distortion and visual attention models are employed to estimate the perceptual loss impact due to error propagation for allocating intra-refreshed macroblocks in the transcoded video. Experimental results show that the proposed method can achieve a much better performance than the existing methods in terms of both the visual quality and perceptual quality measure.
This paper presents a report of a system that delivers customized video content to mobile devices. Constructed from open source components, it can stream transcoded video to mobile devices in real-time. In addition, i...
详细信息
This paper presents a report of a system that delivers customized video content to mobile devices. Constructed from open source components, it can stream transcoded video to mobile devices in real-time. In addition, it allows publishers to add content into their video streams on-the-fly. As a demonstration of this system's capabilities, an application was designed to deliver transcoded Adobe Flash content to Apple iPhones in real-time with the insertion of randomly selected commercial content.
Efficient motion re-estimation schemes for H.264 Band P-frame transcoding are proposed in this paper. The proposed methods compose new motion vectors for out-going video bitstream from the pre-coded bitstream by utili...
详细信息
Efficient motion re-estimation schemes for H.264 Band P-frame transcoding are proposed in this paper. The proposed methods compose new motion vectors for out-going video bitstream from the pre-coded bitstream by utilizing Maximum Likelihood. The experimental results show that the proposed method can improve the performance and efficiency compared to previous approaches.
Wyner-Ziv (WZ) to scalable video coding (SVC) transcoding can offer a suitable framework to support scalable video communications between low-cost devices. In addition, the video delivery provided by SVC covers the ne...
详细信息
Wyner-Ziv (WZ) to scalable video coding (SVC) transcoding can offer a suitable framework to support scalable video communications between low-cost devices. In addition, the video delivery provided by SVC covers the needs of a wide range of homogeneous networks and different devices. Despite the advantages of the video transcoding framework, the transcoder accumulates high complexity and it must be reduced in order to avoid excessive delays in communication. In this article, an approach for WZ to SVC transcoding is presented. The information generated during the first stage is reused during the second one, and as a consequence the time taken by the transcoding is reduced by around 77.77%, with a negligible rate-distortion penalty.
The recently developed video compression standard, H.264/AVC surpasses the performance of previous video standards, such as MPEG-2, MPEG-4(part2), and H.263 and is therefore expected to be selected as the video standa...
详细信息
ISBN:
(纸本)9781424459421
The recently developed video compression standard, H.264/AVC surpasses the performance of previous video standards, such as MPEG-2, MPEG-4(part2), and H.263 and is therefore expected to be selected as the video standard for most digital video applications. The widely distributed infrastructure, however, continues to use the previous standards. Heterogeneous video transcoding offers a significant key to the resolution of this problem. This paper suggests a new algorithm for H.264/AVC to MPEG-2 transcoding that uses motion vector clustering to reduce the computation time with no loss of quality. Such a clustering method can reduce the number of candidate motion vectors that are gathered during the H.264 decoding stage. These candidate motion vectors consider the correlation between the direction and distance of the motion vectors in the variable blocks in H.264/AVC. The candidate motion vector that has the least distortion is then selected in the MPEG-2 encoder. The MPEG-2 encoder can therefore use the best motion vector without carrying out computations for motion estimation. The experimental results show that the proposed method can maintain a good level of video quality while reducing the computational complexity by a considerable 64%, on average, compared to a cascade transcoder.
暂无评论