In previous research, it is shown that the decoding energy demand of several video codecs can be estimated accurately by using bit stream feature-based models. Therefore, we show in this paper that the visualization w...
详细信息
ISBN:
(纸本)9781728180687
In previous research, it is shown that the decoding energy demand of several video codecs can be estimated accurately by using bit stream feature-based models. Therefore, we show in this paper that the visualization with the Decoding Energy Estimation Tool (DENESTO) can help to improve the understanding of the energy demand of the decoder.
In order to improve coding efficiency beyond versatile video coding (VVC), we propose an extended geometric partitioning mode (GPM). GPM is a new inter prediction in VVC and is applied to the object boundary between t...
详细信息
ISBN:
(纸本)9781665475921
In order to improve coding efficiency beyond versatile video coding (VVC), we propose an extended geometric partitioning mode (GPM). GPM is a new inter prediction in VVC and is applied to the object boundary between the foreground and background with different motions. Specifically, GPM partitions a rectangular coding block into two regions with 64 predefined types of straight lines, generates inter prediction samples for each partitioned region and then blends them with a fixed boundary width to obtain the final prediction samples. However, the fixed boundary width of GPM is not always optimal for diverse video content. To solve this problem, the proposed method allows GPM to select multiple boundary widths by block-wise signaling. Furthermore, the proposed method also restricts the selectable boundary width according to the short side of the block to reduce the encoding time for selecting the optimal width. Experiment results following common test conditions in JVET showed an improvement in coding efficiency with bitrate savings of 0.11 % and 3.20 % for camera-captured content and for pure screen or video game content, respectively, compared VVC reference software.
To achieve consumer-level quality, media systems must process continuous streams of audio and video data while maintaining exacting tolerances on sampling rate, jitter, synchronization, and latency. While it is relati...
详细信息
ISBN:
(纸本)0819456586
To achieve consumer-level quality, media systems must process continuous streams of audio and video data while maintaining exacting tolerances on sampling rate, jitter, synchronization, and latency. While it is relatively straightforward to design fixed-function hardware implementations to satisfy worst-case conditions, there is a growing trend to utilize programmable multi-tasking solutions for media applications. The flexibility of these systems enables support for multiple current and future media formats, which can reduce design costs and time-to-market. This paper provides practical engineering solutions to achieve robust media processing on such systems, with specific attention given to power-constrained platforms. The techniques covered in this article utilize the fundamental concepts of algorithm and software optimization, software/hardware partitioning, stream buffering, hierarchical prioritization, and system resource and power management. A novel enhancement to dynamically adjust processor voltage and frequency based on buffer fullness to reduce system power consumption is examined in detail. The application of these techniques is provided in a case study of a portable video player implementation based on a general-purpose processor running a non real-time operating system that achieves robust playback of synchronized H.264 video and MP3 audio from local storage and streaming over 802.11.
In video-on-demand (VOD) system, efficient video delivery scheme can largely increase system service ability by reducing the resource requirement. Most existing schemes are not suited for the instance that the charact...
详细信息
ISBN:
(纸本)0819459763
In video-on-demand (VOD) system, efficient video delivery scheme can largely increase system service ability by reducing the resource requirement. Most existing schemes are not suited for the instance that the characteristic of the workload changing greatly with time. A workload adaptive scheme is proposed in the paper. It absorbs the outstanding thoughts from two efficient schemes: fast broadcasting and patching scheme. The scheme allocates adaptively the number of channels according to request rate to minimize the required bandwidth. We show how to seamlessly perform the transition of changing the number of channels while guaranteeing the clients currently viewing a video will not experience any disruption. The scheme can provide zero-delay video-on-demand (VOD) service. Simulation results show that the scheme adapts nicely to the changing request rate and improves the performance of VOD service significantly in terms of total server bandwidth requirement.
This article attempts to identify some of the technology and research challenges facing the digital media industry in the future. We first discuss several trends in the industry, such as the rapid growth of broadband ...
详细信息
ISBN:
(纸本)0819456586
This article attempts to identify some of the technology and research challenges facing the digital media industry in the future. We first discuss several trends in the industry, such as the rapid growth of broadband Internet networks and the emergence of networking and media-capable devices in the home. Next, we present technical challenges that result from these trends, such us effective media interoperability in devices, and provide a brief overview of Windows Media, which is one of the technologies in the market attempting to address these challenges. Finally, given these trends and the state of the art, we argue that further research on data compression, encoder optimization, and multi-format transcoding can potentially make a significant technical and business impact in digital media. We also explore the reasons that research on related techniques such as wavelets or scalable video coding is having a relatively minor impact in today's practical digital media systems.
Online media server scheduling algorithms in distributed video-on-demand (VoD) systems are studied in this work. We first identify the failure rate and the server-side network bandwidth consumption as two main cost fa...
详细信息
ISBN:
(纸本)0819456586
Online media server scheduling algorithms in distributed video-on-demand (VoD) systems are studied in this work. We first identify the failure rate and the server-side network bandwidth consumption as two main cost factors in a distributed VoD service model. The proposed distributed server scheduler consists of two parts: the request migration scheme and the dynamic content update strategy. By improving the random early migration (REM) scheme, we propose a cost-aware REM (CAREM) scheme to reduce the network bandwidth consumption due to the migration process. Furthermore, to accommodate the change in video popularity and/or client population, we use the server-video affinity to measure the potential server-side bandwidth cost after placing a specific video copy on that server. The dynamic content update strategy uses the server-video affinity to reconfigure video copies on media servers. We conduct extensive simulations to evaluate the performance of the proposed algorithm. It can be shown that CAREM together with the dynamic content update strategy can improve the system performance by reducing the request failure rate as well as the server bandwidth consumption.
The extension of H.264/AVC hybrid video coding towards scalable video coding (SVC) using motion-compensated temporal filtering (MCTF) is presented. Utilizing the lifting approach to implement MCTF, the motion compensa...
详细信息
ISBN:
(纸本)0819459763
The extension of H.264/AVC hybrid video coding towards scalable video coding (SVC) using motion-compensated temporal filtering (MCTF) is presented. Utilizing the lifting approach to implement MCTF, the motion compensation features of H.264/AVC can be re-used for the MCTF prediction step and extended in a straightforward way for the MCTF update step. The MCTF extension of H.264/AVC is also incorporated into a video codec that provides SNR, spatial, and (similar to hybrid video coding) temporal scalability. The paper provides a description of these techniques and presents experimental results that validate their efficiency. In addition applications of SVC to video transmission and video surveillance are described.
In the scalable video coder MC-EZBC,(1) the scalability for motion vectors was not provided, and this greatly impacts its performance when scaling down to very low bit rates and resolutions. Here we enhance MC-EZBC wi...
详细信息
ISBN:
(纸本)0819456586
In the scalable video coder MC-EZBC,(1) the scalability for motion vectors was not provided, and this greatly impacts its performance when scaling down to very low bit rates and resolutions. Here we enhance MC-EZBC with scalable motion vector coding using the Context based Adaptive Binary Arithmetic Coder (CABAC).(2) Both a layered structure for motion vector coding and an alphabet general partition (AGP)(3) of the motion vector symbols are employed for SNR and resolution scalability of the motion vector bitstream. With these two new features and the careful arrangement of the motion vector bitstream output from the existing MC-EZBC, we obtain temporal, SNR, and resolution scalability for motion vectors. This significantly improves both visual and objective performance at low bit rates and resolutions with only a slight PSNR loss (about 0.05 dB), but no detectable visual loss, at high bit rates.
A novel method to detect smoke and/or flame by processing the video data generated by an ordinary camera monitoring a scene is proposed. It is assumed the camera is stationary. Since the smoke is semi-transparent, edg...
详细信息
360-degree videos have drawn increasing attention from both industry and research communities with the popularity of virtual reality applications. The 360-degree video is originally represented as a sphere and able to...
详细信息
ISBN:
(纸本)9781728180687
360-degree videos have drawn increasing attention from both industry and research communities with the popularity of virtual reality applications. The 360-degree video is originally represented as a sphere and able to provides omnidirectional view. Equirectangular projection is widely applied to convert the 360-degree video from the 3D sphere to a 2D plane for the purposes of compression and storage. However, the content of the projected video is distorted, which brings challenges for the conventional video coding methods. In this paper, a novel intra prediction algorithm is proposed for 360-degree video coding. The proposed algorithm tries to handle the distortion by adapting the process of intra prediction to spherical domain. The proposed algorithm is implemented in High Efficiency video Coding (HEVC) test model HM16.16. Experimental results show that the proposed algorithm brings 0.2% BD-rate reduction on average for 360-degree video coding.
暂无评论