Video can be streamed live with different applications (e.g. YouTube Live, Periscope). Typically, the video content is adapted for end users based on receiving client's capabilities, and network bandwidth. The ada...
详细信息
Video can be streamed live with different applications (e.g. YouTube Live, Periscope). Typically, the video content is adapted for end users based on receiving client's capabilities, and network bandwidth. The adaptation is realized with different video representations, which are created by transcoding the original video content. When video is streamed live, transcoding has to be completed within real time constraints, which is a computationally demanding process. Particularly, live transcoding should be enabled efficiently by a content distributor to minimize resource provisioning costs. The contribution of this paper is an architecture for predicting live video transcoding performance on a Docker-based platform. Particularly, cloud resource management for live video transcoding has been focused on. A model was trained based on measurements in different transcoding configurations. Offline evaluation results indicate that live transcoding speed or CPU usage can be predicted with 3-8 % accuracy. When video is transcoded on virtual machines based on predictions in a prototype system (live), live transcoding speed prediction accuracy is within a similar range as the offline performance, but worse for CPU usage prediction (5-15%). In most cases the specified range for transcoding speed and CPU usage can be achieved at least with a precision of 76%.
360° Virtual Reality (VR) services with resolutions of 8K and beyond are a challenging task due to limits of both decoding complexity and constrained public internet bandwidth of consumer devices. Also, general s...
详细信息
360° Virtual Reality (VR) services with resolutions of 8K and beyond are a challenging task due to limits of both decoding complexity and constrained public internet bandwidth of consumer devices. Also, general streaming servers cannot service these large-resolution video streams to many clients because of bandwidth limitation. In this paper, we propose a distributed video transcoding system for achieving viewport adaptive streaming, which is known as tiled streaming, of 8K 360° VR video. The proposed system consists of many motion-constrained High Efficiency Video Coding (HEVC) encoders, a Hadoop/Spark-based distributed computing platform, light-weight bitstream stitcher, and dual HEVC decoders. Experimental results show that 8K 360° videos which are split by 8×8 tiles, respectively, can be encoded at 99 fps, and 4×4 tiles are stitched at 9,585 fps, on average.
As an emerging video codec standard for multi-view video and corresponding depth maps, three-dimensional extension of High Efficiency Video Coding (3D-HEVC) has broad application prospect. In order to make the bit-str...
详细信息
ISBN:
(纸本)9781538649923
As an emerging video codec standard for multi-view video and corresponding depth maps, three-dimensional extension of High Efficiency Video Coding (3D-HEVC) has broad application prospect. In order to make the bit-stream generated by 3D-HEVC suitable for emerging terminal devices, reducing computational complexity is necessary for downsizing resolution transcoding. This article aims to speed up 3D-HEVC transcoding. This paper proposes a fast transcoding algorithm on clipping unnecessary traversal depth level for each CTU. Experimental results show that almost 26% of computation time can be saved for the proposed transcoding algorithm with the certain increase of BDBR.
This paper introduces a robust data embedding method under transcoding and presents several possible hypermedia applications based on the data embedding method. When a JPEG image is transmitted or uploaded through ins...
详细信息
This paper introduces a robust data embedding method under transcoding and presents several possible hypermedia applications based on the data embedding method. When a JPEG image is transmitted or uploaded through instant messaging services (IMS) or social networks services (SNS), transcoding is normally applied to the image. With transcoding, all metadata in the image is removed and sometimes pixel data itself can be modified. The data embedding method in this paper can be robust under transcoding of some IMS and SNS. Additionally, this paper introduces three applications, which are able to be operated on IMS and SNS, with the data embedding method on the JPEG image file; Image Scramble, Photo Message, and Audible Message.
Screen Content Coding (SCC) is an extension of the High-Efficiency Video Coding (HEVC) for encoding screen content videos. However, there are many legacy screen content videos already encoded by HEVC. To efficiently m...
详细信息
Screen Content Coding (SCC) is an extension of the High-Efficiency Video Coding (HEVC) for encoding screen content videos. However, there are many legacy screen content videos already encoded by HEVC. To efficiently migrate screen content videos from the existing HEVC to the emerging SCC, a machine learning based fast transcoding algorithm is proposed by using decision trees in this paper. To speed up the transcoding process, the intermediate data from both the HEVC decoder side and the SCC encoder side are jointly analyzed. Then the optimal coding unit (CU) sizes are mapped from HEVC to SCC while the mode candidates are adaptively checked according to the decision tree outcomes in the re-encoding process. Experimental results show that an average of 48.20% re-encoding time reduction is achieved with only 1.47% Bjontegaard delta bitrate loss using All Intra (AI) configuration.
Media transcoding is an important job in the streaming media processing. Huge amounts of media transcoding, a kind of data and computationally intensive work, are performed as a distributed way. In this paper, we mode...
详细信息
ISBN:
(纸本)9781509044993
Media transcoding is an important job in the streaming media processing. Huge amounts of media transcoding, a kind of data and computationally intensive work, are performed as a distributed way. In this paper, we model a distributed media transcoding system which is based on Hadoop and FFmpeg. To minimize the total completion time (i.e., makespan) and fully utilize resources, we propose a method to configure the container resources in different clusters based on the requirements of a sample split. The proposed method decreases the makespan by approximately 25% compared to Hadoop FIFO scheduler and could be applied to both virtual clusters and physical-machine clusters.
Dynamic adaptive streaming over HTTP (DASH) requires a video server to transcode each original video object to all the possible bit-rate versions, resulting in high CPU power consumption. To address this, we propose a...
详细信息
ISBN:
(纸本)9781538629376
Dynamic adaptive streaming over HTTP (DASH) requires a video server to transcode each original video object to all the possible bit-rate versions, resulting in high CPU power consumption. To address this, we propose a new scheme that balances quality-of-experience (QoE) against transcoding energy. We start by introducing the concept of transcoding gain to express QoE achieved as a result of transcoding operation, by taking version popularity and QoE into account. We then propose two algorithms that determine which versions should be transcoded with the aim of maximizing the overall QoE. Experimental results show that the proposed scheme can effectively limit the amount of transcoding energy consumption(1).
Given the constantly growing demand for live streaming services, live transcoding has become compulsory and very challenging. So far, investigations have been confined to satisfy a huge number of users for ensuring th...
详细信息
ISBN:
(纸本)9781509050192
Given the constantly growing demand for live streaming services, live transcoding has become compulsory and very challenging. So far, investigations have been confined to satisfy a huge number of users for ensuring the Quality of Experience (QoE). The aim of this paper is to propose a framework architecture following ESTI-NFV (Network Function Virtualization) model [1], whereby the transcoding and streaming Virtual Network Functions (VNFs) would be running on top of multiple cloud domains. By respecting ESTI-NFV model, we ensure the flexibility of our virtual delivery platform that scales up/down and in/out relative to the changing demands of the end-users in order to reduce cost. For this purpose, this paper presents a new framework for managing the virtual live transcoding and streaming VNFs on top of multiple cloud domains for ensuring the QoE while reducing the cost. In order to develop such a framework, we have done a set of experimental benchmarking of transcoding and streaming VNFs using variant flavors (i.e., in terms of CPU and Memory resources). The obtained results will be explored later for developing an intelligent algorithm that will be integrated with the proposed framework in managing different transcoding and streaming VNFs in an efficient manner.
In 2011, six academics gathered over 90,000 authentic text messages (SMS) in French from the general public, in compliance with French law (http://***, 'anckhurst et al., 2013). The SMS 'donors' were also ...
详细信息
In 2011, six academics gathered over 90,000 authentic text messages (SMS) in French from the general public, in compliance with French law (http://***, 'anckhurst et al., 2013). The SMS 'donors' were also invited to fill out a sociolinguistic questionnaire (see Figure A1, Moise, 2013, Panckhurst and Moi se, 2014). The 'sud4science' project is part of a vast international initiative, entitled 'sms4science' (http://***/, Fairon et al., 2006, Cougnon and Fairon, 2014, Cougnon, 2015), which aims to build a worldwide database and analyse authentic text messages in different languages. After the 'sud4science' SMS data collection, a pre-processing phase of checking and eliminating any spurious information and a three-step semi-automatic anonymization phase were conducted (Accorsi et al., 2014, Patel et al., 2013). Two extracts were transcoded into standardized French (1,000 SMS) and annotated (100 SMS). The finalized digital resource of 88,000 anonymized French text messages, the '88milSMS' corpus, the extracts and the sociolinguistic questionnaire data are currently available for all to download, from the Huma-Num web service (http://***, Panckhurst et al., 2014). The 88milSMS corpus has also recently become available via a Creative Commons Attribution 4.0 International licence on the 'Ortolang' platform (https://***/11403/comere/cmr-88milsms/cmr-88milsms-tei-v1, Panckhurst et al., in Chanier (ed), 2016). In this paper, first the authors briefly situate the project and describe the anonymization process. Then, they focus on why they decided to exclude full 'transcoding' and linguistic annotation in the first version of the final corpus.
For H.264 to high efficiency video coding (HEVC) transcoding, this paper proposes a hierarchical Long Short-Term Memory (LSTM) method to predict coding unit (CU) splitting. Specifically, we first analyze the correlati...
详细信息
ISBN:
(纸本)9781538604625
For H.264 to high efficiency video coding (HEVC) transcoding, this paper proposes a hierarchical Long Short-Term Memory (LSTM) method to predict coding unit (CU) splitting. Specifically, we first analyze the correlation between CU splitting patterns and H.264 features. Upon our analysis, we further propose a hierarchical LSTM architecture for predicting CU splitting of HEVC, with regard to the explored H.264 features. The features of H.264, including residual, macroblock (MB) partition and bit allocation, are employed as the input to our LSTM method. Experimental results demonstrate that the proposed method outperforms the state-of-the-art H.264 to HEVC transcoding methods, in terms of both complexity reduction and PSNR performance.
暂无评论