In this paper, a fast CU partition algorithm for H.264 to HEVC transcoding based on Fisher Discriminant Analysis is proposed. Using the classification model built with the extracted features from H.264 bitstream, the ...
详细信息
ISBN:
(纸本)9781509053162
In this paper, a fast CU partition algorithm for H.264 to HEVC transcoding based on Fisher Discriminant Analysis is proposed. Using the classification model built with the extracted features from H.264 bitstream, the CU splitting of depth 0 and 1 can be directly determined without rate distortion optimization process, and a simple mode mapping method is used to determine CU splitting in depth 2. To ensure the accuracy of classification model, an online learning strategy is designed to update the model thresholds and weight vectors in time. The experimental results show that the proposed algorithm obtains a speed-up to 1.90x on average with 2.75% BD-rate loss under the low-delay P configuration.
The latest High Efficiency Video Coding (HEVC) standard achieves a significant compression efficiency improvement over the H.264/Advanced Video Coding (AVC) standard, but with a much higher computational complexity. I...
详细信息
The latest High Efficiency Video Coding (HEVC) standard achieves a significant compression efficiency improvement over the H.264/Advanced Video Coding (AVC) standard, but with a much higher computational complexity. In this paper, we propose a novel framework for software-based H.264/AVC to HEVC transcoding, integrated with tools such as wavefront parallel processing that are useful for achieving higher levels of parallelism on multicore processors and distributed systems. By utilizing information extracted from the input H.264/AVC bitstream, the transcoding process can be greatly accelerated with a visual quality loss that is modest for many applications. Based on the HEVC HM 14.0 reference software and using standard HEVC test bitstreams, the proposed transcoder can achieve up to 60x speedup on a Quad Core 8-thread server over decoding-re-encoding based on FFMPEG and the HM software with a BD-rate loss of 15%-20%. By implementing a group of picture-level task distribution on a distributed system with nine processing units, the proposed software transcoder can achieve a speed for transcoding 720 p at 30 Hz in real time.
The arbitrary ratio transcoding usually introduces coding block grid misalignment, which results in difficulties to utilize the decoding information in coding blocks of source videos during the encoding phase. To redu...
详细信息
ISBN:
(纸本)9781509053179
The arbitrary ratio transcoding usually introduces coding block grid misalignment, which results in difficulties to utilize the decoding information in coding blocks of source videos during the encoding phase. To reduce the large computation load of HEVC downscale transcoding in such situation, we propose an efficient transcoding method that we refer the decoded coding unit (CU) partitioning to accelerate partition decision, which takes up the most complexity in the encoding phase as well as the whole transcoding. First, we predict CU depth in pixel level according to decoded partitioning of source videos. Then, we propose adaptive rules to determine CU partitioning of target videos based on the prediction, so that we can make early CU splitting or pruning decision without complex recursive search. Experiments demonstrate that the proposed method achieves about 74% time reduction on average with acceptable BD-rate increase in the encoding phase compared to the encoder in reference software HM13.0.
According to the Interpretive Approach, interpreting is understaning the sense of original language and reproducing the sense in another language. The target of interpreting is sense, rather than words or sentences;an...
详细信息
According to the Interpretive Approach, interpreting is understaning the sense of original language and reproducing the sense in another language. The target of interpreting is sense, rather than words or sentences;and process is transcoding, combin-ing decoding and recoding together. Thus there are two types of triangle system in interpreting:source language, sense, plus target language;and the speaker, interpreter plus listener. The following passage is an analysis of the feasibility and convenience of the transcoding theory and also its application in interpreting.
In this paper, we pose a new problem of video enhancement transcoding, which converts the compressed dark video into compressed normal-lighting one. Distinct statistics of dark and normal videos result in quite differ...
详细信息
ISBN:
(纸本)9781467399623
In this paper, we pose a new problem of video enhancement transcoding, which converts the compressed dark video into compressed normal-lighting one. Distinct statistics of dark and normal videos result in quite different coding modes, which thus enforces latent constraints on mode conversion during transcoding. Following this idea, we propose a fast mode decision algorithm to speed up computation while maintaining rate-distortion (RD) performance. Experimental results show that our method saves the computational complexity nearly 70%, without noticeable RD loss in comparison with the cascaded decoder-encoder approach.
The requirement for real-time video transcoding systems has significantly increased due to the easy and widely available access to high resolution video streams and large-scale applications in recent years. In this pa...
详细信息
ISBN:
(纸本)9781509015719
The requirement for real-time video transcoding systems has significantly increased due to the easy and widely available access to high resolution video streams and large-scale applications in recent years. In this paper, we propose a real-time distributed video transcoding system working on heterogeneous environment to tackle the high requirement of such applications. It allows multiple computers to be networked together to execute the same transcoding task so that the system can process more video streams in real time. Most importantly, the proposed method emphasizes on the velocity of the video data which involves the continuous input video stream and outcomes of transcoded video output stream that is accessible on-the-fly in contrast to the batch-oriented approach such as the MapReduce framework, where output latency can be significant. The performance of the proposed system can be further improved by using a more intelligent scheduler for video frames distribution.
Driven by the advances in personal computing devices and the prevalence of broadband network and wireless mobile network accesses, Crowdsourced Live Streaming (CLS) platforms have emerged in recent years, through whic...
详细信息
Driven by the advances in personal computing devices and the prevalence of broadband network and wireless mobile network accesses, Crowdsourced Live Streaming (CLS) platforms have emerged in recent years, through which numerous broadcasters lively stream their video content, e.g., live events or online game scenes, to fellow viewers. Compared to professional video producers and broadcasters, these new generation broadcasters are highly heterogenous in terms of the network/system configurations and therefore the generated video quality, which calls for massive encoding and transcoding in order to unify the video sources and serve multiple quality versions to viewers with different configurations. On the other hand, with the rapid evolution in the hardware industry, high performance processors (e.g., Intel Core i7-4790K CPU) become mainstream in personal computer market. More end devices can easily transcode high quality videos in realtime. We witness huge computational resource among the massive fellow viewers that could potentially be used for transcoding. In this paper, inspired by fog computing, we propose Crowd-transcoding, a novel framework for CLS systems that offloads the transcoding assignment to the massive viewers. We identify that the key challenges in Crowdtranscoding are to detect qualified stable viewers and to properly assign them to the source channels. We put forward Viewer Crowdsourcing Transcode Scheduler (VCTS) to smartly schedule the workload assignment. Our solution has been evaluated under diverse viewer/channel conditions as well as different parameter settings. The trace-driven simulation confirms the superiority of CrowdTranscoder, while our PlanetLab-based and real world end-viewer experiments show the practical performance of our approach, which also give hint to the further enhancement.
Adaptive bitrate streaming (ABR) has been widely adopted to support video streaming services over heterogeneous devices and varying network conditions. With ABR, each video content is transcoded into multiple represen...
详细信息
ISBN:
(纸本)9781467399562
Adaptive bitrate streaming (ABR) has been widely adopted to support video streaming services over heterogeneous devices and varying network conditions. With ABR, each video content is transcoded into multiple representations in different bitrates and resolutions. However, video transcoding is computing intensive, which requires the transcoding service providers to deploy a large number of servers for transcoding the video contents published by the content producers. As such, a natural question for the transcoding service provider is how to provision the computing resource for transcoding the video contents while maximizing service profit. To address this problem, we design a cloud video transcoding system by taking the advantage of cloud computing technology to elastically allocate computing resource. We propose a method for jointly considering the task scheduling and resource provisioning problem in two timescales, and formulate the service profit maximization as a two-timescale stochastic optimization problem. We derive some approximate policies for the task scheduling and resource provisioning. Based on our proposed methods, we implement our open source cloud video transcoding system Morph and evaluate its performance in a real environment. The experiment results demonstrate that our proposed method can reduce the resource consumption and achieve a higher profit compared with the baseline schemes.
Software Defined Networking (SDN) and Network Function Virtualization (NFV) provide an appealing 5G vision of how content distribution networks can be enhanced with in-service transcoding and adaptation processes. Thi...
详细信息
ISBN:
(纸本)9781467384100
Software Defined Networking (SDN) and Network Function Virtualization (NFV) provide an appealing 5G vision of how content distribution networks can be enhanced with in-service transcoding and adaptation processes. This paper describes a proof-of-concept experimental implementation of SDN/NFV-enabled network domain towards providing an agile video transcoding process for maintaining the QoE level of a media service when network congestion occurs. The paper shows how SDN/NFV techniques are facilitating the dynamic deployment of a transcoder VNF upon a triggering event of a congested bottleneck at the access link of the end-user. Video quality measurements demonstrate the efficiency of the VNF deployment and the seamless applicability of the transcoding process without experiencing any service interruption.
There is an inevitable requirement of transcoding from MPEG-2 to H.264/AVC in many video applications. Due to the use of variable block sizes and rate-distortion optimization techniques in H.264, there is extremely hi...
详细信息
There is an inevitable requirement of transcoding from MPEG-2 to H.264/AVC in many video applications. Due to the use of variable block sizes and rate-distortion optimization techniques in H.264, there is extremely high computational complexity in the reference cascaded transcoder. To reduce the complexity, this paper presents a fast inter mode decision algorithm for P frames in MPEG-2 to H.264/AVC transcoding. Firstly, with the statistical analysis of the correlation between the MB's coding mode in MPEG-2 and the corresponding mode in H.264/AVC, the proposed algorithm directly determines the candidate modes for some special MBs based on the mode mapping rules. Then for other MBs, machine learning tools are used to exploit the correlation between the residual MB information in MPEG-2 and the coding mode in H.264/AVC, and the decision tree is derived to determine the candidate modes. The proposed algorithm efficiently reduces the number of candidate modes in H.264/AVC encoder. Experimental results show that the proposed algorithm can achieve 60 % of transcoding time saving on average with the less PSNR degradation and bitrate increase, comparing with the reference cascaded transcoder.
暂无评论