This paper proposes an efficient inter/intra mode decision for H.264/AVC frame-skipping transcoding. In the proposed algorithm, all information including mode, motion vector etc. are reused when the selected best fram...
详细信息
This paper proposes an efficient inter/intra mode decision for H.264/AVC frame-skipping transcoding. In the proposed algorithm, all information including mode, motion vector etc. are reused when the selected best frame is not skipped; while an efficient mode decision is employed when the best frame is dropped. The experimental results reveal that average 70% of computation time can be saved for the proposed H.264 frame-skipping transcoding algorithm, when compared with fully decoding/encoding procedure. The degradation in the rate-distortion performance is fairly small.
Receiving Internet streaming services on various mobile devices is getting increasingly popular, and cloud platforms have also been gradually employed for delivering streaming services to mobile devices. While a numbe...
详细信息
Receiving Internet streaming services on various mobile devices is getting increasingly popular, and cloud platforms have also been gradually employed for delivering streaming services to mobile devices. While a number of studies have been conducted at the client side to understand and characterize Internet mobile streaming delivery, little is known about the server side, particularly for the recent cloud-based Internet mobile streaming delivery. In this work, we aim to investigate the Internet mobile streaming service at the server side. For this purpose, we have collected a 4-month server-side log on the cloud (with 1,002 TB delivered video traffic) from a top Internet mobile streaming service provider serving worldwide mobile users. Through trace analysis, we find that 1) a major challenge for providing Internet mobile streaming services is rooted from the mobile device hardware and software heterogeneity. In this workload, we find over 3,400 different hardware models with more than 100 different screen resolutions running 14 different mobile OS and three audio codecs and four video codecs. 2) To deal with the device heterogeneity, CPU-intensive transcoding is used on the cloud to customize the video to the appropriate versions at runtime for different devices. A video clip could be transcoded into more than 40 different versions to serve requests from different devices. 3) Compared to videos in traditional Internet streaming, mobile streaming videos are typically of much smaller size (a median of 1.68 MBytes) and shorter duration (a median of 2.7 minutes). Furthermore, the daily mobile user accesses are more skewed following a Zipf-like distribution but users' interests also quickly shift. Considering the huge demand of CPU cycles for online transcoding, we further examine server-side caching to reduce the total CPU cycle demand from the cloud. We show that a policy considering different versions of a video altogether outperforms other intuitive ones when the ca
As a result of improvements in wireless communication technologies, a multimedia data streaming service can be provided for mobile clients such as PDAs and cellular phones. Because mobile devices have low computing po...
详细信息
ISBN:
(纸本)9781424459421;9781424459438
As a result of improvements in wireless communication technologies, a multimedia data streaming service can be provided for mobile clients such as PDAs and cellular phones. Because mobile devices have low computing power and work on a low network bandwidth, a transcoding technology is needed to adapt the original streaming media for mobile environments. To provide quality of service, original encoded media are transcoded within limited time and transmitted to clients without ceasing and jittering phenomena. In particular, when large-scale mobile clients demand streaming services, load distribution methods among distributed transcoding servers have a tremendous impact on the total number of quality of service streams. In this paper, a new load distribution method is proposed based on transcoding time estimation with transcoding servers' information, movie information and target transcoding bit-rate. In experiments, the proposed method produces the best performance scalability according to the increase of transcoding servers.
Scalable video coding is a recent extension of the advanced video coding H.264/AVC standard developed jointly by ISO/IEC and ITU-T, which allows adapting the bitstream easily by dropping parts of it named layers. This...
详细信息
Scalable video coding is a recent extension of the advanced video coding H.264/AVC standard developed jointly by ISO/IEC and ITU-T, which allows adapting the bitstream easily by dropping parts of it named layers. This adaptation makes it possible for a single bitstream to meet the requirements for reliable delivery of video to diverse clients over heterogeneous networks using temporal, spatial or quality scalability, combined or separately. Since the scalable video coding design requires scalability to be provided at the encoder side, existing content cannot benefit from it. Efficient techniques for converting contents without scalability to a scalable format are desirable. In this paper, an approach for temporal scalability transcoding from H.264/AVC to scalable video coding in baseline and main profile is presented and the impact of the GOP size is analyzed. Independently of the GOP size chosen, time savings of around 63 % for baseline profile and 60 % for main profile are achieved while maintaining the coding efficiency.
Owing to the recent development in mobile network technologies, users may enjoy streaming media services anytime, anywhere using personal mobile communication devices. Wireless mobile streaming media services, however...
详细信息
Owing to the recent development in mobile network technologies, users may enjoy streaming media services anytime, anywhere using personal mobile communication devices. Wireless mobile streaming media services, however, have such problems as delay, congestion, and crosstalk due to narrow network bandwidth and limited resources. These problems degrade not only Quality of Service (QoS), but also responsiveness and reliability of the streaming media service. To solve this problem, this paper proposes a Fuzzy Similarity-Based transcoding Proxy (FSTP) mechanism. The proposed FSTP mechanism analyzes the fuzzy similarity of partitioned segment versions of media objects to construct a Fuzzy Similarity-Based transcoding Graph (FSTG). The constructed FSTG determines transcoding for the partitioned media object segment versions. The determined transcoding improves Delay Saving Ratios (DSR), Cache Hit Precision Ratio (CHPR), and Cache Hit Recall Ratio (CHRR). Therefore, the more relevant media object segment versions, determined by the fuzzy similarity, are transcoded and cached first to reduce the start delay time and network traffic. The proposed mechanism is simulated to evaluate various performance parameters. The simulation results show that the proposed mechanism achieves improved performance in DSR, CHPR, and CHRR compared with those of other existing mechanisms.
The emerging video coding standard, HEVC, was developed to replace the current standard, H.264/AVC. However, in order to promote inter-operability with existing systems using the H.264/AVC, transcoding from H.264/AVC ...
详细信息
ISBN:
(纸本)9781479923410
The emerging video coding standard, HEVC, was developed to replace the current standard, H.264/AVC. However, in order to promote inter-operability with existing systems using the H.264/AVC, transcoding from H.264/AVC to the HEVC codec is highly needed. This paper presents a transcoding solution that uses machine learning techniques in order to map H.264/AVC macroblocks into HEVC coding units (CUs). Two alternatives to build the machine learning model are evaluated. The first uses a static training, where the model is built offline and used to transcode any video sequence. The other uses a dynamic training, with two well-defined stages: a training stage and a transcoding stage. In the training stage, full re-encoding is performed while the H.264/AVC and the HEVC information are gathered. This information is then used to build a model, which is used in the transcoding stage to classify the HEVC CU partitioning. Both solutions are tested with well-known video sequences and evaluated in terms of rate-distortion (RD) and complexity. The proposed method is on average 2.26 times faster than the trivial transcoder using fast motion estimation, while yielding a RD loss of only 3.6% in terms of bitrate.
In the context of mobile Web conferencing, slide documents are generally transcoded into JPEG format and wrapped into a Web page prior to delivery. Given the diversity of these devices and their networks, dynamically ...
详细信息
ISBN:
(纸本)9781467355506
In the context of mobile Web conferencing, slide documents are generally transcoded into JPEG format and wrapped into a Web page prior to delivery. Given the diversity of these devices and their networks, dynamically identifying the optimal transcoding parameters is very challenging, as the number of transcoding parameters combinations could be very high. Current solutions use the resolution of the target mobile device and a fixed quality factor as transcoding parameters. However, this technique allows no control over the resulting file size, which, if too large, might increase the delivery time and negatively affect users' experience. Another solution (content selection) which leads to better quality consists in creating several versions and, at delivery time, selecting the best one. However, such a solution is computationally expensive. In this paper, we propose a prediction-based framework which computes near-optimal transcoding parameters dynamically with far less computations. We propose five methods based on this framework. The first predicts near-optimal transcoding parameters, while the others improve their accuracy. From the set of documents tested, two of the proposed methods reach optimality 14% and 30% of the time, respectively. Moreover, the average deviation from optimality for the proposed methods varies from 6% to 3%, with a complexity varying from 1 to 5 transcoding operations.
In this paper, we describe a new source code transformation called dynamic loop reversal that can increase temporal and spatial locality. We also describe a formal method for predicting the cache behaviour and evaluat...
详细信息
ISBN:
(纸本)9781467344715
In this paper, we describe a new source code transformation called dynamic loop reversal that can increase temporal and spatial locality. We also describe a formal method for predicting the cache behaviour and evaluation results of the accuracy of the model by measurements on a cache monitor. The comparisons of the numbers of measured cache misses and the numbers of cache misses estimated by the model indicate that model is relatively accurate and can be used in practice.
One intermediate goal towards achieving universal secure voice interoperability in the Department of Defense (DoD) has been the development of a universal voice encoder (vocoder). This vocoder would seamlessly encode ...
详细信息
ISBN:
(纸本)9780769551241
One intermediate goal towards achieving universal secure voice interoperability in the Department of Defense (DoD) has been the development of a universal voice encoder (vocoder). This vocoder would seamlessly encode speech at a wide range of interoperable variable and fixed data rates to suit a wide range of DoD communication equipment. This paper describes the most important features of such a vocoder and recent advancements in achieving this goal. Specifically this paper will describe three main areas: 1) Summary of the variable data rate (VDR) algorithm and the most recent improvements to the vocoder. 2) Methods for transcoding between fixed rate and variable data rate vocoding modes. 3) Summary of heavily bit error protected, fixed-rate modes.
The use of FFmpeg technology, overcome the current media asset management of some defects, better play the role of media assets. In response to various types of video data, audio data, text, pictures, and other media ...
详细信息
ISBN:
(纸本)9783037858585
The use of FFmpeg technology, overcome the current media asset management of some defects, better play the role of media assets. In response to various types of video data, audio data, text, pictures, and other media digital data storage, inventory management, query, edit material transcoding, information dissemination, and equipment and fixed assets to conduct a comprehensive management. The results show that, based on the "technology" of mass media resources, is to promote the media industry is the motive power of development, any form of content are the media core assets. The media asset design, solve the media transmission and conversion, the ultimate guarantee of effective and complete data.
暂无评论