The H.264 standard is widely deployed for high-quality consumer video applications such as DVD and HD video on the internet. In current times where videos are easily shared live on social media and messaging mobile Ap...
详细信息
ISBN:
(纸本)9781538621905;9781538621899
The H.264 standard is widely deployed for high-quality consumer video applications such as DVD and HD video on the internet. In current times where videos are easily shared live on social media and messaging mobile Apps, consumers need to be able to determine the ownership of these videos against being illegally distributed without proper acknowledgements to the source, as well as take actions against those who spread false videos. Robust watermarking schemes are popularly applied techniques to solve these problems. These types of schemes need to achieve robustness against modification attacks on the embedded watermark. In this paper, we focus on a recent robust watermarking scheme and analyse its robustness. We present attacks that enable malicious buyers to modify the watermarks embedded by the owner.
In many practical transcoding applications, such as video surveillance, the source videos are often contaminated by noise. The presence of noise not only results in poor compression efficiency and visual quality, but ...
详细信息
ISBN:
(纸本)9781509041183
In many practical transcoding applications, such as video surveillance, the source videos are often contaminated by noise. The presence of noise not only results in poor compression efficiency and visual quality, but also imposes an adverse effect on the performance of subsequent video analysis tasks. Thereby it is very necessary to denoise the video. In this paper, we propose an efficient mode decision method for H.264 noisy video transcoding through analysing the effect of noise on mode decision. In the algorithm, we use the information available from previously decoded MBs to decide which modes can be overpassed with little loss to the rate-distortion performance. Experimental results show that our method saves the computational complexity nearly 65%, without noticeable rate-distortion(R-D) loss in comparison with the "Full-coding" (cascaded transcoding) method.
Number processing disorder is an acquired deficit in mathematical skills commonly observed in Alzheimer's disease (AD), usually as a consequence of neurological dysfunction. Common impairments include syntactic er...
详细信息
Number processing disorder is an acquired deficit in mathematical skills commonly observed in Alzheimer's disease (AD), usually as a consequence of neurological dysfunction. Common impairments include syntactic errors (800012 instead of 8012) and intrusion errors (8 thousand and 12 instead of eight thousand and twelve) in number transcoding tasks. This study aimed to understand the characterization of AD-related number processing disorder within an alphabetic language (English) and ideographical language (Chinese), and to investigate the differences between alphabetic and ideographic language processing. Chinese-speaking AD patients were hypothesized to make significantly more intrusion errors than English-speaking ones, due to the ideographical nature of both Chinese characters and Arabic numbers. A simplified number transcoding test derived from EC301 battery was administered to AD patients. Chinese-speaking AD patients made significantly more intrusion errors (p=0.001) than English speakers. This demonstrates that number processing in an alphabetic language such as English does not function in the same manner as in Chinese. The impaired inhibition capability likely contributes to such observations due to its competitive lexical representation in brain for Chinese speakers.
In this paper, we pose a new problem of video enhancement transcoding, which converts the compressed dark video into compressed normal-lighting one. Distinct statistics of dark and normal videos result in quite differ...
详细信息
ISBN:
(纸本)9781467399616
In this paper, we pose a new problem of video enhancement transcoding, which converts the compressed dark video into compressed normal-lighting one. Distinct statistics of dark and normal videos result in quite different coding modes, which thus enforces latent constraints on mode conversion during transcoding. Following this idea, we propose a fast mode decision algorithm to speed up computation while maintaining rate-distortion (RD) performance. Experimental results show that our method saves the computational complexity nearly 70%, without noticeable RD loss in comparison with the cascaded decoder-encoder approach.
Multimedia content is now routinely distributed between devices across global networks. These devices differ in their video rendering capability in terms of frame rate, quality, and spatial resolution. To facilitate c...
详细信息
ISBN:
(纸本)9781509020003
Multimedia content is now routinely distributed between devices across global networks. These devices differ in their video rendering capability in terms of frame rate, quality, and spatial resolution. To facilitate content exchange between such devices it is necessary to transcode the video format;otherwise no exchange can take place. To prolong battery life on mobile devices, transcoding may take place remotely on a cloud data center and in which case content protection is advisable. This paper presents an effective multimedia content protection technique that removes the need to decrypt the video prior to transcoding. It does this by partially encrypting the compressed video in such a way that it is decoder-format compliant. The demonstrated scheme allows the transcoder to transrate the video to a desired bit-rate without spending time in encryption/ decryption before decoding the video. In this way, the content and decryption keys are not exposed to third party software at the remote cloud data center and there is no need for complex key management software at the cloud. Consequently, the proposed scheme significantly simplifies cloudbased processing compared to previous schemes.
We compared French- and English-speaking fifth-grade (10-year-old) children's performance in number transcoding. Whereas English two-digit number names follow the decimal structure (base 10), the structure of Fren...
详细信息
We compared French- and English-speaking fifth-grade (10-year-old) children's performance in number transcoding. Whereas English two-digit number names follow the decimal structure (base 10), the structure of French two-digit number words over 60 follow a vigesimal structure (base 20). Children undertook two number transcoding tasks. While children were generally successful at the tasks, English-speaking children significantly outperformed French-speaking children for numbers following a vigesimal structure in French compared to a decimal structure in English (i.e., numbers >60). Our findings show that verbal number name structures influence children's performance in numerical tasks, even though fifth-grade children have well passed the initial stage of acquiring transcoding skills for two-digit numbers. These findings highlight the importance of language specificities in children's number transcoding.
In this paper, we present a simple but effective video watermarking scheme robust enough against scalable lossy recompression and transcoding. Most of video watermarking algorithms use DCT or DWT, but few of those met...
详细信息
ISBN:
(纸本)9781509019977
In this paper, we present a simple but effective video watermarking scheme robust enough against scalable lossy recompression and transcoding. Most of video watermarking algorithms use DCT or DWT, but few of those method can withstand scalable recompression or transcoding. Algorithms with DWT and DCT usually based on specific encoding format, so the watermark will be lost after transform embedded video to other encoding formats. In our method, we use scene change to choose frames to embed watermark, so it can resist attack of aimless frame dropping. We use Arnold transformation to enhance the imperceptibility of the watermark. Our watermark is a fixed-size binary image, after Arnold transformation, it is uniformly embedded to the chosen frames by a spatial random mapping algorithm. The watermark after embedding to the frame has a high imperceptibility. The proposed algorithm allows blind detection without use of origin video. The embedded watermark can be extracted in a low resolution video recompressed from an embedded high resolution video or transcoded from other format embedded video.
Video streams, either in form of on-demand streaming or live streaming, usually have to be converted (i.e., transcoded) based on the characteristics (e.g., spatial resolution) of clients' devices. transcoding is a...
详细信息
ISBN:
(纸本)9781509024537
Video streams, either in form of on-demand streaming or live streaming, usually have to be converted (i.e., transcoded) based on the characteristics (e.g., spatial resolution) of clients' devices. transcoding is a computationally expensive operation, therefore, streaming service providers currently store numerous transcoded versions of the same video to serve different types of client devices. However, recent studies show that accessing video streams have a long tail distribution. That is, there are few popular videos that are frequently accessed while the majority of them are accessed infrequently. The idea we propose in this research is to transcode the infrequently accessed videos in a on-demand (i.e., lazy) manner. Due to the cost of maintaining infrastructure, streaming service providers (e.g., Netflix) are commonly using cloud services. However, the challenge in utilizing cloud services for video transcoding is how to deploy cloud resources in a cost-efficient manner without any major impact on the quality of video streams. To address the challenge, in this research, we present an architecture for on-demand transcoding of video streams. The architecture provides a platform for streaming service providers to utilize cloud resources in a cost-efficient manner and with respect to the Quality of Service (QoS) requirements of video streams. In particular, the architecture includes a QoS-aware scheduling component to efficiently map video streams to cloud resources, and a cost-efficient dynamic (i.e., elastic) resource provisioning policy that adapts the resource acquisition with respect to the video streaming QoS requirements.
In this work, a single input multiple output (SIMO) transcoding architecture is proposed. SIMO will benefit the mobile edge computing (such as HTTP Live Streaming requiring multiple copies of the video streams at diff...
详细信息
ISBN:
(纸本)9781509037247
In this work, a single input multiple output (SIMO) transcoding architecture is proposed. SIMO will benefit the mobile edge computing (such as HTTP Live Streaming requiring multiple copies of the video streams at different quality levels) without resorting to the legacy transcoding that video stream is completed decoded and encoded multiple times without exploring the compressed information. Leveraging the information encoded in the existing video streams, we could reduce the search candidates when transcoding the high quality bitstream to other versions with reduced quality level. As the first step, we have demonstrated the SIMO idea with bit rate shaping (i.e., bit rate transcoding) only scenario. It has shown more than 2x complexity reduction without quality loss using the common test conditions.
In this paper, a fast CU partition algorithm for H.264 to HEVC transcoding based on Fisher Discriminant Analysis is proposed. Using the classification model built with the extracted features from H.264 bitstream, the ...
详细信息
ISBN:
(纸本)9781509053162
In this paper, a fast CU partition algorithm for H.264 to HEVC transcoding based on Fisher Discriminant Analysis is proposed. Using the classification model built with the extracted features from H.264 bitstream, the CU splitting of depth 0 and 1 can be directly determined without rate distortion optimization process, and a simple mode mapping method is used to determine CU splitting in depth 2. To ensure the accuracy of classification model, an online learning strategy is designed to update the model thresholds and weight vectors in time. The experimental results show that the proposed algorithm obtains a speed-up to 1.90x on average with 2.75% BD-rate loss under the low-delay P configuration.
暂无评论