检索结果-内蒙古大学图书馆

Object-Based video coding by Visual Saliency and Temporal Correlation

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING 2020年第1期8卷 168-178页

作者： Ogasawara, Kazuya Miyazaki, Tomo Sugaya, Yoshihiro Omachi, Shinichiro Tohoku Univ Grad Sch Engn Sendai Miyagi 9808579 Japan

When a disaster occurs, video communication is an effective way to disseminate large quantities of important information. However, video coding standards such as High Efficiency video coding (HEVC) compress entire videos, whatever the contents are;at low bit rates, the quality of significant objects deteriorates. In this paper, an object-based video coding method is proposed to address this problem. The proposed method extracts objects on the basis of visual saliency and temporal correlation between frames. Subsequently, we execute pre-processing which degrades the background quality before encoding the video with HEVC. This method can reduce the bit rate while preserving target object quality. Experimental comparison with HEVC demonstrates the superior performance of the proposed method.

关键词： Encoding Image color analysis video coding Visualization Correlation Standards Bit rate Object-based coding object extraction visual saliency background blurring

来源：评论

学校读者我要写书评

暂无评论

Fast mode decision based on texture-depth correlation and motion prediction for multiview depth video coding

引用

JOURNAL OF REAL-TIME IMAGE PROCESSING 2016年第1期11卷 27-36页

作者： Pan, Zhaoqing Zhang, Yun Kwong, Sam City Univ Hong Kong Dept Comp Sci Kowloon Hong Kong Peoples R China Chinese Acad Sci Shenzhen Inst Adv Technol Shenzhen Peoples R China

The motion estimation and disparity estimation are used to remove the temporal and inter-view redundancies in multiview plus depth video coding, however, the variable block-size ME and DE make the computational complexity increase dramatically. This drawback limits it to be applied in real-time applications. In this paper, based on the mode correlations between depth video and its corresponding texture video, motion prediction and coded block pattern, we propose a fast mode decision algorithm to reduce the computational complexity of multiview depth video coding. Experimental results show that the proposed algorithm can achieve 67.18 and 69.90 % encoding time saving for even and odd views, respectively, while maintaining a comparable rate-distortion performance. In addition, with the dramatic encoding time reduction, the proposed algorithm becomes more suitable for real-time applications.

关键词： Three-dimensional video Fast mode decision Multiview depth video coding video coding

来源：评论

学校读者我要写书评

暂无评论

Overview of research efforts on media ISA extensions and their usage in video coding

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR video TECHNOLOGY 2002年第8期12卷 660-670页

作者： Lappalainen, V Hämäläinen, TD Liuha, P Nokia Res Ctr FIN-33721 Tampere Finland Tampere Univ Technol Inst Digital & Comp Syst FIN-33101 Tampere Finland

This paper summarizes the results of over 25 research groups or individual researchers that have presented video coding implementations on general-purpose processors with the new single instruction multiple data media instruction set architecture extensions. The extensions are briefly introduced and the fundamentals for extensions, as well as some inherent problems, are explained. The reported attempts to utilize the extensions are divided into kernel- and application-level, as well as platform dependent and independent optimizations. Optimized applications include, in addition to some proprietary methods, all of the major video coding standards such as H.261, H.263, MPEG-4, MPEG-1, and MPEG-2. These optimized implementations include a complete video codec, several decoders, and several encoders. Additionally, a performance comparison is given for four representative encoder implementations based on the reported results. Also included is an overview of future trends for new instructions and architectural speed-up techniques.

关键词： H.263 MPEG-4 multimedia instructions single instruction multiple data (SIMD) media instruction set architecture (ISA) extensions video coding

来源：评论

学校读者我要写书评

暂无评论

Improved parabolic prediction-based fractional search for H.264/AVC video coding

引用

IET IMAGE PROCESSING 2009年第5期3卷 261-271页

作者： Lin, Y. Wang, Y. C. Natl Cent Univ Dept Commun Engn Chungli 32054 Taiwan

In this study, the authors propose an efficient fractional pixel search algorithm for H.264/AVC video coding to reduce the computational complexity in half/quarter-pixel motion estimation. A prediction for the optimal motion vector is derived under the assumption that the sum of absolute transform differences error surface is a symmetric parabolic function. With the optimal region, a decision rule for half-pixel search is proposed. The experimental result shows that approximately two search points are required. Based on the half-pixel search, a hierarchical quarter-pixel search is described and an extra three search points, on average, are consumed for each quarter-pixel search. Experimental results show that significant reduction in computation can be achieved, while maintaining high coding efficiency.

关键词： video coding decision rule Searches complexity classes parabolic function motion vector motion estimation error curve code efficiency

来源：评论

学校读者我要写书评

暂无评论

Fast Intra Mode Decision for High Efficiency video coding (HEVC)

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR video TECHNOLOGY 2014年第4期24卷 660-668页

作者： Zhang, Hao Ma, Zhan Cent S Univ Sch Informat Sci & Engn Changsha 410083 Hunan Peoples R China Samsung Res Amer Dallas Richardson TX 75082 USA

The latest High Efficiency video coding (HEVC) standard only requires 50% bit-rate of the H.264/AVC at the same perceptual quality, but with a significant encoder complexity increase. Hence, it is necessary and inevitable to develop fast HEVC encoding algorithms for its potential market adoption. In this paper, we propose a fast intra mode decision for the HEVC encoder. The overall fast intra mode decision algorithm consists of both micro- and macro-level schemes. At the micro-level, we propose the Hadamard cost-based progressive rough mode search (pRMS) to selectively check the potential modes instead of traversing all candidates (i.e., up to 35 in HEVC). Fewer effective candidates will be chosen by the pRMS for the subsequent rate-distortion optimized quantization (RDOQ) to derive the rate-distortion (R-D) optimal mode. An early RDOQ skip method is also introduced to further the complexity reduction. At the macro-level, we introduce the early coding unit (CU) split termination if the estimated R-D cost [through aggregated R-D costs of (partial) sub-CUs] is already larger than the R-D cost of the current CU. On average, the proposed fast intra mode decision provides about 2.5 x speedup (without any platform or source code level optimization) with just a 1.0% Bjontegaard delta rate (BD-rate) increase using the HEVC common test condition. Moreover, our proposed solution also demonstrates the state-of-the-art performance in comparison with other works.

关键词： High Efficiency video coding (HEVC) intra prediction mode decision video coding

来源：评论

学校读者我要写书评

暂无评论

Neural Network-Based Enhancement to Inter Prediction for video coding

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR video TECHNOLOGY 2022年第2期32卷 826-838页

作者： Wang, Yang Fan, Xiaopeng Xiong, Ruiqin Zhao, Debin Gao, Wen Harbin Inst Technol Dept Comp Sci & Technol Harbin 150001 Peoples R China Peng Cheng Lab Shenzhen 518055 Peoples R China Peking Univ Dept Elect Engn & Comp Sci Beijing 100871 Peoples R China

Inter prediction is a crucial part of hybrid video coding frameworks, utilized to exploit the temporal redundancy in video sequences and improve the coding performance. During inter prediction, a predicted block is typically derived from reference pictures using motion estimation and motion compensation. To improve the coding performance of inter prediction, a neural network based enhancement to inter prediction (NNIP) is proposed in this paper. NNIP is composed of three networks, namely residue estimation network, combination network, and deep refinement network. Specifically, first, a residue estimation network is designed to estimate the residue between current block and its predicted block using their available spatial neighbors. Second, the feature maps of the estimated residue and the predicted block are extracted and concatenated in a combination network. Finally, the concatenated feature maps are fed into a deep refinement network to generate a refined residue, which is added back to the predicted block to derive a more accurate predicted block. NNIP is integrated in HEVC to evaluate its efficiency. The experimental results demonstrate that NNIP can achieve 4.6%, 3.0%, and 2.7% BD-rate reduction on average under LDP, LDB, and RA configurations compared to HEVC.

关键词： Artificial neural networks Estimation Motion compensation video coding Interpolation Encoding Deep learning video coding inter prediction NNIP HEVC deep learning

来源：评论

学校读者我要写书评

暂无评论

Distortion variation minimization in real-time video coding

引用

SIGNAL PROCESSING-IMAGE COMMUNICATION 2006年第4期21卷 273-279页

作者： Chen, Zhenzhong Ngan, King Ngi Chinese Univ Hong Kong Dept Elect Engn Shatin Hong Kong Peoples R China

In this paper, we review the rate distortion tradeoff issues in real-time video coding and introduce a minimum variation (MINVAR) distortion criterion based approach. The MINVAR based rate distortion tradeoff framework provides a local optimization strategy as a rate control mechanism in real-time video coding applications by minimizing the distortion variation while the corresponding bit rate fluctuation is limited by utilizing the encoder buffer. The proposed approach aims to achieve a smooth decoded picture quality for pleasing human visual experience. The performance of the proposed method is evaluated with H.264. The experimental results demonstrate that using the proposed approach, the decoded picture quality is smoother than the traditional H.264 joint model (JM) rate control without sacrificing global quality such that a better subjective visual quality is guaranteed. (c) 2005 Elsevier B.V. All rights reserved.

关键词： minimum variation rate distortion rate control video coding

来源：评论

学校读者我要写书评

暂无评论

Cogeneration of Fast Motion Estimation Processors and Algorithms for Advanced video coding

引用

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS 2012年第3期20卷 437-448页

作者： Nunez-Yanez, Jose L. Nabina, Atukem Hung, Eddie Vafiadis, George Univ Bristol Dept Elect Engn Bristol BS8 1UB Avon England Univ British Columbia Dept Elect & Comp Engn Vancouver BC V6T 1Z4 Canada

This paper presents a flexible and scalable motion estimation processor capable of supporting the processing requirements for high-definition (HD) video using the H. 264 Advanced video Codec, which is suited for FPGA implementation. Unlike most previous work, our core is optimized to execute all existing fast block matching algorithms, which we show to match or exceed the inter-frame prediction performance of traditional full-search approaches at the HD resolutions commonly in use today. Using our development tools, such algorithms can be described using a C-style syntax which is compiled into our custom instruction set. We show that different HD sequences exhibit different characteristics which necessitate a flexible and configurable solution when targeting embedded applications. This is supported in our core and toolset by allowing designers to modify the number of functional units to be instantiated. All processor instances remain binary compatible so recompilation of the motion estimation algorithm is not required. Due to this optimization process, it is possible to match the processing requirements of the selected motion estimation algorithm to the hardware microarchitecture leading to a very efficient implementation.

关键词： Field-programmable gate array (FPGA) H.264 motion estimation reconfigurable processor video coding

来源：评论

学校读者我要写书评

暂无评论

Transcoder architectures for video coding

引用

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS 1998年第1期44卷 88-98页

作者： Bjork, N Christopoulos, C Ericsson Telecom AB Compress Lab HF ETX PN XML S-12625 Stockholm Sweden

This paper discusses the problem of transcoding H.263-based video streams. Two different models for transcoding are examined, rate reduction and resolution reduction. Results will show that the computational complexity of the basic transcoding model can be reduced for each model by, on average, 39% and 23% without significant lose in quality. Comparisons with the scaleable coding model are also shown.

关键词： video coding Transcoding video compression Bit rate Decoding Computational complexity Relays Communication standards Standards development Telecommunications

来源：评论

学校读者我要写书评

暂无评论

Benefits and costs of scalable video coding for internet streaming

引用

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION 2005年第4-5期16卷 397-411页

作者： Narroschke, M Leibniz Univ Hannover Inst Theoret Nachrichtentech & InformatVerarbeitu D-30167 Hannover Germany

Benefits and costs of scalable hybrid video coding techniques are analyzed with respect to internet streaming. Temporal, spatial, amplitude scalability, and combinations as described in MPEG-4 are considered. Benefits are a reduction of the server storage capacity, a reduction of the netload for multicast delivery and a graceful degradation in case of transmission errors. Costs are ail increasing netload for unicast delivery and ail increasing computational expense in the decoder. The result of an evaluation shows that temporal scalability has minimum costs among all analyzed techniques. It increases the netload for unicast only marginally with no additional Computational expense in the decoder. Temporal scalability provides a reduction of the server storage capacity and netload for multicast by about 30%, and two steps of graceful degradation. All other known standardized and nonstandardized techniques of spatial and amplitude scalability are associated with costs that appear too high to be attractive for internet streaming. Therefore, only temporal scalability is used at the present. Some of the scalable video coding techniques may become of interest for other applications where the investigated costs are less relevant. (c) 2005 Elsevier Inc. All rights reserved.

关键词： video coding scalable video coding scalability MPEG-4

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：