检索结果-内蒙古大学图书馆

Visibility Rendering Order: Improving Energy Efficiency on Mobile GPUs through Frame Coherence

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2019年第2期30卷 473-485页

作者： de Lucas, Enrique Marcuello, Pedro Parcerisa, Joan-Manuel Gonzalez, Antonio Polytech Univ Catalonia UPC Dept Comp Architecture Campus NordCalle Jordi Girona 1-3 Barcelona 08034 Spain

During real-time graphics rendering, objects are processed by the GPU in the order they are submitted by the CPU, and occluded surfaces are often processed even though they will end up not being part of the final image, thus wasting precious time and energy. To help discard occluded surfaces, most current GPUs include an Early-Depth test before the fragment processing stage. However, to be effective it requires that opaque objects are processed in a front-to-back order. Depth sorting and other occlusion culling techniques at the object level incur overheads that are only offset for applications having substantial depth and/or fragment shading complexity, which is often not the case in mobile workloads. We propose a novel architectural technique for GPUs, Visibility Rendering Order (VRO), which reorders objects front-to-back entirely in hardware by exploiting the fact that the objects in graphics animated applications tend to keep its relative depth order across consecutive frames (temporal coherence). Since order relationships are already tested by the Depth Test, VRO incurs minimal energy overheads because it just requires adding a small hardware to capture that information and use it later to guide the rendering of the following frame. Moreover, unlike other approaches, this unit works in parallel with the graphics pipeline without any performance overhead. We illustrate the benefits of VRO using various unmodified commercial 3D applications for which VRO achieves 27 percent speed-up and 15.8 percent energy reduction on average over a state-of-the-art mobile GPU.

关键词： GPU graphics pipeline energy-efficiency rasterization rendering fragment processing pixel shading occlusion culling visibility tile based deferred rendering tile based rendering topological order

来源：评论

学校读者我要写书评

暂无评论

Multi-fragment Effects on the GPU using the k-Buffer 07

Multi-Fragment Effects on the GPU using the <i>k</i>-Buffer

引用

Symposium on Interactive 3D Graphics and Games

作者： Bavoil, Louis Callahan, Steven P. Lefohn, Aaron Comba, Joao L. D. Silva, Claudio T. Univ Utah Sci Comp & Imaging Inst Salt Lake City UT 84112 USA

ISBN: (纸本)9781595936288

Many interactive rendering algorithms require operations on multiple fragments (i.e., ray intersections) at the same pixel location;however, current Graphics processing Units (GPUs) capture only a single fragment per pixel. Example effects include transparency, translucency, constructive solid geometry, depth-of-field, direct volume rendering, and isosurface visualization. With current GPUs, programmers implement these effects using multiple passes over the scene geometry, often substantially limiting performance. This paper introduces a generalization of the Z-buffer, called the k-buffer, that makes it possible to efficiently implement such algorithms with only a single geometry pass, yet requires only a small, fixed amount of additional memory. The k-buffer uses framebuffer memory as a read-modify-write (RMW) pool of k entries whose use is programmatically defined by a small k-buffer program. We present two proposals for adding k-buffer support to future GPUs and demonstrate numerous multiple-fragment, single-pass graphics algorithms running on both a software-simulated k-buffer and a k-buffer implemented with current GPUs. The goal of this work is to demonstrate the large number of graphics algorithms that the k-buffer enables and that the efficiency is superior to current multi-pass approaches.

关键词： fragment processing graphics hardware visibility ordering blending volume rendering transparency CSG

来源：评论

学校读者我要写书评

暂无评论

DISTRIBUTED QUERY-processing

引用

COMPUTING SURVEYS 1984年第4期16卷 399-433页

作者： YU, CT CHANG, CC Univ. of Illinois at Chicago Chicago Univ. of Illinois at Chicago Chicago

In this paper various techniques for optimizing queries in distributed databases are presented. Although no attempt is made to cover all proposed algorithms on this topic, quite a few ideas extracted from existing algorithms are outlined. It is hoped that large- scale experiments will be conducted to verify the usefulness of these ideas and that they will be integrated to construct a powerful algorithm for distributed query processing. [ABSTRACT FROM AUTHOR]

关键词： Communication cyclic queries distributed query processing fragment processing heuristics join optimization performance semijoin tree queries

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：