检索结果-内蒙古大学图书馆

29th IEEE International Parallel and Distributed Processing Symposium (IPDPS)

作者： Bethel, E. Wes Camp, David Donofrio, David Howison, Mark Lawrence Berkeley Natl Lab Berkeley CA 94720 USA Brown Univ Providence RI 02912 USA

ISBN: (纸本)9781467376846

Many data-intensive algorithms-particularly in visualization, image processing, and data analysis-operate on structured data, that is, data organized in multidimensional arrays. While many of these algorithms are quite numerically intensive, by and large, their performance is limited by the cost of memory accesses. As we move towards the exascale regime of computing, one central research challenge is finding ways to minimize data movement through the memory hierarchy, particularly within a node in a shared-memory parallel setting. We study the effects that an alternative in-memory data layout format has in terms of runtime performance gains resulting from reducing the amount of data moved through the memory hierarchy. We focus the study on shared-memory parallel implementations of two algorithms common in visualization and analysis: a stencil-based convolution kernel, which uses a structured memory access pattern, and raycasting volume rendering, which uses a semi-structured memory access pattern. The question we study is to better understand to what degree an alternative memory layout, when used by these key algorithms, will result in improved runtime performance and memory system utilization. Our approach uses a layout based on a Z-order (Morton-order) space-filling curve data organization, and we measure and report runtime and various metrics and counters associated with memory system utilization. Our results show nearly uniform improved runtime performance and improved utilization of the memory hierarchy across varying levels of concurrency the applications we tested. This approach is complementary to other memory optimization strategies like cache blocking, but may also be more general and widely applicable to a diverse set of applications.

关键词： memory layout data intensive algorithms image analysis visualization multi-core CPUs gpu algorithms performance optimization space-filling curve memory locality shared-memory parallel data-intensive applications stencil operation volume rendering

来源：评论

学校读者我要写书评

暂无评论

Real-time water waves with wave particles

Real-time water waves with wave particles

引用

作者： Yuksel, Cem Texas A&M University

学位级别：Ph.D.

This dissertation describes the wave particles technique for simulating water surface waves and two way fluid-object interactions for real-time applications, such as video games. Water exists in various different forms in our environment and it is important to develop necessary technologies to be able to incorporate all these forms in real-time virtual environments. Handling the behavior of large bodies of water, such as an ocean, lake, or pool, has been computationally expensive with traditional techniques even for offline graphics applications, because of the high resolution requirements of these simulations. A significant portion of water behavior for large bodies of water is the surface wave phenomenon. This dissertation discusses how water surface waves can be simulated efficiently and effectively at real-time frame rates using a simple particle system that we call "wave particles." This approach offers a simple, fast, and unconditionally stable solution to wave simulation. Unlike traditional techniques that try to simulate the water body (or its surface) as a whole with numerical techniques, wave particles merely track the deviations of the surface due to waves forming an analytical solution. This allows simulation of seemingly infinite water surfaces, like an open ocean. Both the theory and implementation of wave particles are discussed in great detail. Two-way interactions of floating objects with water is explained, including generation of waves due to object interaction and proper simulation of the effect of water on the object motion. Timing studies show that the method is scalable, allowing simulation of wave interaction with several hundreds of objects at real-time rates.

关键词： wave particles waves real-time simulation fluid-object interaction gpu algorithms Book Thesis

来源：评论

学校读者我要写书评

暂无评论

A Partitioning gpu-based Algorithm for Processing the k Nearest-Neighbor Query 20

A Partitioning GPU-based Algorithm for Processing the k Near...

引用

12th International Conference on Management of Digital EcoSystems

作者： Velentzas, Polychronis Vassilakopoulos, Michael Corral, Antonio Univ Thessaly Dept Elect & Comp Engn Data Structuring & Engn Lab Volos Greece Univ Almeria Dept Informat Almeria Spain

ISBN: (纸本)9781450381154

The kappa Nearest-Neighbor (kappa-NN) query is a common spatial query that appears in several big data applications. Typically, gpu devices have much larger numbers of processing cores than CPUs and faster device memory than main memory accessed by CPUs, thus, providing higher computing power. We propose and implement a new gpu-based partitioning algorithm for the kappa-NN query, using the CUDA runtime API. Due to partitioning, this algorithm avoids calculating distances for the whole dataset. Using synthetic and real datasets, we present an extensive experimental performance comparison against six existing algorithms. These algorithms are based on calculating distances for the whole in-memory dataset. This comparison shows that the new algorithm excels in all the conducted experiments and outperforms these six algorithms.

关键词： k Nearest-Neighbor Query gpu algorithms Spatial query Partitioning algorithms Parallel computing

来源：评论

学校读者我要写书评

暂无评论

Fast gpu ray tracing of dynamic meshes using geometry images 06

Fast GPU ray tracing of dynamic meshes using geometry images

引用

Proceedings of Graphics Interface 2006

作者： Nathan A. Carr Jared Hoberock Keenan Crane John C. Hart Adobe Corp University of Illinois Urbana-Champaign

ISBN: (纸本)9781568813080

Using the gpu to accelerate ray tracing may seem like a natural choice due to the highly parallel nature of the problem. However, determining the most versatile gpu data structure for scene storage and traversal is a challenge. In this paper, we introduce a new method for quick intersection of triangular meshes on the gpu. The method uses a threaded bounding volume hierarchy built from a geometry image, which can be efficiently traversed and constructed entirely on the gpu. This acceleration scheme is highly competitive with other gpu ray tracing methods, while allowing for both dynamic geometry and an efficient level of detail scheme at no extra cost.

关键词： gpu algorithms geometry images ray tracing mesh parameterization

来源：评论

学校读者我要写书评

暂无评论

Dual scattering approximation for fast multiple scattering in hair 08

Dual scattering approximation for fast multiple scattering i...

引用

ACM SIGGRAPH 2008 papers

作者： Arno Zinke Cem Yuksel Andreas Weber John Keyser Universität Bonn Texas A&M University

ISBN: (纸本)9781450301121

When rendering light colored hair, multiple fiber scattering is essential for the right perception of the overall hair color. In this context, we present a novel technique to efficiently approximate multiple fiber scattering for a full head of human hair or a similar fiber based geometry. In contrast to previous ad-hoc approaches, our method relies on the physically accurate concept of the Bidirectional Scattering Distribution Functions and gives physically plausible results with no need for parameter tweaking. We show that complex scattering effects can be approximated very well by using aggressive simplifications based on this theoretical model. When compared to unbiased Monte-Carlo path tracing, our approximations preserve photo-realism in most settings but with rendering times at least two-orders of magnitude lower. Time and space complexity are much lower compared to photon mapping-based techniques and we can even achieve realistic results in real-time on a standard PC with consumer graphics hardware.

关键词： hair rendering multiple scattering gpu algorithms

来源：评论

学校读者我要写书评

暂无评论

An Improved gpu-based Algorithmfor Processing the k Nearest Neighbor Query 2020

An Improved GPU-based Algorithmfor Processing the k Nearest ...

引用

24th Pan-Hellenic Conference on Informatics

作者： Polychronis Velentzas Panagiotis Moutafis George Mavrommatis University of Thessaly Greece Hellenic Open University Greece

ISBN: (纸本)9781450388979

The k Nearest Neighbor (k-NN) query is a common spatial query that appears in several big data applications. We propose and implement a new gpu-based algorithm for the k-NN query, which improves our previous Symmetric Progression Partitioning method (SPP) by adding a heap buffer. We experimentally prove that the addition of heap speeds up the k-NN query, especially in larger values of k. Using random, synthetic and real datasets, we present an extensive experimental performance comparison against two of our algorithms. This comparison shows that the new algorithm excels in all the conducted experiments.

关键词： gpu algorithms Partitioning algorithms Nearest Neighbors Parallel computing Spatial query

来源：评论

学校读者我要写书评

暂无评论

Deep opacity maps

引用

COMPUTER GRAPHICS FORUM 2008年第2期27卷 675-680页

作者： Yuksel, Cem Keyser, John Texas A&M Univ Dept Comp Sci College Stn TX 77843 USA

We present a new method for rapidly computing shadows from semi-transparent objects like hair. Our deep opacity maps method extends the concept of opacity shadow maps by using a depth map to obtain a per pixel distribution of opacity layers. This approach eliminates the layering artifacts of opacity shadow maps and requires far fewer layers to achieve high quality shadow computation. Furthermore, it is faster than the density clustering technique, and produces less noise with comparable shadow quality. We provide qualitative comparisons to these previous methods and give performance results. Our algorithm is easy to implement, faster, and more memory efficient, enabling us to generate high quality hair shadows in real-time using graphics hardware on a standard PC.

关键词： shadow maps semi-transparent shadows hair shadows real-time shadows gpu algorithms

来源：评论

学校读者我要写书评

暂无评论

Comparing the speed and accuracy of approaches to betweenness centrality approximation

引用

Computational Social Networks 2019年第1期6卷 1页

作者： Matta, John Ercal, Gunes Sinha, Koushik Southern Illinois University Edwardsville Edwardsville IL United States Southern Illinois University Carbondale Carbondale IL United States

Background: Many algorithms require doing a large number of betweenness centrality calculations quickly, and accommodating this need is an active open research area. There are many different ideas and approaches to speeding up these calculations, and it is difficult to know which approach will work best in practical situations. Methods: The current study attempts to judge performance of betweenness centrality approximation algorithms by running them under conditions that practitioners are likely to experience. For several approaches to approximation, we run two tests, clustering and immunization, on identical hardware, along with a process to determine appropriate parameters. This allows an across-the-board comparison of techniques based on the dimensions of speed and accuracy of results. Results: Overall, the speed of betweenness centrality can be reduced several orders of magnitude by using approximation algorithms. We find that the speeds of individual algorithms can vary widely based on input parameters. The fastest algorithms utilize parallelism, either in the form of multi-core processing or gpus. Interestingly, getting fast results does not require an expensive gpu. Conclusions: The methodology presented here can guide the selection of a betweenness centrality approximation algorithm depending on a practitioner’s needs and can also be used to compare new methods to existing ones. © 2019, The Author(s).

关键词： Approximation algorithms Betweenness centrality gpu algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：