检索结果-内蒙古大学图书馆

Symposium on Interactive 3D Graphics and Games

作者： Horn, Daniel Reiter Sugerman, Jeremy Houston, Mike Hartrahan, Pat Stanford Univ Stanford CA 94305 USA

ISBN: (纸本)9781595936288

Over the past few years, the powerful computation rates and high memory bandwidth of GPUs have attracted efforts to run raytracing oil GPUs. Our work extends Foley et al.'s GPU k-d tree research. We port their kd-restart algorithm from multi-pass, using CPU load balancing, to single pass, using current GPUs' branching and looping abilities. We introduce three optimizations: a packetized formulation, a technique for restarting partially down the tree instead of at the root, and a small, fixed-size stack that is checked before resorting to restart. Our optimized implementation achieves 15 - 18 million primary rays per second and 16 - 27 million shadow rays per second oil our test scenes. Our system also takes advantage of GPUs' strengths at rasterization and shading to offer a mode where rasterization replaces eye ray scene intersection, and primary hits and local shading are produced with standard Direct3D code. For 1024x1024 renderings of our scenes with shadows and Phong shading, we achieve 12-18 frames per second. Finally, we investigate the efficiency of our implementation relative to the computational resources of our GPUs and also compare it against conventional CPUs and the Cell processor, which both have been shown to raytrace well.

关键词： Programmable Graphics Hardware data parallel computing Stream computing GPU computing Brook

来源：评论

学校读者我要写书评

暂无评论

AN ASSESSMENT OF THE CONNECTION MACHINE

引用

International Journal of High Speed computing 1993年第4期5卷 523-535页

作者： ROBERT SCHREIBER Research Institute for Advanced Computer Science Mail Stop T045–1 NASA Ames Research Center Mountain View CA 94035 USA

The CM-2 is an example of a connection machine. The strengths and problems of this implementation are considered. Important issues in the architecture and programming environment of connection machines in general are ... 详细信息

关键词： Connection machine SIMD virtual processor data parallel computing

来源：评论

学校读者我要写书评

暂无评论

Application of Top-N Rule-based Optimal Recommendation System for Language Education Content based on parallel computing

引用

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS 2023年第6期14卷 1027-1037页

作者： Hu, Nan Nanyang Med Coll Publ Teaching Dept Nanyang 473000 Henan Peoples R China Wuhan Univ Sch Publ Hlth Wuhan 430000 Hubei Peoples R China

In recent years personalized recommendation services have been applied to many areas of society, typically in the fields of e-commerce, short videos and so on. In response to the serious performance problems of the current online language education platform content recommendation, so in the face of the above opportunities and challenges, this paper designs a new online English education model to allow university students to get a full and more three-dimensional training of English language learning. Based on the MU platform, this paper obtains data from the platform and uses crawler technology to sample and standardize the learning resources for online education. Then user information, such as explicit and implicit ratings of courses, is selected as the main basis for training a user interest preference model. Immediately afterwards, a PRF algorithm combining data parallelism and task parallelism optimization was executed and implemented on Apeche Spark to provide some optimization of data accuracy and content recommendation methods. Finally, the top-N recommendation rule is used to propose a dynamic evolutionary process of identifying students' preferences or learning habits through the results of previous data analysis, so as to make more accurate course content recommendations and learning content guidance for students' English learning. The online three-dimensional teaching model proposed in this paper focuses more on time-series research than traditional algorithms, and can more accurately capture the dynamic changes in students' learning abilities.

关键词： data parallel computing cloud computing data crawlers top-N rules PRF algorithm

来源：评论

学校读者我要写书评

暂无评论

Brook for GPUs: Stream computing on graphics hardware

引用

ACM TRANSACTIONS ON GRAPHICS 2004年第3期23卷 777-786页

作者： Buck, I Foley, T Horn, D Sugerman, J Fatahalian, K Houston, M Hanrahan, P Stanford Univ Stanford CA 94305 USA

In this paper, we present Brook for GPUs, a system for general-purpose computation on programmable graphics hardware. Brook extends C to include simple data-parallel constructs, enabling the use of the GPU as a streaming coprocessor. We present a compiler and runtime system that abstracts and virtualizes many aspects of graphics hardware. In addition, we present an analysis of the effectiveness of the GPU as a compute engine compared to the CPU, to determine when the GPU can outperform the CPU for a particular algorithm. We evaluate our system with five applications, the SAXPY and SGEMV BLAS operators, image segmentation, FFT, and ray tracing. For these applications, we demonstrate that our Brook implementations perform comparably to hand-written GPU code and up to seven times faster than their CPU counterparts.

关键词： programmable graphics hardware data parallel computing stream computing GPU computing Brook

来源：评论

学校读者我要写书评

暂无评论

ClawHMMER: A Streaming HMMer-Search Implementatio 05

ClawHMMER: A Streaming HMMer-Search Implementatio

引用

Proceedings of the 2005 ACM/IEEE conference on Supercomputing

作者： Daniel Reiter Horn Mike Houston Pat Hanrahan Stanford University

ISBN: (纸本)9781595930613

The proliferation of biological sequence data has motivated the need for an extremely fast probabilistic sequence search. One method for performing this search involves evaluating the Viterbi probability of a hidden Markov model (HMM) of a desired sequence family for each sequence in a protein database. However, one of the difficulties with current implementations is the time required to search large databases. Many current and upcoming architectures offering large amounts of compute power are designed with data-parallel execution and streaming in mind. We present a streaming algorithm for evaluating an HMM's Viterbi probability and refine it for the specific HMM used in biological sequence search. We implement our streaming algorithm in the Brook language, allowing us to execute the algorithm on graphics processors. We demonstrate that this streaming algorithm on graphics processors can outperform available CPU implementations. We also demonstrate this implementation running on a 16 node graphics cluster.

关键词： data parallel computing Brook Bio Science Stream computing Programmable Graphics Hardware GPU computing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：