检索结果-内蒙古大学图书馆

International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT)

作者： Khatter, Harsh Aggarwal, Vaishali ABES Engn Coll Dept Comp Sci & Engn Ghaziabad India KIET Dept Comp Sci & Engn Ghaziabad India

ISBN: (纸本)9781479928996

In this digital world, more than 90% of desktop and notebook computers have integrated Graphics Processing Units i.e. GPU's, for better graphics processing. Graphics Processing Unit is not only for graphics applications, even for nongraphics applications too. In the past few years, the graphics programmable processor has evolved into an increasingly convincing computational resource. But GPU sits idle if graphics job queue is empty, which decreases the GPU's efficiency. This paper focuses on various tact to overcome this problem and to make the CPU-GPU processing more powerful and efficient. The graphics programmable processor or Graphics processing unit is especially well suited to address problem sets expressed as data parallel computation with the same program executed on many data elements concurrently. The objective of this paper is to increase the capabilities and flexibility of recent GPU hardware combined with high level GPU programming languages: to accelerate the building of images in a frame buffer intended for output to a display, and, to provide tremendous acceleration for numerically intensive scientific applications. This paper also gives some light on major applicative areas where GPU is in use and where GPU can be used in future.

关键词： computer graphics graphics processing units parallel processing CPU-GPU interaction computational resource data parallel computation graphics job queue graphics programmable processor integrated graphics processing units parallel processing scientific applications Acceleration Bandwidth Electroencephalography Graphics processing units Irrigation Streaming media

来源：评论

学校读者我要写书评

暂无评论

Locality-preserving dynamic load balancing for data-parallel applications on distributed-memory multiprocessors

引用

JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 2002年第6期18卷 1037-1048页

作者： Liu, PF Wu, JJ Yang, CH Natl Taiwan Univ Dept Comp Sci & Informat Engn Taipei 106 Taiwan Acad Sinica Inst Informat Sci Taipei 115 Taiwan

Load balancing and data locality are the two most important factors affecting the performance of parallel programs running on distributed-memory multiprocessors. A good balancing scheme should evenly distribute the workload among the available processors, and locate the tasks close to their data to reduce communication and idle time. In this paper, we study the load balancing problem of data-parallel loops with predictable neighborhood data references. The loops are characterized by variable and unpredictable execution time due to dynamic external workload. Nevertheless the data referenced by each loop iteration exploits spatial locality of stencil references. We combine an initial static BLOCK. scheduling and a dynamic scheduling based on work stealing. data locality is preserved by careful restrictions on the tasks that can be migrated. Experimental results on a network of workstations are reported.

关键词： load balancing data locality MPI work stealing data parallel computation

来源：评论

学校读者我要写书评

暂无评论

On optimal strategies for cycle-stealing in networks of workstations

引用

IEEE TRANSACTIONS ON COMPUTERS 1997年第5期46卷 545-557页

作者： Bhatt, SN Chung, FRK Leighton, FT Rosenberg, AL UNIV PENN DEPT MATHPHILADELPHIAPA 19104 MIT DEPT MATHCAMBRIDGEMA 02139 MIT COMP SCI LABCAMBRIDGEMA 02139 UNIV MASSACHUSETTS DEPT COMP SCIAMHERSTMA 01003

We study the parallel scheduling problem for a new modality of parallel computing: having one workstation ''steal cycles'' from another. We focus on a draconian mode of cycle-stealing, in which the owner of workstation a allows workstation A to take control of B's processor whenever it is idle, with the promise of relinquishing control immediately upon demand. The typically high communication overhead for supplying workstation B with work and receiving its results militates in favor of supplying B with large amounts of work at a time;the risk of losing work in progress when the owner of B reclaims the workstation militates in favor of supplying B with a sequence of small packets of work. The challenge is to balance these two pressures in a way that maximizes the amount of work accomplished. We formulate two models of cycle-stealing. The first attempts to maximize the expected work accomplished during a single episode, when one knows the probability distribution of the return of B's owner. The second attempts to match the productivity of an omniscient cycle-stealer, when one knows how much work that stealer can accomplish. We derive optimal scheduling strategies for sample scenarios within each of these models. Perhaps our most important discovery is the as-yet unexplained coincidence that two quite distinct scenarios lead to almost identical unique optimizing schedules. One scenario falls within our first model;it assumes that the probability of the return of Bs owner is uniform across the lifespan of the episode;the optimizing schedule maximizes the expected amount of work accomplished during the episode. The other scenario falls within our second model;it assumes that B's owner will interrupt our cycle-stealing at most once during the lifespan of the opportunity;the optimizing schedule maximizes the amount of work that one is guaranteed to accomplish during the lifespan.

关键词： cycle-stealing data parallel computation networks of workstations parallel scheduling formal models optimal competitive ratio optimal expected throughput

来源：评论

学校读者我要写书评

暂无评论

A FORMAL ASSOCIATIVE MODEL OF LOGIC PROGRAMMING AND ITS ABSTRACT INSTRUCTION SET

A FORMAL ASSOCIATIVE MODEL OF LOGIC PROGRAMMING AND ITS ABST...

引用

6th International Conference on Tools with Artificial Intelligence

作者： BANSAL, AK LOKAM, PV GHANDIKOTA, MN KENT STATE UNIV DEPT MATH & COMP SCIKENTOH 44242

ISBN: (纸本)0818667850

Associative computation is characterized by the in-tertwining of search by content and data parallel com-putation. This intertwining facilitates the integration of knowledge retrieval and data parallel computation. This paper describes a formal set of architecture independent rules for associative model of logic programming, and an abstract instruction set. The model integrates knowledge retrieval, data parallel computation, and rule based reasoning within logic programming paradigm. An example of abstract instructions has been presented through the compilations of illustrative programs. Benchmark results have been presented. Benchmark result shows that tight integration of rule based reasoning and data parallel computation has reduced overhead on high performance supercomputers. © 1994 IEEE.

关键词： Artificial Intelligence Associative Computing data parallel computation High Performance Knowledge Retrieval Logic Programming parallelism

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：