检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

13 篇 会议
2 篇 期刊文献

馆藏范围

15 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

5 篇 工学
- 3 篇 计算机科学与技术...
- 2 篇 航空宇航科学与技...
- 2 篇 软件工程
- 1 篇 电气工程
2 篇 理学
- 2 篇 天文学

主题

8 篇 programming
7 篇 electronics pack...
3 篇 computational mo...
3 篇 synchronization
2 篇 runtime
2 篇 mpi
2 篇 libraries
2 篇 memory managemen...
2 篇 upc
2 篇 arrays
2 篇 semantics
2 篇 pgas
1 篇 shmem
1 篇 parallel process...
1 篇 productivity
1 篇 message systems
1 篇 computer archite...
1 篇 supercomputers
1 篇 bandwidth
1 篇 resource managem...

机构

1 篇 university of wa...
1 篇 computational re...
1 篇 lawrence berkele...
1 篇 electrical engin...
1 篇 laboratory for t...
1 篇 computer science...
1 篇 department of co...
1 篇 ohio state univ ...
1 篇 extreme scale so...
1 篇 intel corp paral...
1 篇 high performance...
1 篇 electrical & com...
1 篇 computer science...
1 篇 rice university
1 篇 intel corp. para...
1 篇 george washingto...
1 篇 university of wa...
1 篇 univ washington ...
1 篇 argonne natl lab...
1 篇 ohio supercomput...

作者

2 篇 panda dhabaleswa...
2 篇 hongzhang shan
1 篇 jose jithin
1 篇 olivier serres
1 篇 jost g.
1 篇 balaji pavan
1 篇 lusk ewing
1 篇 ulf hanebutte
1 篇 katherine yelick
1 篇 nelson j. e.
1 篇 max grossman
1 篇 pavel shamis
1 篇 jeff r. hammond
1 篇 mario flajslik
1 篇 timothy g. matts...
1 篇 aaron welch
1 篇 daniel buettner
1 篇 el-ghazawi tarek
1 篇 hammond j. r.
1 篇 srinivas sridhar...

语言

15 篇 英文

检索条件"任意字段=9th International Conference on Partitioned Global Address Space Programming Models, PGAS 2015"

共 15 条记录，以下是1-10 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Proceedings - 2015 9th international conference on partitioned global address space programming models, pgas 2015

Proceedings - 2015 9th International Conference on Partition...

引用

9th international conference on partitioned global address space programming models, pgas 2015

ISBN: (纸本)9781509001859

the proceedings contain 14 papers. the topics discussed include: on the fence: an offload approach to ordering one-sided communication;caching puts and gets in a pgas language runtime;impact of frequency scaling on one sided remote memory accesses;implementing high-performance geometric multigrid solver with naturally grained messages;an evaluation of anticipated extensions for fortran coarrays;preliminary implementation of coarray Fortran translator based on omni XcalableMP;using the parallel research kernels to study pgas models;PHLAME: hierarchical locality exploitation using the pgas model;a compiler transformation to overlap communication with dependent computation;toward a data-centric profiler for pgas applications;scaling HabaneroUPC++ on heterogeneous supercomputers;PySHMEM: a high productivity OpenSHMEM interface for Python;and ISx: a scalable integer sort for co-design in the exascale era.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Using the Parallel Research Kernels to study pgas models 9

Using the Parallel Research Kernels to study PGAS models

引用

9th international conference on partitioned global address space programming models (pgas)

作者： Van der Wijngaart, R. F. Sridharan, S. Kayi, A. Jost, G. Hammond, J. R. Mattson, T. G. Nelson, J. E. Intel Corp Parallel Comp Lab Santa Clara CA 95051 USA Univ Washington Seattle WA 98195 USA

ISBN: (纸本)9781509001859

A subset of the Parallel Research Kernels (PRK), simplified parallel application patterns, are used to study the behavior of different runtimes implementing the pgas programming model. the goal of this paper is to show that such an approach is practical and effective as we approach the exascale era. Our experimental results indicate that for the kernels we selected, MPI with two-sided communications outperforms the pgas runtimes SHMEM, UPC, Grappa, and MPI-3 with RMA extensions.

关键词： programming models pgas MPI SHMEM UPC Grappa

来源：评论

学校读者我要写书评

暂无评论

Message of the pgas 2015 General Chair

Proceedings - 2015 9th International Conference on Partition...

引用

Proceedings - 2015 9th international conference on partitioned global address space programming models, pgas 2015 2015年 vii页

作者： El-Ghazawi, Tarek George Washington University United States

来源：评论

学校读者我要写书评

暂无评论

Message from the pgas 2015 Program Chair

Proceedings - 2015 9th International Conference on Partition...

引用

Proceedings - 2015 9th international conference on partitioned global address space programming models, pgas 2015 2015年 viii页

作者： Panda, Dhabaleswar K.

来源：评论

学校读者我要写书评

暂无评论

conference Committee

Conference Committee

引用

international conference on partitioned global address space programming models (pgas)

Provides a listing of current committee members and society officers.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Caching Puts and Gets in a pgas Language Runtime

Caching Puts and Gets in a PGAS Language Runtime

引用

international conference on partitioned global address space programming models (pgas)

作者： Michael P. Ferguson Daniel Buettner Cray Inc. Seattle Washington Laboratory for Telecommunication Sciences College Park MD

We investigated a software cache for pgas PUT and GET operations. the cache is implemented as a software write-back cache with dirty bits, local memory consistency operations, and programmer-guided prefetch. this cache supports programmer productivity while enabling communication aggregation and overlap. We evaluated an implementation of this cache for remote data within the Chapel programming language. the cache provides a 2x speedup for several distributed memory application benchmarks written in Chapel across a variety of network configurations. In addition, we observed that improvements to compiler optimization did not remove the benefit of the cache.

关键词： Prefetching Electronics packaging Optimization Synchronization programming Semantics Libraries

来源：评论

学校读者我要写书评

暂无评论

Using the Parallel Research Kernels to Study pgas models

Using the Parallel Research Kernels to Study PGAS Models

引用

international conference on partitioned global address space programming models (pgas)

作者： Rob F. Van Der Wijngaart Srinivas Sridharan Abdullah Kayi Gabriele Jost Jeff R. Hammond Timothy G. Mattson Jacob E. Nelson University of Washington Seattle WA US Intel Corp. Parallel Computing Lab USA Intel Corp. Parallel Computing Lab University of Washington

A subset of the Parallel Research Kernels (PRK),simplified parallel application patterns, are used to study the behavior of different runtimes implementing the pgas programming model. the goal of this paper is to show that such an approach is practical and effective as we approach the exascale era. Our experimental results indicate that forthe kernels we selected, MPI with two-sided communications outperforms the pgas runtimes SHMEM, UPC, Grappa, and MPI-3 with RMA extensions.

关键词： Kernel Runtime Synchronization Electronics packaging programming Bandwidth Protocols

来源：评论

学校读者我要写书评

暂无评论

Hybrid Parallel programming with MPI and Unified Parallel C 10

Hybrid Parallel Programming with MPI and Unified Parallel C

引用

7th ACM international conference on Computing Frontiers (CF)

作者： Dinan, James Balaji, Pavan Lusk, Ewing Sadayappan, P. thakur, Rajeev Ohio State Univ Dept Comp Sci & Engn 2015 Neil Ave Columbus OH 43210 USA Argonne Natl Lab Div Math & Comp Sci Argonne IL 60439 USA

ISBN: (纸本)9781450300445

the Message Passing Interface (MPI) is one of the most widely used programming models for parallel computing. However, the amount of memory available to an MPI process is limited by the amount of local memory within a compute node. partitioned global address space (pgas) models such as Unified Parallel C (UPC) are growing in popularity because of their ability to provide a shared global address space that spans the memories of multiple compute nodes. However, taking advantage of UPC can require a large recoding effort for existing parallel applications. In this paper, we explore a new hybrid parallel programming model that combines MPI and UPC. this model allows MPI programmers incremental access to a greater amount of memory, enabling memory-constrained MPI codes to process larger data sets. In addition, the hybrid model offers UPC programmers an opportunity to create static UPC groups that are connected overMPI. As we demonstrate, the use of such groups can significantly improve the scalability of locality-constrained UPC codes. this paper presents a detailed description of the hybrid model and demonstrates its effectiveness in two applications: a random access benchmark and the Barnes-Hut cosmological simulation. Experimental results indicate that the hybrid model can greatly enhance performance;using hybrid UPC groups that span two cluster nodes, RA performance increases by a factor of 1.33 and using groups that span four cluster nodes, Barnes-Hut experiences a twofold speedup at the expense of a 2% increase in code size.

关键词： MPI UPC pgas Hybrid Parallel programming

来源：评论

学校读者我要写书评

暂无评论

Implementing High-Performance Geometric Multigrid Solver with Naturally Grained Messages

Implementing High-Performance Geometric Multigrid Solver wit...

引用

international conference on partitioned global address space programming models (pgas)

作者： Hongzhang Shan Samuel Williams Yili Zheng Amir Kamil Katherine Yelick Computational Research Division Lawrence Berkeley National Laboratory Berkeley CA USA

Structured grid linear solvers often require manually packing and unpacking of communication data to achieve high *** this process efficiently is challenging, labor-intensive, and potentially *** this paper, we explore an alternative approach that communicates the data with naturally grained message sizes without manual packing and unpacking. this approach is the distributed analogue of shared-memory programming, taking advantage of the global address space in pgas languages to provide substantial programming ease. However, its performance may suffer from the large number of small messages. We investigate the runtime support required in the UPC++ library for this naturally grained version to close the performance gap between the two approaches and attain comparable performance at scale using the High-Performance Geometric Multgrid (HPGMG-FV) benchmark as a driver.

关键词： Synchronization programming Electronics packaging Runtime Interpolation Arrays

来源：评论

学校读者我要写书评

暂无评论

On the Fence: An Offload Approach to Ordering One-Sided Communication

On the Fence: An Offload Approach to Ordering One-Sided Comm...

引用

international conference on partitioned global address space programming models (pgas)

作者： Mario Flajslik James Dinan Intel Corporation

partitioned global address space (pgas) and one-sided communication models allow shared data to be transparently and asynchronously accessed by any process within a parallel computation. In order to ensure that updates are performed in the intended order, the programmer must either use potentially slower ordered communication, or perform operations that order unordered communication, such as a fence in the OpenSHMEM model. Often, implementations of such ordering mechanisms require blocking until pending operations have completed remotely, before allowing new operations to be issued. In this work, we present a new queuing technique for the implementation of one-sided communication ordering that is nonblocking and ensures asynchronous progress for pending communication operations. We describe an implementation of this approach using Portals triggered operations to offload queuing of communication operations across ordering boundaries. By eliminating blocking for ordered communication, this approach is able to provide automatic overlap of communication and computation. We demonstrate the benefit of this technique on several applications and measure performance improvements in the 10%-15% range from allowing computation to progress while ordered communication operations are pending.

关键词： Radiation detectors Portals Electronics packaging Computational modeling Memory management programming Data models

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共2页 << < 1 2 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：