检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

250 篇 会议
11 篇 期刊文献

馆藏范围

261 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

194 篇 工学
- 193 篇 计算机科学与技术...
- 67 篇 软件工程
- 33 篇 电气工程
- 29 篇 电子科学与技术（可...
- 7 篇 控制科学与工程
- 5 篇 建筑学
- 4 篇 动力工程及工程热...
- 4 篇 化学工程与技术
- 3 篇 信息与通信工程
- 3 篇 生物医学工程（可授...
- 2 篇 土木工程
- 2 篇 农业工程
- 2 篇 环境科学与工程（可...
- 2 篇 生物工程
69 篇 管理学
- 57 篇 管理科学与工程(可...
- 12 篇 图书情报与档案管...
- 7 篇 工商管理
43 篇 理学
- 32 篇 数学
- 8 篇 系统科学
- 4 篇 化学
- 4 篇 生物学
- 3 篇 物理学
- 2 篇 地球物理学
- 1 篇 海洋科学
- 1 篇 地质学
- 1 篇 统计学（可授理学、...
3 篇 教育学
- 3 篇 教育学
2 篇 法学
- 2 篇 社会学
2 篇 农学
- 2 篇 作物学
1 篇 经济学
- 1 篇 应用经济学

主题

30 篇 graphics process...
24 篇 graphics process...
22 篇 parallel process...
15 篇 runtime
15 篇 computational mo...
15 篇 kernel
13 篇 parallel algorit...
13 篇 instruction sets
12 篇 parallel program...
10 篇 optimization
10 篇 algorithm design...
9 篇 programming
8 篇 computer archite...
8 篇 bandwidth
8 篇 resource allocat...
8 篇 hardware
7 篇 registers
7 篇 synchronization
6 篇 scalability
6 篇 parallel archite...

机构

3 篇 iit dept comp sc...
3 篇 school of comput...
3 篇 oak ridge natl l...
3 篇 intel corp paral...
2 篇 univ tokyo ctr i...
2 篇 ibm tj watson re...
2 篇 tsinghua univ de...
2 篇 lawrence berkele...
2 篇 univ calif berke...
2 篇 univ calif davis...
2 篇 lawrence livermo...
2 篇 mit cambridge ma...
2 篇 pacific northwes...
2 篇 univ calif berke...
2 篇 natl univ singap...
2 篇 stanford univ st...
2 篇 univ calif berke...
2 篇 department of el...
2 篇 univ calif berke...
2 篇 department of co...

作者

4 篇 demmel james
4 篇 yamazaki ichitar...
3 篇 dongarra jack
3 篇 solomonik edgar
3 篇 james demmel
3 篇 buluc aydin
3 篇 yang guangwen
3 篇 sanders peter
2 篇 raicu ioan
2 篇 kim kyungjoo
2 篇 revels jarrett
2 篇 meng jiayuan
2 篇 regier jeffrey
2 篇 mcauliffe jon
2 篇 howard steve
2 篇 kaeli david r.
2 篇 liang jie
2 篇 zomaya albert y.
2 篇 puri satish
2 篇 pamnany kiran

语言

260 篇 英文
1 篇 其他

检索条件"任意字段=27th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2013"

共 261 条记录，以下是1-10 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Falcon: A Timestamp-based Protocol to Maximize the Cache Efficiency in the distributed Shared Memory 36

Falcon: A Timestamp-based Protocol to Maximize the Cache Eff...

引用

36th ieee international parallel and distributed processing symposium (ieee ipdps)

作者： Zhang, Jin Yu, Xiangyao Qi, Zhengwei Guan, Haibing Shanghai Jiao Tong Univ Shanghai Peoples R China Univ Wisconsin Madison WI USA

ISBN: (纸本)9781665481069

distributed shared memory (DSM) systems can handle data-intensive applications and recently receiving more attention. A majority of existing DSM implementations are based on write-invalidation (WI) protocols, which achieve sub-optimal performance when the cache size is small. Specifically, the vast majority of invalidation messages become useless when evictions are frequent. the problem is troublesome regarding scarce memory resources in data centers. To this end, we propose a self-invalidation protocol Falcon to eliminate invalidation messages. It relies on per-operation timestamps to achieve the global memory order required by sequential consistency (SC). Furthermore, we conduct a comprehensive discussion on the two protocols with an emphasis on the cache size impact. We also implement both protocols atop a recent DSM system, Grappa. the evaluation shows that the optimal protocol can improve the performance of a KV database by 27% and a graph processing application by 71.4% against the vanilla cache-free scheme.

关键词： distributed shard memory cache coherence self-invalidation

来源：评论

学校读者我要写书评

暂无评论

27th Workshop on Job Scheduling Strategies for parallel processing;(JSSPP 2024)

27th Workshop on Job Scheduling Strategies for Parallel Proc...

引用

ieee international symposium on parallel and distributed processing Workshops and Phd Forum (ipdpsW)

作者： Dalibor Klusáček Julita Corbalán Gonzalo P. Rodrigo

来源：评论

学校读者我要写书评

暂无评论

Euler Meets GPU: Practical Graph Algorithms with theoretical Guarantees 35

Euler Meets GPU: Practical Graph Algorithms with Theoretical...

引用

35th ieee international parallel and distributed processing symposium (ipdps)

作者： Polak, Adam Siwiec, Adrian Stobierski, Michal Jagiellonian Univ Fac Math & Comp Sci Krakow Poland

ISBN: (纸本)9781665440660

the Euler tour technique is a classical tool for designing parallel graph algorithms, originally proposed for the PRAM model. We ask whether it can be adapted to run efficiently on GPU. We focus on two established applications of the technique: (1) the problem of finding lowest common ancestors (LCA) of pairs of nodes in trees, and (2) the problem of finding bridgis in undirected graphs. In our experiments, we compare theoretically optimal algorithms using the Euler tour technique against simpler heuristics supposed to perform particularly well on typical instances. We show that the Euler tour-based algorithms not only fulfill their theoretical promises and outperform practical heuristics on hard instances, but also perform on par with them on easy instances.

关键词： graph algorithms parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

27th Workshop on Job Scheduling Strategies for parallel processing;(JSSPP 2024)

2024 IEEE International Parallel and Distributed Processing ...

引用

2024 ieee international parallel and distributed processing symposium Workshops, ipdpsW 2024 2024年 802-802页

作者： Klusacek, Dalibor Corbalan, Julita Rodrigo, Gonzalo P.

来源：评论

学校读者我要写书评

暂无评论

ABET Accreditation: A Way Forward for PDC Education

ABET Accreditation: A Way Forward for PDC Education

引用

35th ieee international parallel and distributed processing symposium (ipdps)

作者： Aly, Sherif G. Harmanani, Haidar Raj, Rajendra K. Sharafeddine, Sanaa Amer Univ Cairo Dept Comp Sci & Engn Cairo Egypt Lebanese Amer Univ Dept Comp Sci & Math Beirut Lebanon Rochester Inst Technol Dept Comp Sci Rochester NY 14623 USA

ISBN: (纸本)9781665435772

With parallel and distributed computing (PDC) now wide-spread, modern computing programs must incorporate PDC within the curriculum. ACM and ieee Computer Society's Computer Science curricular guidelines have recommended exposure to PDC concepts since 2013. More recently, a variety of initiatives have made PDC curricular content, lectures, and labs freely available for undergraduate computer science programs. Despite these efforts, progress in ensuring computer science students graduate with sufficient PDC exposure has been uneven. this paper discusses the impact of ABET's revised criteria that have required exposure to PDC to achieve accreditation for computer science programs since 2018. the authors reviewed 20 top ABET-accredited computer science programs and analyzed how they covered the required PDC components in their curricula. Using their own institutions as case studies, the authors examine in detail how three different ABET-accredited computer science programs covered PDC using different approaches, yet meeting the PDC requirements of these ABET criteria. the paper also shows how ACM/ieee Computer Society curricular guidelines for computer engineering and software engineering programs, along with ABET accreditation criteria, can cover PDC.

关键词： parallel and distributed computing computing programs curricular guidelines ABET accreditation

来源：评论

学校读者我要写书评

暂无评论

27th international Workshop on High-Level parallel Programming Models and Supportive Environments (HIPS 2022)

27th International Workshop on High-Level Parallel Programmi...

引用

ieee international symposium on parallel and distributed processing Workshops and Phd Forum (ipdpsW)

来源：评论

学校读者我要写书评

暂无评论

27th international Workshop on High-Level parallel Programming Models and Supportive Environments (HIPS 2022)

Proceedings - 2022 IEEE 36th International Parallel and Dist...

引用

Proceedings - 2022 ieee 36th international parallel and distributed processing symposium Workshops, ipdpsW 2022 2022年 508-509页

作者： Li, Jiajia Ruefenacht, Martin College of William and Mary United States Leibniz Supercomputing Centre Germany

来源：评论

学校读者我要写书评

暂无评论

Hierarchical Heterogeneous Cluster Systems for Scalable distributed Deep Learning

Hierarchical Heterogeneous Cluster Systems for Scalable Dist...

引用

international symposium on Object-Oriented Real-Time distributed Computing

作者： Yibo Wang Tongsheng Geng Ericson Silva Jean-Luc Gaudiot Electrical Engineering and Computer Science University of California Irvine Irvine USA Electrical Engineering and Computer Science Pontifical Catholic University of Minas Gerais Minas Gerais Brazil

ISBN: (数字)9798350371284

ISBN: (纸本)9798350371291

distributed deep learning framework tools should aim at high efficiency of training and inference of distributed exascale deep learning algorithms. there are three major challenges in this endeavor: scalability, adaptivity and efficiency. Any future framework will need to be adaptively utilized for a variety of heterogeneous hardware and network environments and will thus be required to be capable of scaling from single compute node up to large clusters. Further, it should be efficiently integrated into popular frameworks such as TensorFlow, PyTorch, etc. this paper proposes a dynamically hybrid (hierarchy) distribution structure for distributed deep learning, taking advantage of flexible synchronization on both centralized and decentralized architectures, implementing multi-level fine-grain parallelism on distributed platforms. It is scalable as the number of compute nodes increases, and can also adapt to various compute abilities, memory structures and communication costs.

关键词： Deep learning Training Costs Scalability parallel processing Real-time systems Inference algorithms

来源：评论

学校读者我要写书评

暂无评论

DELTA: distributed Locality-Aware Cache Partitioning for Tile-based Chip Multiprocessors 34

DELTA: Distributed Locality-Aware Cache Partitioning for Til...

引用

34th ieee international parallel and distributed processing symposium (ipdps)

作者： Holtryd, Nadja Manivannan, Madhavan Stenstrom, Per Pericas, Miquel Chalmers Univ Technol Dept Comp Sci & Engn Gothenburg Sweden

ISBN: (纸本)9781728168760

Cache partitioning in tile-based CMP architectures is a challenging problem because of i) the need to determine capacity allocations with low computational overhead and ii) the need to place allocations close to where they are used, in order to reduce access latency. Although, previous solutions have addressed the problem of reducing the computational overhead and incorporating locality-awareness, they suffer from the overheads of centrally determining allocations. In this paper, we propose DELTA, a novel distributed and locality-aware cache partitioning solution which works by exchanging asynchronous challenges among cores. the distributed nature of the algorithm coupled with the low computational complexity allows for frequent reconfigurations at negligible cost and for the scheme to be implemented directly in hardware. the allocation algorithm is supported by an enforcement mechanism which enables locality-aware placement of data. We evaluate DELTA on 16- and 64-core tiled CMPs with multi-programmed workloads. Our evaluation shows that DELTA improves performance by 9% and 16%, respectively, on average, compared to an unpartitioned shared last-level cache.

关键词： cache partitioning multicore architectures performance isolation

来源：评论

学校读者我要写书评

暂无评论

DepGraph: A Dependency-Driven Accelerator for Efficient Iterative Graph processing 27

DepGraph: A Dependency-Driven Accelerator for Efficient Iter...

引用

27th ieee international symposium on High-Performance Computer Architecture (HPCA)

作者： Zhang, Yu Liao, Xiaofei Jin, Hai He, Ligang He, Bingsheng Liu, Haikun Gu, Lin Huazhong Univ Sci & Technol Natl Engn Res Ctr Big Data Technol & Syst Serv Comp Technol & Syst Lab Cluster & Grid Comp LabSch Comp Sci & Tecnnol Wuhan Peoples R China Univ Warwick Dept Comp Sci Warwick England Natl Univ Singapore Singapore Singapore

ISBN: (纸本)9781665422352

Many graph processing systems have been recently developed for many-core processors. However, for iterative graph processing, due to the dependencies between vertices' states, the propagations of new states of vertices are inherently conducted along graph paths sequentially and are also dependent on each other. Despite the years' research effort, existing solutions still severely underutilize many-core processors to quickly propagate the new states of vertices, suffering from slow convergence speed. In this paper, we propose a dependency-driven programmable accelerator, DepGraph, which couples with the core architecture of the many-core processor and can fundamentally alleviate the challenge of dependencies for faster state propagation. Specifically, we propose an effective dependency-driven asynchronous execution approach into novel microarchitecture designs for faster state propagations. DepGraph prefetches the vertices for the core on-the-fly along the dependency chains between their states and the active vertices' new states, aiming to effectively accelerate the propagations of the active vertices' new states and also ensure better data locality. through transforming the dependency chains along the frequently-used paths into direct ones at runtime and maintaining these calculated direct dependencies as a set of fast shortcuts, called hub index, DepGraph further accelerates most state propagations. Also, many propagations do not need to wait for the completion of other propagations, which enables more propagations to be effectively conducted along the paths with higher degree of parallelism. the experimental results show that for iterative graph processing on a simulated 64-core processor, a cutting-edge software graph processing system can achieve 5.0-22.7 times speedup after integrating with our DepGraph while incurring only 0.6% area cost. In comparison with three state-of-the-art hardware solutions, i.e., HATS, Minnow, and PHI, DepGraph improves the performan

关键词： Accelerator domain specialized graph processing many core

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共27页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：