检索结果-内蒙古大学图书馆

IEEE COMPUTER ARCHITECTURE LETTERS 2014年第1期13卷 53-56页

作者： Shim, Keun Sup Lis, Mieszko Khan, Omer Devadas, Srinivas MIT Cambridge MA 02139 USA Univ Connecticut Storrs CT USA

Chip-multiprocessors (CMPs) have become the mainstream parallel architecture in recent years;for scalability reasons, designs with high core counts tend towards tiled CMPs with physically distributed shared caches. This naturally leads to a Non-Uniform Cache Access (NUCA) design, where on-chip access latencies depend on the physical distances between requesting cores and home cores where the data is cached. Improving data locality is thus key to performance, and several studies have addressed this problem using data replication and data migration. In this paper, we consider another mechanism, hardware-level thread migration. This approach, we argue, can better exploit shared data locality for NUCA designs by effectively replacing multiple round-trip remote cache accesses with a smaller number of migrations. High migration costs, however, make it crucial to use thread migrations judiciously;we therefore propose a novel, on-line prediction scheme which decides whether to perform a remote access (as in traditional NUCA designs) or to perform a thread migration at the instruction level. For a set of parallel benchmarks, our thread migration predictor improves the performance by 24% on average over the shared-NUCA design that only uses remote accesses.

关键词： Parallel Architecture distributed caches Cache Coherence Data Locality

来源：评论

学校读者我要写书评

暂无评论

The Execution Migration Machine: Directoryless Shared-Memory Architecture

引用

COMPUTER 2015年第9期48卷 50-59页

作者： Shim, Keun Sup Lis, Mieszko Khan, Omer Devadas, Srinivas DE Shaw Res New York NY 10036 USA Univ British Columbia Dept Elect & Comp Engn Vancouver BC V5Z 1M9 Canada Univ Connecticut Dept Elect & Comp Engn Storrs CT USA MIT Elect Engn & Comp Sci Cambridge MA 02139 USA

For certain applications involving chip multiprocessors with more than 16 cores, a directoryless architecture with fine-grained and partial-context thread migration can outperform directory-based coherence, providing ... 详细信息

关键词： microprocessor chips shared memory systems chip multiprocessors directoryless shared-memory architecture execution migration machine fine-grained thread migration partial-context thread migration verification complexity Cache memory distributed databases Instruction sets Parallel processing Program processors Protocols Software architecture System-on-chip data locality distributed caches hardware parallel architectures shared memory thread migration

来源：评论

学校读者我要写书评

暂无评论

Storage products with scalable extendibility and a flexible operation capability

NEC TECHNICAL JOURNAL

引用

NEC TECHNICAL JOURNAL 2007年第3期2卷 22-25页

作者： Ohtani, Hiroyuki Yoshita, Hiromitsu Yoshikawa, Akihiro Fujimori, Hideaki Takiyanagi, Masumi Tanaka, Yoji NEC Corp Ltd Comp Software Operat Unit Comp Software Div 1 Minato Ku Tokyo 1088001 Japan NEC Corp Ltd Comp Operat Unit 1 Syst Storage Prod Div Minato Ku Tokyo 1088001 Japan

The explosive dissemination of broadband in recent years has increased the exchange of various types of data via the Internet, leading to an annual increase of 160% in the amount of data used in E-mails and movie contents. With regard to storage for use in IT systems, the issues attracting most attention are considered to be sudden increases in the data volumes, complications of management and the impacts of shutdowns. This paper discusses solutions to these issues, including "iStorage D8" featuring scalability, manageability and availability and "iStorage D1/D3" featuring high cost efficiency, easy introduction and space saving design.

关键词： storage disk array device RAID cache control extension/expansion without system interruption energy-saving function distributed caches backup

来源：评论

学校读者我要写书评

暂无评论

Cluster prefetch: tolerating on-chip wire delays in clustered microarchitectures 04

Cluster prefetch: tolerating on-chip wire delays in clustere...

引用

Proceedings of the 18th annual international conference on Supercomputing

作者： Rajeev Balasubramonian University of Utah

ISBN: (纸本)9781581138399

The growing dominance of wire delays at future technology points renders a microprocessor communication-bound. Clustered microarchitectures allow most dependence chains to execute without being affected by long on-chip wire latencies. They also allow faster clock speeds and reduce design complexity, thereby emerging as a popular design choice for future microprocessors. However, a centralized data cache threatens to be the primary bottle-neck in highly clustered systems. The paper attempts to identify the most complexity-effective approach to alleviate this bottleneck. While decentralized cache organizations have been proposed, they introduce excessive logic and wiring complexity. The paper evaluates if the performance gains of a decentralized cache are worth the increase in complexity. We also introduce and evaluate the behavior of Cluster Prefetch - the forwarding of data values to a cluster through accurate address prediction. Our results show that the success of this technique depends on accurate speculation across unresolved stores. The technique applies for a wide class of processor models and most importantly, it allows high performance even while employing a simple centralized data cache. We conclude that address prediction holds more promise for future wire-delay-limited processors than decentralized cache organizations.

关键词： data prefetch communication-bound processors effective address and memory dependence prediction clustered microarchitectures distributed caches processor

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：