检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

16 篇 会议
9 篇 期刊文献

馆藏范围

25 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

13 篇 工学
- 13 篇 计算机科学与技术...
- 8 篇 软件工程
- 2 篇 电气工程
- 2 篇 电子科学与技术（可...
- 2 篇 环境科学与工程（可...
- 1 篇 动力工程及工程热...
- 1 篇 控制科学与工程
4 篇 管理学
- 3 篇 图书情报与档案管...
- 1 篇 管理科学与工程(可...
2 篇 理学
- 2 篇 系统科学
1 篇 医学
- 1 篇 基础医学(可授医学...
- 1 篇 临床医学

主题

3 篇 file organizatio...
2 篇 phase change ran...
2 篇 nonvolatile memo...
2 篇 hardware
1 篇 power demand
1 篇 flash memories
1 篇 data processing
1 篇 memory managemen...
1 篇 proposals
1 篇 computer archite...
1 篇 computer softwar...
1 篇 portable compute...
1 篇 graphics process...
1 篇 containers
1 篇 nand circuits
1 篇 simulators
1 篇 energy utilizati...
1 篇 buffer storage
1 篇 graph neural net...
1 篇 inference algori...

机构

6 篇 computer archite...
4 篇 lawrence berkele...
4 篇 pennsylvania sta...
4 篇 university of il...
3 篇 computer archite...
2 篇 department of ee...
2 篇 computer archite...
2 篇 computer archite...
2 篇 computer archite...
2 篇 computer archite...
1 篇 center for neura...
1 篇 istituto neurolo...
1 篇 department of ps...
1 篇 simula-uio-ucsd ...
1 篇 department of co...
1 篇 department of ps...
1 篇 department of ne...
1 篇 indiana universi...
1 篇 tnb department o...
1 篇 max planck insti...

作者

15 篇 jung myoungsoo
9 篇 zhang jie
9 篇 myoungsoo jung
9 篇 kwon miryeong
5 篇 miryeong kwon
4 篇 koh sungjoon
4 篇 jie zhang
4 篇 donghyun gouk
3 篇 donofrio david
3 篇 shalf john
3 篇 lee changrim
2 篇 kandemir mahmut ...
2 篇 hanyeoreum bae
2 篇 kim nam sung
2 篇 choi wonil
2 篇 kandemir mahmut
2 篇 yoon jungyeon
2 篇 kandemir mahmut ...
2 篇 gyuyoung park
2 篇 gouk donghyun

语言

25 篇 英文

检索条件"机构=Computer Architecture and Memory Systems Laboratory"

共 25 条记录，以下是1-10 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Large-scale Graph Neural Network Services through Computational SSD and In-Storage Processing architectures

Large-scale Graph Neural Network Services through Computatio...

引用

2022 IEEE Hot Chips 34 Symposium, HCS 2022

作者： Kwon, Miryeong Gouk, Donghyun Lee, Sangwon Jung, Myoungsoo Computer Architecture and Memory Systems Laboratory

Demonstration Video Link: https://***/watch?v=b5fZBESH1TM

ISBN: (数字)9781665460286

ISBN: (纸本)9781665460286

Demonstration Video Link: https://***/watch?v=b5fZBESH1TM

关键词： computer architecture Inference algorithms Graph neural networks Energy efficiency

来源：评论

学校读者我要写书评

暂无评论

DockerSSD: Containerized In-Storage Processing and Hardware Acceleration for Computational SSDs

DockerSSD: Containerized In-Storage Processing and Hardware ...

引用

IEEE Symposium on High-Performance computer architecture

作者： Donghyun Gouk Miryeong Kwon Hanyeoreum Bae Myoungsoo Jung Computer Architecture and Memory Systems Laboratory KAIST Panmnesia Inc

Processing data in storage is an energy-efficient solution to examine massive datasets. However, a general incarnation of such well-known task-offloading model in a real system is unfortunately unsuccessful due to not only poor performance but also many practical challenges, such as limited processing capabilities and high vulnerabilities at the storage-level. We propose DockerSSD, a fully flexible in-storage processing (ISP) model that can run a variety of applications near flash without their source-level modification. Specifically, it enables lightweight OS-level virtualization in modern SSDs, which allows the storage intelligence to be well harmonized with existing computing environment and makes ISP even faster. Instead of developing a vendor-specific ISP to offload, DockerSSD can reuse existing Docker images, create containers as a self-governing execution object in storage, and process data directly where they are in real-time. To this end, we design a new communication method and virtual firmware that operate together to download Docker images and manage their container execution without a change of the existing storage interface and runtime. We further accelerate ISP and reduce the execution latency by automating container-related network and I/O handling data paths over hardware. Our evaluation shows that DockerSSD is 2.0 × faster than state-of-the-art ISP models for workloads with a high volume of system calls or file accesses. Moreover, it demonstrates a reduction in power and energy consumption by 1.6 × and 2.3 × respectively.

关键词：

来源：评论

学校读者我要写书评

暂无评论

GraphTensor: Comprehensive GNN-Acceleration Framework for Efficient Parallel Processing of Massive Datasets

GraphTensor: Comprehensive GNN-Acceleration Framework for Ef...

引用

International Symposium on Parallel and Distributed Processing (IPDPS)

作者： Junhyeok Jang Miryeong Kwon Donghyun Gouk Hanyeoreum Bae Myoungsoo Jung Computer Architecture and Memory Systems Laboratory Korea Advanced Institute of Science and Technology (KAIST)

We present GraphTensor, a comprehensive open-source framework that supports efficient parallel neural network processing on large graphs. GraphTensor offers a set of easy-to-use programming primitives that appreciate both graph and neural network execution behaviors from the beginning (graph sampling) to the end (dense data processing). Our framework runs diverse graph neural network (GNN) models in a destination-centric, feature-wise manner, which can significantly shorten training execution times in a GPU. In addition, GraphTensor rearranges multiple GNN kernels based on their system hyperparameters in a self-governing manner, thereby reducing the processing dimensionality and the latencies further. From the end-to-end execution viewpoint, GraphTensor significantly shortens the service-level GNN latency by applying pipeline parallelism for efficient graph dataset preprocessing. Our evaluation shows that GraphTensor exhibits 1.4× better training performance than emerging GNN frameworks under the execution of large-scale, real-world graph workloads. For the end-to-end services, GraphTensor reduces training latencies of an advanced version of the GNN frameworks (optimized for multi-threaded graph sampling) by 2.4×, on average.

关键词：

来源：评论

学校读者我要写书评

暂无评论

DC-Store: Eliminating Noisy Neighbor Containers using Deterministic I/O Performance and Resource Isolation 18

DC-Store: Eliminating Noisy Neighbor Containers using Determ...

引用

18th USENIX Conference on File and Storage Technologies, FAST 2020

作者： Kwon, Miryeong Gouk, Donghyun Lee, Changrim Kim, Byounggeun Hwang, Jooyoung Jung, Myoungsoo Computer Architecture and Memory Systems Laboratory Samsung

ISBN: (纸本)9781939133120

We propose DC-store, a storage framework that offers deterministic I/O performance for a multi-container execution environment. DC-store's hardware-level design implements multiple NVM sets on a shared storage pool, each providing a deterministic SSD access time by removing internal resource conflicts. In parallel, software support of DC-Store is aware of the NVM sets and enlightens Linux kernel to isolate noisy neighbor containers, performing page frame reclaiming, from peers. We prototype both hardware and software counterparts of DC-Store and evaluate them in a real system. The evaluation results demonstrate that containerized data-intensive applications on DC-Store exhibit 31% shorter average execution time, on average, compared to those on a baseline system. Copyright © Proc. of the 18th USENIX Conference on File and Storage Tech., FAST 2020. All rights reserved.

关键词： Containers

来源：评论

学校读者我要写书评

暂无评论

Scalable parallel flash firmware for many-core architectures 18

Scalable parallel flash firmware for many-core architectures

引用

18th USENIX Conference on File and Storage Technologies, FAST 2020

作者： Zhang, Jie Kwon, Miryeong Swift, Michael Jung, Myoungsoo Computer Architecture and Memory Systems Laboratory University of Wisconsin at Madison

ISBN: (纸本)9781939133120

NVMe is designed to unshackle flash from a traditional storage bus by allowing hosts to employ many threads to achieve higher bandwidth. While NVMe enables users to fully exploit all levels of parallelism offered by modern SSDs, current firmware designs are not scalable and have difficulty in handling a large number of I/O requests in parallel due to its limited computation power and many hardware contentions. We propose DeepFlash, a novel manycore-based storage platform that can process more than a million I/O requests in a second (1MIOPS) while hiding long latencies imposed by its internal flash media. Inspired by a parallel data analysis system, we design the firmware based on many-to-many threading model that can be scaled horizontally. The proposed DeepFlash can extract the maximum performance of the underlying flash memory complex by concurrently executing multiple firmware components across many cores within the device. To show its extreme parallel scalability, we implement DeepFlash on a many-core prototype processor that employs dozens of lightweight cores, analyze new challenges from parallel I/O processing and address the challenges by applying concurrency-aware optimizations. Our comprehensive evaluation reveals that DeepFlash can serve around 4.5 GB/s, while minimizing the CPU demand on microbenchmarks and real server workloads. Copyright © Proc. of the 18th USENIX Conference on File and Storage Tech., FAST 2020. All rights reserved.

关键词： Firmware

来源：评论

学校读者我要写书评

暂无评论

DC-store: eliminating noisy neighbor containers using deterministic I/O performance and resource isolation 20

DC-store: eliminating noisy neighbor containers using determ...

引用

Proceedings of the 18th USENIX Conference on File and Storage Technologies

作者： Miryeong Kwon Donghyun Gouk Changrim Lee Byounggeun Kim Jooyoung Hwang Myoungsoo Jung Computer Architecture and Memory Systems Laboratory Korea Advanced Institute of Science and Technology Computer Architecture and Memory Systems Laboratory Samsung

ISBN: (纸本)9781939133120

关键词：

来源：评论

学校读者我要写书评

暂无评论

Scalable parallel flash firmware for many-core architectures 20

Scalable parallel flash firmware for many-core architectures

引用

Proceedings of the 18th USENIX Conference on File and Storage Technologies

作者： Jie Zhang Miryeong Kwon Michael Swift Myoungsoo Jung Computer Architecture and Memory Systems Laboratory Korea Advanced Institute of Science and Technology Computer Architecture and Memory Systems Laboratory University of Wisconsin at Madison

ISBN: (纸本)9781939133120

NVMe is designed to unshackle flash from a traditional storage bus by allowing hosts to employ many threads to achieve higher bandwidth. While NVMe enables users to fully exploit all levels of parallelism offered by modern SSDs, current firmware designs are not scalable and have difficulty in handling a large number of I/O requests in parallel due to its limited computation power and many hardware *** propose DeepFlash, a novel manycore-based storage platform that can process more than a million I/O requests in a second (1MIOPS) while hiding long latencies imposed by its internal flash media. Inspired by a parallel data analysis system, we design the firmware based on many-to-many threading model that can be scaled horizontally. The proposed DeepFlash can extract the maximum performance of the underlying flash memory complex by concurrently executing multiple firmware components across many cores within the device. To show its extreme parallel scalability, we implement DeepFlash on a many-core prototype processor that employs dozens of lightweight cores, analyze new challenges from parallel I/O processing and address the challenges by applying concurrency-aware optimizations. Our comprehensive evaluation reveals that DeepFlash can serve around 4.5 GB/s, while minimizing the CPU demand on microbenchmarks and real server workloads.

关键词：

来源：评论

学校读者我要写书评

暂无评论

DRAM-Less: Hardware Acceleration of Data Processing with New memory

DRAM-Less: Hardware Acceleration of Data Processing with New...

引用

IEEE Symposium on High-Performance computer architecture

作者： Jie Zhang Gyuyoung Park David Donofrio John Shalf Myoungsoo Jung Computer Architecture and Memory Systems Laboratory Korea Advanced Institute of Science and Technology (KAIST) Computer Architecture and Memory Systems Laboratory Lawrence Berkeley National Laboratory

ISBN: (数字)9781728161495

ISBN: (纸本)9781728161501

General purpose hardware accelerators have become major data processing resources in many computing domains. However, the processing capability of hardware accelerations is often limited by costly software interventions and memory copies to support compulsory data movement between different processors and solid-state drives (SSDs). This in turn also wastes a significant amount of energy in modern accelerated systems. In this work, we propose, DRAM-less, a hardware automation approach that precisely integrates many state-of-the-art phase change memory (PRAM) modules into its data processing network to dramatically reduce unnecessary data copies with a minimum of software modifications. We implement a new memory controller that plugs a real 3x nm multi-partition PRAM to 28nm technology FPGA logic cells and interoperate its design into a real PCIe accelerator emulation platform. The evaluation results reveal that our DRAM-less achieves, on average, 47% better performance than advanced acceleration approaches that use a peer-to-peer DMA.

关键词： Phase change random access memory Data processing Hardware Acceleration Kernel Buffer storage

来源：评论

学校读者我要写书评

暂无评论

ZnG: Architecting GPU Multi-Processors with New Flash for Scalable Data Analysis

ZnG: Architecting GPU Multi-Processors with New Flash for Sc...

引用

Annual International Symposium on computer architecture, ISCA

作者： Jie Zhang Myoungsoo Jung Computer Architecture and Memory Systems Laboratory Korea Advanced Institute of Science and Technology (KAIST)

ISBN: (数字)9781728146614

ISBN: (纸本)9781728146621

We propose ZnG, a new GPU-SSD integrated architecture, which can maximize the memory capacity in a GPU and address performance penalties imposed by an SSD. Specifically, ZnG replaces all GPU internal DRAMs with an ultra-low-latency SSD to maximize the GPU memory capacity. ZnG further removes performance bottleneck of the SSD by replacing its flash channels with a high-throughput flash network and integrating SSD firmware in the GPU's MMU to reap the benefits of hardware accelerations. Although flash arrays within the SSD can deliver high accumulated bandwidth, only a small fraction of such bandwidth can be utilized by GPU's memory requests due to mismatches of their access granularity. To address this, ZnG employs a large L2 cache and flash registers to buffer the memory requests. Our evaluation results indicate that ZnG can achieve 7.5× higher performance than prior work.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Automatic-SSD: Full Hardware Automation over New memory for High Performance and Energy Efficient PCIe Storage Cards

Automatic-SSD: Full Hardware Automation over New Memory for ...

引用

IEEE International Conference on computer-Aided Design

作者： Gyuyoung Park Myoungsoo Jung Computer Architecture and Memory Systems Laboratory Korea Advanced Institute of Science and Technology Daejeon Korea

ISBN: (数字)9781665423243

We propose Automatic-SSD that converts all storage management logic into hardware, which enable energy efficient, high performance fast memory based block storage. To achieve low operating power, Automatic-SSD directly reads or writes host-side data to underlying backend storage media without internal DRAM caches. To realize such DRAM-less approach with better performance and make it more energy efficient, Automatic-SSD also removes the internal processor(s) and firmware execution therein by fully automating the backend request management and data transfers over all pipelined hardware modules. We prototype Automatic-SSD on a middle-end FPGA custom board, employing massive numbers of phase change memories as representative of new memory technologies. Our evaluation results show that, compared to a conventional firmware-based approach, Automatic-SSD shows up 28.8× and 25.4× better bandwidth and latency behaviors, respectively, while consuming only 5% of the total energy, on overage.

关键词： Microprogramming Nonvolatile memory Hardware Phase change random access memory memory management Power demand Media

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共3页 << < 1 2 3 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：