检索结果-内蒙古大学图书馆

12th USENIX symposium on Operating Systems Design and Implementation (OSDI)

作者： Zhu, Xiaowei Chen, Wenguang Zheng, Weimin Ma, Xiaosong Tsinghua Univ Dept Comp Sci & Technol TNLIST Beijing Peoples R China Tsinghua Univ Yangtze Delta Reg Inst Technol Innovat Ctr Yinzhou Yinzhou Zhejiang Peoples R China Hamad Bin Khalifa Univ Qatar Comp Res Inst Doha Qatar

ISBN: (纸本)9781931971331

Traditionally distributed graph processing systems have largely focused on scalability through the optimizations of inter-node communication and load balance. However, they often deliver unsatisfactory overall processing efficiency compared with shared-memory graph computing frameworks. We analyze the behavior of several graph-parallel systems and find that the added overhead for achieving scalability becomes a major limiting factor for efficiency, especially with modern multi-core processors and high-speed interconnection networks. Based on our observations, we present Gemini, a distributed graph processing system that applies multiple optimizations targeting computation performance to build scalability on top of efficiency. Gemini adopts (1) a sparse-dense signal-slot abstraction to extend the hybrid push-pull computation model from shared-memory to distributed scenarios, (2) a chunk-based partitioning scheme enabling low-overhead scaling out designs and locality-preserving vertex accesses, (3) a dual representation scheme to compress accesses to vertex indices, (4) NUMA-aware sub-partitioning for efficient intra-node memory accesses, plus (5) locality-aware chunking and fine-grained work-stealing for improving both inter-node and intra-node load balance, respectively. Our evaluation on an 8-node high-performance cluster (using five widely used graph applications and five real-world graphs) shows that Gemini significantly outperforms all well-known existing distributed graph processing systems, delivering up to 39.8 x (from 8.91x) improvement over the fastest among them.

关键词： Scalability

来源：评论

学校读者我要写书评

暂无评论

Low Latency and Resource-aware Program Composition for Large-scale Data Analysis 16

Low Latency and Resource-aware Program Composition for Large...

引用

16th ieee/ACM International symposium on Cluster, Cloud and Grid Computing (CCGrid)

作者： Tanaka, Masahiro Taura, Kenjiro Torisawa, Kentaro Natl Inst Informat & Commun Technol NICT Universal Commun Res Inst 3-5 Hikaridai Seika Kyoto 6190289 Japan Univ Tokyo Grad Sch Informat Sci & Technol Dept Informat & Commun Engn 7-3-1 Hongo Bunkyo Ku Tokyo 1130033 Japan

ISBN: (纸本)9781509024537

the importance of large-scale data analysis has shown a recent increase in a wide variety of areas, such as natural language processing, sensor data analysis, and scientific computing. Such an analysis application typically reuses existing programs as components and is often required to continuously process new data with low latency while processing large-scale data on distributed computation nodes. However, existing frameworks for combining programs into a parallel data analysis pipeline (e.g., workflow) are plagued by the following issues: (1) Most frameworks are oriented toward high-throughput batch processing, which leads to high latency. (2) A specific language is often imposed for the composition and/or such a specific structure as a simple unidirectional dataflow among constituting tasks. (3) A program used as a component often takes a long time to start up due to the heavy load at initialization, which is referred to as the startup overhead. Our solution to these problems is a remote procedure call (RPC)-based composition, which is achieved by our middleware Rapid Service Connector (RaSC). RaSC can easily wrap an ordinary program and make it accessible as an RPC service, called a RaSC service. Using such component programs as RaSC services enables us to integrate them into one program with low latency without being restricted to a specific workflow language or dataflow structure. In addition, a RaSC service masks the startup overhead of a component program by keeping the processes of the component program alive across RPC requests. We also proposed architecture that automatically manages the number of processes to maximize the throughput. the experimental results showed that our approach excels in overall throughput as well as latency, despite its RPC overhead. We also showed that our approach can adapt to runtime changes in the throughput requirements.

关键词： Large-scale data processing program composition service composition

来源：评论

学校读者我要写书评

暂无评论

1st ieee Workshop on parallel and distributed processing for computational social systems (ParSocial 2016)

Proceedings - 2016 IEEE 30th International Parallel and Dist...

引用

proceedings - 2016 ieee 30th International parallel and distributed processing symposium, IPDPS 2016 2016年 1793-1794页

作者： Santos, Eunice E. Korah, John Illinois Institute of Technology United States

来源：评论

学校读者我要写书评

暂无评论

20th workshop on job scheduling strategies for parallel processing (JSSPP)

Proceedings - 2016 IEEE 30th International Parallel and Dist...

引用

proceedings - 2016 ieee 30th International parallel and distributed processing symposium, IPDPS 2016 2016年 1485-1485页

作者： Cirne, Walfredo Desai, Narayan Google United States Ericsson United States

来源：评论

学校读者我要写书评

暂无评论

2016 ieee International Conference on Networking Architecture and Storage, NAS 2016 - proceedings

2016 IEEE International Conference on Networking Architectur...

引用

11th ieee International Conference on Networking Architecture and Storage, NAS 2016

ISBN: (纸本)9781509033157

the proceedings contain 39 papers. the topics discussed include: DVS: dynamic variable-width striping RAID for shingled write disks;cooperative bandwidth sharing for 5G heterogeneous network using game theory;active burst-buffer: in-transit processing integrated into hierarchical storage;DS-Index: a distributed search solution for federated cloud;assessing advanced technology in CENATE;efficient parity update for scaling raid-like storage systems;a stripe-oriented write performance optimization for RAID-structured storage systems;CircularCache: scalable and adaptive cache management for massive storage systems;a kind of FTL scheme which keeps the high performance and lowers the capacity of ram occupied by mapping table;distributed slot scheduling algorithm for hybrid CSMA/TDMA MAC in wireless sensor networks;correlating hardware performance events to CPU and dram power consumption;dynamic power-performance adjustment on clustered multi-threading processors;a high-performance persistent identification concept;hybrid replication: optimizing network bandwidth and primary storage performance for remote replication;GPU-ABFT: optimizing algorithm-based fault tolerance for heterogeneous systems with GPUs;and improving read performance of SSDs via balanced redirected read.

关键词：

来源：评论

学校读者我要写书评

暂无评论

6th ieee workshop on parallel computing and optimization-PCO 2016

Proceedings - 2016 IEEE 30th International Parallel and Dist...

引用

proceedings - 2016 ieee 30th International parallel and distributed processing symposium, IPDPS 2016 2016年 721-721页

作者： El Baz, Didier Ucar, Bora Team CDA LAAS-CNRS France CNRS ENS Lyon France

来源：评论

学校读者我要写书评

暂无评论

NSF/TCPP workshop on parallel and distributed computing education (EduPar 2016)

Proceedings - 2016 IEEE 30th International Parallel and Dist...

引用

proceedings - 2016 ieee 30th International parallel and distributed processing symposium, IPDPS 2016 2016年 938-940页

作者： Vaidyanathan, Ramachandran Prasad, Sushil K. Puri, Satish Louisiana State University United States Georgia State University United States

来源：评论

学校读者我要写书评

暂无评论

ieee international workshop on graph algorithm building blocks - (GABB 2016)

Proceedings - 2016 IEEE 30th International Parallel and Dist...

引用

proceedings - 2016 ieee 30th International parallel and distributed processing symposium, IPDPS 2016 2016年 855-855页

作者： Mattson, Tim Intel Corp. United States

来源：评论

学校读者我要写书评

暂无评论

21st ieee workshop on dependable parallel, distributed and network-centric systems (DPDNS 16)

Proceedings - 2016 IEEE 30th International Parallel and Dist...

引用

proceedings - 2016 ieee 30th International parallel and distributed processing symposium, IPDPS 2016 2016年 1269-1269页

作者： Avresky, Dimiter Maehle, Erik Palmieri, Roberto IRIANC Boston United States Munich Germany University of Luebeck Germany Virginia Tech. University United States

来源：评论

学校读者我要写书评

暂无评论

1st international workshop on emerging parallel and distributed runtime systems and middleware (IPDRM 2016)

Proceedings - 2016 IEEE 30th International Parallel and Dist...

引用

proceedings - 2016 ieee 30th International parallel and distributed processing symposium, IPDPS 2016 2016年 1726-1726页

作者： Song, Shuaiwen Leon Gamblin, Todd Pacific Northwest National Lab United States Lawrence Livermore National Lab United States

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：