检索结果-内蒙古大学图书馆

34th International Conference on High Performance Computing (ISC High Performance)

作者： Dong, Bin Wu, Kesheng Byna, Suren Tang, Houjun Lawrence Berkeley Natl Lab 1 Cyclotron Rd Berkeley CA 94720 USA

ISBN: (纸本)9783030206567;9783030206550

MapReduce brought on the Big Data revolution. However, its impact on scientific data analyses has been limited because of fundamental limitations in its data and programming models. Scientific data is typically stored as multidimensional arrays, while MapReduce is based on key-value (KV) pairs. Applying MapReduce to analyze array-based scientific data requires a conversion of arrays to KV pairs. This conversion incurs a large storage overhead and loses structural information embedded in the array. For example, analysis operations, such as convolution, are defined on the neighbors of an array element. Accessing these neighbors is straightforward using array indexes, but requires complex and expensive operations like self-join in the KV data model. In this work, we introduce a novel `structural locality'-aware programming model (SLOPE) to compose data analysis directly on multidimensional arrays. We also develop a parallel execution engine for SLOPE to transparently partition the data, to cache intermediate results, to support in-place modification, and to recover from failures. Our evaluations with real applications show that SLOPE is over ninety thousand times faster than Apache Spark and is 38% faster than TensorFlow.

关键词： Multidimensional array Programming model Structural locality Composable data analysis User-defined function arrayUDF Apache Spark TensorFlow MapReduce array cache

来源：评论

学校读者我要写书评

暂无评论

Issues and challenges in the performance analysis of real disk arrays

引用

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2004年第6期15卷 559-574页

作者： Varki, E Merchant, A Xu, JZ Qiu, XZ Univ New Hampshire Dept Comp Sci Durham NH 03824 USA Hewlett Packard Labs Storage Syst Dept Palo Alto CA 94304 USA Falconstor Software Inc Melville NY 11747 USA

The performance modeling and analysis of disk arrays is challenging due to the presence of multiple disks, large array caches, and sophisticated array controllers. Moreover, storage manufacturers may not reveal the internal algorithms implemented in their devices, so real disk arrays are effectively black-boxes. We use standard performance techniques to develop an integrated performance model that incorporates some of the complexities of real disk arrays. We show how measurement data and baseline performance models can be used to extract information about the various features implemented in a disk array. In this process, we identify areas for future research in the performance analysis of real disk arrays.

关键词： RAID analytical performance model array cache parallel I/O enterprise storage systems I/O performance evaluation disk array

来源：评论

学校读者我要写书评

暂无评论

Performance analysis of RAID in different workload

引用

Open Cybernetics and Systemics Journal 2015年第1期9卷 324-328页

作者： Dule, Zhang Xiaoyun, Ji Miao, He Huaijie, Zhu China Petroleum Pipleline Engineering Corporation LangfangHebei065000 China

A performance evaluation model is built for the RAID system with queuing network. With MVA method we develop, validate and apply an analytic performance model for disks arrays configured as a RAID 5. The results show ... 详细信息

关键词： Analytical performance model array cache Disk array I/O performance evaluation Parallel I/O RAID

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：