检索结果-内蒙古大学图书馆

7th International Conference on parallel and Distributed Systems Workshops (ICPADS 2000)

作者： Imasaki, K Dandamudi, S Carleton Univ Sch Comp Sci Ctr Parallel & Distributed Comp Ottawa ON K1S 5B6 Canada

ISBN: (纸本)0769505686

Networks Of Workstations (NOWs) are attractive for parallel processing due to their cost advantage. This paper investigates the performance issues in processing join operations and the inherent tradeoff in the networked workstation environment. Specifically, we look at the performance of the nested-loop join algorithm. Since NOWs are heterogeneous in nature, loan sharing is important for their performance. We evaluated the performance of three load sharing methods: static equal, static proportional, and dynamic scheduling with fixed-chunk size. The three scheduling methods are evaluated on an experimental heterogeneous network of workstations with non-query background loads. Our experimental results suggest that, when there is no background load, dynamic scheduling outperforms static equal scheduling (up to 40%) and marginally better (about 10% better speedup) than the static proportional scheduling. When there is dynamic background load on nodes, dynamic scheduling provides substantial performance improvement over the static proportional scheduling (up to 50%) and static equal scheduling (up to about 100%). In all cases, selection of an appropriate chunk size is important in dynamic scheduling.

关键词： network of workstations parallel database query processing dynamic load sharing parallel join algorithm

来源：评论

学校读者我要写书评

暂无评论

UNIFORM PARTITIONING OF RELATIONS USING HISTOGRAM EQUALIZATION FRAMEWORK - AN EFFICIENT parallel HASH-BASED join

引用

INFORMATION PROCESSING LETTERS 1995年第5期55卷 283-289页

作者： PARK, UK CHOI, HK KIM, TG KOREA ADV INST SCI & TECHNOL DEPT ELECT ENGNYUSONG GUTAEJON 305701SOUTH KOREA KANGWEON NATL UNIV DEPT COMP ENGNCHUNCHON 200701SOUTH KOREA

Many parallel join algorithms have been proposed for parallel relational database systems. Among them, the parallel hash-based join algorithm (PHJA) has been found to be superior to other join algorithms for the uniform distribution of data. In real databases, it is often found that certain values for a given attribute occur more frequently than other values. This phenomenon is referred to as data skew. An efficient algorithm called skew resolution join algorithm is proposed for parallel join operations with skewed data. A methodology is proposed for partitioning relations evenly across all processors in a parallel database system. Using the histogram equalization technique, the framework transforms the histogram of skewed data to uniform distribution that corresponds to the relative power of node processors in the system. The proposed algorithm exhibits better performance than the conventional PHJA in the presence of data skew, with negligible overhead in the absence of data skew.

关键词： DATABASES parallel join algorithm DATA SKEW HISTOGRAM EQUALIZATION

来源：评论

学校读者我要写书评

暂无评论

Dynamic load balancing in multicomputer database systems using partition tuning

引用

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 1995年第6期7卷 968-983页

作者： Hua, KA Lee, C Hua, CM NATL CHENG KUNG UNIV INST INFORMAT ENGN TAINAN 70101 TAIWAN HBO & CO LONGWOOD FL 32750 USA

Shared nothing multiprocessor architecture is known to be more scalable to support very large databases. Compared to other join strategies, a hash-based join algorithm is particularly efficient and easily parallelized for this computation model, However, this hardware structure is very sensitive to the skew in tuple distribution. Unless the parallel hash join algorithm includes some dynamic load balancing mechanism, the skew effect can severely deteriorate the system performance. In this paper, we investigate this issue, in particular, three parallel hash join algorithms are presented, We implement a simulator to study the effectiveness of these schemes. The simulation model is validated by comparing the simulation results to those produced by the actual implementation of the algorithms running on a multiprocessor system. Our performance study indicates that a naive approach is not able to provide tangible savings, However, the carefully designed strategies can offer substantial improvement over conventional techniques for a wide range of skew conditions.

关键词： database machine load balancing parallel join algorithm query processing relational database

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：