检索结果-内蒙古大学图书馆

parallel SORT-HASH OBJECT-ORIENTED COLLECTION JOIN ALGORITHMS FOR SHARED-MEMORY MACHINES

parallel Algorithms and Applications 2002年第2期17卷 85-126页

作者： David Taniar - E-mail: David.Taniar@infotech.monash.edu.au[a] J. Wenny Rahayu[b] [a] School of Business Systems Monash University Clayton Vic Australia [b] Department of Computer Science and Computer Engineering La Trobe University Bundarra Vic Australia

Collection join queries are join queries based on collection attributes (i.e. non-atomic attributes), which are common in object-oriented databases. We have identified three different kinds of collection join queries, namely;cullection-equijoin,collection-intersectjoin, andsub-collectionjoin. In this paper, we propose parallel join algorithms for these three collection join query types based on a combination of sort and hash methods, which we callparallel sort-hash, collection join algorithms. The proposed join algorithms play an important role in parallel object-oriented query processing, due to their superiority over the conventional join methods which are usually in a form of relational division, and also the inefficiency of the original join predicate processing. In our implementation of these algorithms on a shared-memory machine, we show that the combination between sort and hash methods is proven to be better than the conventional sort-merge and nested-loop based parallel join processing

关键词： parallel query processing parallel join Object-oriented join queries Collection types Object-oriented databases parallel databases

来源：评论

学校读者我要写书评

暂无评论

Performance analysis of parallelization models for path expression queries

引用

INFORMATION SCIENCES 1999年第1-2期117卷 107-142页

作者： Taniar, D Rahayu, JW RMIT Univ Dept Comp Sci Melbourne Vic Australia La Trobe Univ Sch Comp Sci & Comp Engn Bundoora Vic 3083 Australia

In this paper, parallelization models for path expressions queries are studied. Path expression queries involve multiple classes along aggregation/association hierarchies. parallelization models for path expression queries are "inter-object parallelization" and "inter-class parallelization". Inter-object parallelization exploits the associativity within complex objects, whereas inter-class parallelization imposes upon process independence. The behaviours of these parallelization models are described in terms of analytical models. Performance evaluation is also performed to confirm the results from the quantitative analysis. (C) 1999 Elsevier Science Inc. All rights reserved.

关键词： path expression queries parallel query processing inter-object parallelization inter-class parallelization parallel object-oriented databases

来源：评论

学校读者我要写书评

暂无评论

Semantic parallelism for documentary database systems

引用

International Conference and Exhibition on High-Performance Computing and Networking

作者： Biscondi, N Inst Natl Sci Appl LISI F-69621 Villeurbanne France

ISBN: (纸本)3540644431

As data volume and query processing loads increase, companies that provide information retrieval services are turning to highperformance parallel computing, storage and searching. In this paper we present a new paradigm of semantic parallelism dedicated to documentary databases. Based on existing parallel database techniques, our approach uses particular features of documentary databases to retrieve semantically relevant information in a more rapid and efficient way. Thus it can greatly alleviate both information overload and vocabulary problem of information retrieval. Extensive simulation results confirm the efficiency of our approach.

关键词： parallel query processing parallel algorithms information retrieval semantic analysis

来源：评论

学校读者我要写书评

暂无评论

On applying hash filters to improving the execution of multi-join queries

引用

VLDB Journal 1997年第2期6卷 121-131页

作者： Chen, Ming-Syan Hsiao, Hui-I Yu, Philip S. Electrical Engineering Department National Taiwan University Taipei Taiwan IBM T.J. Watson Research Center Yorktown NY 10598 P.O.Box 704 United States

In this paper, we explore an approach of inter-leaving a bushy execution tree with hash filters to improve the execution of multi-join queries. Similar to semi-joins in distributed query processing, hash filters can be applied to eliminate non-matching tuples from joining relations before the execution of a join, thus reducing the join cost. Note that hash filters built in different execution stages of a bushy tree can have different costs and effects. The effect of hash filters is evaluated first. Then, an efficient scheme to determine an effective sequence of hash filters for a bushy execution tree is developed, where hash filters are built and applied based on the join sequence specified in the bushy tree so that not only is the reduction effect optimized but also the cost associated is minimized. Various schemes using hash filters are implemented and evaluated via simulation. It is experimentally shown that the application of hash filters is in general a very powerful means to improve the execution of multi-join queries, and the improvement becomes more prominent as the number of relations in a query increases.

关键词： Bushy trees Hash filters parallel query processing Sort-merge joins

来源：评论

学校读者我要写书评

暂无评论

APPLYING SEGMENTED RIGHT-DEEP TREES TO PIPELINING MULTIPLE HASH JOINS

引用

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 1995年第4期7卷 656-668页

作者： CHEN, MS LO, ML YU, PS YOUNG, HC UNIV MICHIGAN DEPT EECSANN ARBORMI 48109 IBM CORP ALMADEN RES CTRSAN JOSECA 95120

The pipelined execution of multijoin queries in a multiprocessor-based database system is explored in this paper. Using hash-based joins, multiple joins can be pipelined so that the early results from a join, before the whole join is completed, are sent to the next join for processing. The execution of a query is usually denoted by a query execution tree, To improve the execution of pipelined hash joins, an innovative approach on query execution tree selection is proposed to exploit segmented right deep trees, which are bushy trees of right-deep subtrees. We first derive an analytical model for the execution of a pipeline segment, and then, in light of the model, develop heuristic schemes to determine the query execution plan based on a segmented right-deep tree so that the query can be efficiently executed. As shown by our simulation, the proposed approach, without incurring additional overhead on plan execution, possesses more flexibility in query plan generation, and can lead to query plans of better performance than those achievable by the previous schemes using right-deep trees.

关键词： PIPELINING parallel query processing BUSHY TREES RIGHT-DEEP TREES HASH JOINS

来源：评论

学校读者我要写书评

暂无评论

A SELF-ADJUSTING DATA DISTRIBUTION MECHANISM FOR MULTIDIMENSIONAL LOAD BALANCING IN MULTIPROCESSOR-BASED DATABASE-SYSTEMS

引用

INFORMATION SYSTEMS 1994年第7期19卷 549-567页

作者： LEE, C HUA, KA NATL CHENG KUNG UNIV INST INFORMAT ENGNTAINANTAIWAN UNIV CENT FLORIDA DEPT COMP SCIORLANDOFL 32816

With the advent of micro-processor, memory, and communication technology, it is economically feasible to develop a parallel database computer system to improve the performance of database systems. Relations in such an environment are usually partitioned and distributed across computing units. To achieve the optimal performance, it is essential for each unit to have a perfectly balanced load (i.e., identical amount of data). However, fragment sizes may vary due to insertions to and deletions from a relation. To retain good performance, the system needs to periodically rebalance the load of the processors by redistributing data among computing units. Traditionally, the redistribution is performed by reshuffling tuples among processors through a relation repartitioning (e.g., rehashing) process. The computation of this process is at the tuple level. In this paper, we present a self-adjusting data distribution scheme which balances computer workload at a cell (coarser grain than tuple) level during query processing to minimize redistribution cost. The entire scheme is built on top of the popular grid file structure. The adaptivity of the scheme and its relevant features are discussed. The cost of load rebalancing is estimated. The result shows that under our assumptions, it is always beneficial to rebalance computer workload before performing a join on skewed data.

关键词： parallel query processing LOAD BALANCING DATA SKEW GRID FILE

来源：评论

学校读者我要写书评

暂无评论

DATA-FLOW query EXECUTION IN A parallel MAIN-MEMORY ENVIRONMENT

引用

DISTRIBUTED AND parallel DATABASES 1993年第1期1卷 103-128页

作者： WILSCHUT, AN APERS, PMG 1. University of Twente P.O. Box 217 7500 AE Enschede The Netherlands2. University of Twente P.O. Box 217 7500 AE Enschede The Netherlands

In this paper, the performance and characteristics of the execution of various join-trees on a parallel DBMS are studied. The results of this study are a step into the direction of the design of a query optimization strategy that is fit for parallel execution of complex queries. Among others, synchronization issues are identified to limit the performance gain from parallelism. A new hash-join algorithm is introduced that has fewer synchronization constraints than the known hash-join algorithms. Also, the behavior of individual join operations in a join-tree is studied in a simulation experiment. The results show that the introduced Pipelining hash-join algorithm yields a better performance for multi-join queries. The format of the optimal join-tree appears to depend on the size of the operands of the join: A multi-join between small operands performs best with a bushy schedule;larger operands are better off with a linear schedule. The results from the simulation study are confirmed with an analytic model for dataflow query execution.

关键词： parallel query processing MULTI-JOIN QUERIES SIMULATION ANALYTICAL MODELING

来源：评论

学校读者我要写书评

暂无评论

DATA-FLOW query EXECUTION IN A parallel MAIN-MEMORY ENVIRONMENT

DATA-FLOW QUERY EXECUTION IN A PARALLEL MAIN-MEMORY ENVIRONM...

引用

1991 International Conference on parallel and Distributed Information Systems

作者： WILSCHUT, AN APERS, PMG

ISBN: (纸本)0818622954

关键词： parallel query processing MULTI-JOIN QUERIES SIMULATION ANALYTICAL MODELING

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：