One of the differences between relational and object-oriented databases (OODB) is that attributes in OODB can be of a collection type (e.g. sets, lists, arrays, and bags) as well as a simple type (e.g. integer, string...
详细信息
One of the differences between relational and object-oriented databases (OODB) is that attributes in OODB can be of a collection type (e.g. sets, lists, arrays, and bags) as well as a simple type (e.g. integer, string). Consequently, explicit joinqueries in OODB may be based on collection attributes. We call this type of join Collection joinqueries. There are three different kinds of collection joinqueries, namely: Collection-Equi join, Collection-Intersect join, and Sub-Collection join. Basically, a collection-equi join query checks an equality of both collection operands, whereas a collection-intersect join query checks whether there is an intersection between the two join collection attributes. Sub-collection joinqueries check whether one collection is a sub-collection of the other. In this paper, we present parallel join algorithms for the above three collection join query types based on the sort-merge technique. Sonic of the proposed algorithms employ a nested-loop construct as well. We also outline the complexity of collection merging in the algorithm. Parallel join algorithms are normally composed of two stages, data partitioning and local join. For the data partitioning stage in collection-intersect and sub-collection join algorithms, we propose a 'Divide and Partial Broadcast' partitioning. The proposed join algorithms play an important role in parallel object-oriented query processing, due to their superiority over the conventional join methods, which are usually in a form of relational division, and also the inefficiency of original join predicates processing.
Collection joinqueries are joinqueries based on collection attributes (i.e. non-atomic attributes), which are common in object-oriented databases. We have identified three different kinds of collection joinqueries,...
详细信息
Collection joinqueries are joinqueries based on collection attributes (i.e. non-atomic attributes), which are common in object-oriented databases. We have identified three different kinds of collection joinqueries, namely;cullection-equijoin,collection-intersectjoin, andsub-collectionjoin. In this paper, we propose parallel join algorithms for these three collection join query types based on a combination of sort and hash methods, which we callparallel sort-hash, collection join algorithms. The proposed join algorithms play an important role in parallel object-oriented query processing, due to their superiority over the conventional join methods which are usually in a form of relational division, and also the inefficiency of the original join predicate processing. In our implementation of these algorithms on a shared-memory machine, we show that the combination between sort and hash methods is proven to be better than the conventional sort-merge and nested-loop based parallel join processing
暂无评论