SQL queries in the real world are replete with group-by clauses. These queries are often used for strategic decision making due to the nature of group-by queries where raw information is grouped according to the desig...
详细信息
SQL queries in the real world are replete with group-by clauses. These queries are often used for strategic decision making due to the nature of group-by queries where raw information is grouped according to the designated groups and within each group aggregate functions are normally carried out. In strategic decision making, parallelization of these queries becomes necessary to speed up query response time. The trend in parallel computing is now shifted to cluster architectures. In this paper, we propose a technique for parallelization of group-by queries in cluster architectures. The need for a specialized algorithm on cluster architectures is raised due to the fact that most existing work on parallel group-by queryprocessing is based on a shared-nothing architecture, and furthermore, they cannot be efficiently implemented in cluster platforms.
暂无评论