检索结果-内蒙古大学图书馆

Computation World - Future Computing, Service Computation, Cognitive, Adaptive, Content, Patterns Conference

作者： Yu, Qicheng McCann, Julie A. Cai, Fang Fang London Metropolitan Univ Fac Comp London England Imperial Coll Dept Comp London SW7 2AZ England

ISBN: (纸本)9781424451661

Data warehousing continues to play an important role in global information systems for businesses. Meanwhile, applications of data warehousing have evolved from reporting and decision support systems to mission critical decision making systems. This requires data warehouses to combine both historical and current data from operational systems. Since a join operation is one of the most expensive operations in query processing, it is vital to develop effective and efficient join techniques for a distributed warehouse environment. In this paper, we propose an agent-based adaptive join algorithm called Ajoin for effective and efficient online join operations in distributed data warehouses. Ajoin utilises intelligent agents for dynamic optimisation and coordination of join processing at run time. Key aspects of the Ajoin algorithm have been implemented and evaluated against other modern adaptive join algorithms. It has been shown that Ajoin exhibits better performance under various distributed and dynamic data warehouse environments in our study.

关键词： Adaptive join software agents data warehousing join algorithm

来源：评论

学校读者我要写书评

暂无评论

Multi-Core, Main-Memory joins: Sort vs. Hash Revisited

引用

PROCEEDINGS OF THE VLDB ENDOWMENT 2013年第1期7卷 85-96页

作者： Balkesen, Cagri Alonso, Gustavo Teubner, Jens Oezsu, M. Tamer ETH Syst Grp Zurich Switzerland TU Dortmund Univ Dortmund Germany Univ Waterloo Waterloo ON Canada

In this paper we experimentally study the performance of main-memory, parallel, multi-core join algorithms, focusing on sort-merge and (radix-) hash join. The relative performance of these two join approaches have been a topic of discussion for a long time. With the advent of modern multicore architectures, it has been argued that sort-merge join is now a better choice than radix-hash join. This claim is justified based on the width of SIMD instructions (sort-merge outperforms radix-hash join once SIMD is sufficiently wide), and NUMA awareness (sort-merge is superior to hash join in NUMA architectures). We conduct extensive experiments on the original and optimized versions of these algorithms. The experiments show that, contrary to these claims, radix-hash join is still clearly superior, and sort-merge approaches to performance of radix only when very large amounts of data are involved. The paper also provides the fastest implementations of these algorithms, and covers many aspects of modern hardware architectures relevant not only for joins but for any parallel data processing operator.

关键词： Computer architecture Data handling Software architecture Hardware architecture join algorithm Large amounts of data Multicore architectures Numa architectures Parallel data processing Relative performance SIMD instructions

来源：评论

学校读者我要写书评

暂无评论

Efficiently discovering critical workflows in scientific explorations

引用

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE 2009年第5期25卷 577-585页

作者： Shao, Qihong Sun, Peng Chen, Yi Arizona State Univ Dept Comp Sci & Engn Tempe AZ 85281 USA

Existing workflow management systems assume that scientists have a well-specified workflow design before the execution. In reality, a lot of scientific discoveries are made as a result of a dynamic process, where scientists keep proposing new hypotheses and verifying them through multiple tries of various experiments before achieving successful experimental results. Consequently, not all the experiments in a workflow execution have necessarily contributed to the final result. In this paper, we investigate the problem of effectively reproducing the results of previous scientific workflow executions by discovering the critical experiments leading to the success and the logical constraints on their execution order. Relational schema and SQL queries have been designed for effectively recording the workflow execution log, efficiently identifying the critical experiments from the log, and recommending experiment reproduction strategies to users. Furthermore, we propose optimization techniques for evaluating such SQL queries according to the unique characteristics of the log data. Experimental evaluations demonstrate the performance speedup of our approach. (C) 2008 Elsevier B.V. All rights reserved.

关键词： Scientific workflow Log join algorithm Database system

来源：评论

学校读者我要写书评

暂无评论

Effectively decreasing the maintenance overhead of highly dynamic Chord system

Effectively decreasing the maintenance overhead of highly dy...

引用

10th International Conference on Advanced Communication Technology

作者： Ren, Xiao-Jin Gu, Zhi-Min Ding, Xiao-Guang Duan, Zhao-Lei Beijing Inst Technol Sch Comp Sci & Technol Beijing 100081 Peoples R China

ISBN: (纸本)9788955191356

P2P systems are highly dynamic in nature. Nodes may join in or leave the P2P system at any moment. Frequently joining or leaving must increase the maintenance overhead greatly in DHT-based P2P system. The main reason of causing the cost is the lookup cost that nodes build their fingers. In this paper we introduce an iterative join algorithm for Chord that is suitable for highly dynamic environments. Iterative join algorithm builds the finger of node by iterative lookup and by the help of fingers information of nodes in the lookup path. Theory analysis and simulation show that Iterative join algorithm decreases efficiently the maintenance overhead and improve the lookup performance.

关键词： P2P iterative join algorithm

来源：评论

学校读者我要写书评

暂无评论

Performance analysis of three text-join algorithms

引用

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 1998年第3期10卷 477-492页

作者： Meng, WY Yu, C Wang, W Rishe, N SUNY Binghamton Dept Comp Sci Binghamton NY 13902 USA Univ Illinois Dept Elect Engn & Comp Sci Chicago IL 60607 USA Univ Calif Los Angeles Dept Comp Sci Los Angeles CA 90024 USA Florida Int Univ Sch Comp Sci Miami FL 33199 USA

When a multidatabase system contains textual database systems (i.e., information retrieval systems), queries against the global schema of the multidatabase system may contain a new type of joins-joins between attributes of textual type. Three algorithms for processing such a type of joins are presented and their I/O costs are analyzed in this paper. Since such a type of joins often involves document collections of very large size, it is very important to find efficient algorithms to process them. The three algorithms differ on whether the documents themselves or the inverted files on the documents are used to process the join. Our analysis and the simulation results indicate that the relative performance of these algorithms depends on the input document collections, system characteristics, and the input query. For each algorithm, the type of input document collections with which the algorithm is likely to perform well is identified. An integrated algorithm that automatically selects the best algorithm to use is also proposed.

关键词： query processing textual database information retrieval join algorithm multidatabase

来源：评论

学校读者我要写书评

暂无评论

SELECT-PARTITIONED join - AN IMPROVED PARTITION-BASED join algorithm

引用

INFORMATION SYSTEMS 1991年第2期16卷 199-209页

作者： HO, C JONG, SP MYUNGHWAN, K CORNELL UNIV SCH ELECT ENGN ITHACA NY 14853 USA

We propose a new partition-based join algorithm, called select-partitioned join, which performs better than the sort-merge and hash-partitioned join. The proposed select-partitioned join algorithm consists of three major steps. The first step is to determine a partitioning pattern by which the total join cost can be minimized and choose the bound values of the buckets by using a selection algorithm. The second is to partition both relations into ranged buckets according to the partitioning pattern and the bound values chosen in the previous step. And the last step is to apply the nested-block join on the partitioned bucket pairs. The selection algorithm is based on the cumulative distribution function and it is performed by a single scan of the smaller relation. The performance of the select-partitioned join is analyzed in terms of the number of I/Os and compared with the sort-merge and hash-partitioned join algorithms. Our join algorithm is better than a hash-partitioned join algorithm, which is, in general, known to be better for the join operation. Simulation experiments are conducted for the join algorithms.

关键词： join algorithm RELATIONAL DATABASE SELECTION algorithm CUMULATIVE DISTRIBUTION FUNCTION

来源：评论

学校读者我要写书评

暂无评论

HYBRID join - AN IMPROVED SORT-BASED join algorithm

引用

INFORMATION PROCESSING LETTERS 1989年第2期32卷 51-56页

作者： CHOI, HK KIM, M Dep. Electr. Eng. Korea Adv. Inst. Sci. and Technol. P.O. Box 150 Cheongryang Seoul 130-650 Rep. Korea

This paper proposes an algorithm that improves the sort-based join method. Unlike the sort-based join, it employs both sorting and partitioning for avoiding two complete sorts of both relations, thus it will be referred to as hybrid join. The algorithm consists of completely sorting only the smaller relation and partitioning the other one into ranged buckets according to the order statistics of the sorted relation. The final join is performed on the sorted relation and the ranged buckets.

关键词： join algorithm relational database

来源：评论

学校读者我要写书评

暂无评论

join DURING MERGE - AN IMPROVED SORT BASED algorithm

引用

INFORMATION PROCESSING LETTERS 1985年第1期21卷 11-16页

作者： NEGRI, M PELAGATTI, G Dipartimento di Elettronica Politecnico di Milano Piazza Leonardo da Vinci 32 20133 Milano Italy

A join operation consists of selecting from the set of all pairs of records of 2 files those pairs that possess some matching property. Because the join operation is so important in database applications, several algorithms have been proposed for performing it efficiently. A sort-based join is performed by completely sorting both files, then joining the 2 sorted files in one pass. The one-way join-during-merge algorithm consists of completely sorting one file and only partially sorting the other file and then performing the join. Two join-during-merge algorithms are described and analyzed and their superiority with respect to the traditional sort-based algorithm is shown. It is suggested that the join-during-merge algorithm be used in all cases in which the sort-based algorithm was considered convenient. Since it is possible to choose the optimal algorithm before performing the join, the stated gains can be achieved in reality.

关键词： Relational join join algorithm

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：